Question Consider the points: , , , , , , , . a) Compute the distance matrix using Euclidean distance measure. b) Identify the clusters that could be formed using
Question Term frequency matrix for the five articles (A1 to A5) is shown below. Answer the following questions: 1) What is the TF-IDF value for (A4, Corona)? 2)
Question In National Games Championship, a high jump competition is being conducted along with other athletic events. Only n out of m athletes were able to meet the eligibility criteria
Question An organization has million documents in its repository. A document X has term ‘Mining’ occurring 4 times and term ‘Discovery’ occurring for 5 times. Other words occur less frequently.
Question Mike is trying to get into a Medical college for Post-graduation in India. Before applying for any college/university, he needs to take an exam for that particular college/university. Therefore,
Question Consider the distance matrix for data objects. The outlier score of an object is the inverse of density around an object. The density of an object is equal to
Question An FMCG Company training set has 100 records for T (tooth paste) & 400 records for competitor C. P, Q, R denote subsets of attribute values in records which