Unit6 - Subjective Questions

CSE274 • Practice Questions with Detailed Answers

1

Explain the fundamental role of Unsupervised Learning and distinguish it from Supervised Learning.

2

Mathematically define Euclidean Distance and Manhattan Distance. In which scenarios is Manhattan distance preferred over Euclidean?

3

What is Cosine Distance? Derive the formula for Cosine Similarity and explain why it is commonly used in text mining.

4

Describe the K-Means Clustering algorithm step-by-step. What is the objective function it tries to minimize?

5

Explain the 'Elbow Method' for determining the optimal number of clusters () in K-Means.

6

Compare K-Means and K-Medoids (PAM). Why is K-Medoids considered more robust to outliers?

7

Discuss Hierarchical Clustering. Distinguish between Agglomerative and Divisive approaches.

8

Explain the different Linkage Criteria used in Hierarchical Clustering: Single, Complete, Average, and Ward's Linkage.

9

Describe the DBSCAN algorithm. Define the concepts of Core Points, Border Points, and Noise Points.

10

What are the primary advantages of Density-Based Clustering (DBSCAN) over Partitioning methods like K-Means?

11

Explain the concept of Anomaly Detection in the context of Unsupervised Learning.

12

Define Inertia as an evaluation metric for clustering. What are its limitations?

13

Explain the Silhouette Score. How is it calculated and how do you interpret its value?

14

What is the Davies–Bouldin Index? How does it differ from the Silhouette Score in terms of optimization?

15

Discuss the 'Curse of Dimensionality' and its impact on choosing distance metrics for clustering.

16

Compare Partitioning (K-Means), Hierarchical, and Density-based (DBSCAN) clustering methods.

17

What is a Dendrogram? How is it used to determine the number of clusters in Hierarchical Clustering?

18

Why is scaling or normalization of features critical before applying distance-based clustering algorithms?

19

Explain the concept of 'Centroid' in K-Means versus 'Medoid' in K-Medoids.

20

Describe the main challenges associated with Unsupervised Learning compared to Supervised Learning.