Unit6 - Subjective Questions

INT396 • Practice Questions with Detailed Answers

1

Distinguish between Internal and External Clustering Validation Metrics. Provide examples of each.

2

Explain the Silhouette Score. How is it calculated, and how should its values be interpreted?

3

Describe the intuition behind Cohesion and Separation in the context of clustering evaluation.

4

What is Stability-Based Evaluation in unsupervised learning? Explain its general methodology.

5

Discuss the Interpretability Challenges specifically associated with Unsupervised Learning.

6

Analyze the Ethical Considerations involved in Pattern Discovery using unsupervised learning.

7

Present a case study on using Unsupervised Learning for Customer Segmentation.

8

How can visualizing handwritten digits (MNIST) aid in evaluating Unsupervised Learning models? Discuss specific techniques.

9

Compare and contrast the Davies-Bouldin Index and the Dunn Index.

10

Explain the Adjusted Rand Index (ARI) and its significance as an external clustering metric.

11

Define Normalized Mutual Information (NMI). How is it used to evaluate clustering?

12

Describe the Calinski-Harabasz Index (Variance Ratio Criterion).

13

How can the optimal number of clusters be determined using the Silhouette Score in a practical scenario?

14

Explain the concept of 'Curse of Dimensionality' and its impact on distance-based internal clustering validation metrics.

15

Detail a real-world case study of unsupervised learning in anomaly detection.

16

Discuss how domain knowledge plays a crucial role in evaluating customer segmentation models.

17

What is the Purity metric in external clustering validation? Provide its formula and limitations.

18

Explain the concept of 'Data Leakage' in the context of evaluating unsupervised learning models.

19

Describe a scenario where K-Means would yield a high Davies-Bouldin index (poor score) but the clustering is visually correct.

20

Summarize the end-to-end process of addressing Interpretability and Ethical challenges when deploying an unsupervised learning model in healthcare.