1

Distinguish between Supervised, Unsupervised, and Semi-Supervised Learning with appropriate examples.

2

Explain the mathematical problem formulation for Unsupervised Learning.

3

Describe how Unsupervised Learning is applied in Market Segmentation.

4

How is Unsupervised Learning utilized in Anomaly and Fraud Detection?

5

Discuss the role of Unsupervised Learning in discovering patterns in Biological and Social Data.

6

Define Euclidean Distance. Provide its mathematical formula and explain when it is best used.

7

Define Manhattan Distance and explain how it differs from Euclidean Distance.

8

What is Cosine Similarity? Provide its mathematical derivation and explain its significance in Natural Language Processing (NLP).

9

Why does the choice of distance metric matter in Unsupervised Learning?

10

Compare and contrast Customer Behavior Analysis and Market Segmentation using Unsupervised Learning.

11

Explain the concept of Semi-Supervised Learning and how it bridges the gap between Supervised and Unsupervised Learning.

12

What is the 'Curse of Dimensionality', and how does it affect distance calculations in Unsupervised Learning?

13

How does feature scaling impact the choice of Euclidean distance? Justify your answer.

14

Derive the relationship between Euclidean Distance and Cosine Similarity for normalized vectors.

15

A retail company wants to group its stores based on their physical locations (latitude and longitude). Which distance metric should they use and why?

16

Describe a scenario where Manhattan Distance would be strictly better than Euclidean Distance.

17

How would you formulate a Recommender System as an Unsupervised Learning problem using Cosine Similarity?

18

Why is fraud detection considered a difficult Unsupervised Learning problem? Highlight the major challenges.

19

Compare Unsupervised Learning to Supervised Learning regarding input data, expected output, and evaluation metrics.

20

Summarize the overarching goals of Unsupervised Learning and provide a mapping of real-world use cases to specific unsupervised techniques (Clustering, Anomaly Detection, Dimensionality Reduction).

Unit1 - Subjective Questions