1

Define Association Rule Mining (ARM) and explain its primary objective in unsupervised learning.

2

Explain the metrics 'Support' and 'Confidence' in the context of Association Rule Mining. Provide their mathematical formulas.

3

What is the 'Lift' metric in Association Rule Mining? How is it calculated, and how do you interpret its values?

4

Define the 'Conviction' metric in Association Rule Mining. Provide its mathematical formula and explain its significance compared to Confidence.

5

State and explain the Anti-monotone property (Apriori principle). How does it help in reducing computational complexity?

6

Describe the step-by-step working of the Apriori Algorithm for finding frequent itemsets.

7

What is the FP-Growth algorithm? How does the construction of the FP-Tree address the limitations of the Apriori algorithm?

8

Explain the process of mining frequent itemsets from an FP-Tree using the FP-Growth algorithm.

9

What is Market Basket Analysis? Provide two concrete examples of how businesses utilize this technique.

10

Distinguish between Anomaly Detection and Novelty Detection.

11

Explain the core principle behind the Isolation Forest algorithm. Why is it highly efficient for high-dimensional datasets?

12

Detail the calculation and interpretation of the Anomaly Score in the Isolation Forest algorithm.

13

What is Local Outlier Factor (LOF)? Explain the concept of 'Local' in this algorithm.

14

Explain the terms 'Reachability Distance' and 'Local Reachability Density' in the context of the LOF algorithm.

15

Compare Isolation Forest and Local Outlier Factor (LOF). In what scenarios would you choose one over the other?

16

Describe how unsupervised anomaly detection techniques are applied in Fraud Detection. Provide a specific use case.

17

How is Anomaly Detection utilized in Cybersecurity? Explain with the context of Network Intrusion Detection Systems (NIDS).

18

Discuss the trade-off involved when setting the minimum support and minimum confidence thresholds in Association Rule Mining.

19

What are the limitations of Association Rule Mining in real-world business applications?

20

Calculate and interpret Support, Confidence, and Lift given the following scenario: A store has 1,000 transactions. 200 transactions contain Coffee, 100 contain Sugar, and 50 contain both.

Unit5 - Subjective Questions