1

What is the Variance Threshold method in feature selection, and why is it used?

2

Explain the concept of Correlation-based Feature Selection. How does handling multicollinearity improve model performance?

3

Distinguish between Forward Selection and Backward Elimination methods.

4

How is Tree-based Feature Importance calculated? Give an example using Random Forest.

5

Define Feature Extraction. How does it differ from Feature Selection?

6

What are Aggregation Features? Provide examples of how they are created from transactional data.

7

Explain the Curse of Dimensionality and its impact on Machine Learning models.

8

Describe the step-by-step mathematical algorithm for Principal Component Analysis (PCA).

9

What is the Explained Variance Ratio in PCA, and how is it used to select the number of components?

10

Compare and contrast Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA).

Both are linear transformation techniques used for dimensionality reduction, but they have different objectives.

Feature	PCA (Principal Component Analysis)	LDA (Linear Discriminant Analysis)
Type	Unsupervised (ignores class labels).	Supervised (uses class labels).
Goal	Maximize the variance of the data.	Maximize the separation between multiple classes.
Focus	Preserves the global structure of data.	Preserves the discriminatory information.
Axes	Finds directions of maximum spread.	Finds directions that maximize the ratio of between-class variance to within-class variance.
Usage	General dimensionality reduction, noise reduction.	Pre-processing for classification tasks.

Summary: PCA is about spread; LDA is about separation.

11

Derive the mathematical criterion (Fisher's Criterion) used in Linear Discriminant Analysis (LDA).

12

Discuss the strategies for creating new features from existing continuous and categorical variables.

13

What is the role of the Covariance Matrix in Dimensionality Reduction?

14

Explain the concept of Wrapper Methods in feature selection. What are their advantages and disadvantages?

15

Why is Dimensionality Reduction considered necessary before applying algorithms like K-Nearest Neighbors (KNN)?

16

What are the limitations of Principal Component Analysis (PCA)?

17

Explain Embedded Methods for feature selection with an example.

18

How does Recursive Feature Elimination (RFE) work?

19

What is the distinction between Univariate and Multivariate Feature Selection?

20

In the context of LDA, explain the terms Within-class Scatter Matrix ( $S_W$ ) and Between-class Scatter Matrix ( $S_B$ ).

Unit2 - Subjective Questions