Unit2 - Subjective Questions

CSE274 • Practice Questions with Detailed Answers

1

What is the Variance Threshold method in feature selection, and why is it used?

2

Explain the concept of Correlation-based Feature Selection. How does handling multicollinearity improve model performance?

3

Distinguish between Forward Selection and Backward Elimination methods.

4

How is Tree-based Feature Importance calculated? Give an example using Random Forest.

5

Define Feature Extraction. How does it differ from Feature Selection?

6

What are Aggregation Features? Provide examples of how they are created from transactional data.

7

Explain the Curse of Dimensionality and its impact on Machine Learning models.

8

Describe the step-by-step mathematical algorithm for Principal Component Analysis (PCA).

9

What is the Explained Variance Ratio in PCA, and how is it used to select the number of components?

10

Compare and contrast Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA).

11

Derive the mathematical criterion (Fisher's Criterion) used in Linear Discriminant Analysis (LDA).

12

Discuss the strategies for creating new features from existing continuous and categorical variables.

13

What is the role of the Covariance Matrix in Dimensionality Reduction?

14

Explain the concept of Wrapper Methods in feature selection. What are their advantages and disadvantages?

15

Why is Dimensionality Reduction considered necessary before applying algorithms like K-Nearest Neighbors (KNN)?

16

What are the limitations of Principal Component Analysis (PCA)?

17

Explain Embedded Methods for feature selection with an example.

18

How does Recursive Feature Elimination (RFE) work?

19

What is the distinction between Univariate and Multivariate Feature Selection?

20

In the context of LDA, explain the terms Within-class Scatter Matrix () and Between-class Scatter Matrix ().