Unit 1 Introduction to Predictive Analytics Easy

1

What is the primary goal of Predictive Analytics?

Unit 1 The Analytics Spectrum Medium

2

Differentiate between Diagnostic Analytics and Predictive Analytics.

Unit 1 CRISP-DM Medium

3

Can you list the six phases of the CRISP-DM process?

Unit 1 Machine Learning Overview Easy

4

How does Machine Learning differ from traditional programming?

Unit 1 Types of Machine Learning Easy

5

What is the fundamental difference between Supervised and Unsupervised learning regarding input data?

Unit 1 Supervised Learning Medium

6

When would you use Classification over Regression?

Unit 1 Data Preprocessing Medium

7

What is the difference between List-wise deletion and Imputation when handling missing values?

Unit 1 Data Transformation Hard

8

Explain the formula for Min-Max Scaling (Normalization) and when it is typically used.

Unit 1 Data Transformation Medium

9

Why do we perform One-Hot Encoding on categorical variables?

Unit 1 Data Splitting Easy

10

Why is it important to split data into Training and Test sets?

Unit 2 Simple Linear Regression Easy

11

In the equation

Y = \beta_0 + \beta_1X + \epsilon

, what do

\beta_0

and

\beta_1

represent?

Unit 2 Ordinary Least Squares Hard

12

What is the objective of the Ordinary Least Squares (OLS) method?

Unit 2 Correlations Medium

13

If the Pearson Correlation Coefficient (

r

) is -1, what does that indicate?

Unit 2 Multiple Linear Regression Medium

14

What is Multicollinearity and why is it a problem in Multiple Linear Regression?

Unit 2 Logistic Regression Easy

15

Despite its name, what is Logistic Regression used for?

Unit 2 Logistic Regression Hard

16

What is the role of the Sigmoid function in Logistic Regression?

Unit 2 Polynomial Regression Medium

17

When should you use Polynomial Regression instead of Simple Linear Regression?

Unit 2 Model Performance Medium

18

What is the difference between MAE and MSE?

Unit 2 Model Performance Easy

19

What does an

R^2

score of 1 indicate?

Unit 2 OLS Assumptions Hard

20

Define Homoscedasticity in the context of regression assumptions.

Unit 3 k-Nearest Neighbors Easy

21

Why is k-Nearest Neighbors (k-NN) referred to as a "Lazy Learning" algorithm?

Unit 3 k-Nearest Neighbors Medium

22

What happens if the value of 'k' in k-NN is set too small (e.g., k=1)?

Unit 3 Naïve Bayes Medium

23

What is the "Naïve" assumption in the Naïve Bayes classifier?

Unit 3 Decision Trees Hard

24

Explain the concept of Entropy in Decision Trees.

Unit 3 Support Vector Machine Medium

25

What are Support Vectors in SVM?

Unit 3 Support Vector Machine Hard

26

What is the Kernel Trick in SVM and why is it used?

Unit 3 Evaluation Metrics Easy

27

In a Confusion Matrix, what does a False Positive (Type I Error) represent?

Unit 3 Evaluation Metrics Medium

28

When should you prioritize Recall over Precision?

Unit 3 Evaluation Metrics Medium

29

What is the F1 Score and when is it most useful?

Unit 3 Evaluation Metrics Easy

30

What does an AUC (Area Under Curve) of 0.5 indicate?

Unit 4 Introduction to Unsupervised Learning Easy

31

What is the main goal of Unsupervised Learning?

Unit 4 K-Means Clustering Medium

32

What is a Centroid in K-Means clustering?

Unit 4 K-Means Clustering Medium

33

How does the Elbow Method help in determining the optimal number of clusters (

K

)?

Unit 4 K-Means Clustering Hard

34

What is the "Random Initialization Trap" in K-Means and how is it solved?

Unit 4 Hierarchical Clustering Easy

35

What is a Dendrogram?

Unit 4 Hierarchical Clustering Medium

36

Differentiate between Agglomerative and Divisive hierarchical clustering.

Unit 4 Association Rules Easy

37

What is Market Basket Analysis?

Unit 4 Association Rules Medium

38

Define the "Support" metric in Association Rules.

Unit 4 Association Rules Hard

39

Why is the "Lift" metric considered more critical than Confidence?

Unit 4 Association Rules Easy

40

What does a Lift value greater than 1 indicate?

Unit 5 Dimensionality Reduction Easy

41

What is the "Curse of Dimensionality"?

Unit 5 Principal Component Analysis Medium

42

What are Principal Components in PCA?

Unit 5 Principal Component Analysis Hard

43

Why is Standardization (Scaling) a necessary step before performing PCA?

Unit 5 Neural Networks Easy

44

What are the three main types of layers in a Multi-layer Perceptron (MLP)?

Unit 5 Neural Networks Medium

45

Why do Neural Networks require an Activation Function?

Unit 5 Neural Networks Hard

46

Briefly explain the concept of Backpropagation.

Unit 5 Convolutional Neural Networks Medium

47

Why are CNNs preferred over MLPs for image processing?

Unit 5 Convolutional Neural Networks Medium

48

What is the function of a Pooling Layer in a CNN?

Unit 5 Recurrent Neural Networks Easy

49

What type of data are Recurrent Neural Networks (RNNs) designed to handle?

Unit 5 Recurrent Neural Networks Hard

50

What is the Vanishing Gradient problem in RNNs?

Unit 6 Bias-Variance Trade-off Medium

51

Define "Bias" in the context of model error.

Unit 6 Bias-Variance Trade-off Medium

52

Define "Variance" in the context of model error.

Unit 6 Underfitting vs Overfitting Easy

53

What does Underfitting mean?

Unit 6 Underfitting vs Overfitting Easy

54

What does Overfitting mean?

Unit 6 Irreducible Error Easy

55

What is Irreducible Error?

Unit 6 Bias-Variance Trade-off Hard

56

Describe the relationship between Model Complexity and Bias/Variance.

Unit 6 Bias-Variance Trade-off Medium

57

What is the "Sweet Spot" in the Bias-Variance Trade-off?

Unit 6 Error Decomposition Hard

58

What are the three components that make up the expected prediction error?

Unit 6 Bias-Variance Trade-off Medium

59

Does a Decision Tree with no depth limit have high bias or high variance?

Unit 6 Bias-Variance Trade-off Easy

60

Does a Linear Regression model applied to curved data have high bias or high variance?

Viva Questions