Viva Questions

INT234 60 Questions
Unit 1 Introduction to Predictive Analytics
1
What is the primary goal of Predictive Analytics?
Unit 1 The Analytics Spectrum
2
Differentiate between Diagnostic Analytics and Predictive Analytics.
Unit 1 CRISP-DM
3
Can you list the six phases of the CRISP-DM process?
Unit 1 Machine Learning Overview
4
How does Machine Learning differ from traditional programming?
Unit 1 Types of Machine Learning
5
What is the fundamental difference between Supervised and Unsupervised learning regarding input data?
Unit 1 Supervised Learning
6
When would you use Classification over Regression?
Unit 1 Data Preprocessing
7
What is the difference between List-wise deletion and Imputation when handling missing values?
Unit 1 Data Transformation
8
Explain the formula for Min-Max Scaling (Normalization) and when it is typically used.
Unit 1 Data Transformation
9
Why do we perform One-Hot Encoding on categorical variables?
Unit 1 Data Splitting
10
Why is it important to split data into Training and Test sets?
Unit 2 Simple Linear Regression
11
In the equation , what do and represent?
Unit 2 Ordinary Least Squares
12
What is the objective of the Ordinary Least Squares (OLS) method?
Unit 2 Correlations
13
If the Pearson Correlation Coefficient () is -1, what does that indicate?
Unit 2 Multiple Linear Regression
14
What is Multicollinearity and why is it a problem in Multiple Linear Regression?
Unit 2 Logistic Regression
15
Despite its name, what is Logistic Regression used for?
Unit 2 Logistic Regression
16
What is the role of the Sigmoid function in Logistic Regression?
Unit 2 Polynomial Regression
17
When should you use Polynomial Regression instead of Simple Linear Regression?
Unit 2 Model Performance
18
What is the difference between MAE and MSE?
Unit 2 Model Performance
19
What does an score of 1 indicate?
Unit 2 OLS Assumptions
20
Define Homoscedasticity in the context of regression assumptions.
Unit 3 k-Nearest Neighbors
21
Why is k-Nearest Neighbors (k-NN) referred to as a "Lazy Learning" algorithm?
Unit 3 k-Nearest Neighbors
22
What happens if the value of 'k' in k-NN is set too small (e.g., k=1)?
Unit 3 Naïve Bayes
23
What is the "Naïve" assumption in the Naïve Bayes classifier?
Unit 3 Decision Trees
24
Explain the concept of Entropy in Decision Trees.
Unit 3 Support Vector Machine
25
What are Support Vectors in SVM?
Unit 3 Support Vector Machine
26
What is the Kernel Trick in SVM and why is it used?
Unit 3 Evaluation Metrics
27
In a Confusion Matrix, what does a False Positive (Type I Error) represent?
Unit 3 Evaluation Metrics
28
When should you prioritize Recall over Precision?
Unit 3 Evaluation Metrics
29
What is the F1 Score and when is it most useful?
Unit 3 Evaluation Metrics
30
What does an AUC (Area Under Curve) of 0.5 indicate?
Unit 4 Introduction to Unsupervised Learning
31
What is the main goal of Unsupervised Learning?
Unit 4 K-Means Clustering
32
What is a Centroid in K-Means clustering?
Unit 4 K-Means Clustering
33
How does the Elbow Method help in determining the optimal number of clusters ()?
Unit 4 K-Means Clustering
34
What is the "Random Initialization Trap" in K-Means and how is it solved?
Unit 4 Hierarchical Clustering
35
What is a Dendrogram?
Unit 4 Hierarchical Clustering
36
Differentiate between Agglomerative and Divisive hierarchical clustering.
Unit 4 Association Rules
37
What is Market Basket Analysis?
Unit 4 Association Rules
38
Define the "Support" metric in Association Rules.
Unit 4 Association Rules
39
Why is the "Lift" metric considered more critical than Confidence?
Unit 4 Association Rules
40
What does a Lift value greater than 1 indicate?
Unit 5 Dimensionality Reduction
41
What is the "Curse of Dimensionality"?
Unit 5 Principal Component Analysis
42
What are Principal Components in PCA?
Unit 5 Principal Component Analysis
43
Why is Standardization (Scaling) a necessary step before performing PCA?
Unit 5 Neural Networks
44
What are the three main types of layers in a Multi-layer Perceptron (MLP)?
Unit 5 Neural Networks
45
Why do Neural Networks require an Activation Function?
Unit 5 Neural Networks
46
Briefly explain the concept of Backpropagation.
Unit 5 Convolutional Neural Networks
47
Why are CNNs preferred over MLPs for image processing?
Unit 5 Convolutional Neural Networks
48
What is the function of a Pooling Layer in a CNN?
Unit 5 Recurrent Neural Networks
49
What type of data are Recurrent Neural Networks (RNNs) designed to handle?
Unit 5 Recurrent Neural Networks
50
What is the Vanishing Gradient problem in RNNs?
Unit 6 Bias-Variance Trade-off
51
Define "Bias" in the context of model error.
Unit 6 Bias-Variance Trade-off
52
Define "Variance" in the context of model error.
Unit 6 Underfitting vs Overfitting
53
What does Underfitting mean?
Unit 6 Underfitting vs Overfitting
54
What does Overfitting mean?
Unit 6 Irreducible Error
55
What is Irreducible Error?
Unit 6 Bias-Variance Trade-off
56
Describe the relationship between Model Complexity and Bias/Variance.
Unit 6 Bias-Variance Trade-off
57
What is the "Sweet Spot" in the Bias-Variance Trade-off?
Unit 6 Error Decomposition
58
What are the three components that make up the expected prediction error?
Unit 6 Bias-Variance Trade-off
59
Does a Decision Tree with no depth limit have high bias or high variance?
Unit 6 Bias-Variance Trade-off
60
Does a Linear Regression model applied to curved data have high bias or high variance?