1

Explain the Perceptron algorithm. What is the Perceptron Update Rule?

2

Describe Logistic Regression. Why is the Sigmoid function used in this model?

3

Derive the Cost Function (Log Loss) for Logistic Regression.

4

What is the 'Naïve' assumption in the Naïve Bayes Classifier? Explain with the help of Bayes' Theorem.

5

Define Confusion Matrix and explain the terms TP, TN, FP, and FN.

6

Differentiate between Precision and Recall. When is F1-Score used?

7

Explain the K-Nearest Neighbors (K-NN) algorithm. Why is it called a 'Lazy Learner'?

8

Discuss the effect of the value of 'K' in the K-NN algorithm regarding overfitting and underfitting.

9

Explain the concept of Entropy and Information Gain in Decision Trees.

10

What is the difference between Pre-pruning and Post-pruning in Decision Trees?

Pruning is a technique to reduce the size of decision trees to prevent overfitting.

Feature	Pre-pruning (Early Stopping)	Post-pruning
Definition	Halting the tree construction early before it creates a full tree.	Letting the tree grow to its full depth and then removing insignificant nodes.
Mechanism	Stops splitting based on constraints (e.g., max depth, min samples per leaf, min impurity decrease).	Evaluates complete branches. If removing a branch improves/maintains validation accuracy, replace it with a leaf.
Pros	Faster training time; simple to implement.	Usually produces better trees; less likely to 'under-prune'.
Cons	Can be too aggressive (Horizon effect) - might stop before a good split occurs later.	Computationally expensive as it requires growing the full tree first.

11

Explain the core intuition behind Support Vector Machines (SVM). What are Support Vectors?

12

What is the 'Kernel Trick' in SVM? Why is it useful?

13

Define ROC Curve and ROC-AUC. How do they help in model evaluation?

14

Compare the Precision-Recall (PR) Curve with the ROC Curve. When should you use PR Curve?

15

What is the problem of Zero Frequency in Naïve Bayes and how does Laplace Smoothing solve it?

16

Compare Parametric and Non-parametric models with examples from Unit 3.

17

Distinguish between Generative and Discriminative models using Naïve Bayes and Logistic Regression as examples.

18

Explain Gini Impurity and how it differs from Entropy.

19

Briefly explain the concept of Supervised Learning and list the steps involved.

20

What are the advantages and disadvantages of Decision Trees?

Unit3 - Subjective Questions

The Perceptron

Perceptron Update Rule

Logistic Regression

The Sigmoid Function

Derivation of Log Loss

Bayes' Theorem

The 'Naïve' Assumption

Confusion Matrix

Terminology

Precision vs. Recall

F1-Score

K-Nearest Neighbors (K-NN)

Lazy Learner

Entropy

Information Gain (IG)

Support Vector Machine (SVM)

Support Vectors

The Problem

The Kernel Trick

ROC Curve (Receiver Operating Characteristic)

ROC-AUC (Area Under the Curve)

Comparison

When to use PR Curve (AUC-PR)?

Zero Frequency Problem

Laplace Smoothing (Additive Smoothing)

Parametric Models

Non-parametric Models

Discriminative Models (e.g., Logistic Regression)

Generative Models (e.g., Naïve Bayes)

Gini Impurity

Difference from Entropy

Supervised Learning

Steps involved:

Advantages

Disadvantages