Unit3 - Subjective Questions

INT428 • Practice Questions with Detailed Answers

1

Define Supervised Learning and distinguish between its two main sub-categories: Classification and Regression.

2

Explain the concept of Unsupervised Learning. How does it differ from Supervised Learning? Provide two real-world applications.

3

Describe Reinforcement Learning (RL). Define the key components: Agent, Environment, State, Action, and Reward.

4

State Bayes' Theorem mathematically and explain its significance in machine learning.

5

What are Bayesian Networks? Explain their structure and utility in probabilistic reasoning.

6

Explain the role of Linear Algebra in Machine Learning, specifically focusing on Vectors and Matrices.

7

What is Feature Engineering? Discuss two common techniques: Normalization and One-Hot Encoding.

8

Differentiate between Precision and Recall in the context of model evaluation. When should each be prioritized?

9

Explain K-Fold Cross-Validation and why it is preferred over a simple Train-Test split.

10

Explain the Bias-Variance Tradeoff in Machine Learning.

11

Define Overfitting and Underfitting. How can they be detected?

12

Compare Supervised, Unsupervised, and Reinforcement Learning based on Data Type, Feedback, and Goal.

13

Explain the role of Statistics in Machine Learning, specifically focusing on Descriptive Statistics (Mean, Median, Standard Deviation).

14

What is a Confusion Matrix? Draw a layout for a binary classification problem and explain True Positives, False Positives, True Negatives, and False Negatives.

15

How does Probabilistic Reasoning handle uncertainty in AI systems? Explain the concept of Marginalization.

16

Discuss the real-world application of Supervised Learning in Email Spam Detection and Unsupervised Learning in Recommender Systems.

17

Why is Dimensionality Reduction important in Machine Learning? Mention the Curse of Dimensionality.

18

Derive the Naive Bayes classification rule from Bayes' Theorem. Why is it called "Naive"?

19

Explain the concept of F1 Score. Why is it a better metric than Accuracy for imbalanced datasets?

20

What is Imputation in the context of handling missing data? Describe two methods to perform it.