1

Define Vector Space Models (VSM) in the context of Natural Language Processing and explain their primary purpose.

2

Distinguish between sparse vector representations and dense word embeddings. Provide an example of each.

3

Explain the concept of dense word embeddings and discuss two major advantages they offer over traditional count-based representations.

4

Describe the Word2Vec framework. What are the two primary architectures introduced in this framework?

5

Explain the Continuous Bag-of-Words (CBOW) model in detail, including its objective function.

6

Describe the Skip-Gram model architecture and formulate its objective function.

7

Compare the CBOW and Skip-Gram architectures of Word2Vec in terms of learning capabilities and performance.

8

Explain the concept of Negative Sampling in the context of training Word2Vec models. Why is it necessary?

9

Describe the Global Vectors for Word Representation (GloVe) model. How does it utilize the word co-occurrence matrix?

10

Formulate the mathematical objective function of the GloVe model and explain its components.

11

Compare and contrast Word2Vec and GloVe embeddings in terms of underlying principles and training mechanisms.

12

How do word embeddings capture semantic similarity? Explain how Cosine Similarity is used to measure it.

13

Explain how word embeddings exhibit analogy relationships. Provide a mathematical example using classic vector offsets.

14

Why is it important to visualize embedding spaces? Name two common algorithms used for this purpose.

15

Explain how Principal Component Analysis (PCA) can be used to visualize high-dimensional word embeddings.

16

Describe t-Distributed Stochastic Neighbor Embedding (t-SNE) and explain why it is favored over PCA for visualizing word embeddings.

17

Discuss the limitations of static word embeddings like Word2Vec and GloVe.

18

Explain the role of the 'window size' hyperparameter in training Word2Vec models. How does varying it affect the learned embeddings?

19

Define Hierarchical Softmax. How does it improve the efficiency of training word embedding models?

20

Outline the end-to-end pipeline for training and evaluating a Word2Vec model on a custom text corpus.

Unit2 - Subjective Questions