Unit4 - Subjective Questions

CSE472 • Practice Questions with Detailed Answers

1

Explain the standard Encoder-Decoder architecture used in Natural Language Processing.

2

Describe how sequence-to-sequence models are applied to Machine Translation. What are the key steps involved?

3

Discuss the application of sequence-to-sequence models in text summarization. Differentiate between abstractive and extractive summarization in this context.

4

What are the primary limitations of classical sequence-to-sequence models without attention?

5

Explain the concept of 'Attention' in deep NLP. Why was it introduced?

6

Provide the mathematical formulation for computing Soft Attention in an encoder-decoder network.

7

Distinguish between Soft Attention and Hard Attention mechanisms.

8

Explain the Bahdanau Attention mechanism (Additive Attention) in detail, including its alignment score equation.

9

Explain the Luong Attention mechanism (Multiplicative Attention). How does it differ from Bahdanau attention in terms of state usage?

10

What are the three different scoring functions proposed in Luong Attention?

11

Compare Bahdanau Attention and Luong Attention mechanisms.

12

Describe the process of integrating attention into an encoder-decoder network during the decoding phase.

13

What is the BLEU score? Explain how modified n-gram precision is calculated in BLEU.

14

Explain the Brevity Penalty in the context of the BLEU score calculation. Why is it necessary?

15

What are ROUGE scores in NLP evaluation? Differentiate between ROUGE-N and ROUGE-L.

16

Compare BLEU and ROUGE scores. In which NLP tasks is each predominantly used and why?

17

What is Teacher Forcing, and why is it used during the training of sequence-to-sequence models?

18

Discuss the evaluation challenges in generative NLP tasks and how automated metrics address or fail to address them.

19

Explain the concept of ROUGE-S (Skip-Bigram Co-occurrence). How does it differ from standard ROUGE-2?

20

Derive the basic structural flow of how a translation is generated in a Sequence-to-Sequence model with Attention, from input to final probability distribution.