Unit5 - Subjective Questions

INT312 • Practice Questions with Detailed Answers

1

Define Apache HBase. Explain its primary characteristics and why it is classified as a NoSQL database.

2

Compare and contrast Apache HBase with a traditional Relational Database Management System (RDBMS).

3

Distinguish between HDFS and HBase. Why do we need HBase when HDFS is already capable of storing massive datasets?

4

Describe the logical data model of Apache HBase in detail.

5

Explain the significance of the 'Timestamp' and 'Versioning' in the HBase data model.

6

Outline the high-level architecture of Apache HBase. Name its core components.

7

What is the role of the HMaster in HBase architecture? Does it lie in the data read/write path?

8

Describe the responsibilities of a RegionServer in Apache HBase.

9

Explain the Write Path in Apache HBase. How is data physically written?

10

Explain the Read Path in Apache HBase. How does HBase locate a specific row key?

11

What are Minor and Major Compactions in HBase? Why are they necessary?

12

What is Zookeeper's role in an Apache HBase cluster? What happens if Zookeeper goes down?

13

Explain the concept of 'Hotspotting' in HBase. List three techniques to avoid it.

14

What is an HFile? Describe its structure briefly.

15

Discuss the significance of Bloom Filters in Apache HBase.

16

What are Regions in HBase? Explain the process of Region Splitting.

17

Describe any five basic HBase shell commands with their syntax and purpose.

18

What are the best practices for designing a Row Key in Apache HBase?

19

Discuss three common use cases where Apache HBase is the ideal database choice.

20

How does Apache HBase integrate with Hadoop MapReduce? Explain the classes used for mapping and reducing.