Unit 3 - Practice Quiz

INT323

1 Who is widely recognized as the 'Father of Data Warehousing'?

A. Bill Inmon
B. Ralph Kimball
C. E.F. Codd
D. Peter Chen

2 A Data Warehouse is defined as a subject-oriented, integrated, time-variant, and ____ collection of data.

A. Volatile
B. Non-volatile
C. Transient
D. Virtual

3 Which characteristic of a Data Warehouse implies that data is organized around major entities such as customer, product, and sales?

A. Integrated
B. Time-variant
C. Subject-oriented
D. Non-volatile

4 In the context of Data Warehousing, what does 'Integrated' mean?

A. Data is stored in a single file
B. Data is collected from various sources and standardized
C. Data is encrypted for security
D. Data is accessible via the web

5 What is the primary purpose of the 'Staging Area' in Data Warehouse architecture?

A. To store data permanently
B. To provide query access to end-users
C. To clean and transform data before loading it into the warehouse
D. To store meta-data only

6 Which type of data represents the 'Time-variant' nature of a Data Warehouse?

A. Current real-time data only
B. Historical data spanning a specific period
C. Future predictive data
D. Temporary session data

7 What is a 'Data Mart'?

A. A complete enterprise-wide data warehouse
B. A subset of a data warehouse focused on a specific department or function
C. A tool for data mining
D. A backup storage device

8 Which architecture approach advocates building the Enterprise Data Warehouse (EDW) first and then creating Data Marts?

A. Bottom-up Approach
B. Top-down Approach
C. Agile Approach
D. Spiral Approach

9 The 'Bottom-up' approach to Data Warehousing, which starts with Data Marts, is associated with:

A. Bill Inmon
B. Ralph Kimball
C. Larry Ellison
D. Edgar Codd

10 What does OLAP stand for?

A. Online Analytical Processing
B. Online Advanced Programming
C. Offline Analysis Protocol
D. Overlay Analytical Platform

11 Which table in a schema contains the quantitative data (numbers/metrics) for analysis?

A. Dimension Table
B. Fact Table
C. Meta Table
D. Index Table

12 Which table contains descriptive attributes used for query filtering and labeling?

A. Fact Table
B. Dimension Table
C. Measure Table
D. Summary Table

13 In a Star Schema, how are tables arranged?

A. Multiple fact tables sharing dimension tables
B. A central fact table connected to multiple dimension tables
C. A central dimension table connected to multiple fact tables
D. Normalized tables linked in a chain

14 What distinguishes a Snowflake Schema from a Star Schema?

A. The Fact table is normalized
B. The Dimension tables are normalized
C. It uses only one table
D. It does not support SQL

15 A 'Fact Constellation' schema is also known as:

A. Galaxy Schema
B. Star Schema
C. Snowflake Schema
D. Bus Schema

16 What is 'Meta-data' in the context of a Data Warehouse?

A. The actual business data
B. Data about data
C. Deleted data
D. Backup data

17 Which OLAP operation allows moving from summary data to detailed data?

A. Roll-up
B. Drill-down
C. Slice
D. Dice

18 Which OLAP operation performs aggregation on a data cube (e.g., climbing up the hierarchy)?

A. Drill-down
B. Roll-up
C. Pivot
D. Slice

19 Selecting a single value for one dimension to create a subset of the cube is called:

A. Slicing
B. Dicing
C. Pivoting
D. Drilling

20 Which type of OLAP stores data in a multidimensional array-based storage engine?

A. ROLAP
B. MOLAP
C. HOLAP
D. DOLAP

21 ROLAP stands for:

A. Relational Online Analytical Processing
B. Real-time Online Analytical Processing
C. Remote Online Analytical Processing
D. Rapid Online Analytical Processing

22 HOLAP attempts to combine the best features of:

A. Star and Snowflake Schemas
B. ROLAP and MOLAP
C. Data Mining and Data Warehousing
D. OLTP and OLAP

23 Which of the following is NOT a typical application of a Data Warehouse?

A. Decision Support Systems (DSS)
B. Transaction Processing (OLTP)
C. Trend Analysis
D. Reporting and Forecasting

24 What is the acronym ETL short for?

A. Extract, Transform, Load
B. Evaluate, Transfer, Load
C. Extract, Test, Lock
D. Enter, Transform, List

25 In the context of ETL, 'Transformation' includes:

A. Backing up data to tape
B. Cleaning, filtering, and aggregating data
C. Giving users read access
D. Installing database software

26 What is an ODS in Data Warehousing architecture?

A. Offline Data Storage
B. Operational Data Store
C. Online Decision System
D. Object Data Source

27 Which utility is primarily used to refresh a data warehouse with new data?

A. Data Loading Utility
B. Query Manager
C. Index Creator
D. Security Manager

28 What is the 'Grain' of a fact table?

A. The size of the table on disk
B. The level of detail represented by a single row
C. The number of columns
D. The type of database used

29 Which schema results in the highest performance for queries due to fewer joins?

A. Snowflake Schema
B. Star Schema
C. Normalized Schema
D. Hierarchical Schema

30 What type of metadata allows tools to understand how to navigate the data (e.g., mapping source to target)?

A. Business Metadata
B. Technical/Operational Metadata
C. Social Metadata
D. Administrative Metadata

31 A 'Dependent Data Mart' is sourced from:

A. Directly from operational systems
B. The Enterprise Data Warehouse
C. External flat files only
D. User input

32 An 'Independent Data Mart' is sourced from:

A. The Enterprise Data Warehouse
B. Directly from operational source systems
C. Another Data Mart
D. The backup server

33 Which of the following is a security concern in Data Warehousing?

A. Keeping data volatile
B. Ensuring data is not time-variant
C. Controlling access to sensitive aggregated data
D. Reducing data redundancy

34 What is the purpose of a 'Surrogate Key' in a dimension table?

A. To replace the natural primary key with an internal system integer
B. To encrypt the data
C. To link to the internet
D. To store the date

35 In the Three-Tier Data Warehouse Architecture, what constitutes the bottom tier?

A. OLAP Server
B. Front-end tools
C. Data Warehouse Database Server
D. Web Interface

36 In the Three-Tier Data Warehouse Architecture, what is the middle tier?

A. Relational Database
B. OLAP Server
C. Reporting Tools
D. Data Mining Tools

37 Which of the following is a benefit of Data Warehousing?

A. Increases transactional processing speed
B. Enhances business intelligence and decision making
C. Reduces storage requirements
D. Replaces the need for backups

38 What is 'Data Scrubbing'?

A. Deleting data permanently
B. Merging data marts
C. Cleaning data to remove inconsistencies and errors
D. Compressing data

39 The operation of rotating the data axes in an OLAP cube to view data from different perspectives is called:

A. Slice
B. Dice
C. Pivot
D. Drill-up

40 Which backup strategy involves copying the entire data warehouse?

A. Full Backup
B. Incremental Backup
C. Differential Backup
D. Partial Backup

41 Why is backup crucial for a Data Warehouse despite it being a read-mostly system?

A. Data is volatile
B. To recover from hardware failure or data corruption
C. To increase query speed
D. To allow write operations

42 A 'Virtual Data Warehouse' is:

A. A set of views over operational databases
B. A cloud-based physical storage
C. A simulation of a database
D. An empty database

43 Which tool category allows end-users to create reports and graphs from the Data Warehouse?

A. ETL Tools
B. Data Access/Reporting Tools
C. Database Management Systems
D. Operating Systems

44 In a Fact Table, a 'Measure' that can be summed across all dimensions is called:

A. Additive
B. Semi-additive
C. Non-additive
D. Subtractive

45 A 'Semi-additive' measure is one that:

A. Can be summed across all dimensions
B. Can be summed across some dimensions but not others
C. Cannot be summed at all
D. Is always text

46 Which utility is used to monitor the performance of the Data Warehouse?

A. Load Manager
B. System/Performance Monitor
C. Query Manager
D. Warehouse Manager

47 The define-measure-analyze-improve-control cycle is often supported by data from:

A. Text Editors
B. Data Warehouses
C. Network Routers
D. Web Browsers

48 What is the specific challenge of 'Data Marting' when done independently?

A. It is too expensive
B. It creates inconsistent 'islands of information'
C. It is too slow
D. It requires too much storage

49 Which of the following best describes 'Business Metadata'?

A. Physical database schema details
B. Terms, definitions, and ownership info for business users
C. Log files of server activity
D. Programming code for ETL

50 In the context of Informatica and DW, what is a 'Repository'?

A. The hardware where data is stored
B. A central database storing metadata and transformation rules
C. The final report generated
D. The operational source system