Unit 3 - Practice Quiz

INT323 50 Questions
0 Correct 0 Wrong 50 Left
0/50

1 Who is widely recognized as the 'Father of Data Warehousing'?

A. E.F. Codd
B. Peter Chen
C. Bill Inmon
D. Ralph Kimball

2 A Data Warehouse is defined as a subject-oriented, integrated, time-variant, and ____ collection of data.

A. Volatile
B. Non-volatile
C. Transient
D. Virtual

3 Which characteristic of a Data Warehouse implies that data is organized around major entities such as customer, product, and sales?

A. Time-variant
B. Subject-oriented
C. Integrated
D. Non-volatile

4 In the context of Data Warehousing, what does 'Integrated' mean?

A. Data is collected from various sources and standardized
B. Data is accessible via the web
C. Data is encrypted for security
D. Data is stored in a single file

5 What is the primary purpose of the 'Staging Area' in Data Warehouse architecture?

A. To store meta-data only
B. To provide query access to end-users
C. To store data permanently
D. To clean and transform data before loading it into the warehouse

6 Which type of data represents the 'Time-variant' nature of a Data Warehouse?

A. Historical data spanning a specific period
B. Current real-time data only
C. Future predictive data
D. Temporary session data

7 What is a 'Data Mart'?

A. A complete enterprise-wide data warehouse
B. A tool for data mining
C. A backup storage device
D. A subset of a data warehouse focused on a specific department or function

8 Which architecture approach advocates building the Enterprise Data Warehouse (EDW) first and then creating Data Marts?

A. Spiral Approach
B. Agile Approach
C. Bottom-up Approach
D. Top-down Approach

9 The 'Bottom-up' approach to Data Warehousing, which starts with Data Marts, is associated with:

A. Ralph Kimball
B. Bill Inmon
C. Larry Ellison
D. Edgar Codd

10 What does OLAP stand for?

A. Overlay Analytical Platform
B. Offline Analysis Protocol
C. Online Advanced Programming
D. Online Analytical Processing

11 Which table in a schema contains the quantitative data (numbers/metrics) for analysis?

A. Dimension Table
B. Index Table
C. Fact Table
D. Meta Table

12 Which table contains descriptive attributes used for query filtering and labeling?

A. Measure Table
B. Dimension Table
C. Fact Table
D. Summary Table

13 In a Star Schema, how are tables arranged?

A. Multiple fact tables sharing dimension tables
B. A central dimension table connected to multiple fact tables
C. Normalized tables linked in a chain
D. A central fact table connected to multiple dimension tables

14 What distinguishes a Snowflake Schema from a Star Schema?

A. The Dimension tables are normalized
B. The Fact table is normalized
C. It does not support SQL
D. It uses only one table

15 A 'Fact Constellation' schema is also known as:

A. Galaxy Schema
B. Snowflake Schema
C. Star Schema
D. Bus Schema

16 What is 'Meta-data' in the context of a Data Warehouse?

A. The actual business data
B. Data about data
C. Deleted data
D. Backup data

17 Which OLAP operation allows moving from summary data to detailed data?

A. Drill-down
B. Dice
C. Roll-up
D. Slice

18 Which OLAP operation performs aggregation on a data cube (e.g., climbing up the hierarchy)?

A. Slice
B. Roll-up
C. Drill-down
D. Pivot

19 Selecting a single value for one dimension to create a subset of the cube is called:

A. Slicing
B. Pivoting
C. Drilling
D. Dicing

20 Which type of OLAP stores data in a multidimensional array-based storage engine?

A. ROLAP
B. DOLAP
C. HOLAP
D. MOLAP

21 ROLAP stands for:

A. Rapid Online Analytical Processing
B. Real-time Online Analytical Processing
C. Relational Online Analytical Processing
D. Remote Online Analytical Processing

22 HOLAP attempts to combine the best features of:

A. ROLAP and MOLAP
B. OLTP and OLAP
C. Data Mining and Data Warehousing
D. Star and Snowflake Schemas

23 Which of the following is NOT a typical application of a Data Warehouse?

A. Decision Support Systems (DSS)
B. Transaction Processing (OLTP)
C. Trend Analysis
D. Reporting and Forecasting

24 What is the acronym ETL short for?

A. Enter, Transform, List
B. Evaluate, Transfer, Load
C. Extract, Test, Lock
D. Extract, Transform, Load

25 In the context of ETL, 'Transformation' includes:

A. Backing up data to tape
B. Installing database software
C. Cleaning, filtering, and aggregating data
D. Giving users read access

26 What is an ODS in Data Warehousing architecture?

A. Online Decision System
B. Operational Data Store
C. Offline Data Storage
D. Object Data Source

27 Which utility is primarily used to refresh a data warehouse with new data?

A. Index Creator
B. Query Manager
C. Security Manager
D. Data Loading Utility

28 What is the 'Grain' of a fact table?

A. The level of detail represented by a single row
B. The number of columns
C. The size of the table on disk
D. The type of database used

29 Which schema results in the highest performance for queries due to fewer joins?

A. Hierarchical Schema
B. Star Schema
C. Snowflake Schema
D. Normalized Schema

30 What type of metadata allows tools to understand how to navigate the data (e.g., mapping source to target)?

A. Business Metadata
B. Social Metadata
C. Administrative Metadata
D. Technical/Operational Metadata

31 A 'Dependent Data Mart' is sourced from:

A. External flat files only
B. User input
C. Directly from operational systems
D. The Enterprise Data Warehouse

32 An 'Independent Data Mart' is sourced from:

A. Another Data Mart
B. The backup server
C. Directly from operational source systems
D. The Enterprise Data Warehouse

33 Which of the following is a security concern in Data Warehousing?

A. Ensuring data is not time-variant
B. Reducing data redundancy
C. Controlling access to sensitive aggregated data
D. Keeping data volatile

34 What is the purpose of a 'Surrogate Key' in a dimension table?

A. To store the date
B. To replace the natural primary key with an internal system integer
C. To encrypt the data
D. To link to the internet

35 In the Three-Tier Data Warehouse Architecture, what constitutes the bottom tier?

A. Front-end tools
B. Data Warehouse Database Server
C. Web Interface
D. OLAP Server

36 In the Three-Tier Data Warehouse Architecture, what is the middle tier?

A. Data Mining Tools
B. OLAP Server
C. Reporting Tools
D. Relational Database

37 Which of the following is a benefit of Data Warehousing?

A. Increases transactional processing speed
B. Enhances business intelligence and decision making
C. Reduces storage requirements
D. Replaces the need for backups

38 What is 'Data Scrubbing'?

A. Cleaning data to remove inconsistencies and errors
B. Deleting data permanently
C. Compressing data
D. Merging data marts

39 The operation of rotating the data axes in an OLAP cube to view data from different perspectives is called:

A. Dice
B. Slice
C. Drill-up
D. Pivot

40 Which backup strategy involves copying the entire data warehouse?

A. Differential Backup
B. Partial Backup
C. Full Backup
D. Incremental Backup

41 Why is backup crucial for a Data Warehouse despite it being a read-mostly system?

A. To increase query speed
B. Data is volatile
C. To recover from hardware failure or data corruption
D. To allow write operations

42 A 'Virtual Data Warehouse' is:

A. A simulation of a database
B. An empty database
C. A cloud-based physical storage
D. A set of views over operational databases

43 Which tool category allows end-users to create reports and graphs from the Data Warehouse?

A. Database Management Systems
B. ETL Tools
C. Operating Systems
D. Data Access/Reporting Tools

44 In a Fact Table, a 'Measure' that can be summed across all dimensions is called:

A. Semi-additive
B. Non-additive
C. Subtractive
D. Additive

45 A 'Semi-additive' measure is one that:

A. Cannot be summed at all
B. Can be summed across all dimensions
C. Can be summed across some dimensions but not others
D. Is always text

46 Which utility is used to monitor the performance of the Data Warehouse?

A. System/Performance Monitor
B. Load Manager
C. Warehouse Manager
D. Query Manager

47 The define-measure-analyze-improve-control cycle is often supported by data from:

A. Network Routers
B. Data Warehouses
C. Text Editors
D. Web Browsers

48 What is the specific challenge of 'Data Marting' when done independently?

A. It is too expensive
B. It is too slow
C. It requires too much storage
D. It creates inconsistent 'islands of information'

49 Which of the following best describes 'Business Metadata'?

A. Programming code for ETL
B. Physical database schema details
C. Log files of server activity
D. Terms, definitions, and ownership info for business users

50 In the context of Informatica and DW, what is a 'Repository'?

A. The hardware where data is stored
B. The operational source system
C. The final report generated
D. A central database storing metadata and transformation rules