Unit 3 - Practice Quiz

INT323 50 Questions
0 Correct 0 Wrong 50 Left
0/50

1 Who is widely recognized as the 'Father of Data Warehousing'?

A. E.F. Codd
B. Bill Inmon
C. Peter Chen
D. Ralph Kimball

2 A Data Warehouse is defined as a subject-oriented, integrated, time-variant, and ____ collection of data.

A. Volatile
B. Transient
C. Virtual
D. Non-volatile

3 Which characteristic of a Data Warehouse implies that data is organized around major entities such as customer, product, and sales?

A. Integrated
B. Non-volatile
C. Time-variant
D. Subject-oriented

4 In the context of Data Warehousing, what does 'Integrated' mean?

A. Data is stored in a single file
B. Data is collected from various sources and standardized
C. Data is encrypted for security
D. Data is accessible via the web

5 What is the primary purpose of the 'Staging Area' in Data Warehouse architecture?

A. To clean and transform data before loading it into the warehouse
B. To provide query access to end-users
C. To store data permanently
D. To store meta-data only

6 Which type of data represents the 'Time-variant' nature of a Data Warehouse?

A. Future predictive data
B. Historical data spanning a specific period
C. Current real-time data only
D. Temporary session data

7 What is a 'Data Mart'?

A. A backup storage device
B. A tool for data mining
C. A subset of a data warehouse focused on a specific department or function
D. A complete enterprise-wide data warehouse

8 Which architecture approach advocates building the Enterprise Data Warehouse (EDW) first and then creating Data Marts?

A. Agile Approach
B. Spiral Approach
C. Bottom-up Approach
D. Top-down Approach

9 The 'Bottom-up' approach to Data Warehousing, which starts with Data Marts, is associated with:

A. Edgar Codd
B. Larry Ellison
C. Ralph Kimball
D. Bill Inmon

10 What does OLAP stand for?

A. Online Analytical Processing
B. Online Advanced Programming
C. Overlay Analytical Platform
D. Offline Analysis Protocol

11 Which table in a schema contains the quantitative data (numbers/metrics) for analysis?

A. Meta Table
B. Dimension Table
C. Index Table
D. Fact Table

12 Which table contains descriptive attributes used for query filtering and labeling?

A. Measure Table
B. Dimension Table
C. Fact Table
D. Summary Table

13 In a Star Schema, how are tables arranged?

A. A central fact table connected to multiple dimension tables
B. A central dimension table connected to multiple fact tables
C. Multiple fact tables sharing dimension tables
D. Normalized tables linked in a chain

14 What distinguishes a Snowflake Schema from a Star Schema?

A. The Fact table is normalized
B. It uses only one table
C. The Dimension tables are normalized
D. It does not support SQL

15 A 'Fact Constellation' schema is also known as:

A. Bus Schema
B. Galaxy Schema
C. Star Schema
D. Snowflake Schema

16 What is 'Meta-data' in the context of a Data Warehouse?

A. The actual business data
B. Data about data
C. Backup data
D. Deleted data

17 Which OLAP operation allows moving from summary data to detailed data?

A. Roll-up
B. Drill-down
C. Dice
D. Slice

18 Which OLAP operation performs aggregation on a data cube (e.g., climbing up the hierarchy)?

A. Drill-down
B. Roll-up
C. Slice
D. Pivot

19 Selecting a single value for one dimension to create a subset of the cube is called:

A. Pivoting
B. Slicing
C. Dicing
D. Drilling

20 Which type of OLAP stores data in a multidimensional array-based storage engine?

A. MOLAP
B. HOLAP
C. DOLAP
D. ROLAP

21 ROLAP stands for:

A. Relational Online Analytical Processing
B. Real-time Online Analytical Processing
C. Rapid Online Analytical Processing
D. Remote Online Analytical Processing

22 HOLAP attempts to combine the best features of:

A. Data Mining and Data Warehousing
B. Star and Snowflake Schemas
C. ROLAP and MOLAP
D. OLTP and OLAP

23 Which of the following is NOT a typical application of a Data Warehouse?

A. Decision Support Systems (DSS)
B. Transaction Processing (OLTP)
C. Reporting and Forecasting
D. Trend Analysis

24 What is the acronym ETL short for?

A. Extract, Transform, Load
B. Extract, Test, Lock
C. Enter, Transform, List
D. Evaluate, Transfer, Load

25 In the context of ETL, 'Transformation' includes:

A. Giving users read access
B. Cleaning, filtering, and aggregating data
C. Installing database software
D. Backing up data to tape

26 What is an ODS in Data Warehousing architecture?

A. Operational Data Store
B. Object Data Source
C. Online Decision System
D. Offline Data Storage

27 Which utility is primarily used to refresh a data warehouse with new data?

A. Query Manager
B. Security Manager
C. Data Loading Utility
D. Index Creator

28 What is the 'Grain' of a fact table?

A. The number of columns
B. The type of database used
C. The size of the table on disk
D. The level of detail represented by a single row

29 Which schema results in the highest performance for queries due to fewer joins?

A. Star Schema
B. Normalized Schema
C. Hierarchical Schema
D. Snowflake Schema

30 What type of metadata allows tools to understand how to navigate the data (e.g., mapping source to target)?

A. Business Metadata
B. Administrative Metadata
C. Technical/Operational Metadata
D. Social Metadata

31 A 'Dependent Data Mart' is sourced from:

A. Directly from operational systems
B. The Enterprise Data Warehouse
C. External flat files only
D. User input

32 An 'Independent Data Mart' is sourced from:

A. The Enterprise Data Warehouse
B. Another Data Mart
C. Directly from operational source systems
D. The backup server

33 Which of the following is a security concern in Data Warehousing?

A. Keeping data volatile
B. Ensuring data is not time-variant
C. Reducing data redundancy
D. Controlling access to sensitive aggregated data

34 What is the purpose of a 'Surrogate Key' in a dimension table?

A. To encrypt the data
B. To store the date
C. To replace the natural primary key with an internal system integer
D. To link to the internet

35 In the Three-Tier Data Warehouse Architecture, what constitutes the bottom tier?

A. OLAP Server
B. Data Warehouse Database Server
C. Front-end tools
D. Web Interface

36 In the Three-Tier Data Warehouse Architecture, what is the middle tier?

A. Relational Database
B. OLAP Server
C. Reporting Tools
D. Data Mining Tools

37 Which of the following is a benefit of Data Warehousing?

A. Reduces storage requirements
B. Replaces the need for backups
C. Increases transactional processing speed
D. Enhances business intelligence and decision making

38 What is 'Data Scrubbing'?

A. Merging data marts
B. Cleaning data to remove inconsistencies and errors
C. Deleting data permanently
D. Compressing data

39 The operation of rotating the data axes in an OLAP cube to view data from different perspectives is called:

A. Pivot
B. Slice
C. Drill-up
D. Dice

40 Which backup strategy involves copying the entire data warehouse?

A. Partial Backup
B. Full Backup
C. Incremental Backup
D. Differential Backup

41 Why is backup crucial for a Data Warehouse despite it being a read-mostly system?

A. To recover from hardware failure or data corruption
B. Data is volatile
C. To allow write operations
D. To increase query speed

42 A 'Virtual Data Warehouse' is:

A. A cloud-based physical storage
B. A set of views over operational databases
C. An empty database
D. A simulation of a database

43 Which tool category allows end-users to create reports and graphs from the Data Warehouse?

A. Operating Systems
B. ETL Tools
C. Database Management Systems
D. Data Access/Reporting Tools

44 In a Fact Table, a 'Measure' that can be summed across all dimensions is called:

A. Additive
B. Semi-additive
C. Subtractive
D. Non-additive

45 A 'Semi-additive' measure is one that:

A. Is always text
B. Cannot be summed at all
C. Can be summed across all dimensions
D. Can be summed across some dimensions but not others

46 Which utility is used to monitor the performance of the Data Warehouse?

A. Warehouse Manager
B. System/Performance Monitor
C. Load Manager
D. Query Manager

47 The define-measure-analyze-improve-control cycle is often supported by data from:

A. Network Routers
B. Text Editors
C. Web Browsers
D. Data Warehouses

48 What is the specific challenge of 'Data Marting' when done independently?

A. It is too slow
B. It is too expensive
C. It requires too much storage
D. It creates inconsistent 'islands of information'

49 Which of the following best describes 'Business Metadata'?

A. Log files of server activity
B. Physical database schema details
C. Programming code for ETL
D. Terms, definitions, and ownership info for business users

50 In the context of Informatica and DW, what is a 'Repository'?

A. The operational source system
B. The hardware where data is stored
C. The final report generated
D. A central database storing metadata and transformation rules