Unit 5 - Practice Quiz

INT395 50 Questions
0 Correct 0 Wrong 50 Left
0/50

1 Which of the following best describes a Time Series dataset?

A. A dataset where the target variable is categorical and independent of time.
B. A dataset containing only images captured at different locations.
C. A sequence of data points indexed in time order, often with equal intervals.
D. A collection of data points collected from random individuals at a single point in time.

2 In the context of Time Series analysis, what does Stationarity imply?

A. The statistical properties (mean, variance, covariance) are constant over time.
B. The data contains no missing values.
C. The time series has a constant trend increasing over time.
D. The seasonal component of the data is strictly periodic.

3 Which component of a time series represents a long-term increase or decrease in the data?

A. Seasonality
B. Noise (Irregularity)
C. Trend
D. Cyclicality

4 What is the difference between Univariate and Multivariate time series?

A. Univariate is for stationary data; Multivariate is for non-stationary data.
B. Univariate uses regression; Multivariate uses classification.
C. Univariate involves a single variable varying over time; Multivariate involves multiple variables varying over time.
D. Univariate has irregular intervals; Multivariate has regular intervals.

5 When preparing time series data for supervised learning, what is a Lag Feature?

A. Observation from a previous time step used as an input feature (e.g., ).
B. The difference between the current value and the mean.
C. A future value used to predict the past.
D. The error term from a moving average model.

6 Which of the following is a common method to handle Irregular Intervals in time series data before modeling?

A. Ignoring the time index and treating it as cross-sectional data.
B. Using a random number generator to fill gaps.
C. Resampling or Interpolation to a fixed frequency.
D. Dropping all timestamps.

7 Why is K-Fold Cross-Validation with random shuffling inappropriate for time series data?

A. Time series data is always too small for K-Fold.
B. It assumes the data is non-stationary.
C. It is too computationally expensive.
D. It causes data leakage by using future data to predict past data.

8 What is the Walk-Forward Validation (or Time Series Split) strategy?

A. Using a sliding or expanding window where the training set consists of past data and the test set consists of immediate future data.
B. Randomly selecting 20% of the data for testing.
C. Splitting data based on the magnitude of the values.
D. Training on all data and testing on the same data.

9 Which mathematical operation is commonly used to transform a non-stationary series with a trend into a stationary one?

A. Squaring
B. Integration
C. Differencing (e.g., )
D. Multiplication

10 In an Autoregressive (AR) model of order , denoted as AR(), the current value is predicted based on:

A. Past forecast errors.
B. A linear combination of past values.
C. Independent external variables only.
D. A moving average of the entire dataset.

11 The equation represents which model?

A. AR(1)
B. ARIMA(1,1,1)
C. MA(1)
D. Random Walk

12 What does the 'I' in ARIMA stand for?

A. Interpolated
B. Independent
C. Integrated
D. Irregular

13 In the notation ARIMA(p, d, q), what does the parameter represent?

A. The seasonality period.
B. The size of the moving average window.
C. The number of times that the raw observations are differenced.
D. The number of lag observations included in the model.

14 Which plot is most useful for determining the order of a Moving Average (MA) model?

A. Histogram
B. Scatter plot
C. Partial Autocorrelation Function (PACF)
D. Autocorrelation Function (ACF)

15 Which plot is most useful for determining the order of an Autoregressive (AR) model?

A. Autocorrelation Function (ACF)
B. Partial Autocorrelation Function (PACF)
C. Spectral Density Plot
D. Box Plot

16 What distinguishes SARIMA from ARIMA?

A. SARIMA does not require stationarity.
B. SARIMA handles missing values better.
C. SARIMA is only for multivariate data.
D. SARIMA explicitly models Seasonal components.

17 In SARIMA notation , what does represent?

A. The maximum value of the series.
B. The number of time steps for a single seasonal period.
C. The number of features.
D. The order of the polynomial trend.

18 If a time series is a White Noise process, which of the following is true?

A. It can be perfectly predicted using linear regression.
B. It strictly follows a sinusoidal wave.
C. It has zero autocorrelation for all lags .
D. Its mean and variance change rapidly over time.

19 What is the primary risk of using standard Min-Max Scaling on the entire dataset before splitting into train/test in time series?

A. It introduces Lookahead Bias (Data Leakage).
B. It makes the data non-stationary.
C. It changes the time order.
D. It removes the seasonality.

20 Which test is commonly used to check for Stationarity?

A. ANOVA
B. Chi-Square test
C. Augmented Dickey-Fuller (ADF) test
D. T-test

21 Consider the equation: . This is an example of:

A. MA(1) process.
B. Stationary AR(1) process.
C. Explosive AR(1) process.
D. AR(1) with unit root.

22 In the context of Time Series Regression, what are Exogenous Variables?

A. Variables representing the error term.
B. Internal lag variables derived from the target.
C. Variables that are predicted by the model.
D. External variables that influence the target but are not influenced by it.

23 If a time series exhibits a variance that increases over time (heteroscedasticity), which transformation is commonly applied to stabilize variance?

A. Standardization
B. Logarithmic Transformation
C. Adding a constant
D. Differencing

24 What is an Expanding Window split strategy?

A. The test set size grows while the training set shrinks.
B. The training set size remains constant and slides forward.
C. Data is shuffled and split randomly.
D. The training set starts small and grows in size as new data becomes available.

25 What is the consequence of Over-differencing a time series?

A. It increases the variance of the trend.
B. The series becomes perfectly linear.
C. It makes the trend more pronounced.
D. It introduces artificial noise and negative autocorrelation.

26 Which of the following describes a Random Walk model?

A. ARIMA(1, 0, 0)
B. ARIMA(0, 0, 1)
C. ARIMA(1, 1, 1)
D. ARIMA(0, 1, 0)

27 In an AR(1) model , if , the series is:

A. White Noise
B. Stationary
C. A Random Walk (Non-stationary)
D. Mean reverting

28 Which metric provides a penalty for adding parameters to a model, helping to avoid overfitting in ARIMA model selection?

A. R-Squared
B. MSE (Mean Squared Error)
C. MAE (Mean Absolute Error)
D. AIC (Akaike Information Criterion)

29 For a monthly time series with strong yearly seasonality, what is the value of the seasonal period parameter ?

A. 30
B. 12
C. 4
D. 7

30 What is Seasonal Differencing?

A.
B.
C.
D. where is the seasonal period.

31 A model defined as is an:

A. ARMA(1,1)
B. AR(1)
C. AR(2)
D. MA(2)

32 The Ljung-Box Test is applied to the residuals of a time series model to check:

A. If the residuals differ significantly from white noise (i.e., if there is remaining autocorrelation).
B. If the model parameters are statistically significant.
C. If the original data was stationary.
D. If the residuals are normally distributed.

33 How are missing values typically handled in time series if dropping them is not an option?

A. Time-based interpolation (e.g., linear) or Forward Fill.
B. Replace with the dataset maximum.
C. Replace with the dataset standard deviation.
D. Replace with zero.

34 Which component is represented by in SARIMA?

A. Seasonal Autoregressive order
B. Seasonal Moving Average order
C. Trend Differencing order
D. Seasonal Differencing order

35 If the ACF plot decays slowly (linear decay) while the PACF shows a sharp cutoff at lag 1, the series is likely:

A. AR(1)
B. MA(1)
C. White Noise
D. ARMA(1,1)

36 In time series regression, what is a Rolling Window feature?

A. A window that randomly selects features.
B. A feature computed by aggregating (mean, max, etc.) values over a fixed-size window of past observations.
C. A technique to increase the sampling rate.
D. A feature that looks into the future.

37 Which of the following equations represents an ARIMA(1,1,1) model (excluding constant)? Let .

A.
B.
C.
D.

38 When encoding cyclical time features like 'Month of Year' (1-12) for regression, what is the best approach to preserve the cycle (i.e., Dec is close to Jan)?

A. Sine and Cosine transformation
B. One-Hot Encoding
C. Label Encoding (1, 2, ..., 12)
D. Binary Encoding

39 What is Naive Forecasting?

A. Using a complex neural network.
B. Predicting the value at time to be equal to the value at time .
C. Predicting zero for all future steps.
D. Predicting the current value as the average of all past values.

40 What is the key assumption of the Moving Average (MA) model regarding shocks/errors?

A. Current value depends on past values of the variable.
B. Current value depends on current and past unobserved white noise error terms.
C. Errors increase exponentially with time.
D. Errors are correlated with the independent variables.

41 In the context of SARIMA, what is an Exogenous regressor often denoted as?

A. ARIMAE
B. ARIMAS
C. XY-ARIMA
D. SARIMAX

42 If a time series has a unit root, the series is:

A. Stationary
B. Non-Stationary
C. Deterministic
D. Seasonal

43 Which ARIMA model is theoretically equivalent to Simple Exponential Smoothing?

A. ARIMA(0,0,0)
B. ARIMA(0,1,1)
C. ARIMA(1,0,0)
D. ARIMA(2,2,2)

44 What is the purpose of Auto-Regressive terms in a model?

A. To model the relationship between an observation and a lagged observation.
B. To remove seasonality.
C. To model the relationship between an observation and a residual error.
D. To integrate the time series.

45 Why is it important to examine the Residuals of a fitted time series model?

A. To ensure they resemble random white noise.
B. To ensure they have high autocorrelation.
C. To ensure they are all positive.
D. To ensure they follow a clear trend.

46 What implies that a Time Series is Deterministic?

A. It contains a stochastic component.
B. It can be perfectly predicted by a mathematical function of time (e.g., ).
C. It is a random walk.
D. It has infinite variance.

47 In a Multi-step forecast strategy, what is the 'Direct' method?

A. Using only the most recent observation.
B. Building a separate model for each future time step to be predicted.
C. Ideally predicting one step ahead and using that prediction for the next step.
D. Averaging the results of AR and MA models.

48 What is the 'Recursive' (or Iterative) multi-step forecast strategy?

A. Predicting , then using that prediction as input to predict , and so on.
B. Training on future data.
C. Using a different algorithm for every step.
D. Predicting all steps simultaneously using a vector output.

49 Which transformation is the inverse of Differencing?

A. Logarithm
B. Cumulative Sum (Integration)
C. Exponentiation
D. Differentiation

50 When is an ARIMA model considered Invertible?

A. When .
B. When it can be rewritten as an infinite AR process.
C. When the AR polynomial roots lie inside the unit circle.
D. When the MA polynomial roots lie outside the unit circle.