1Which of the following best describes a Time Series dataset?
A.A collection of data points collected from random individuals at a single point in time.
B.A sequence of data points indexed in time order, often with equal intervals.
C.A dataset where the target variable is categorical and independent of time.
D.A dataset containing only images captured at different locations.
Correct Answer: A sequence of data points indexed in time order, often with equal intervals.
Explanation:A time series is a sequence of numerical data points in successive order, usually occurring at uniform intervals (e.g., daily stock prices, hourly temperature).
Incorrect! Try again.
2In the context of Time Series analysis, what does Stationarity imply?
A.The time series has a constant trend increasing over time.
B.The statistical properties (mean, variance, covariance) are constant over time.
C.The data contains no missing values.
D.The seasonal component of the data is strictly periodic.
Correct Answer: The statistical properties (mean, variance, covariance) are constant over time.
Explanation:A stationary time series is one whose statistical properties such as mean, variance, and autocorrelation are all constant over time. This is a key assumption for many models like ARIMA.
Incorrect! Try again.
3Which component of a time series represents a long-term increase or decrease in the data?
A.Seasonality
B.Noise (Irregularity)
C.Trend
D.Cyclicality
Correct Answer: Trend
Explanation:The Trend represents the underlying long-term direction in the data (increasing or decreasing) over a long period, ignoring short-term fluctuations.
Incorrect! Try again.
4What is the difference between Univariate and Multivariate time series?
A.Univariate has irregular intervals; Multivariate has regular intervals.
B.Univariate involves a single variable varying over time; Multivariate involves multiple variables varying over time.
D.Univariate is for stationary data; Multivariate is for non-stationary data.
Correct Answer: Univariate involves a single variable varying over time; Multivariate involves multiple variables varying over time.
Explanation:Univariate time series consists of a single sequence of observations (e.g., just temperature). Multivariate consists of multiple changing variables that may influence each other (e.g., temperature, humidity, and wind speed).
Incorrect! Try again.
5When preparing time series data for supervised learning, what is a Lag Feature?
A.The difference between the current value and the mean.
B.A future value used to predict the past.
C.Observation from a previous time step used as an input feature (e.g., ).
D.The error term from a moving average model.
Correct Answer: Observation from a previous time step used as an input feature (e.g., ).
Explanation:Lag features involve shifting the time series data so that past values (e.g., ) become input features to predict the current value .
Incorrect! Try again.
6Which of the following is a common method to handle Irregular Intervals in time series data before modeling?
A.Dropping all timestamps.
B.Resampling or Interpolation to a fixed frequency.
C.Using a random number generator to fill gaps.
D.Ignoring the time index and treating it as cross-sectional data.
Correct Answer: Resampling or Interpolation to a fixed frequency.
Explanation:To apply standard time series algorithms, data is often resampled (e.g., aggregation) or interpolated (e.g., linear fill) to ensure a constant time interval between observations.
Incorrect! Try again.
7Why is K-Fold Cross-Validation with random shuffling inappropriate for time series data?
A.It is too computationally expensive.
B.It causes data leakage by using future data to predict past data.
C.Time series data is always too small for K-Fold.
D.It assumes the data is non-stationary.
Correct Answer: It causes data leakage by using future data to predict past data.
Explanation:Random shuffling destroys the temporal order. If you train on future data and test on past data (look-ahead bias), the model performance estimates will be overly optimistic and unrealistic.
Incorrect! Try again.
8What is the Walk-Forward Validation (or Time Series Split) strategy?
A.Randomly selecting 20% of the data for testing.
B.Training on all data and testing on the same data.
C.Using a sliding or expanding window where the training set consists of past data and the test set consists of immediate future data.
D.Splitting data based on the magnitude of the values.
Correct Answer: Using a sliding or expanding window where the training set consists of past data and the test set consists of immediate future data.
Explanation:Walk-Forward Validation respects temporal order. The model is trained on a sequence of past data and evaluated on the subsequent time steps, simulating real-world forecasting.
Incorrect! Try again.
9Which mathematical operation is commonly used to transform a non-stationary series with a trend into a stationary one?
A.Integration
B.Differencing (e.g., )
C.Multiplication
D.Squaring
Correct Answer: Differencing (e.g., )
Explanation:Differencing involves computing the change between consecutive observations. It stabilizes the mean of a time series by removing changes in the level of a time series, thus eliminating trend.
Incorrect! Try again.
10In an Autoregressive (AR) model of order , denoted as AR(), the current value is predicted based on:
A.Past forecast errors.
B.A linear combination of past values.
C.A moving average of the entire dataset.
D.Independent external variables only.
Correct Answer: A linear combination of past values.
Explanation:An AR() model predicts the variable of interest using a linear combination of its past values: .
Incorrect! Try again.
11The equation represents which model?
A.AR(1)
B.MA(1)
C.ARIMA(1,1,1)
D.Random Walk
Correct Answer: MA(1)
Explanation:This is a Moving Average model of order 1, MA(1). It models the value as a linear combination of the mean and the current and past error terms (shocks).
Incorrect! Try again.
12What does the 'I' in ARIMA stand for?
A.Independent
B.Interpolated
C.Integrated
D.Irregular
Correct Answer: Integrated
Explanation:The 'I' stands for Integrated. It refers to the differencing of raw observations (subtracting an observation from an observation at the previous time step) to make the time series stationary.
Incorrect! Try again.
13In the notation ARIMA(p, d, q), what does the parameter represent?
A.The number of lag observations included in the model.
B.The size of the moving average window.
C.The number of times that the raw observations are differenced.
D.The seasonality period.
Correct Answer: The number of times that the raw observations are differenced.
Explanation: is the degree of differencing required to make the time series stationary (i.e., how many times we subtract the previous value from the current value).
Incorrect! Try again.
14Which plot is most useful for determining the order of a Moving Average (MA) model?
A.Scatter plot
B.Histogram
C.Partial Autocorrelation Function (PACF)
D.Autocorrelation Function (ACF)
Correct Answer: Autocorrelation Function (ACF)
Explanation:For an MA() process, the ACF cuts off after lag . Thus, the ACF plot is used to identify the order of the MA component.
Incorrect! Try again.
15Which plot is most useful for determining the order of an Autoregressive (AR) model?
A.Autocorrelation Function (ACF)
B.Partial Autocorrelation Function (PACF)
C.Box Plot
D.Spectral Density Plot
Correct Answer: Partial Autocorrelation Function (PACF)
Explanation:For an AR() process, the PACF cuts off after lag , while the ACF tails off. Therefore, PACF is used to determine the AR order.
Explanation:SARIMA (Seasonal ARIMA) extends ARIMA by adding seasonal autoregressive and moving average terms to handle data with repeating cycles (seasonality).
Incorrect! Try again.
17In SARIMA notation , what does represent?
A.The maximum value of the series.
B.The number of features.
C.The number of time steps for a single seasonal period.
D.The order of the polynomial trend.
Correct Answer: The number of time steps for a single seasonal period.
Explanation: refers to the periodicity of the seasonality (e.g., for monthly data with a yearly cycle, for daily data with a weekly cycle).
Incorrect! Try again.
18If a time series is a White Noise process, which of the following is true?
A.It can be perfectly predicted using linear regression.
B.Its mean and variance change rapidly over time.
C.It has zero autocorrelation for all lags .
D.It strictly follows a sinusoidal wave.
Correct Answer: It has zero autocorrelation for all lags .
Explanation:White noise is a sequence of random numbers with constant mean (usually 0), constant variance, and no correlation between values at different times (uncorrelated).
Incorrect! Try again.
19What is the primary risk of using standard Min-Max Scaling on the entire dataset before splitting into train/test in time series?
A.It makes the data non-stationary.
B.It introduces Lookahead Bias (Data Leakage).
C.It changes the time order.
D.It removes the seasonality.
Correct Answer: It introduces Lookahead Bias (Data Leakage).
Explanation:If you scale using the min/max of the whole dataset, the training data contains information about the range of the future test data. Scaling should be fitted only on the training set.
Incorrect! Try again.
20Which test is commonly used to check for Stationarity?
A.T-test
B.Augmented Dickey-Fuller (ADF) test
C.Chi-Square test
D.ANOVA
Correct Answer: Augmented Dickey-Fuller (ADF) test
Explanation:The ADF test checks the null hypothesis that a unit root is present in a time series sample (indicating non-stationarity). A low p-value indicates the series is stationary.
Incorrect! Try again.
21Consider the equation: . This is an example of:
A.AR(1) with unit root.
B.Stationary AR(1) process.
C.Explosive AR(1) process.
D.MA(1) process.
Correct Answer: Stationary AR(1) process.
Explanation:This is an AR(1) process . Since the absolute value of the coefficient , the process is stationary.
Incorrect! Try again.
22In the context of Time Series Regression, what are Exogenous Variables?
A.Variables that are predicted by the model.
B.Internal lag variables derived from the target.
C.External variables that influence the target but are not influenced by it.
D.Variables representing the error term.
Correct Answer: External variables that influence the target but are not influenced by it.
Explanation:Exogenous variables (often denoted as in ARIMAX) are parallel input time series that help explain the behavior of the target variable (e.g., Temperature helping to predict Ice Cream Sales).
Incorrect! Try again.
23If a time series exhibits a variance that increases over time (heteroscedasticity), which transformation is commonly applied to stabilize variance?
A.Differencing
B.Logarithmic Transformation
C.Adding a constant
D.Standardization
Correct Answer: Logarithmic Transformation
Explanation:Taking the log of a time series is a standard technique to linearize exponential trends and stabilize variance (homoscedasticity) when fluctuations grow with the level of the series.
Incorrect! Try again.
24What is an Expanding Window split strategy?
A.The training set size remains constant and slides forward.
B.The training set starts small and grows in size as new data becomes available.
C.The test set size grows while the training set shrinks.
D.Data is shuffled and split randomly.
Correct Answer: The training set starts small and grows in size as new data becomes available.
Explanation:In an expanding window, the training set effectively accumulates all prior data up to the current split point, retaining the entire history, unlike a sliding window which drops old data.
Incorrect! Try again.
25What is the consequence of Over-differencing a time series?
A.The series becomes perfectly linear.
B.It introduces artificial noise and negative autocorrelation.
C.It makes the trend more pronounced.
D.It increases the variance of the trend.
Correct Answer: It introduces artificial noise and negative autocorrelation.
Explanation:Differencing a series that is already stationary creates a new series with increased variance and artificial dependencies (specifically negative autocorrelation at lag 1), complicating modeling.
Incorrect! Try again.
26Which of the following describes a Random Walk model?
A.ARIMA(0, 1, 0)
B.ARIMA(1, 0, 0)
C.ARIMA(0, 0, 1)
D.ARIMA(1, 1, 1)
Correct Answer: ARIMA(0, 1, 0)
Explanation:A Random Walk is defined as , or . This corresponds to an ARIMA model with differencing order and no AR or MA terms ().
Incorrect! Try again.
27In an AR(1) model , if , the series is:
A.Stationary
B.Mean reverting
C.A Random Walk (Non-stationary)
D.White Noise
Correct Answer: A Random Walk (Non-stationary)
Explanation:If the coefficient is 1, a unit root exists. The value depends entirely on the previous value plus a shock, and variances accumulate over time, making it non-stationary.
Incorrect! Try again.
28Which metric provides a penalty for adding parameters to a model, helping to avoid overfitting in ARIMA model selection?
A.MSE (Mean Squared Error)
B.R-Squared
C.AIC (Akaike Information Criterion)
D.MAE (Mean Absolute Error)
Correct Answer: AIC (Akaike Information Criterion)
Explanation:AIC estimates the quality of a model relative to others. It includes a term representing the likelihood and a penalty term for the number of parameters (), discouraging unnecessary complexity.
Incorrect! Try again.
29For a monthly time series with strong yearly seasonality, what is the value of the seasonal period parameter ?
A.4
B.7
C.12
D.30
Correct Answer: 12
Explanation:Monthly data repeats a cycle every year, so the period (or ) is 12.
Incorrect! Try again.
30What is Seasonal Differencing?
A.
B. where is the seasonal period.
C.
D.
Correct Answer: where is the seasonal period.
Explanation:Seasonal differencing subtracts the value from the same season in the previous cycle (e.g., this January minus last January) to remove seasonal components.
Incorrect! Try again.
31A model defined as is an:
A.AR(1)
B.AR(2)
C.MA(2)
D.ARMA(1,1)
Correct Answer: AR(2)
Explanation:This model relies on two previous time steps ( and ), making it an Autoregressive model of order 2.
Incorrect! Try again.
32The Ljung-Box Test is applied to the residuals of a time series model to check:
A.If the residuals are normally distributed.
B.If the residuals differ significantly from white noise (i.e., if there is remaining autocorrelation).
C.If the model parameters are statistically significant.
D.If the original data was stationary.
Correct Answer: If the residuals differ significantly from white noise (i.e., if there is remaining autocorrelation).
Explanation:The Ljung-Box test checks if the autocorrelations of the residuals are non-zero. If p-value < 0.05, residuals are not white noise, implying the model failed to capture some structure.
Incorrect! Try again.
33How are missing values typically handled in time series if dropping them is not an option?
A.Replace with zero.
B.Time-based interpolation (e.g., linear) or Forward Fill.
C.Replace with the dataset maximum.
D.Replace with the dataset standard deviation.
Correct Answer: Time-based interpolation (e.g., linear) or Forward Fill.
Explanation:Since time series data is continuous, methods like Forward Fill (carrying the last valid observation forward) or Linear Interpolation utilize the temporal relationship to estimate missing values.
Incorrect! Try again.
34Which component is represented by in SARIMA?
A.Seasonal Autoregressive order
B.Seasonal Moving Average order
C.Seasonal Differencing order
D.Trend Differencing order
Correct Answer: Seasonal Differencing order
Explanation:Capital represents the order of seasonal differencing applied to the series.
Incorrect! Try again.
35If the ACF plot decays slowly (linear decay) while the PACF shows a sharp cutoff at lag 1, the series is likely:
A.MA(1)
B.AR(1)
C.White Noise
D.ARMA(1,1)
Correct Answer: AR(1)
Explanation:This is the signature behavior of an AR process. An AR() process has a PACF that cuts off at lag and an ACF that tails off.
Incorrect! Try again.
36In time series regression, what is a Rolling Window feature?
A.A feature computed by aggregating (mean, max, etc.) values over a fixed-size window of past observations.
B.A window that randomly selects features.
C.A feature that looks into the future.
D.A technique to increase the sampling rate.
Correct Answer: A feature computed by aggregating (mean, max, etc.) values over a fixed-size window of past observations.
Explanation:Rolling window statistics (e.g., 7-day moving average) summarize recent history to capture local trends or volatility as input features.
Incorrect! Try again.
37Which of the following equations represents an ARIMA(1,1,1) model (excluding constant)? Let .
A.
B.
C.
D.
Correct Answer:
Explanation:ARIMA(1,1,1) implies differencing (). The model is then an ARMA(1,1) on the differenced series .
Incorrect! Try again.
38When encoding cyclical time features like 'Month of Year' (1-12) for regression, what is the best approach to preserve the cycle (i.e., Dec is close to Jan)?
A.Label Encoding (1, 2, ..., 12)
B.One-Hot Encoding
C.Sine and Cosine transformation
D.Binary Encoding
Correct Answer: Sine and Cosine transformation
Explanation:Using Sin/Cos transformations maps the cyclic feature onto a circle, ensuring that the distance between Month 12 and Month 1 is small, preserving the cyclical nature.
Incorrect! Try again.
39What is Naive Forecasting?
A.Using a complex neural network.
B.Predicting the current value as the average of all past values.
C.Predicting the value at time to be equal to the value at time .
D.Predicting zero for all future steps.
Correct Answer: Predicting the value at time to be equal to the value at time .
Explanation:Naive forecasting assumes that the next observation will be exactly the same as the most recent observation (). It is often used as a baseline.
Incorrect! Try again.
40What is the key assumption of the Moving Average (MA) model regarding shocks/errors?
A.Current value depends on past values of the variable.
B.Current value depends on current and past unobserved white noise error terms.
C.Errors are correlated with the independent variables.
D.Errors increase exponentially with time.
Correct Answer: Current value depends on current and past unobserved white noise error terms.
Explanation:MA models conceptualize the series as being driven by a sequence of random shocks (errors). The current value is a weighted sum of recent shocks.
Incorrect! Try again.
41In the context of SARIMA, what is an Exogenous regressor often denoted as?
A.SARIMAX
B.ARIMAS
C.ARIMAE
D.XY-ARIMA
Correct Answer: SARIMAX
Explanation:The 'X' in SARIMAX or ARIMAX stands for eXogenous variables, allowing the inclusion of external predictors.
Incorrect! Try again.
42If a time series has a unit root, the series is:
A.Stationary
B.Non-Stationary
C.Seasonal
D.Deterministic
Correct Answer: Non-Stationary
Explanation:A unit root indicates that the characteristic equation of the AR process has a root of 1, meaning shocks have a permanent effect, resulting in non-stationarity.
Incorrect! Try again.
43Which ARIMA model is theoretically equivalent to Simple Exponential Smoothing?
A.ARIMA(0,1,1)
B.ARIMA(1,0,0)
C.ARIMA(2,2,2)
D.ARIMA(0,0,0)
Correct Answer: ARIMA(0,1,1)
Explanation:Simple Exponential Smoothing is mathematically equivalent to an ARIMA(0,1,1) model (a random walk with MA(1) error).
Incorrect! Try again.
44What is the purpose of Auto-Regressive terms in a model?
A.To model the relationship between an observation and a lagged observation.
B.To model the relationship between an observation and a residual error.
C.To remove seasonality.
D.To integrate the time series.
Correct Answer: To model the relationship between an observation and a lagged observation.
Explanation:Auto-Regressive (AR) terms capture the momentum or mean-reversion by modeling the current value as a function of its own past values.
Incorrect! Try again.
45Why is it important to examine the Residuals of a fitted time series model?
A.To ensure they follow a clear trend.
B.To ensure they have high autocorrelation.
C.To ensure they resemble random white noise.
D.To ensure they are all positive.
Correct Answer: To ensure they resemble random white noise.
Explanation:If residuals are white noise (uncorrelated, mean 0), it indicates the model has successfully captured all the signal (pattern) in the data. Any pattern left in residuals means the model can be improved.
Incorrect! Try again.
46What implies that a Time Series is Deterministic?
A.It can be perfectly predicted by a mathematical function of time (e.g., ).
B.It contains a stochastic component.
C.It is a random walk.
D.It has infinite variance.
Correct Answer: It can be perfectly predicted by a mathematical function of time (e.g., ).
Explanation:A deterministic trend is consistent and predictable (like a perfect line or sine wave), containing no random/stochastic variation.
Incorrect! Try again.
47In a Multi-step forecast strategy, what is the 'Direct' method?
A.Ideally predicting one step ahead and using that prediction for the next step.
B.Building a separate model for each future time step to be predicted.
C.Using only the most recent observation.
D.Averaging the results of AR and MA models.
Correct Answer: Building a separate model for each future time step to be predicted.
Explanation:The Direct strategy builds specific models for specific horizons (e.g., one model predicts , another predicts ), avoiding error accumulation found in recursive methods.
Incorrect! Try again.
48What is the 'Recursive' (or Iterative) multi-step forecast strategy?
A.Predicting , then using that prediction as input to predict , and so on.
B.Training on future data.
C.Using a different algorithm for every step.
D.Predicting all steps simultaneously using a vector output.
Correct Answer: Predicting , then using that prediction as input to predict , and so on.
Explanation:The recursive strategy uses a single model. It predicts the next step, appends the prediction to the history, and uses the updated history to predict the subsequent step.
Incorrect! Try again.
49Which transformation is the inverse of Differencing?
A.Differentiation
B.Cumulative Sum (Integration)
C.Logarithm
D.Exponentiation
Correct Answer: Cumulative Sum (Integration)
Explanation:If data was differenced to achieve stationarity for modeling, the predictions must be integrated (cumulative sum) to return them to the original scale.
Incorrect! Try again.
50When is an ARIMA model considered Invertible?
A.When the AR polynomial roots lie inside the unit circle.
B.When the MA polynomial roots lie outside the unit circle.
C.When it can be rewritten as an infinite AR process.
D.When .
Correct Answer: When it can be rewritten as an infinite AR process.
Explanation:Invertibility is a property of MA processes. An MA process is invertible if it can be written as an AR() process, which requires the roots of the MA characteristic equation to lie outside the unit circle (or absolute coefficients sum < 1 in simple cases).
Incorrect! Try again.
Give Feedback
Help us improve by sharing your thoughts or reporting issues.