Unit 2 - Practice Quiz

MTH302 60 Questions
0 Correct 0 Wrong 60 Left
0/60

1 What is the primary purpose of a scatter plot in statistics?

scatter plots Easy
A. To show the relationship between two quantitative variables
B. To compare parts of a whole
C. To display the frequency distribution of a single variable
D. To show data over a period of time

2 If the points on a scatter plot generally form a pattern from the lower-left to the upper-right, what type of relationship does this suggest?

scatter plots Easy
A. A negative correlation
B. A positive correlation
C. No correlation
D. A non-linear relationship

3 What does a scatter plot with points scattered randomly, showing no clear direction or pattern, suggest about the variables?

scatter plots Easy
A. Little to no correlation
B. A strong negative correlation
C. A strong positive correlation
D. A perfect correlation

4 The value of a correlation coefficient, denoted by , always lies between which two values?

correlation coefficient and its properties Easy
A. -1 and +1
B. -1 and 0
C. 0 and 1
D. 0 and 100

5 A correlation coefficient of indicates which of the following?

correlation coefficient and its properties Easy
A. No linear relationship
B. A perfect positive linear relationship
C. A weak positive linear relationship
D. A perfect negative linear relationship

6 If the correlation coefficient between two variables is 0, what does this imply?

correlation coefficient and its properties Easy
A. There is a strong negative relationship.
B. The variables are perfectly correlated.
C. There is a causal relationship.
D. There is no linear relationship between the variables.

7 How is the correlation coefficient between two variables, and , affected if the units of measurement for both variables are changed (e.g., from meters to centimeters)?

correlation coefficient and its properties Easy
A. It increases
B. It becomes zero
C. It decreases
D. It remains unchanged

8 If two variables have a strong correlation, does this mean one variable causes the other to change?

correlation coefficient and its properties Easy
A. Yes, but only if the correlation is positive.
B. Yes, a strong correlation always implies causation.
C. Not necessarily, as correlation does not imply causation.
D. Yes, but only if the correlation is negative.

9 Karl Pearson's correlation coefficient is best suited for measuring the relationship between which type of variables?

Karl Pearson’s correlation coefficient Easy
A. One quantitative and one categorical variable
B. Two categorical variables
C. Two quantitative variables
D. Two ranked variables

10 What is another common name for Karl Pearson's correlation coefficient?

Karl Pearson’s correlation coefficient Easy
A. Product-moment correlation coefficient
B. Rank correlation coefficient
C. Regression coefficient
D. Coefficient of determination

11 The sign (positive or negative) of Karl Pearson's correlation coefficient indicates the...

Karl Pearson’s correlation coefficient Easy
A. Strength of the relationship
B. Cause of the relationship
C. Direction of the relationship
D. Significance of the relationship

12 A key assumption for interpreting Karl Pearson's correlation coefficient is that the relationship between the variables is...

Karl Pearson’s correlation coefficient Easy
A. Exponential
B. Linear
C. Logarithmic
D. Curvilinear

13 Spearman's rank correlation coefficient is used to measure the strength of association between...

Spearman’s rank correlation coefficient Easy
A. Two independent samples
B. Two nominal variables
C. Two means
D. Two ranked variables

14 What is the first step in the process of calculating Spearman's rank correlation coefficient for a set of data?

Spearman’s rank correlation coefficient Easy
A. Rank the values for each variable separately
B. Create a scatter plot of the data
C. Calculate the mean of each variable
D. Find the difference between the values

15 Unlike Pearson's correlation which assesses linear relationships, Spearman's rank correlation assesses what type of relationship?

Spearman’s rank correlation coefficient Easy
A. Monotonic relationship
B. Causal relationship
C. Random relationship
D. Exponential relationship

16 In which of the following situations would Spearman's rank correlation be more appropriate to use than Pearson's correlation?

Spearman’s rank correlation coefficient Easy
A. When the data is ordinal (ranked)
B. When the data is perfectly linear
C. When the sample size is very small
D. When the data is categorical with no order

17 What is the primary purpose of simple linear regression?

Linear regression and its properties Easy
A. To find the average of a dataset
B. To classify data into different groups
C. To model the relationship between a dependent variable and an independent variable
D. To determine if two variables are correlated

18 In the simple linear regression equation, , what does the coefficient represent?

Linear regression and its properties Easy
A. The correlation coefficient
B. The slope of the regression line
C. The y-intercept of the regression line
D. The predicted value of y

19 The "line of best fit" in a linear regression model is the line that...

Linear regression and its properties Easy
A. Has the steepest possible slope
B. Minimizes the sum of the squared vertical distances of the points from the line
C. Passes through the maximum number of data points
D. Connects the first and last data points

20 In linear regression, what is a "residual"?

Linear regression and its properties Easy
A. The value of the independent variable
B. The difference between the observed value and the predicted value
C. The slope of the regression line
D. The difference between two predicted values

21 If the covariance between two variables X and Y is 15, the variance of X is 25, and the variance of Y is 9, what is the Karl Pearson’s correlation coefficient?

Karl Pearson’s correlation coefficient Medium
A. 0.5
B. 1.0
C. 0.75
D. 1.25

22 In which of the following scenarios would Spearman's rank correlation be more appropriate than Pearson's correlation coefficient?

Spearman’s rank correlation coefficient Medium
A. When the data is perfectly normally distributed.
B. When the relationship between variables is monotonic but not linear.
C. When the sample size is very large.
D. When we want to measure the strength of a linear relationship only.

23 If the regression line of Y on X is given by , and the mean of X is , what is the mean of Y, ?

Linear regression and its properties Medium
A. 5
B. 8
C. 11
D. Cannot be determined

24 A scatter plot of variable Y versus variable X shows points that form a clear U-shape. What can you conclude about the Pearson correlation coefficient (r) for this data?

scatter plots Medium
A. r will be close to -1
B. r will be close to 0
C. r will be undefined
D. r will be close to +1

25 If the correlation coefficient between two variables, height (in meters) and weight (in kg), is 0.8, what will be the correlation coefficient if the height is measured in centimeters and weight is measured in grams?

correlation coefficient and its properties Medium
A. Cannot be determined without the data
B. 0.008
C. 80
D. 0.8

26 A regression analysis of student scores (Y) versus hours studied (X) yielded the equation . What is the correct interpretation of the slope?

Linear regression and its properties Medium
A. A student who studies for 0 hours is predicted to get a score of 5.
B. The average score for all students is 55.
C. For every 5 additional hours studied, the student's score is predicted to increase by 50 points.
D. For each additional hour studied, the student's score is predicted to increase by 5 points.

27 For a set of paired data (X, Y), the ranks are as follows: and . What is the Spearman's rank correlation coefficient?

Spearman’s rank correlation coefficient Medium
A. 1.0
B. -1.0
C. -0.5
D. 0.0

28 If the two regression coefficients are and , what is the correlation coefficient between X and Y?

Karl Pearson’s correlation coefficient Medium
A. 0.6
B. 0.36
C. -0.36
D. -0.6

29 If the correlation coefficient between X and Y is 0.7, what percentage of the variation in Y is explained by the linear relationship with X?

correlation coefficient and its properties Medium
A. 30%
B. 49%
C. 7%
D. 70%

30 Given the regression line of Y on X as , the mean of X values is and the mean of Y values is . If we were to calculate the regression line of X on Y, which of the following points is guaranteed to be on that line?

Linear regression and its properties Medium
A. (4, 1.5)
B. (10, 19)
C. (1.5, 4)
D. (19, 10)

31 For the data points (1, 2), (2, 4), (3, 6), (4, 8), what is the value of Karl Pearson's correlation coefficient?

Karl Pearson’s correlation coefficient Medium
A. 0
B. -1
C. 1
D. 0.5

32 If two variables X and Y have a correlation coefficient of , which statement is most accurate?

correlation coefficient and its properties Medium
A. There is a weak negative linear relationship between X and Y.
B. X causes Y to decrease.
C. 90% of the data points lie on the regression line.
D. There is a strong negative linear relationship between X and Y.

33 Calculate Spearman's rank correlation for the following data on two judges' scores: Judge A: (10, 12, 11), Judge B: (15, 18, 16).

Spearman’s rank correlation coefficient Medium
A. 0.5
B. 0.0
C. 1.0
D. -1.0

34 The regression equation of Y on X is . The slope is calculated as . If , , and , what is the value of the slope ?

Linear regression and its properties Medium
A. 2.0
B. 1.0
C. 0.5
D. 4.0

35 A scatter plot shows a cloud of points that is wide at low values of X and narrow at high values of X, with a general downward trend. This pattern is known as:

scatter plots Medium
A. Homoscedasticity
B. Autocorrelation
C. Multicollinearity
D. Heteroscedasticity

36 If two variables, X and Y, are statistically independent, what is the expected value of their Pearson correlation coefficient?

correlation coefficient and its properties Medium
A. -1
B. 1
C. It depends on the distribution
D. 0

37 Given a regression equation , what is the predicted value of y when x = 10 and the actual observed value was y = 75?

Linear regression and its properties Medium
A. 80
B. 120
C. 20
D. 75

38 For the following paired data (X, Y): (5, 8), (10, 15), (15, 12), (20, 18), what is the sum of squared differences in ranks, , used to calculate Spearman's correlation?

Spearman’s rank correlation coefficient Medium
A. 4
B. 2
C. 0
D. 1

39 If we add 5 to every X value and subtract 10 from every Y value in a dataset, how will the Karl Pearson's correlation coefficient (r) change?

Karl Pearson’s correlation coefficient Medium
A. It will become 0.
B. It will decrease.
C. It will not change.
D. It will increase.

40 If the correlation coefficient 'r' is 0, which of the following statements is true?

correlation coefficient and its properties Medium
A. There is no relationship of any kind between the variables.
B. The slope of the regression line is undefined.
C. There is no linear relationship between the variables.
D. The variables are independent.

41 For a set of data points where , if the relationship is given by for values symmetrically distributed around 0 (e.g., ), what will be the value of the Karl Pearson’s correlation coefficient ?

Karl Pearson’s correlation coefficient Hard
A.
B.
C.
D.

42 A simple linear regression model is fitted, yielding the equation . If the independent variable is rescaled to and the dependent variable is rescaled to , what will be the new regression equation for in terms of ?

Linear regression and its properties Hard
A.
B.
C.
D.

43 Consider a dataset where the variables and have a perfect monotonic, but non-linear relationship, such as . Which of the following statements about the Karl Pearson's correlation coefficient () and Spearman's rank correlation coefficient () is most likely to be true?

Spearman’s rank correlation coefficient Hard
A. while or
B. and or
C.
D.

44 Let the correlation coefficient between two variables and be . Two new variables are defined as and . What is the correlation coefficient between and ?

correlation coefficient and its properties Hard
A. -0.8
B. -1.2
C. 0.8
D. Cannot be determined

45 In a simple linear regression model , let be the residuals. Which of the following statements is mathematically guaranteed to be false for any OLS regression with an intercept?

Linear regression and its properties Hard
A. The correlation between the residuals and the observed values is zero.
B. The sum of the squared residuals, , is minimized.
C. The correlation between the residuals and the predicted values is zero.
D. The correlation between the residuals and the independent variable is zero.

46 The equations of two regression lines are and . What is the Karl Pearson correlation coefficient between and ?

Karl Pearson’s correlation coefficient Hard
A. -3/4
B. 4/3
C. -4/3
D. 3/4

47 A scatter plot for 100 data points shows a weak positive correlation (). A new data point is added at , where is significantly larger than all other values (an x-outlier), and falls exactly on the regression line calculated from the original 100 points. How will this new point, known as a 'high leverage' point, most likely affect the correlation coefficient ?

scatter plots Hard
A. It will have very little effect on .
B. It will push closer to 0.
C. It will significantly increase towards 1.
D. It will significantly decrease towards -1.

48 In a dataset of 10 pairs, the ranks for variable X are and the ranks for variable Y are . The standard formula for Spearman's correlation is . What is the primary issue with using this specific formula here?

Spearman’s rank correlation coefficient Hard
A. The sample size is too small for this formula.
B. The formula is only valid for positive correlation.
C. The presence of tied ranks requires a correction factor, making this formula inaccurate.
D. The formula requires data to be normally distributed.

49 Two different simple linear regression models are fitted. Model A has a correlation coefficient . Model B has a correlation coefficient . The total sum of squares () is the same for both datasets. Which statement is correct about the sum of squared residuals () for the two models?

Linear regression and its properties Hard
A. The relationship cannot be determined.
B.
C.
D.

50 For three variables , , and , it is known that the correlation between and is , and the correlation between and is . What can be concluded about the minimum possible value for the correlation between and , ?

correlation coefficient and its properties Hard
A. must be at least 0.
B. must be at least 0.81.
C. can be as low as -1.
D. must be at least 0.62.

51 You are given four datasets that have nearly identical summary statistics: mean of X, mean of Y, variance of X, variance of Y, and Karl Pearson's correlation coefficient (). However, their scatter plots are drastically different. One plot is a clear linear relationship, another is a perfect non-linear relationship, a third has a major outlier, and a fourth has a high leverage point. What is the most important conclusion from this scenario?

Karl Pearson’s correlation coefficient Hard
A. Visualizing data using scatter plots is a critical step before interpreting correlation or fitting a regression model.
B. Karl Pearson's correlation coefficient is robust to outliers and non-linearity.
C. A high correlation coefficient () always guarantees a useful linear model.
D. Summary statistics, including correlation, are sufficient to understand the relationship between two variables.

52 In a regression analysis, an observation is identified that has a low leverage value but a very large residual. How would you classify this point and describe its likely effect on the regression line?

Linear regression and its properties Hard
A. It is a typical data point and will have minimal effect on the regression.
B. It is an outlier, but likely not an influential point; it will increase the standard error but may not significantly change the slope.
C. It is an influential point that will drastically change the slope of the regression line.
D. It is a high-leverage point that will strongly pull the regression line towards it.

53 For a dataset of size , the ranks of variable X are and the ranks of variable Y are . Calculate the Spearman's rank correlation coefficient .

Spearman’s rank correlation coefficient Hard
A. 0.929
B. 1
C. 0.952
D. 0.976

54 A researcher studies the relationship between hours studied and exam scores for two different subjects, A and B. For subject A, the correlation is . For subject B, the correlation is . When the two datasets are combined, what can be said about the correlation of the aggregate data?

Karl Pearson’s correlation coefficient Hard
A. can be negative, positive, or zero, and is not constrained to be between 0.7 and 0.8.
B. must be positive.
C. must be greater than 0.7.
D. must be between 0.7 and 0.8.

55 A scatter plot of residuals versus predicted values for a linear regression model shows a fan or cone shape, where the vertical spread of the residuals increases as the predicted values increase. What is the primary implication of this pattern?

scatter plots Hard
A. The assumption of constant variance (homoscedasticity) is violated.
B. The relationship between the variables is non-linear.
C. There are significant outliers in the dataset.
D. The independence of errors assumption is violated.

56 An observational study finds a strong positive correlation () between the number of firefighters at a fire and the amount of damage caused by the fire. The conclusion drawn is that sending more firefighters causes more damage. Which statistical concept best explains the flaw in this conclusion?

correlation coefficient and its properties Hard
A. Non-linearity in the relationship.
B. The ecological fallacy.
C. The effect of outliers.
D. Spurious correlation due to a confounding variable.

57 A student calculates summary statistics from a sample of 10 data pairs: , , and . They then calculate the Pearson correlation coefficient . What is the most likely reason for their result?

Karl Pearson’s correlation coefficient Hard
A. The data contains significant outliers.
B. The relationship is strongly non-linear.
C. The sample size is too small.
D. There must be a calculation error in one of the summary statistics.

58 Consider a simple linear regression model where the intercept is forced to be zero (), often called regression through the origin. Which property of the residuals from a standard OLS regression (with an intercept) is NOT guaranteed to hold for this model?

Linear regression and its properties Hard
A. The sum of the residuals, , is zero.
B. The regression line passes through the point .
C. The sum of squared residuals is minimized under the constraint that the line passes through the origin.
D. The slope is calculated as .

59 A dataset for variables and follows a perfect parabolic relationship for values from 0 to 10. What would you expect the Spearman's rank correlation coefficient () to be, approximately?

Spearman’s rank correlation coefficient Hard
A. Close to -1
B. Close to 0
C. Approximately 0.5
D. Close to 1

60 What is the value of the correlation coefficient between a variable (with non-zero variance) and a constant ?

correlation coefficient and its properties Hard
A. 1
B. Undefined
C. -1
D. 0