1In hypothesis testing, what is the null hypothesis ()?
types of error
Easy
A.A statement of no effect or no difference
B.A statement about the sample statistic
C.The hypothesis the researcher wants to prove
D.The same as a Type I error
Correct Answer: A statement of no effect or no difference
Explanation:
The null hypothesis () is a default statement that there is no relationship between two measured phenomena, or no association among groups. It is the hypothesis that is presumed true until statistical evidence indicates otherwise.
Incorrect! Try again.
2What is a Type I error in hypothesis testing?
types of error
Easy
A.Rejecting a true null hypothesis
B.Failing to reject a true null hypothesis
C.Rejecting a false null hypothesis
D.Failing to reject a false null hypothesis
Correct Answer: Rejecting a true null hypothesis
Explanation:
A Type I error, also known as a 'false positive', occurs when the null hypothesis is true, but we incorrectly reject it based on our sample data.
Incorrect! Try again.
3A Type II error occurs when we...
types of error
Easy
A.Reject a true null hypothesis
B.Correctly accept a true null hypothesis
C.Reject a false null hypothesis
D.Fail to reject a false null hypothesis
Correct Answer: Fail to reject a false null hypothesis
Explanation:
A Type II error, also known as a 'false negative', occurs when the null hypothesis is false, but we do not have sufficient evidence to reject it.
Incorrect! Try again.
4The probability of committing a Type I error is denoted by which symbol?
types of error
Easy
A.
B.
C.
D.
Correct Answer:
Explanation:
The symbol (alpha) represents the level of significance, which is the maximum acceptable probability of making a Type I error.
Incorrect! Try again.
5A Z-test for a single mean is most appropriate to use when...
Z-test for single mean and difference of means
Easy
A.The population variance is known
B.The population variance is unknown
C.The sample size is very small (e.g., n < 10)
D.We are comparing three or more means
Correct Answer: The population variance is known
Explanation:
The Z-test is used when the population variance () is known. If the sample size is large (n>30), the sample variance can also be used as a good estimate, but the primary condition is known population variance.
Incorrect! Try again.
6Under the null hypothesis, the Z-test statistic follows which distribution?
Z-test for single mean and difference of means
Easy
A.t-distribution
B.Standard Normal distribution
C.F-distribution
D.Chi-square distribution
Correct Answer: Standard Normal distribution
Explanation:
The Z-test statistic is standardized so that it follows a standard normal distribution, which has a mean of 0 and a standard deviation of 1.
Incorrect! Try again.
7A Z-test for the difference of means is used to compare the...
Z-test for single mean and difference of means
Easy
A.Means of two populations
B.Proportions of two populations
C.Variances of two populations
D.Medians of two populations
Correct Answer: Means of two populations
Explanation:
This test determines if there is a statistically significant difference between the means of two independent groups, typically when population variances are known or sample sizes are large.
Incorrect! Try again.
8A key condition for using a Z-test instead of a t-test for a single mean is that...
Z-test for single mean and difference of means
Easy
A.The sample size is small
B.The null hypothesis is about variance
C.The data must be categorical
D.The population standard deviation () is known
Correct Answer: The population standard deviation () is known
Explanation:
The primary condition for using a Z-test is that the population standard deviation, , is known. If it is unknown, a t-test is the appropriate choice, especially for smaller samples.
Incorrect! Try again.
9A Student's t-test is generally used when the sample size is small and...
Student's t-test for single mean and difference of means
Easy
A.The population variance is unknown
B.We are comparing more than two groups
C.The data is from a non-normal distribution
D.The population variance is known
Correct Answer: The population variance is unknown
Explanation:
The t-test is specifically designed for situations where the population variance is not known and has to be estimated from the sample data, which is common with smaller sample sizes.
Incorrect! Try again.
10As the degrees of freedom increase, the t-distribution approaches which distribution?
Student's t-test for single mean and difference of means
Easy
A.A uniform distribution
B.The Standard Normal distribution
C.The F-distribution
D.The Chi-square distribution
Correct Answer: The Standard Normal distribution
Explanation:
With a large number of degrees of freedom (which corresponds to a large sample size), the t-distribution becomes nearly identical to the standard normal (Z) distribution.
Incorrect! Try again.
11A t-test for the difference of means is used to determine if there is a significant difference between the __ of two independent groups.
Student's t-test for single mean and difference of means
Easy
A.Variances
B.Proportions
C.Means
D.Medians
Correct Answer: Means
Explanation:
Similar to the Z-test for difference of means, the t-test compares the averages (means) of two groups to see if they are statistically different from each other.
Incorrect! Try again.
12To find the critical value for a t-test from a table, you need the significance level () and what other piece of information?
Student's t-test for single mean and difference of means
Easy
A.Population variance
B.Sample mean
C.Degrees of freedom
D.Standard error
Correct Answer: Degrees of freedom
Explanation:
The shape of the t-distribution depends on the degrees of freedom (related to sample size). Therefore, to look up a critical value, you must specify both and the degrees of freedom.
Incorrect! Try again.
13What is the primary purpose of an F-test?
F-test
Easy
A.To compare the means of two populations
B.To test the goodness of fit of a model
C.To compare the variances of two or more populations
D.To compare a sample mean to a population mean
Correct Answer: To compare the variances of two or more populations
Explanation:
The F-test is a statistical test used to assess whether two population variances are equal. It is also the fundamental test used in Analysis of Variance (ANOVA) to compare means by analyzing variances.
Incorrect! Try again.
14The F-statistic is calculated as a ratio of two...
F-test
Easy
A.Standard deviations
B.Variances
C.Means
D.Sample sizes
Correct Answer: Variances
Explanation:
The F-statistic is defined as the ratio of two sample variances, . This ratio is used to test the hypothesis that the population variances are equal.
Incorrect! Try again.
15An F-test is the central component of a widely used statistical method for comparing the means of three or more groups. What is this method called?
F-test
Easy
A.Correlation analysis
B.Analysis of Variance (ANOVA)
C.Z-test
D.Chi-square test
Correct Answer: Analysis of Variance (ANOVA)
Explanation:
ANOVA uses the F-test to determine whether the variability between group means is larger than the variability of the observations within the groups.
Incorrect! Try again.
16The shape of the F-distribution is defined by two parameters. What are they?
F-test
Easy
A.Observed and expected frequencies
B.Numerator and denominator degrees of freedom
C.Sample size and alpha level
D.Mean and variance
Correct Answer: Numerator and denominator degrees of freedom
Explanation:
The shape of the F-distribution is determined by the degrees of freedom for the variance in the numerator and the degrees of freedom for the variance in the denominator.
Incorrect! Try again.
17A Chi-square goodness of fit test is used to determine if...
Chi-square test for goodness of fit
Easy
A.Two population variances are equal
B.Two population means are equal
C.A sample mean is equal to a population mean
D.A sample's frequency distribution fits a hypothesized distribution
Correct Answer: A sample's frequency distribution fits a hypothesized distribution
Explanation:
The goodness of fit test checks whether the observed frequency distribution of a categorical variable matches an expected or theoretical distribution (e.g., 'Are the dice fair?').
Incorrect! Try again.
18What type of data is analyzed using a Chi-square test?
Chi-square test for goodness of fit
Easy
A.Continuous data
B.Categorical (frequency) data
C.Time-series data
D.Ranked data
Correct Answer: Categorical (frequency) data
Explanation:
The Chi-square test is specifically designed for analyzing categorical data, where data is counted and placed into different categories.
Incorrect! Try again.
19The Chi-square test statistic () is calculated by comparing the observed frequencies with the...
Chi-square test for goodness of fit
Easy
A.Mean frequencies
B.Median frequencies
C.Expected frequencies
D.Modal frequencies
Correct Answer: Expected frequencies
Explanation:
The formula for the Chi-square statistic involves summing the squared differences between observed (O) and expected (E) frequencies, divided by the expected frequencies: .
Incorrect! Try again.
20In a Chi-square goodness of fit test, a very small calculated value (close to zero) indicates that...
Chi-square test for goodness of fit
Easy
A.A Type I error has occurred
B.The observed frequencies are very different from the expected frequencies
C.The sample size is too small
D.The observed frequencies are very close to the expected frequencies
Correct Answer: The observed frequencies are very close to the expected frequencies
Explanation:
A small value means there is a small difference between the observed and expected data, suggesting a good fit between the sample data and the hypothesized distribution. This would lead to not rejecting the null hypothesis.
Incorrect! Try again.
21A pharmaceutical company is testing a new drug. The null hypothesis () is that the drug has no effect. A clinical trial is conducted, and the researchers conclude that the drug is effective. However, in reality, the drug has no effect. What type of error has been made?
types of error
Medium
A.No error was made
B.Type I error
C.Type II error
D.Standard error
Correct Answer: Type I error
Explanation:
A Type I error occurs when a true null hypothesis is rejected. In this case, the null hypothesis (drug has no effect) was true, but the researchers rejected it, leading to a false positive conclusion.
Incorrect! Try again.
22A car manufacturer claims its new model gets an average of 35 miles per gallon (mpg). A sample of 49 cars is tested, and the sample mean is 34 mpg. If the population standard deviation is known to be 2.8 mpg, what is the calculated Z-statistic to test the manufacturer's claim?
Z-test for single mean and difference of means
Medium
A.Z = -2.50
B.Z = 2.50
C.Z = -0.36
D.Z = -1.00
Correct Answer: Z = -2.50
Explanation:
The Z-statistic is calculated using the formula: . Plugging in the values: .
Incorrect! Try again.
23A researcher wants to determine if a new teaching method improves test scores. A sample of 16 students has a mean score improvement of 5 points, with a sample standard deviation of 4 points. What is the calculated t-statistic for testing if the mean improvement is significantly greater than 0?
Student's t-test for single mean and difference of means
Medium
A.t = 2.0
B.t = 1.25
C.t = 5.0
D.t = 4.0
Correct Answer: t = 5.0
Explanation:
The t-statistic is calculated as . Here, the null hypothesis is that the mean improvement is 0. So, .
Incorrect! Try again.
24An F-test is conducted to compare the variances of two independent samples. Sample 1 has a size and sample variance . Sample 2 has a size and sample variance . What is the calculated F-statistic and its corresponding degrees of freedom?
F-test
Medium
A.F = 4.8, df = (9, 11)
B.F = 2.4, df = (10, 12)
C.F = 2.4, df = (9, 11)
D.F = 0.417, df = (11, 9)
Correct Answer: F = 2.4, df = (9, 11)
Explanation:
The F-statistic is the ratio of the larger variance to the smaller variance, . The degrees of freedom are for the numerator and for the denominator.
Incorrect! Try again.
25A poll claims that in a city, 50% of voters favor Candidate A, 30% favor Candidate B, and 20% are undecided. A random sample of 200 voters is taken. What is the expected frequency of voters who favor Candidate A?
Chi-square test for goodness of fit
Medium
A.200
B.50
C.60
D.100
Correct Answer: 100
Explanation:
The expected frequency for a category is calculated by multiplying the total sample size by the claimed proportion for that category. Expected frequency for A = .
Incorrect! Try again.
26If the significance level, , of a hypothesis test is decreased from 0.05 to 0.01, what is the effect on the probability of committing a Type I error () and a Type II error ()?
types of error
Medium
A. decreases and increases
B.Both and decrease
C. increases and decreases
D.Both and increase
Correct Answer: decreases and increases
Explanation:
The significance level is the probability of a Type I error, so decreasing it directly decreases the chance of a Type I error. However, making it harder to reject the null hypothesis (decreasing ) increases the chance of failing to reject a false null hypothesis, thus increasing the probability of a Type II error, .
Incorrect! Try again.
27Two brands of light bulbs are tested. Brand X (sample of 100) has a mean life of 1250 hours. Brand Y (sample of 100) has a mean life of 1220 hours. If the population standard deviation for both brands is 100 hours, what is the standard error of the difference between the two sample means?
Z-test for single mean and difference of means
Medium
A.20.00 hours
B.10.00 hours
C.1.41 hours
D.14.14 hours
Correct Answer: 14.14 hours
Explanation:
The standard error of the difference is calculated as . Here, hours.
Incorrect! Try again.
28A paired-sample t-test was conducted on 10 subjects to see if a training program was effective. The calculated t-statistic was 2.50. At a 0.05 significance level, the critical value for a one-tailed test is . What is the appropriate conclusion?
Student's t-test for single mean and difference of means
Medium
A.Reject the null hypothesis; the program is effective.
B.More subjects are needed to make a conclusion.
C.The t-statistic is smaller than the critical value.
D.Fail to reject the null hypothesis; the program is not effective.
Correct Answer: Reject the null hypothesis; the program is effective.
Explanation:
The calculated t-statistic (2.50) is greater than the critical value (1.833). Therefore, the result falls in the rejection region, and we reject the null hypothesis, concluding that the training program has a statistically significant effect.
Incorrect! Try again.
29In a Chi-square test for goodness of fit with 6 categories, the calculated test statistic is . The critical value at the 5% significance level is . What is the conclusion of the test?
Chi-square test for goodness of fit
Medium
A.Reject the null hypothesis, as the degrees of freedom are 6.
B.Fail to reject the null hypothesis, as 12.0 > 11.07.
C.Reject the null hypothesis, as the observed distribution does not fit the expected distribution.
D.Fail to reject the null hypothesis, as the observed distribution fits the expected distribution.
Correct Answer: Reject the null hypothesis, as the observed distribution does not fit the expected distribution.
Explanation:
The degrees of freedom are . Since the calculated statistic (12.0) is greater than the critical value (11.07), we reject the null hypothesis (). This means there is a significant difference between the observed and expected frequencies.
Incorrect! Try again.
30The primary purpose of an F-test in the context of hypothesis testing is to compare:
F-test
Medium
A.The proportion of two populations.
B.The variances of two populations.
C.The means of two populations.
D.A sample mean to a population mean.
Correct Answer: The variances of two populations.
Explanation:
The F-test is specifically designed to test the null hypothesis that two normally distributed populations have equal variances (). It forms the basis for comparing variability between two groups.
Incorrect! Try again.
31When performing a two-sample t-test assuming equal variances, you must first calculate a pooled sample variance. If sample 1 has and sample 2 has , what is the pooled variance ?
Student's t-test for single mean and difference of means
Medium
A.22.50
B.21.11
C.20.00
D.18.33
Correct Answer: 21.11
Explanation:
The formula for pooled variance is . Plugging in the values: .
Incorrect! Try again.
32The power of a statistical test (1 - ) represents the probability of:
types of error
Medium
A.Correctly rejecting a false null hypothesis.
B.Correctly failing to reject a true null hypothesis.
C.Incorrectly rejecting a true null hypothesis.
D.Incorrectly failing to reject a false null hypothesis.
Correct Answer: Correctly rejecting a false null hypothesis.
Explanation:
Power is the probability of making the correct decision when the null hypothesis is false. It is the ability of a test to detect a real effect. Since is the probability of a Type II error (failing to reject a false ), is the probability of avoiding that error.
Incorrect! Try again.
33A study is conducted to compare the IQ scores of men and women. A sample of 50 men has a mean IQ of 102 and a sample of 50 women has a mean IQ of 105. Assume the population standard deviation of IQ scores is 15 for both groups. What is the Z-statistic for the difference between the two means?
Z-test for single mean and difference of means
Medium
A.1.00
B.-1.00
C.-0.10
D.-3.00
Correct Answer: -1.00
Explanation:
The correct option follows directly from the given concept and definitions.
Incorrect! Try again.
34A machine should produce ball bearings with a mean diameter of 10 mm. A sample of 9 bearings yields a mean of 10.04 mm and a sample standard deviation of 0.06 mm. What are the correct degrees of freedom for conducting a t-test on this data?
Student's t-test for single mean and difference of means
Medium
A.1
B.10
C.9
D.8
Correct Answer: 8
Explanation:
For a one-sample t-test, the degrees of freedom (df) are calculated as the sample size minus one. In this case, .
Incorrect! Try again.
35A geneticist expects offspring phenotypes to follow a 9:3:3:1 ratio. In an experiment with 160 offspring, one category had an observed frequency of 35, while its expected frequency was 30. What is the contribution of this single category to the total Chi-square statistic?
Chi-square test for goodness of fit
Medium
A.25.0
B.5.00
C.0.167
D.0.83
Correct Answer: 0.83
Explanation:
The contribution of one category to the Chi-square statistic is calculated as . For this category, the calculation is .
Incorrect! Try again.
36For an F-test to be valid for comparing two variances, which of the following assumptions is most critical?
F-test
Medium
A.The sample sizes must be equal ().
B.The samples must be dependent.
C.The population means must be equal.
D.The two populations from which the samples are drawn are approximately normally distributed.
Correct Answer: The two populations from which the samples are drawn are approximately normally distributed.
Explanation:
The F-test for variances is quite sensitive to departures from normality. Therefore, a critical assumption is that both underlying populations follow a normal distribution. Other assumptions include independence of samples.
Incorrect! Try again.
37The average salary for a certain profession is claimed to be $60,000. A sample of 64 professionals has a mean salary of $63,000. If the population standard deviation is $16,000, what is the p-value for a two-tailed test of this claim? (Use Z-table values: Z=1.5 corresponds to an area of 0.4332)
Z-test for single mean and difference of means
Medium
A.0.0668
B.0.8664
C.1.50
D.0.1336
Correct Answer: 0.1336
Explanation:
The correct option follows directly from the given concept and definitions.
Incorrect! Try again.
38A t-test for the difference in means is performed on two independent samples, one with 15 observations and the other with 10. Assuming the population variances are equal, what are the degrees of freedom for this test?
Student's t-test for single mean and difference of means
Medium
A.23
B.14
C.9
D.25
Correct Answer: 23
Explanation:
For an independent samples t-test with the assumption of equal variances, the degrees of freedom are calculated as . In this case, .
Incorrect! Try again.
39The Chi-square goodness-of-fit test is used to determine if:
Chi-square test for goodness of fit
Medium
A.the variance of a sample is equal to the population variance.
B.two population means are equal.
C.two categorical variables are independent.
D.a sample frequency distribution fits a specific claimed population distribution.
Correct Answer: a sample frequency distribution fits a specific claimed population distribution.
Explanation:
The primary purpose of the Chi-square goodness-of-fit test is to compare observed sample frequencies with the expected frequencies that would occur if the sample were drawn from a population with a specific theoretical distribution (e.g., uniform, normal, or a specific ratio).
Incorrect! Try again.
40If you perform an F-test and calculate an F-statistic of 1.0, what does this value suggest about the sample variances?
F-test
Medium
A.The test is inconclusive.
B.The two population variances are identical.
C.A calculation error has occurred.
D.The two sample variances are identical.
Correct Answer: The two sample variances are identical.
Explanation:
The F-statistic is the ratio of two sample variances, . If this ratio is exactly 1.0, it means that the numerator and denominator are equal, so the sample variances ( and ) are identical. It does not, by itself, prove the population variances are identical.
Incorrect! Try again.
41A research team conducts a study with a small sample size () to detect what they suspect is a small effect. They set their significance level at . The study results in a p-value of 0.08, and they fail to reject the null hypothesis. Which of the following statements is the most accurate analysis of this situation?
types of error
Hard
A.A Type I error has definitely occurred because the p-value is close to .
B.The null hypothesis is likely true because the result is not statistically significant. A larger sample size would almost certainly confirm this.
C.Increasing the sample size would increase the power of the test, but would also necessarily increase the probability of a Type I error ().
D.The study was likely underpowered, meaning there is a high probability of a Type II error () if a real effect exists. Increasing the sample size would decrease while remains fixed by choice.
Correct Answer: The study was likely underpowered, meaning there is a high probability of a Type II error () if a real effect exists. Increasing the sample size would decrease while remains fixed by choice.
Explanation:
A p-value of 0.08 with a small sample size trying to detect a small effect is a classic sign of an underpowered study. Power is the probability of correctly rejecting a false null hypothesis (). Failing to reject H0 doesn't prove it's true. Increasing the sample size () increases power, which means it decreases the probability of a Type II error (). The Type I error rate () is fixed by the researcher and is not affected by sample size.
Incorrect! Try again.
42A 95% confidence interval for a population mean is calculated from a large sample to be [98.5, 101.5]. Based on this interval alone, what can you conclude about two different hypothesis tests: (1) vs at , and (2) vs at ?
Z-test for single mean
Hard
A.Fail to reject for test (1); Reject for test (2).
B.Fail to reject for test (1); Fail to reject for test (2).
C.Reject for test (1); Fail to reject for test (2).
D.Reject for test (1); Reject for test (2).
Correct Answer: Reject for test (1); Reject for test (2).
Explanation:
A two-sided hypothesis test at level will reject if and only if the hypothesized value is outside the % confidence interval. For test (1), the value 102 is outside the 95% CI [98.5, 101.5], so we reject . For test (2), a one-sided test at has the same critical value as a two-sided test at . The sample mean is the center of the CI, . Since 98 is outside the CI and the sample mean is in the direction of (), we reject for the one-tailed test as well.
Incorrect! Try again.
43A researcher compares two independent groups (). They assume equal variances and perform a pooled two-sample t-test. Unknown to them, the population variances are drastically different, with the smaller group having a much larger variance (). What is the most likely consequence of this violation of assumptions?
Student's t-test for difference of means
Hard
A.The Type I error rate will be substantially inflated (much higher than the nominal ), increasing the chance of a false positive.
B.The calculated t-statistic will be largely unaffected, but the degrees of freedom will be incorrect.
C.The Type I error rate will be substantially deflated (much lower than the nominal ), making the test overly conservative.
D.The pooled variance estimate will still be unbiased, leading to a valid test despite the unequal variances.
Correct Answer: The Type I error rate will be substantially inflated (much higher than the nominal ), increasing the chance of a false positive.
Explanation:
The pooled t-test is not robust to violations of the equal variance assumption when sample sizes are unequal. When the group with the smaller sample size also has the larger population variance, the pooled variance estimate is biased. This leads to an underestimation of the true standard error of the difference in means. A smaller denominator in the t-statistic results in a larger |t|, making it easier to reject the null hypothesis. Consequently, the actual Type I error rate becomes much higher than the specified .
Incorrect! Try again.
44An F-test for the equality of two population variances () is conducted using samples of size and . The resulting F-statistic is . What is the most appropriate conclusion?
F-test
Hard
A.We fail to reject the null hypothesis, but this test result's validity is highly dependent on the assumption that both populations are normally distributed.
B.The population variances are almost certainly equal because the F-statistic is very close to 1.
C.We should reject the null hypothesis because even a small deviation from 1 can be significant with these sample sizes.
D.The test is invalid because the F-statistic must be greater than the critical value, which is always much larger than 1.
Correct Answer: We fail to reject the null hypothesis, but this test result's validity is highly dependent on the assumption that both populations are normally distributed.
Explanation:
An F-statistic close to 1 provides evidence in favor of the null hypothesis that the variances are equal, so we would fail to reject . However, a critical caveat of the F-test for equality of variances is its extreme sensitivity to the normality assumption. Even slight departures from normality in the underlying populations can severely compromise the test's validity, making it unreliable in many real-world applications. Therefore, while we don't reject , we must acknowledge this critical limitation.
Incorrect! Try again.
45A biologist counts the number of mutations in 400 DNA samples to see if the distribution follows a Poisson model. She calculates the sample mean to be , which she uses as the estimate for the Poisson parameter . She then creates 6 categories for the Chi-square goodness-of-fit test: '0', '1', '2', '3', '4', and '5 or more'. What are the correct degrees of freedom for this test?
Chi-square test for goodness of fit
Hard
A.3
B.5
C.6
D.4
Correct Answer: 4
Explanation:
The degrees of freedom for a Chi-square goodness-of-fit test are calculated as , where is the number of categories and is the number of parameters estimated from the data used to calculate the expected frequencies. Here, there are categories. The parameter for the Poisson distribution, , was not known beforehand but was estimated from the data using the sample mean. Therefore, . The degrees of freedom are .
Incorrect! Try again.
46A medical researcher is testing a new drug with potentially severe side effects. The null hypothesis is that the drug has no effect. The consequences of a Type I error (false positive) are approving a useless but harmful drug, while the consequences of a Type II error (false negative) are failing to approve a potentially beneficial drug. To minimize patient harm, how should the researcher adjust the significance level and what is the resulting trade-off?
types of error
Hard
A.Keep as it's the standard, but increase the sample size to decrease both and simultaneously.
B.Increase (e.g., to 0.10) to increase power, thus reducing the probability of a Type II error ().
C.Decrease (e.g., to 0.01) to make the test more stringent. This will decrease the probability of a Type I error but will increase the probability of a Type II error ().
D.Decrease (e.g., to 0.01). This will decrease both the Type I and Type II error rates.
Correct Answer: Decrease (e.g., to 0.01) to make the test more stringent. This will decrease the probability of a Type I error but will increase the probability of a Type II error ().
Explanation:
The most severe consequence is a Type I error (approving a harmful drug). To minimize the risk of this error, the researcher should lower the significance level , which is the probability of a Type I error. This makes the criterion for rejecting the null hypothesis stricter. However, there is a trade-off: decreasing for a fixed sample size increases , the probability of a Type II error. The researcher is thus making a conscious choice to be more cautious about false positives at the expense of potentially missing a true effect.
Incorrect! Try again.
47In a study with a very large sample size (), a Z-test for the difference of means yields a p-value of 0.04. The difference in sample means is , and the known population standard deviations are large (). What is the most appropriate interpretation of this result?
Z-test for difference of means
Hard
A.A Type I error must have occurred since the difference in means is tiny compared to the standard deviations.
B.The result is statistically significant, but the effect size may be too small to be practically meaningful.
C.The test is invalid because the p-value is too close to the 0.05 threshold.
D.There is a strong, practically important difference between the two group means.
Correct Answer: The result is statistically significant, but the effect size may be too small to be practically meaningful.
Explanation:
With extremely large sample sizes, even trivial differences can become statistically significant. The p-value of 0.04 indicates statistical significance (if ), meaning we can be confident there is a non-zero difference between the population means. However, the magnitude of this difference (0.5) is very small compared to the variability within the groups (SD=50). This distinction between statistical significance (is there an effect?) and practical significance (is the effect large enough to matter?) is crucial, especially in 'big data' scenarios.
Incorrect! Try again.
48A researcher performs a one-sample t-test on a sample of size . The data comes from a population that is known to be heavily skewed, not normal. Under which condition would the t-test still be considered approximately valid?
Student's t-test for single mean
Hard
A.The t-test is robust against violations of normality, but only for two-tailed tests and large sample sizes (). With and heavy skew, the results are unreliable.
B.The t-test is never valid in this case; a non-parametric test must be used.
C.The t-test is robust to skewness as long as there are no outliers in the sample data.
D.The t-test is only valid if the sample mean is equal to the sample median.
Correct Answer: The t-test is robust against violations of normality, but only for two-tailed tests and large sample sizes (). With and heavy skew, the results are unreliable.
Explanation:
The t-test's validity relies on the assumption that the sampling distribution of the mean is approximately normal. The Central Limit Theorem ensures this for large sample sizes (often cited as ). However, with a small sample size like drawn from a heavily skewed population, the sampling distribution of the mean will also be skewed. This violates the core assumption of the t-test, making its p-value and confidence intervals unreliable. In this scenario, a non-parametric alternative like the Wilcoxon signed-rank test would be more appropriate.
Incorrect! Try again.
49If an F-test is used as the omnibus test in a one-way ANOVA comparing the means of groups, and the null hypothesis is rejected, what does this imply about the population variances of the groups?
F-test
Hard
A.It implies that the population variances are unequal, which is why the means are different.
B.It implies that all population variances must be equal for the test to be valid; this is a core assumption, not a conclusion.
C.It implies that at least one of the population variances is different from the others.
D.It implies nothing about the variances, as the F-test in ANOVA compares means, not variances.
Correct Answer: It implies that all population variances must be equal for the test to be valid; this is a core assumption, not a conclusion.
Explanation:
This question tests the distinction between the F-test for equality of variances and the F-test within an ANOVA. The F-test in ANOVA compares the variance between groups to the variance within groups to make an inference about the population means. A fundamental assumption of ANOVA is homoscedasticity, meaning all groups are assumed to have equal population variances (). Rejecting the null hypothesis in an ANOVA says nothing about these variances; it only suggests that at least two population means are different. The equality of variances is a precondition for the test, not a result of it.
Incorrect! Try again.
50When performing a Chi-square goodness-of-fit test, a researcher finds that 3 out of 8 categories have expected frequencies less than 5. What is the most appropriate corrective action to ensure the validity of the test?
Chi-square test for goodness of fit
Hard
A.Remove the categories with low expected frequencies from the analysis entirely.
B.Combine adjacent or logically related categories to ensure all new categories have expected frequencies of at least 5.
C.Use a Z-test instead, as it does not have requirements for expected frequencies.
D.Proceed with the test, as the rule is only a guideline and is not strict.
Correct Answer: Combine adjacent or logically related categories to ensure all new categories have expected frequencies of at least 5.
Explanation:
The Chi-square test's statistical theory relies on the test statistic following a Chi-square distribution, which is an approximation that holds well when expected frequencies are sufficiently large (a common rule of thumb is at least 5). When this condition is violated, the p-value can be inaccurate. The standard procedure is to collapse or pool categories in a logical manner until the expected frequency in each new, combined category meets the minimum requirement. This reduces the number of categories () and thus the degrees of freedom, but it makes the test valid.
Incorrect! Try again.
51A pharmaceutical company is testing 10 new, completely ineffective drugs against a placebo. For each drug, a separate hypothesis test (: drug has no effect) is conducted at a significance level of . What is the probability that at least one of these tests results in a Type I error, leading the company to falsely conclude that an ineffective drug is effective?
types of error
Hard
A.Exactly 0.05
B.
C.
D.
Correct Answer:
Explanation:
This is the multiple comparisons problem. For a single test, the probability of NOT making a Type I error is . Since the 10 tests are independent, the probability of not making a Type I error in ANY of the 10 tests is . The event of 'at least one Type I error' is the complement of 'no Type I errors'. Therefore, the probability is , which is approximately 0.40. This demonstrates how the family-wise error rate inflates when conducting multiple tests.
Incorrect! Try again.
52A researcher measures the reaction time of 20 subjects before and after consuming caffeine. They incorrectly analyze the data using a two-sample independent t-test instead of a paired t-test. If there is a strong positive correlation between the before and after measurements (i.e., subjects who are fast before are also fast after), how did this analytical error likely affect the outcome?
Student's t-test for difference of means
Hard
A.It would greatly overestimate the standard error of the mean difference, making the test less powerful and increasing the chance of a Type II error.
B.It would have no significant effect on the outcome since the mean difference is the same.
C.It would greatly underestimate the standard error, inflating the t-statistic and increasing the chance of a Type I error.
D.It would result in incorrect degrees of freedom ( instead of ), but the standard error calculation would be approximately correct.
Correct Answer: It would greatly overestimate the standard error of the mean difference, making the test less powerful and increasing the chance of a Type II error.
Explanation:
A paired t-test properly accounts for the inter-subject variability by analyzing the differences within each subject. An independent t-test incorrectly treats this variability as random error. With a strong positive correlation between measurements, the variance of the differences (used in a paired test) is much smaller than the sum of the variances of the two groups (used in an independent test). By using the independent test, the researcher includes the large variability between subjects, which inflates the calculated standard error, reduces the t-statistic, and makes it much harder to detect a true effect (i.e., it decreases the power of the test).
Incorrect! Try again.
53A Chi-square goodness-of-fit test for a discrete uniform distribution with categories results in a test statistic of . The sample size is . What is the most likely interpretation of this result?
Chi-square test for goodness of fit
Hard
A.We fail to reject the null hypothesis, which is an inconclusive result.
B.The null hypothesis is strongly supported, and we can be very confident the data came from a uniform distribution.
C.A calculation error must have occurred, as the test statistic is too low to be possible.
D.The fit is almost perfect, possibly indicating that the data is not genuinely random or was fabricated to fit the model.
Correct Answer: The fit is almost perfect, possibly indicating that the data is not genuinely random or was fabricated to fit the model.
Explanation:
The expected value of a Chi-square distribution is its degrees of freedom, which in this case is . A test statistic of 0.5 is extremely close to zero, indicating that the observed frequencies are much closer to the expected frequencies than one would expect due to random sampling variability. This results in a very high p-value (close to 1.0). While this means we do not reject the null hypothesis, a value this low can be suspicious. It suggests the data fits the model 'too well,' which can be a red flag for issues like data fabrication or a misunderstanding of the random process being studied.
Incorrect! Try again.
54A two-tailed Z-test for a single mean ( vs ) is conducted at . The test yields a statistic of . What would be the p-value and conclusion for a one-tailed test with the alternative hypothesis using the exact same sample data?
Z-test for single mean
Hard
A.The conclusion cannot be determined without knowing the sample mean and standard deviation.
B.p-value = 0.072; Fail to reject .
C.p-value = 0.964; Fail to reject .
D.p-value = 0.036; Reject .
Correct Answer: p-value = 0.036; Reject .
Explanation:
The p-value for a two-tailed test with is . Since , we would fail to reject in the two-tailed test. However, for a one-tailed test with , the p-value is simply the area in the tail corresponding to the test statistic, provided the statistic is in the direction of the alternative. Since is negative, it is in the direction of . Therefore, the one-tailed p-value is . Since , we would reject in the one-tailed test. This shows how the same data can lead to different conclusions depending on the hypothesis.
Incorrect! Try again.
55In an F-test for equality of variances, a researcher calculates the F-statistic as with degrees of freedom. Suppose they had instead calculated the statistic as . How does the upper critical value for this second test, , relate to the critical values of the first test for a two-tailed test at significance level ?
F-test
Hard
A. is the reciprocal of the upper critical value of the first test.
B. is equal to the upper critical value of the first test, but with the degrees of freedom swapped.
C. is unrelated to the critical values of the first test.
D.The upper critical value for the second test is the reciprocal of the lower critical value of the first test.
Correct Answer: The upper critical value for the second test is the reciprocal of the lower critical value of the first test.
Explanation:
The F-distribution is not symmetric. The critical values are denoted . A property of the F-distribution is that . The lower critical value for the first test (with df) is the reciprocal of the upper critical value for an F-distribution with the degrees of freedom swapped. Therefore, the upper critical value for the second test () is the reciprocal of the lower critical value for the first test ().
Incorrect! Try again.
56A 95% confidence interval for a mean is calculated as [50, 60] from a sample of size . Which of the following statements about a 95% prediction interval for a single new observation from the same population is correct?
Student's t-test for single mean
Hard
A.The prediction interval will be wider than [50, 60] because it must account for both the uncertainty in the mean and the individual data point's variability.
B.The prediction interval will be narrower than [50, 60] because it's for a single observation.
C.It is impossible to compare the two intervals without knowing the sample standard deviation.
D.The prediction interval will be identical to the confidence interval, [50, 60].
Correct Answer: The prediction interval will be wider than [50, 60] because it must account for both the uncertainty in the mean and the individual data point's variability.
Explanation:
A confidence interval estimates the range for the population mean. A prediction interval estimates the range for a single future data point. The uncertainty in predicting a single point comes from two sources: the uncertainty in estimating the true population mean (captured by the confidence interval) and the inherent random variability of individual data points around the mean. Because it incorporates this additional source of variance, a prediction interval is always wider than a confidence interval for the mean calculated from the same data.
Incorrect! Try again.
57Two independent samples of sizes and are drawn. The population standard deviations and are unknown. A researcher decides to use a Z-test for the difference of means. In which scenario is this choice most justifiable?
Z-test for difference of means
Hard
A.It is justifiable only if both populations are known to be perfectly normally distributed.
B.This choice is never justifiable; a t-test must always be used when population standard deviations are unknown.
C.It is justifiable if the sample variances and are very close to each other.
D.It is justifiable because the sample sizes are both greater than 30, allowing the sample standard deviations () to be used as reliable estimates for the population standard deviations ().
Correct Answer: It is justifiable because the sample sizes are both greater than 30, allowing the sample standard deviations () to be used as reliable estimates for the population standard deviations ().
Explanation:
While the Student's t-test is technically the correct test when population variances are unknown, the t-distribution converges to the standard normal (Z) distribution as the degrees of freedom increase. With sample sizes of 40 and 50, the degrees of freedom will be large enough (df > 30) that the t-distribution is nearly identical to the Z-distribution. In this case, using the sample standard deviations () as plug-in estimates for the population standard deviations () in the Z-test formula is a common and acceptable approximation.
Incorrect! Try again.
58The power of a statistical test is when testing against , with known and a fixed and . If the true population mean is actually , how would the actual power of the test compare to 0.80?
types of error
Hard
A.The actual power would be less than 0.80.
B.The actual power would be exactly 0.80.
C.The actual power would be greater than 0.80.
D.The power cannot be determined without knowing and .
Correct Answer: The actual power would be greater than 0.80.
Explanation:
Power is the probability of correctly rejecting a false null hypothesis. It depends on the effect size, which is the difference between the true parameter value and the value under the null hypothesis. The original power calculation was based on an effect size corresponding to . If the true mean is , the true effect size is larger ( vs ). A larger effect size is easier to detect, meaning the distribution under the true alternative hypothesis is further from the null distribution. This increases the probability of rejecting the null hypothesis, thus increasing the power of the test to a value greater than 0.80.
Incorrect! Try again.
59A Welch's t-test (unequal variances) for the difference between two independent means results in . The degrees of freedom are calculated using the Welch-Satterthwaite equation to be . How should the p-value be determined?
Student's t-test for difference of means
Hard
A.Use the standard normal (Z) distribution because the degrees of freedom are not an integer.
B.Round the degrees of freedom up to 19 and use the t-distribution with 19 df.
C.Use a t-distribution with the degrees of freedom truncated to the integer below, , as this provides a more conservative estimate of the p-value.
D.The test is invalid because degrees of freedom must be an integer.
Correct Answer: Use a t-distribution with the degrees of freedom truncated to the integer below, , as this provides a more conservative estimate of the p-value.
Explanation:
The Welch-Satterthwaite equation for degrees of freedom often produces a non-integer value. When using a t-distribution table (or software that requires integer df), the standard conservative practice is to truncate the value (round down) to the nearest integer. In this case, using instead of or . The t-distribution has slightly thicker tails for lower degrees of freedom. By using the lower integer df, the calculated p-value will be slightly larger, making the test more conservative and reducing the risk of a Type I error.
Incorrect! Try again.
60For a Chi-square goodness-of-fit test with categories, what is the theoretical maximum possible value of the Chi-square test statistic, ?
Chi-square test for goodness of fit
Hard
A.
B., the degrees of freedom.
C.The value is unbounded and can be infinitely large.
D.The sample size, .
Correct Answer:
Explanation:
The maximum value of the statistic occurs in the most extreme case where one category contains all observations and the other categories contain zero observations. Let's assume the -th category has all observations, so and for . The test statistic becomes . This sum simplifies to . This demonstrates that while the statistic can be very large, it is bounded by a function of the sample size and the number of categories.