1) Which statistical measure indicates the spread of data around the mean?
A) Standard Deviation
2) Which of the following is true about the correlation coefficient (r)?
A) It ranges from -1 to 1
3) What does a p-value help determine in hypothesis testing?
A) Statistical significance
4) Which of the following is not a probability distribution?
A) Gradient
5) What is multicollinearity in multiple regression?
A) Independent variables are correlated
6) Which statistical concept helps reduce overfitting in machine learning?
A) Regularization
7) Which of the following describes homoscedasticity?
A) Equal spread of residuals across all values
8) Which type of variable is used in regression analysis?
A) Continuous
9) Which of the following distributions is used for modeling rare events?
A) Poisson
10) Which of the following is NOT a measure of central tendency?
A) Variance
11) What is the median of the dataset: [4, 6, 8, 10, 12]?
A) 8
12) Which summary statistic is most affected by outliers?
A) Mean
13) Which metric is commonly used to detect outliers?
A) Interquartile Range (IQR)
14) Which probability distribution is symmetric and bell-shaped?
A) Normal distribution
15) What does a box plot NOT show?
A) Mean
16) What does a high variance in a dataset suggest?
A) Data points are widely spread out
17) Which evaluation metric is best for imbalanced classification problems?
A) Precision and Recall
18) Which test would be most appropriate for comparing the means of two groups?
A) T-test
19) Which of the following best describes a positively skewed distribution?
A) Mode < Median < Mean
20) If two variables are highly correlated, what can we infer?
A) They have a linear relationship
