- A negative correlation is one in which _____________. both variables move in opposite directions
- A testing set is ________. a small portion of a data set that is used to see whether a model works
- A training set helps train the model so it will perform well on new data. .csv
- A training set is ________. a large portion of a data set that is used to build a sound model
- If you don’t ensure that data is stored in the correct format (such as numeric or character), what can happen? Valid data can be treated as missing data.
- In a box plot, the interquartile range (IQR) contains: Upper quartile, median, and lower quartile
- To combine functions, use the _____________________. pipe operator
- Visualization, numerical measures for evaluation, and comparing between different models are all ways of doing what? Evaluating whether model results make sense
- What are descriptive statistics? A method for showing some basic features of a data set
- What does a P-score measure? It indicates whether the ANOVA test result is statistically significant.
- What is the purpose of an ANOVA test? It helps find correlations between different groups of a categorical variable.
- Which of the following is NOT a method for evaluating a regression model? Root absolute error (RAE)
- Which of the following is NOT a task facilitated by R? Data generation
- Which of the following is NOT one of the four groups in the tidyverse library? Practical Programming
- Which of the following is NOT true about a model? A model cannot predict a value given only one other value.
- Which of the following is NOT true about a model? The amount of data you have should have no effect on the accuracy of the model.
- Which of the following is NOT true about a regression line? The degree of slope has nothing to do with the correlation.
- Which of the following is NOT true of a scatter plot? It cannot suggest a linear relationship between two variables.
- Which of the following is NOT true? In a data set, data is usually collected from a single source and stored in a single format
- Which of these is NOT a method for normalizing data? Compound Y
Other Links:
See other websites for quiz:
Check on QUIZLET