BA6933 QUIZ SOLUTIONS

  1. How many scales of measurement exist?
a. 8
b. 2
c. 4
d. 6

2. The number of observations will always be the same as the

a. number of variables.
b. number of elements.
c. sample size.
d. population size.

3.The scale of measurement used for variable data that is simply a label for the purpose of identifying the attribute of an element is the

a. ratio scale.
b. interval scale.
c. ordinal scale.
d. nominal scale.

4. descriptive statistics?

a. Underfilling occurs when a plotted value is between the chart’s upper and lower control limits.
b. Underfilling occurs when a plotted value is below the chart’s lower control limit.
c. Underfilling occurs when a plotted value is both above the chart’s upper and below the chart’s lower limits.
d. Underfilling occurs when a plotted value is above the chart’s upper control limit.

5. Ordinary arithmetic operations are meaningful

a. only with quantitative data.
b. with neither quantitative or categorical data.
c. only with categorical data.
d. either with quantitative or categorical data.

6. Ordinary arithmetic operations are meaningful

a. only with quantitative data.
b. with neither quantitative or categorical data.
c. only with categorical data.
d. either with quantitative or categorical data

7. A portion of the population selected to represent the population is called

a. statistical inference.
b. a census.
c. descriptive statistics.
d. a sample.

8. Consider the following data summary.

This is an example of a _____.

a. histogram
b. frequency table
c. line graph
d. box plot

9. The subject of data mining deals with

a. keeping data secure so that unauthorized individuals cannot access the data.
b. computational procedure for data analysis.
c. computing the average for data.
d. methods for developing useful decision-making information from large data bases.

10. A survey to collect data on the entire population is

a. a population.
b. a sample.
c. an inference.
d. a census.

11. Which of the following is not an example of descriptive statistics?

a. The proportion of mailed-out questionnaires that were returned
b. A histogram depicting the age distribution for 30 randomly selected students
c. A table summarizing the data collected in a sample of new-car buyers
d. An estimate of the number of Alaska residents who have visited Canada

12. Which of the following disciplines has contributed the least to the development of data mining procedures?

a. Mathematics
b. Statistics
c. Psychology
d. Computer science

13. In a sample of 800 students in a university, 240 or 30% are Business majors. The 30% is an example of

a. a sample.
b. statistical inference.
c. a population.
d. descriptive statistics.

14. Optimization models, which generate solutions that maximize or minimize some objective subject to a set of constraints, fall into the category of

a. diagnostic analytics.
b. predictive analytics.
c. prescriptive analytics.
d. descriptive analytics.

15. The process of analyzing sample data in order to draw conclusions about the characteristics of a population is called

a. descriptive statistics.
b. data analysis.
c. data summarization.
d. statistical inference.

16. Dr. Kurt Thearling, a leading practitioner in the field, defines data mining as “the _________ extraction of _________ information from databases”.

a. automated, predictive
b. timely, accurate
c. thorough, insightful
d. intentional, useful

17. the field, defines data mining as “the _________ extraction of _________ information from databases”.

a. automated, predictive
b. timely, accurate
c. thorough, insightful
d. intentional, useful

18. Statistical inference

a. is the same as descriptive statistics.
b. is the same as a census.
c. refers to the process of drawing inferences about the sample based on the characteristics of the population.
d. is the process of drawing inferences about the population based on the information taken from the sample.

19. Data

a. are always numeric.
b. are the raw material of statistics.
c. are always non-numeric.
d. are always categorical

20. A population is

a. the collection of all items of interest in a particular study.
b. the same as a sample.
c. the selection of a random sample.
d. always the same size as the sample

21. Which of the following is not an example of descriptive statistics?

a. A histogram depicting the age distribution for 30 randomly selected students
b. The proportion of mailed-out questionnaires that were returned
c. An estimate of the number of Alaska residents who have visited Canada
d. A table summarizing the data collected in a sample of new-car buyers

22. How many scales of measurement exist?

a. 2
b. 4
c. 6
d. 8

23. A statistics professor asked students in a class their ages. On the basis of this information, the professor states that the average age of all the students in the university is 24 years. This is an example of

a. descriptive statistics.
b. an experiment.
c. statistical inference.
d. a census.

24. A characteristic of interest for the elements is called a

a. variable.
b. sample.
c. quality.
d. data set

25. Data measured a nominal scale

a. must rank order the data.
b. must be alphabetic.
c. must be numeric.
d. can be either numeric or nonnumeric.

26. Statistical studies in which researchers control variables of interest are

a. experimental studies.
b. non-experimental studies.
c. control observational studies.
d. observational studies.

27. In a sample of 1,600 registered voters, 912 or 57% approve of the way the President is doing his job. The 57% approval is an example of

a. descriptive statistics.
b. a population.
c. a sample.
d. statistical inference.

28. The set of analytical techniques that yield a best course of action is

a. diagnostic analytics.
b. predictive analytics.
c. descriptive analytics.
d. prescriptive analytics.

29. A sample of 100 individuals in a town was asked how much they paid in property tax per year. On the basis of this information, the reporter states that the average property tax bill of all residents of the town is $1,500. This is an example of _____.

a. descriptive statistics
b. a census
c. an experiment
d. statistical inference

30. The height of a building, measured in feet, is an example of

a. feet data.
b. categorical data.
c. quantitative data.
d. either categorical or quantitative data.

31. Optimization models, which generate solutions that maximize or minimize some objective subject to a set of constraints, fall into the category of

a. diagnostic analytics.
b. prescriptive analytics.
c. descriptive analytics.
d. predictive analytics.

32. The scale of measurement that has an inherent zero value defined is the

a. ratio scale.
b. nominal scale.
c. interval scale.
d. ordinal scale.

33. Which of the following defines the term “statistics”?

a. Statistics are rarely useful and informative.
b. Statistics refers only to the calculation of numbers, such as a mean.
c. Statistics is the art and science of collecting, analyzing, presenting, and interpreting data.
d. Statistics are used only in sports to calculate “stats” for teams and players such as average rushing yards.

34. The closing stock price of MNM Corporation for the last 7 trading days is shown below.

DayStock Price
184
287
384
488
585
690
791

The median is

a. 86.
b. 85.
c. 84.
d. 87.

35. skewness is

a. 0.
b. positive.
c. 1.
d. .5.

36. The weights (in pounds) of a sample of 36 individuals were recorded and the following statistics were calculated.

mean = 160range = 60
mode = 165variance = 324
median = 170

The coefficient of variation equals

a. 11.25%.
b. 0.1125%.
c. 0.20312%.
d. 203.12%.

37. The value which has half of the observations above it and half the observations below it is called the

a. mean.
b. range.
c. mode.
d. median.

38. Which of the following is not a measure of variability?

a. Range
b. Mode
c. Interquartile range
d. Standard deviation

39. The closing stock price of MNM Corporation for the last 7 trading days is shown below.

DayStock Price
184
287
384
488
585
690
791

The mode is

a. 85.
b. 87.
c. 84.
d. 86

40. The closing stock price of MNM Corporation for the last 7 trading days is shown below.

DayStock Price
184
287
384
488
585
690
791

The mode is

a. 85.
b. 87.
c. 84.
d. 86.

41. The coefficient of correlation

a. can be larger than 1.
b. cannot be larger than 1.
c. cannot be negative.
d. is the same as the coefficient of determination.

42. Which of the following symbols represents the size of the sample?

a. n
b. σ2
c. σ
d. N 

43. If a negative relationship exists between two variables, and y, which of the following statements is true?

a. As x decreases, y stays the same.
b. As x decreases, y decreases.
c. As x increases, y decreases.
d. As x increases, y increases.

44. Data that provide labels or names for categories of like items are known as

a. category data.
b. quantitative data.
c. label data.
d. categorical data.

45. Consider the scatter diagram below.

What type of relationship is shown for the number of students and their average score?

a. A positive relationship
b. A quadratic relationship
c. No apparent relationship
d. A negative relationship

46. Suppose a sample of 150 individuals was taken. Their gender and their preferred computer manufacturer was noted. Partial results of the study follow in a crosstabulation of column percentages.

If 80 of those in the study prefer Apple computers, how many males preferred Apple computers?

a. 30
b. 80
c. 64
d. 120

47. The numbers of hours worked (per week) by 400 statistics students are shown below.

Number of hoursFrequency
0 -920
10 – 1980
20 – 29200
30 – 39100

The relative frequency of students working 10 – 19 hours per week is

a. .25
b. .40
c. .80
d. .20

48. A graphical tool typically associated with the display of key performance indicators is a

a. data dashboard.
b. side-by-side bar chart.
c. stem-and-leaf display.
d. stacked bar chart.

49. Which of the following graphical methods shows the relationship between two variables?

a. Crosstabulation
b. Histogram
c. Pie chart
d. Dot plot

50. Which of the following is a graphical summary of a set of data in which each data value is represented by a dot above the axis?

a. Crosstabulation
b. Histogram
c. Box plot
d. Dot plot

51. The numbers of hours worked (per week) by 400 statistics students are shown below.

Number of hoursFrequency
0 -920
10 – 1980
20 – 29200
30 – 39100

The cumulative percent frequency for students working less than 20 hours per week is

a. 20%.
b. 80%.
c. 100%.
d. 25%.

52. Information on the number of new teachers hired in a school district for each of four years is given in the table below.

The percent frequency of new hires in 2019 is _____.

a. 25%
b. 10%
c. 80%
d. 40%

53. Histograms based on data on housing prices and salaries typically are

a. symmetric.
b. skewed to the right.
c. stacked.
d. skewed to the left.

54. A survey of 800 college seniors resulted in the following crosstabulation regarding their undergraduate major and whether or not they plan to go to graduate school.

                  Undergraduate Major
Graduate SchoolBusinessEngineeringOthersTotal
Yes7084126280
No182208130520
Total252292256800

Of those students who are majoring in business, what percentage plans to go to graduate school?

a. 8.75
b. 27.78
c. 72.22
d. 70.00

55. A graphical tool typically associated with the display of key performance indicators is a

a. stacked bar chart.
b. side-by-side bar chart.
c. data dashboard.
d. stem-and-leaf display

56. The total number of data items with a value less than the upper limit for the class is given by the

a. relative frequency distribution.
b. cumulative relative frequency distribution.
c. cumulative frequency distribution.
d. frequency distribution.

57. The relative frequency of a class is

a. equal to the frequency of the class multiplied by 100%.
b. equal to the frequency of the class.
c. always equal to 1%.
d. equal to the frequency of the class divided by the total number of observations, n.

58. The numbers of hours worked (per week) by 400 statistics students are shown below.

Number of hoursFrequency
0 – 920
10 – 1980
20 – 29200
30 – 39100

The percentage of students who work at least 10 hours per week is

a. 5%.
b. 50%.
c. 100%.
d. 95%.

59. Growth factors for the population of Chattanooga in the past two years have been 8 and 12. The geometric mean has a value of

a. √96.
b. 20.
c. 96.
d. √20.

60. The 75th percentile is referred to as the

a. second quartile.
b. third quartile.
c. fourth quartile.
d. first quartile

61. During a cold winter, the temperature stayed below zero for ten days (ranging from -20 to -5). The variance of the temperatures of the ten-day period

a. must be at least zero.
b. can be either negative or positive.
c. cannot be computed since all the numbers are negative.
d. is negative since all the numbers are negative.

62. The correlation coefficient

a. is the same as the covariance.
b. can be larger than 1.
c. cannot be less than zero.
d. can be negative.

63. The closing stock price of MNM Corporation for the last 7 trading days is shown below.

DayStock Price
184
287
384
488
585
690
791

The median is

a. 86.
b. 87.
c. 85.
d. 84

64. The pth percentile is a value such that approximately

a. p percent of the observations are less than the value and p percent are more than this value.
b. (100 – p) percent of the observations are less than the value and p percent are more than this value.
c. (100 – p) percent of the observations are less than the value and (100 – p) percent are more than this value.
d. p percent of the observations are less than the value and (100 – p) percent are more than this value.

65. In a five number summary, which of the following is not used for data summarization?

a. The largest value
b. The mean
c. The 25th percentile
d. The smallest value

66. A graphical presentation of the relationship between two quantitative variables is

a. histogram.
b. stem-and-leaf display.
c. scatter diagram.
d. dot plot.

67. A graphical presentation of the relationship between two quantitative variables is

a. histogram.
b. stem-and-leaf display.
c. scatter diagram.
d. dot plot.

68. Before drawing any conclusions about the relationship between two variables shown in a crosstabulation, you should

a. construct a dot plot and look for significant gaps.
b. construct a scatter diagram and find the trendline.
c. develop a relative frequency distribution.
d. investigate whether any hidden variables could affect the conclusions.

69. Before drawing any conclusions about the relationship between two variables shown in a crosstabulation, you should

a. construct a dot plot and look for significant gaps.
b. construct a scatter diagram and find the trendline.
c. develop a relative frequency distribution.
d. investigate whether any hidden variables could affect the conclusions.

70. A survey of 800 college seniors resulted in the following crosstabulation regarding their undergraduate major and whether or not they plan to go to graduate school.

                  Undergraduate Major
Graduate SchoolBusinessEngineeringOthersTotal
Yes7084126280
No182208130520
Total252292256800

Of those students who are majoring in business, what percentage plans to go to graduate school?

a. 72.22
b. 70.00
c. 27.78
d. 8.75

71. Consider the scatter diagram below.

What type of relationship is shown for the number of students and their average score?

a. A negative relationship
b. No apparent relationship
c. A positive relationship
d. A quadratic relationship

72. When the conclusions based upon the unaggregated data can be completely reversed if we look at the aggregated crosstabulation, the occurrence is known as

a. Reverse correlation.
b. Simpson’s paradox.
c. Pareto’s rule.
d. Negative correlation.

73. The proper way to construct a stem-and-leaf display for the data set {62, 67, 68, 73, 73, 79, 91, 94, 95, 97} is to

a. include a stem labeled ‘(8)’ and enter no leaves on the stem.
b. include a stem labeled ‘8’ and enter one leaf value of ‘0’ on the stem.
c. include a stem labeled ‘8’ and enter no leaves on the stem.
d. exclude a stem labeled ‘8

74. Data that indicate how much or how many are known as

a. quantitative data.
b. categorical data.
c. cumulative data.
d. relative data.

75. The approximate class width for a frequency distribution involving quantitative data can be determined using the expression

a. desired number of classes/class midpoint.
b. mean frequency/total frequency.
c. range/desired number of classes.
d. total frequency/class midpoint

76. The numbers of hours worked (per week) by 400 statistics students are shown below.

Number of hoursFrequency
0 – 920
10 – 1980
20 – 29200
30 – 39100

The class width used in this frequency distribution is

a. 39.
b. 4.5.
c. 9.
d. 10.

77. A sample of 15 children shows their favorite restaurants:

McDonaldsLuppi’sMellow Mushroom
Friday’sMcDonaldsMcDonalds
Pizza HutTaco BellMcDonalds
Mellow MushroomLuppi’sPizza Hut
McDonaldsFriday’sMcDonalds

Which of the following distributions would be inappropriate for this data?

a. Relative frequency
b. Frequency
c. Percent frequency
d. Cumulative frequency

78. In a scatter diagram, a line that provides an approximation of the relationship between the variables is known as a

a. correlation axis.
b. trend line.
c. zero-bias line.
d. determination line.

79. sample of 15 children shows their favorite restaurants:

McDonaldsLuppi’sMellow Mushroom
Friday’sMcDonaldsMcDonalds
Pizza HutTaco BellMcDonalds
Mellow MushroomLuppi’sPizza Hut
McDonaldsFriday’sMcDonalds

Which of the following displays is most appropriate for this data?

a. Histogram
b. Stacked bar chart
c. Side-by-side bar chart
d. Pie chart

80. The difference between the lower class limits of adjacent classes provides the

a. class width.
b. class limits.
c. class midpoint.
d. number of classes.

81. The percent frequency of a class is computed by

a. dividing the relative frequency by 100.
b. multiplying the relative frequency by 10.
c. adding 100 to the relative frequency.
d. multiplying the relative frequency by 100.

82. A survey of 800 college seniors resulted in the following crosstabulation regarding their undergraduate major and whether or not they plan to go to graduate school.

                  Undergraduate Major
Graduate SchoolBusinessEngineeringOthersTotal
Yes7084126280
No182208130520
Total252292256800

The above crosstabulation shows

a. frequencies.
b. row percentages.
c. column percentages.
d. overall percentages.

83. The numerical value of the variance

a. is always smaller than the numerical value of the standard deviation.
b. is negative if the mean is negative.
c. is always larger than the numerical value of the standard deviation.
d. can be larger or smaller than the numerical value of the standard deviation.

84. variability is overcome by interquartile range?

a. The sum of the range variances is zero
b. The range is influenced too much by extreme values
c. The range is difficult to compute
d. The range is negative

85. A box plot is a graphical representation of data that is based on

a. a histogram.
b. z-scores.
c. a five number summary.
d. the empirical rule

86. What can be concluded from the scatter diagram below for the two variables, years of education and unemployment rate?

a. As years of education increase, the unemployment rate increases; therefore, the correlation coefficient, rxy, will be negative.
b. The correlation coefficient will be equal to zero.
c. As years of education increase, the unemployment rate decreases; therefore, the correlation coefficient, rxy, will be positive.
d. The covariance will be negative.

87. Growth factors for the population of Chattanooga in the past two years have been 8 and 12. The geometric mean has a value of

a. √96.
b. 96.
c. √20.
d. 20

88. Generally, which one of the following is the least appropriate measure of central tendency for a data set that contains outliers?

a. 50th percentile
b. Median
c. 2nd quartile
d. Mean

89. Statements about the proportion of data values that must be within a specified number of standard deviations of the mean can be made using

a. A five-number summary.
b. Percentiles.
c. Chebyshev’s theorem.
d. The empirical rule.

90. The relative frequency of a class is computed by

a. dividing the midpoint of the class by the sample size.
b. dividing the frequency of the class by the sample size.
c. dividing the sample size by the frequency of the class.
d. dividing the frequency of the class by the midpoint.

91. The coefficient of variation is

a. the square of the standard deviation.
b. the same as the variance.
c. the standard deviation divided by the mean times 100.
d. the mean divided by the standard deviation.

92. When a percentage of the smallest and largest values are deleted from a data set, the mean of the remaining data values is the

a. weighted mean.
b. trimmed mean.
c. interquartile mean.
d. geometric mean

93. Since the mode is the most frequently occurring data value, it

a. is always larger than the median.
b. is always larger than the mean.
c. can never be larger than the mean.
d. None of these alternatives are correct.

94. The nth root of the product of the n observations is the

a. product variance.
b. weighted mean.
c. geometric mean.
d. product deviation.

95. The numbers of hours worked (per week) by 400 statistics students are shown below.

Number of hoursFrequency
0 -920
10 – 1980
20 – 29200
30 – 39100

The cumulative percent frequency for students working less than 20 hours per week is

a. 80%.
b. 25%.
c. 20%.
d. 100%

96. The relative frequency of a class is

a. equal to the frequency of the class divided by the total number of observations, n.
b. equal to the frequency of the class.
c. always equal to 1%.
d. equal to the frequency of the class multiplied by 100%

97. A sample of 15 children shows their favorite restaurants:

McDonaldsLuppi’sMellow Mushroom
Friday’sMcDonaldsMcDonalds
Pizza HutTaco BellMcDonalds
Mellow MushroomLuppi’sPizza Hut
McDonaldsFriday’sMcDonalds

Which of the following is the correct relative frequency for McDonalds?

a. .5
b. .6
c. .27
d. .4

98. If several frequency distributions are constructed from the same data set, the distribution with the widest class width will have the

a. most classes.
b. fewest classes.
c. smallest total frequency.
d. largest total frequency.

99. A display used to compare the frequency, relative frequency or percent frequency of two categorical variables is a

a. stacked bar chart.
b. scatter diagram.
c. pie chart.
d. stem-and-leaf display

100. A frequency distribution is a tabular summary of data showing the

a. fraction of items in several classes.
b. relative percentage of items in several classes.
c. percentage of items in several classes.
d. number of items in several classes.

101. When the conclusions based upon the unaggregated data can be completely reversed if we look at the aggregated crosstabulation, the occurrence is known as

a. Pareto’s rule.
b. Negative correlation.
c. Simpson’s paradox.
d. Reverse correlation.

102. unaggregated data can be completely reversed if we look at the aggregated crosstabulation, the occurrence is known as

a. Pareto’s rule.
b. Negative correlation.
c. Simpson’s paradox.
d. Reverse correlation.

103. The closing stock price of MNM Corporation for the last 7 trading days is shown below.

DayStock Price
184
287
384
488
585
690
791

The mode is

a. 84.
b. 87.
c. 85.
d. 86.

104. Geometric mean is a measure of

a. variability.
b. dispersion.
c. location.
d. weight.

105. Growth factors for the population of Chattanooga in the past two years have been 8 and 12. The geometric mean has a value of

a. √20.
b. 20.
c. 96.
d. √96.

106. From a population of size 500, a random sample of 50 items is selected. The mode of the sample

a. can be larger, smaller or equal to the mode of the population.
b. must be 500.
c. must be equal to the mean of the population, if the sample is truly random.
d. must be equal to the mode of population, if the sample is truly random.

107. A numerical measure of linear association between two variables is the

a. variance.
b. standard deviation.
c. coefficient of variation.
d. covariance.

108. The geometric mean of 1, 1, 8 is

a. 10.0.
b. 2.0.
c. 3.0.
d. 3.33.

109. Which of the following is not a measure of variability of a single variable?

a. Range
b. Standard deviation
c. Covariance
d. Interquartile range

110. Arithmetic operations provide meaningful results for variables that

a. use any scale of measurement except nominal.
b. are quantitative.
c. appear as non-numerical values.
d. have non-negative values

111. Which scale of measurement can be either numeric or non-numeric?

a. Nominal
b. Quantitative
c. Interval
d. Ratio

112. Which of the following is a categorical variable?

a. Your age on your last birthday
b. Your accounting class start time
c. Your high school graduation year
d. Your cell phone area code

113. Ordinary arithmetic operations are meaningful

a. either with quantitative or categorical data.
b. only with quantitative data.
c. with neither quantitative or categorical data.
d. only with categorical data

114. In a sample of 1,600 registered voters, 912 or 57% approve of the way the President is doing his job. A political pollster estimates: “Fifty-seven percent of all voters approve of the President.” This statement is an example of

a. statistical inference.
b. a sample.
c. a population.
d. descriptive statistics

115. A sample of 100 individuals in a town was asked how much they paid in property tax per year. On the basis of this information, the reporter states that the average property tax bill of all residents of the town is $1,500. This is an example of _____.

a. a census
b. statistical inference
c. descriptive statistics
d. an experiment

116. Income is an example of a variable that uses the

a. nominal scale.
b. ordinal scale.
c. interval scale.
d. ratio scale.

117. The proper way to construct a stem-and-leaf display for the data set {62, 67, 68, 73, 73, 79, 91, 94, 95, 97} is to

a. include a stem labeled ‘8’ and enter no leaves on the stem.
b. exclude a stem labeled ‘8.
c. include a stem labeled ‘(8)’ and enter no leaves on the stem.
d. include a stem labeled ‘8’ and enter one leaf value of ‘0’ on the stem.

118. Which of the following is least useful in making comparisons or showing the relationships of two variables?

a. Stacked bar chart
b. Crosstabulation
c. Scatter diagram
d. Stem-and-leaf display

119. Which of the following is not a recommended guideline for creating an effective graphical display?

a. Give the display a clear and concise title
b. Use three dimensions whenever possible, to give the display depth
c. Label each axis and show the units of measure
d. If colors are used to distinguish categories, use a legend to define them

120. The relative frequency of a class is

a. equal to the frequency of the class.
b. equal to the frequency of the class multiplied by 100%.
c. always equal to 1%.
d. equal to the frequency of the class divided by the total number of observations, n.

121. A survey of 800 college seniors resulted in the following crosstabulation regarding their undergraduate major and whether or not they plan to go to graduate school.

                  Undergraduate Major
Graduate SchoolBusinessEngineeringOthersTotal
Yes7084126280
No182208130520
Total252292256800

Of those students who are majoring in business, what percentage plans to go to graduate school?

a. 8.75
b. 70.00
c. 72.22
d. 27.78

122. A sample of 15 children shows their favorite restaurants:

McDonaldsLuppi’sMellow Mushroom
Friday’sMcDonaldsMcDonalds
Pizza HutTaco BellMcDonalds
Mellow MushroomLuppi’sPizza Hut
McDonaldsFriday’sMcDonalds

Which of the following is the correct frequency distribution?

a. McDonalds 4, Friday’s 3, Pizza Hut 1, Mellow Mushroom 4, Luppi’s 3, Taco Bell 1
b. McDonalds 6, Friday’s 1, Pizza Hut 3, Mellow Mushroom 1, Luppi’s 2, Taco Bell 2
c. McDonalds 6, Friday’s 2, Pizza Hut 2, Mellow Mushroom 2, Luppi’s 2, Taco Bell 1
d. None of these alternatives are correct

123. A graphical device for depicting categorical data that have been summarized in a frequency distribution, relative frequency distribution, or percent frequency distribution is a

a. histogram.
b. bar chart.
c. dot plot.
d. stem-and-leaf display.

124. The interquartile range is

a. the difference between the third quartile and the first quartile.
b. the difference between the largest and smallest values.
c. the 50th percentile.
d. another name for the variance.

125. The percentage of data values that must be within one, two, and three standard deviations of the mean for data having a bell-shaped distribution can be determined using

a. Percentiles.
b. A five-number summary.
c. The empirical rule.
d. Chebyshev’s theorem.

126. The most frequently occurring value of a data set is called the

a. median.
b. mode.
c. outlier.
d. mean.

127. The nth root of the product of the n observations is the

a. product variance.
b. weighted mean.
c. product deviation.
d. geometric mean.

128. If the data distribution is symmetric, the skewness is

a. 0.
b. .5.
c. 1.
d. positive.

129. The value of the sum of the deviations from the mean, i.e., Σ(x-x̄) must always be

a. less than the zero.
b. either positive or negative depending on whether the mean is negative or positive.
c. negative.
d. zero.

130. Suppose a sample of 45 measurements gave a data set with a range of –8 to –22. The standard deviation of the measurements

a. must be at least zero.
b. cannot be computed since all the numbers are negative.
c. can be either negative or positive.
d. is negative since all the numbers are negative

131. Suppose a sample of 45 measurements gave a data set with a range of –8 to –22. The standard deviation of the measurements

a. must be at least zero.
b. cannot be computed since all the numbers are negative.
c. can be either negative or positive.
d. is negative since all the numbers are negative

132. The owner of a factory regularly requests a graphical summary of all employees’ salaries. The graphical summary of salaries is an example of

a. descriptive statistics.
b. an experiment.
c. a sample.
d. statistical inference.

133. Data measured a nominal scale

a. can be either numeric or nonnumeric.
b. must be numeric.
c. must be alphabetic.
d. must rank order the data.

134. A characteristic of interest for the elements is called a

a. sample.
b. data set.
c. quality.
d. variable.

135. Dr. Kurt Thearling, a leading practitioner in the field, defines data mining as “the _________ extraction of _________ information from databases”.

a. intentional, useful
b. timely, accurate
c. thorough, insightful
d. automated, predictive

136. Which of the following variables uses the interval scale of measurement?

a. Time duration
b. Standardized test score
c. Vehicle miles-per-gallon
d. Student ID number

137. Statistical studies in which researchers control variables of interest are

a. observational studies.
b. experimental studies.
c. control observational studies.
d. non-experimental studies.

138. An interviewer has made an error in recording the data. This type of error is known as

a. an experimental error.
b. a conglomerate error.
c. a non-experimental error.
d. a data acquisition error.

139. The process of capturing, storing, and maintaining data is known as

a. data mining.
b. data warehousing.
c. big data.
d. data manipulation.

140. Income is an example of a variable that uses the

a. ratio scale.
b. interval scale.
c. ordinal scale.
d. nominal scale.

141.

Different methods of developing useful information from large data bases are dealt with under

a. data mining.
b. data manipulation.
c. data warehousing.
d. big data.

142. The process of analyzing sample data in order to draw conclusions about the characteristics of a population is called

a. data summarization.
b. descriptive statistics.
c. statistical inference.
d. data analysis.

143. A characteristic of interest for the elements is called a(n) _____.

a. observation
b. variable
c. sample
d. data set

144. Which of the following is not a scale of measurement?

a. Primal
b. Interval
c. Nominal
d. Ordinal

145. The average age in a sample of 190 students at City College is 22. As a result of this sample, it can be concluded that the average age of all the students at City College

a. could not be 22.
b. must be less than 22, since the sample is only a part of the population.
c. must be more than 22, since the population is always larger than the sample.
d. is around 22.

146. The subject of data mining deals with

a. computing the average for data.
b. methods for developing useful decision-making information from large data bases.
c. keeping data secure so that unauthorized individuals cannot access the data.
d. computational procedure for data analysis.

147. The process of capturing, storing, and maintaining data is known as

a. data manipulation.
b. data mining.
c. big data.
d. data warehousing.

148. The number of observations will always be the same as the

a. population size.
b. number of elements.
c. sample size.
d. number of variables.

149. The measurement scale suitable for quantitative data is

a. either interval or ratio scale.
b. ordinal scale.
c. nominal scale.
d. only interval scale

150. On a street, the houses are numbered from 300 to 450. The house numbers are examples of

a. neither quantitative nor categorical data.
b. both quantitative and categorical data.
c. categorical data.
d. quantitative dat

151. How many scales of measurement exist?

a. 6
b. 8
c. 4
d. 2

152. All the data collected in a particular study are referred to as the

a. population.
b. variable.
c. data set.
d. inference.

153.

A sample of 15 children shows their favorite restaurants:

McDonaldsLuppi’sMellow Mushroom
Friday’sMcDonaldsMcDonalds
Pizza HutTaco BellMcDonalds
Mellow MushroomLuppi’sPizza Hut
McDonaldsFriday’sMcDonalds

Which of the following is the correct percent frequency for McDonalds?

a. 10%
b. 27%
c. 2%
d. 40%

154. The number of sick days taken (per month) by 150 factory workers is summarized below.

The cumulative frequency for the class 11–15 is _____.

a. .10
b. .97
c. 145
d. 15

155. For stem-and-leaf displays where the leaf unit is not stated, the leaf unit is assumed to equal

a. -1.
b. 10.
c. 0.
d. 1.

156. The percent frequency of a class is computed by

a. multiplying the relative frequency by 10.
b. dividing the relative frequency by 100.
c. adding 100 to the relative frequency.
d. multiplying the relative frequency by 100.

157. Data that indicate how much or how many are known as

a. relative data.
b. cumulative data.
c. categorical data.
d. quantitative data.

158. Which of the following is not a measure of variability of a single variable?

a. Interquartile range
b. Standard deviation
c. Covariance
d. Range

159. The geometric mean of 1, 2, 4, and 10 is

a. 4.0.
b. 17.0.
c. 2.99.
d. 4.25

160. The weights (in pounds) of a sample of 36 individuals were recorded and the following statistics were calculated.

mean = 160range = 60
mode = 165variance = 324
median = 170

The coefficient of variation equals

a. 203.12%.
b. 0.1125%.
c. 11.25%.
d. 0.20312%

161. The geometric mean of 2, 4, 8 is

a. 4.0.
b. 4.67.
c. 5.0.
d. 16.

162. The geometric mean of 1, 3, 5, and 6 is

a. 3.08.
b. 15.0.
c. 3.75.
d. 5.0.

163. Suppose a sample of 45 measurements gave a data set with a range of –8 to –22. The standard deviation of the measurements

a. can be either negative or positive.
b. cannot be computed since all the numbers are negative.
c. is negative since all the numbers are negative.
d. must be at least zero.

164. From a population of size 500, a random sample of 50 items is selected. The mode of the sample

a. can be larger, smaller or equal to the mode of the population.
b. must be equal to the mode of population, if the sample is truly random.
c. must be equal to the mean of the population, if the sample is truly random.
d. must be 500.

165. The symbol σ is used to represent

a. the standard deviation of the sample.
b. the standard deviation of the population.
c. the variance of the sample.
d. the variance of the population.

166. Geometric mean is a measure of

a. location.
b. weight.
c. variability.
d. dispersion.

167. A sample of 15 children shows their favorite restaurants:

McDonaldsLuppi’sMellow Mushroom
Friday’sMcDonaldsMcDonalds
Pizza HutTaco BellMcDonalds
Mellow MushroomLuppi’sPizza Hut
McDonaldsFriday’sMcDonalds

Which of the following is the correct relative frequency for McDonalds?

a. .5
b. .27
c. .4
d. .6

168. A researcher is gathering data from four geographical areas designated: South = 1; North = 2; East = 3; West = 4. The designated geographical regions represent

a. either categorical or quantitative data.
b. categorical data.
c. quantitative data.
d. crosstabular data

169. The numbers of hours worked (per week) by 400 statistics students are shown below.

Number of hoursFrequency
0 – 920
10 – 1980
20 – 29200
30 – 39100

The percentage of students who work at least 10 hours per week is

a. 5%.
b. 50%.
c. 100%.
d. 95%.

170. A survey of 800 college seniors resulted in the following crosstabulation regarding their undergraduate major and whether or not they plan to go to graduate school.

                  Undergraduate Major
Graduate SchoolBusinessEngineeringOthersTotal
Yes7084126280
No182208130520
Total252292256800

The above crosstabulation shows

a. frequencies.
b. column percentages.
c. overall percentages.
d. row percentages.

171. A graphical device for depicting categorical data that have been summarized in a frequency distribution, relative frequency distribution, or percent frequency distribution is a

a. histogram.
b. dot plot.
c. bar chart.
d. stem-and-leaf display.

172. A graphical device for depicting categorical data that have been summarized in a frequency distribution, relative frequency distribution, or percent frequency distribution is a

a. histogram.
b. dot plot.
c. bar chart.
d. stem-and-leaf display.

173. In 2000, the average hours of television watched weekly in a household was 5 with a standard deviation of 1.43. In 2002, average hours of television watched weekly in a household was 8 with a standard deviation of 3.12. Which of the following statements is correct?

a. The variance for the number of hours is equal for both years.
b. The number of hours is more variable in the year 2002.
c. The median number of hours of television watched weekly is 6.5.
d. The number of hours is more variable in the year 2000.

174. Which of the following descriptive statistics is not measured in the same units as the data?

a. Standard deviation
b. Variance
c. Interquartile range
d. 35th percentile

175. Since the mode is the most frequently occurring data value, it

a. is always larger than the mean.
b. can never be larger than the mean.
c. is always larger than the median.
d. None of these alternatives are correct

176. When n-1 is used in the denominator to compute variance,

a. the data set is from a census.
b. the data set is a population.
c. the data set could be either a sample or a population.
d. the data set is a sample.

177. The closing stock price of MNM Corporation for the last 7 trading days is shown below.

DayStock Price
184
287
384
488
585
690
791

The mode is

a. 85.
b. 87.
c. 84.
d. 86

178. The closing stock price of MNM Corporation for the last 7 trading days is shown below.

DayStock Price
184
287
384
488
585
690
791

The mode is

a. 85.
b. 87.
c. 84.
d. 86.

179. Which of the following symbols represents the size of the population?

a. σ2
b. μ
c. N
d. σ

180. The correlation coefficient

a. can be larger than 1.
b. can be negative.
c. is the same as the covariance.
d. cannot be less than zero.

181. When computing the mean, the smallest value

a. can never be negative.
b. can never be zero.
c. can never be less than the mean.
d. can be any value.

182. The difference between the largest and the smallest data values is the

a. coefficient of variation.
b. interquartile range.
c. range.
d. variance

183. The relative frequency of a class is computed by

a. dividing the frequency of the class by n.
b. dividing the cumulative frequency of the class by n.
c. dividing n by cumulative frequency of the class.
d. dividing the frequency of the class by the number of classes

184. A graphical presentation of the relationship between two quantitative variables is

a. scatter diagram.
b. histogram.
c. dot plot.
d. stem-and-leaf display.

185. A survey of 800 college seniors resulted in the following crosstabulation regarding their undergraduate major and whether or not they plan to go to graduate school.

                  Undergraduate Major
Graduate SchoolBusinessEngineeringOthersTotal
Yes7084126280
No182208130520
Total252292256800

Of those students who are planning on going to graduate school, what percentage are majoring in engineering?

a. 28.8
b. 30.0
c. 40.4
d. 10.5

186. Data that indicate how much or how many are known as

a. quantitative data.
b. cumulative data.
c. categorical data.
d. relative data.

187. The numbers of hours worked (per week) by 400 statistics students are shown below.

Number of hoursFrequency
0 -920
10 – 1980
20 – 29200
30 – 39100

The cumulative percent frequency for students working less than 20 hours per week is

a. 20%.
b. 80%.
c. 25%.
d. 100%

188. A sample of 15 children shows their favorite restaurants:

McDonaldsLuppi’sMellow Mushroom
Friday’sMcDonaldsMcDonalds
Pizza HutTaco BellMcDonalds
Mellow MushroomLuppi’sPizza Hut
McDonaldsFriday’sMcDonalds

Which of the following displays is most appropriate for this data?

a. Stacked bar chart
b. Histogram
c. Pie chart
d. Side-by-side bar chart

189. In a cumulative frequency distribution, the last class will always have a cumulative frequency equal to

a. one.
b. 10.
c. 100%.
d. the total number of elements in the data set.

190. Data that indicate how much or how many are known as

a. relative data.
b. categorical data.
c. quantitative data.
d. cumulative data.

191. In a stem-and-leaf display,

a. one or more digits are used to define each stem, and a single digit is used to define each leaf.
b. one or more digits are used to define each stem, and one or more digits are used to define each leaf.
c. a single digit is used to define each stem, and a single digit is used to define each leaf.
d. a single digit is used to define each stem, and one or more digits are used to define each leaf.

192. A cumulative relative frequency distribution shows

a. the proportion of data items with values less than or equal to the lower limit of each class.
b. the proportion of data items with values less than or equal to the upper limit of each class.
c. the percentage of data items with values less than or equal to the upper limit of each class.
d. the percentage of data items with values less than or equal to the lower limit of each class.

193. During a cold winter, the temperature stayed below zero for ten days (ranging from -20 to -5). The variance of the temperatures of the ten-day period

a. can be either negative or positive.
b. must be at least zero.
c. cannot be computed since all the numbers are negative.
d. is negative since all the numbers are negative.

194. When computing the mean, the smallest value

a. can be any value.
b. can never be zero.
c. can never be negative.
d. can never be less than the mean.

195. If the sample standard deviation for the number of new vehicles sold per month for a sample of 6 car dealerships is 4.2, what is the variance for this set of data?

a. 4.2
b. 17.64
c. 2
d. .7

196. Which of the following symbols represents the standard deviation of the population?

a. σ
b. σ2
c. μ
d. 

197. Consider the following data as well as the corresponding stem-and-leaf display on the annual property taxes for eight residents of a city.

What is the leaf unit for the display?

a. 0.1
b. 1
c. 1000
d. 100

198. For stem-and-leaf displays where the leaf unit is not stated, the leaf unit is assumed to equal

a. 0.
b. -1.
c. 10.
d. 1.

199. The relative frequency of a class is computed by

a. dividing the frequency of the class by the midpoint.
b. dividing the midpoint of the class by the sample size.
c. dividing the sample size by the frequency of the class.
d. dividing the frequency of the class by the sample size.

200. The number of miles from their residence to their place of work for 120 employees is shown below.

The relative frequency of employees who drive 10 miles or less to work is _____.

a. 0.71
b. 0.85
c. 0.85
d. 0.25

Other Links:

Statistics Quiz

Networking Quiz

See other websites for quiz:

Check on QUIZLET

Check on CHEGG

Leave a Reply

Your email address will not be published. Required fields are marked *