STAT 200: Introduction to Statistics Final Examination, Fall 2017 OL4
STAT 200 OL4/US2 Sections Final Exam
Fall 2017
The final exam will be posted at 12:01 am on December 15, and it is due at 11:59 pm on December 17, 2017. Eastern Time is our reference time.
This is an open-book exam. You may refer to your text and other course materials for the current course as you work on the exam, and you may use a calculator. You must complete the exam individually. Neither collaboration nor consultation with others is allowed. It is a violation of the UMUC Academic Dishonesty and Plagiarism policy to use unauthorized materials or work from others.
Answer all 20 questions. Make sure your answers are as complete as possible. Show all of your supporting work and reasoning. Answers that come straight from calculators, programs or software packages without any explanation will not be accepted. If you need to use technology (for example, Excel, online or hand- held calculators, statistical packages) to aid in your calculation, you must cite the sources and explain how you get the results. You can only submit the Word doc answer sheet. Final Exam answer sheets must be uploaded before the deadline as they will not be accepted via email.
Record your answers and work on the separate Word doc answer sheet provided.
This exam has 20 questions; 5% for each question.
You must include the Honor Pledge on the title page of your submitted final exam. Exams submitted without the Honor Pledge will not be accepted.
STAT 200: Introduction to Statistics Final Examination, Fall 2017 OL4
1.
True or False. Justify for full credit.
- (a) If A and B are disjoint, P(A) = 0.4 and P(B) = 0.5, then P(A OR B) = 0.9.
- (b) If the variance for a data set is zero, then all the observations in this data set must be
identical.
- (c) There may be more than one mode in a data set.
- (d) A 90% confidence interval is wider than a 95% confidence interval of the same parameter.
- (e) In a right-tailed test, the value of the test statistic is 2. The test statistic follows a
distribution with the distribution curve shown below. If we know the shaded area is 0.03, then we have sufficient evidence to reject the null hypothesis at 0.05 level of significance.
Choose the best answer. Justify for full credit.
UMUC STAT Club wanted to estimate the study hours of STAT 200 students. One STAT 200 section was randomly selected and all students from that section were asked to fill out the questionnaire. This type of sampling is called:
2.
(a)
(b)
A study was conducted at a local college to analyze the trend of average GPA of all students graduated from the college. According to the Registrar, the average GPA for students with economics major from the class of 2016 is 3.5. The value 3.5 is a
- (i) statistic
- (ii) parameter
- (iii) cannot be determined
(i) (ii) (iii) (iv)
cluster convenience systematic stratified
STAT 200: Introduction to Statistics Final Examination, Fall 2017 OL4 Page 3 of 9
3.
(c)
(d)
(e)
The hotel ratings are usually on a scale from 0 star to 5 stars. The level of this measurement is
- (i) interval
- (ii) nominal
- (iii) ordinal
- (iv) ratio
500 students took a chemistry test. You sampled 100 students to estimate the average score and the standard deviation. How many degrees of freedom were there in the estimation of the standard deviation?
- (i) 99
- (ii) 100
- (iii) 499
- (iv) 500
You choose an alpha level of 0.01 and then analyze your data. What is the probability that you will make a Type I error given that the null hypothesis is true?
- (i) 0.025
- (ii) 0.05
- (iii) 0.01
- (iv) 0.10
A random sample of 500 students was chosen from UMUC STAT 200 classes. The frequency distribution below shows the distribution for study time each week (in hours). (Show all work. Just the answer, without supporting work, will receive no credit.)
Study Time (in hours)
Frequency
Relative Frequency
0.0 – 5.0
40
5.1 – 10.0
100
10.1 – 15.0
0.25
15.1 – 20.0
120
20.1 – 25.0
Total
500
(a)
(b) (c)
Complete the frequency table with frequency and relative frequency. Express the relative frequency to two decimal places.
What percentage of the study times was not more than 15 hours?
In what class interval must the median lie? 5.1 – 10.0, 10.1 -15.0, 15.1 – 20.0, or 20.1 – 25.0? Why?
STAT 200: Introduction to Statistics Final Examination, Fall 2017 OL4 Page 4 of 9
4. The five-number summary below shows the grade distribution of a STAT 200 quiz for a sample of 500 students.
Answer each question based on the given information, and explain your answer in each case.
(a) (b) (c) (d) (e)
5.
(a) (b)
6.
(a) (b)
What is the minimum in the grade distribution?
Which quarter has the smallest spread of data? What is that spread?
Find the interquartile range in the grade distribution.
Are there more students in the score band of 45- 65 or 65 – 85? Why?
Can the average score be determined based on the given information? Why or why not?
A basket contains 1 white balls, 5 yellow balls, and 4 red balls. Consider selecting one ball at a time from the basket. (Show all work. Just the answer, without supporting work, will receive no credit.)
Assuming the ball selection is without replacement. What is the probability that the first ball is white and the second ball is red?
Assuming the ball selection is with replacement. What is the probability that the first ball is red and the second ball is also red?
There are 1000 juniors in a college. Among the 1000 juniors, 400 students are taking STAT200, and 700 students are taking PSYC300. There are 200 students taking both courses. Let S be the event that a randomly selected student takes STAT200, and P be the event that a randomly selected student takes PSYC300. (Show all work. Just the answer, without supporting work, will receive no credit.)
Provide a written description of the complement event of (S OR P). What is the probability of complement event of (S OR P)?
STAT 200: Introduction to Statistics Final Examination, Fall 2017 OL4 Page 5 of 9
7. Consider rolling a fair 6-faced die twice. Let A be the event that the sum of the two rolls is equal to 7, and B be the event that the first one is an odd number.
- (a) What is the probability that the sum of the two rolls is equal to 7 given that the first one is an odd number? Show all work. Just the answer, without supporting work, will receive no credit.
- (b) Are event A and event B independent? Explain.
8. Answer the following two questions. (Show all work. Just the answer, without supporting work, will receive no credit).
- (a) UMUC Stat Club is sending a delegate of 2 members to attend the 2018 Joint Statistical Meeting in Vancouver. There are 10 qualified candidates. How many different ways can the delegate be selected?
- (b) A bike courier needs to make deliveries at 5 different locations. How many different routes can he take?
9. Imagine you are in a game show. There are 20 prizes hidden on a game board with 100 spaces. One prize is worth $50, nine are worth $20, and another ten are worth $10. You have to pay $5 to the host if your choice is not correct. Let the random variable x be the money you get or lose. Show all work. Just the answer, without supporting work, will receive no credit.
- (a) Complete the following probability distribution.
- (b) What is your expected winning or loss in this game? Be specific in your answer whether it’s winning or loss.
10. Mimi joined UMUC basketball team in spring 2017. On average, she is able to score 20% of the field goals. Assume she tries 10 field goals in a game.
- (a) Let X be the number of field goals that Mimi scores in the game. As we know, the distribution of X is a binomial probability distribution. What is the number of trials (n), probability of successes (p) and probability of failures (q), respectively?
- (b) Find the probability that Mimi scores at least 3 of the 10 field goals. (round the answer to 3 decimal places) Show all work. Just the answer, without supporting work, will receive no credit.
x
P(x)
-$5
$10
$20
$50
STAT 200: Introduction to Statistics Final Examination, Fall 2017 OL4 Page 6 of 9
11. A research concludes that the number of hours of exercise per week for adults is normally distributed with a mean of 4 hours and a standard deviation of 2.5 hours. Show all work. Just the answer, without supporting work, will receive no credit.
- (a) What is the probability that a randomly selected adult has at least 5 hours of exercise per week (round the answer to 4 decimal places)
- (b) Find the 75th percentile for the distribution of exercise time per week. (round the answer to 2 decimal places)
12. Assume the SAT Mathematics Level 2 test scores are normally distributed with a mean of 500 and a standard deviation of 100. Show all work. Just the answer, without supporting work, will receive no credit.
- (a) Consider all random samples of 64 test scores. What is the standard deviation of the sample means? (Round your answer to three decimal places)
- (b) What is the probability that 64 randomly selected test scores will have a mean test score that is between 490 and 520? (Round your answer to four decimal places)
- In a study designed to test the effectiveness of garlic for lowering cholesterol, 49 adults were treated with garlic tablets. Cholesterol levels were measured before and after the treatment. The changes in their LDL cholesterol (in mg/dL) have a mean of 3 and standard deviation of 10. Construct a 95% confidence interval estimate of the mean change in LDL cholesterol after the garlic tablet treatment. Show all work. Just the answer, without supporting work, will receive no credit.
- Mimi conducted a survey on a random sample of 100 adults. 75 adults in the sample chose banana as his / her favorite fruit. Construct a 90% confidence interval estimate of the proportion of adults whose favorite fruit is banana. Show all work. Just the answer, without supporting work, will receive no credit.
- A researcher is interested in testing the claim that more than 80% of the adults believe in global warming. She conducted a survey on a random sample of 800 adults. The survey showed that 650 adults in the sample believe in global warming.
Assume the researcher wants to use a 0.05 significance level to test the claim.
- (a) Identify the null hypothesis and the alternative hypothesis.
- (b) Determine the test statistic. Show all work; writing the correct test statistic, without supporting
work, will receive no credit.
- (c) Determine the P-value for this test. Show all work; writing the correct P-value, without
supporting work, will receive no credit.
- (d) Is there sufficient evidence to support the claim that more than 80% of adults believe in global warming? Explain.
STAT 200: Introduction to Statistics Final Examination, Fall 2017 OL4 Page 7 of 9
16.
In a study of memory recall, 5 people were given 10 minutes to memorize a list of 20 words. Each was asked to list as many of the words as he or she could remember both 1 hour and 24 hours later. The result is shown in the following table.
Number of Words Recalled
Subject
1 hour later 24 hours later
1 2 3 4 5
13 13 18 14 12 11 15 14 11 11
(a) (b)
(c) (d)
17.
(a) (b)
(c) (d)
Is there evidence to suggest that the mean number of words recalled after 1 hour exceeds the mean recall after 24 hours? Assume we want to use a 0.05 significance level to test the claim.
Identify the null hypothesis and the alternative hypothesis.
Determine the test statistic. Show all work; writing the correct test statistic, without supporting
work, will receive no credit.
Determine the P-value. Show all work; writing the correct P-value, without supporting work, will receive no credit.
Is there sufficient evidence to support the claim that the mean number of words recalled after 1 hour exceeds the mean recall after 24 hours? Justify your conclusion.
John oversees a bottle-filling machine in a company. The amount of fluid dispensed into each bottle is approximately normally distributed with an unknown population standard deviation. On a particular day, a random sample of 400 bottles yielded a mean of 357.2 ml and a standard deviation of 16.2 ml. John then concluded that the population standard deviation of the fluid dispense amount by the machine is greater than 15 ml.
The lead engineer wants to use a 0.05 significance level to test John’s claim.
Identify the null hypothesis and alternative hypothesis.
Determine the test statistic. Show all work; writing the correct test statistic, without supporting work, will receive no credit.
Determine the P-value for this test. Show all work; writing the correct P-value, without supporting work, will receive no credit.
Is there sufficient evidence to support John’s claim that the population standard deviation of the fluid dispense amount by the machine is greater than 15 ml? Explain.
STAT 200: Introduction to Statistics Final Examination, Fall 2017 OL4 Page 8 of 9
18. The UMUC Daily News reported that the color distribution for plain M&M’s was: 35% brown, 30% yellow, 15% orange, 10% green, and 10% tan. Each piece of candy in a random sample of 200 plain M&M’s was classified according to color, and the results are listed below. Use a 0.05 significance level to test the claim that the published color distribution is correct. Show all work and justify your answer.
- (a) Identify the null hypothesis and the alternative hypothesis.
- (b) Determine the test statistic. Show all work; writing the correct test statistic, without supporting
work, will receive no credit.
- (c) Determine the P-value. Show all work; writing the correct P- value, without supporting work,
will receive no credit.
- (d) Is there sufficient evidence to support the claim that the published color distribution is correct? Justify your answer.
19. A STAT 200 instructor believes that the average quiz score is a good predictor of final exam score. A random sample of 10 students produced the following data where x is the average quiz score and y is the final exam score.
- (a) Find an equation of the least squares regression line. Show all work; writing the correct equation, without supporting work, will receive no credit.
- (b) Based on the equation from part (a), what is the predicted final exam score if the average quiz score is 80? Show all work and justify your answer.
Color
Brown
Yellow
Orange
Green
Tan
Number
80
45
30
30
15
x
72
96
55
65
100
50
85
70
74
85
y
72
98
50
70
96
60
83
75
77
87
STAT 200: Introduction to Statistics Final Examination, Fall 2017 OL4 Page 9 of 9
20. A study of 10 different weight loss programs involved 200 subjects. Each of the 10 programs had 20 subjects in it. The subjects were followed for 12 months. Weight change for each subject was recorded. We want to test the claim that the mean weight loss is the same for the 10 programs.
(a) Complete the following ANOVA table with sum of squares, degrees of freedom, and mean square (Show all work):
Source of Variation
Sum of Squares
(SS)
Degrees of Freedom
(df)
Mean Square
(MS)
Factor (Between)
53.6
Error (Within)
Total
553.05
199
N/A
- (b) Determine the test statistic. Show all work; writing the correct test statistic, without supporting work, will receive no credit.
- (c) Determine the P-value. Show all work; writing the correct P-value, without supporting work, will receive no credit.
- (d) Is there sufficient evidence to support the claim that the mean weight loss is the same for the 10 programs at the significance level of 0.05? Explain.