0 penilaian0% menganggap dokumen ini bermanfaat (0 suara)
164 tayangan4 halaman
A Ph. D. Candidate is collecting data about women in math careers. Office workers were asked how long it took them to travel to work in minutes. A survey was given to all Texas high schools to determine if students favored starting school closer to labor day.
A Ph. D. Candidate is collecting data about women in math careers. Office workers were asked how long it took them to travel to work in minutes. A survey was given to all Texas high schools to determine if students favored starting school closer to labor day.
A Ph. D. Candidate is collecting data about women in math careers. Office workers were asked how long it took them to travel to work in minutes. A survey was given to all Texas high schools to determine if students favored starting school closer to labor day.
AP Statistics Data Analysis Graphical Displays Review
1. A university instructor created a website for her Organic chemistry course. The students in her class were encouraged to use the website as an additional resource for the course. At the end of the semester, the instructor asked each student how many times had he or she visited the website and recorded the counts. Based on the histogram, describe the distribution of the website use.
The distribution of the of the number of visits to the course website by each student for the semester is skewed to the left, with the number of visits ranging from 1 to 15 views. The distribution is centered at about 14 visits, with many students visiting 15 times. There is an outlier in the distribution, two students who visited the site once. The next highest number of visits was 8.
2. A Ph. D. candidate is collecting data about women in math careers. She interviewed 200 female mathematicians and recorded the following data: number of years attending university, math classes taken in high school (algebra, geometry, etc.), gender of high school math teacher, and high school GPA. Tell which variables are qualitative and which variables are quantitative. For the numerical variable state whether they are continuous or discrete.
Quantitative: Number of years attending (discrete), GPA (continuous) Qualitative: Classes taken, Gender of high school teacher.
3. Office workers were asked how long it took them to travel to work one morning in minutes. Provided is a table of their responses. Sketch a stem-and-leaf plot for the data. Without actually calculating the mean or median, would you expect the mean to be greater than or less than median. Justify your answer. Commute Times in Minutes 20 20 20 22 23 24 24 25 27 28 30 32 35 37 41 42 47 48 49 50 52 58 60 65 73
Higher, because the data are skewed to the right. Mean > Median
4. A survey was given to all Texas high schools to determine if students favored starting school closer to Labor Day. The amount of data received was enormous, so the researcher decided to focus on just 5A high schools, like DPHS. Now that the researcher has the data he/she will generalize the results to Texas as a state. Discuss the population of interest, sample, and the branch of statistics described in the last sentence? (in context)
Population of interest: All high schools in the state of Texas. Sample: 5A high schools that returned the survey. Collecting and organizing the data is part of descriptive statistics. One the research determines his/her results and generalizes back to all high schools in Texas this will be inferential statistics. 5. The Kentucky Derby has been run annually since 1900 at Churchill Downs, Louisville, Kentucky. The distance is 1 miles. Since 1900, all winning times have been over 2 minutes, except for the record time of 1 minute and 59.2 seconds run by Secretariat in 1973. The following graph shows seconds over 2 minutes for all winning times.There are 98 data values represented. What percentage of winning times is between 2 minutes 3.15 seconds and 2 minutes 7.15 seconds? 37%
6. The data below gives the cost per ounce (in cents) for 30 shampoos intended for normal hair and 30 shampoos intended for fine hair. Normal 79 63 19 9 37 49 20 16 55 69 23 14 9 7 21 44 13 16 23 20 64 28 18 32 81 5 47 50 8 9 Fine 69 9 23 22 8 12 32 12 18 74 19 63 49 37 55 75 44 8 17 11 23 50 65 51 35 14 20 28 8 27
Both the normal and fine shampoos cost have a right tail distribution which skews the cost to the higher values. This means you have more smaller costs for the shampoos. They both have clusters in the $5-$28 range. The fine shampoo has a range of $67 whereas the normal shampoo has a range of $77. Key 0|5 = $5.00
7. Below is the fastest speeds driven by statistic students as reported on the student surveys. Construct a histogram for the data. Then CUSS about the data.
165 110 105 90 85 110 120 192 130
105 120 70 70 90 70 109 60 130
125 130 90 95 130 80 110 120 90
90 100 90
The distribution is skewed to the right. Meaning more times around the lower values (70-91) were reported. The mean of the distribution is greater than the median. There is a gap in the range of 133-154. The range of the drivers speed is 121 with a maximum reported speed of 192 and a minimum reported speed of 70
9 9 9 8 7 5 0 8 8 8 9 9 8 6 6 4 3 1 1 2 2 4 7 8 9 8 3 3 1 0 0 2 0 2 3 3 7 8 7 2 3 2 5 7 9 7 4 4 4 9 5 0 5 0 1 5 9 4 3 6 3 5 9 9 7 4 5 1 8 Class Frequency R.F 70 x < 91 11 .367 91 x < 112 8 .267 112 x < 133 8 .267 133 x < 154 0 .000 154 x < 175 2 .067 175 x < 196 1 .033 Seconds C u m u l a t i v e
Free Response 9. The graph below displays the scores of 32 students on a recent exam. Scores on this exam ranged from 64 to - 95 points.
a) Describe the shape of this distribution in context of the problem.
The distribution is skewed to the left (or toward the lower scores). This could result from being a harder test.
b) In order to motivate her students, the instructor of the class wants to report that, overall, the classs performance on the exam was high. Which summary statistics, the mean or the median, should the instructor use to report that overall exam performance was high? Explain.
Since the distribution is skewed towards the lower values, the mean will be pulled in that direction. Thus, the instructor should report the median to motivate her students.
c)The midrange is defined as max min 2 imum imum . Compute this value using the data.
64 95 79.5 2 midrange
d) Is the midrange considered a measure of center or a measure of spread? Explain.
The midrange is a measure of center. The maximum provides information about the upper tail, more specifically the upper extreme value. The minimum provides information about the lower tail, more specifically the lower extreme value. By averaging these two values and creating the midrange, we are creating a statistic that provides the halfway point between the two extremes.
10. Most women who have had a mastectomy (removal of breast tissue for medical reasons) can have breast reconstruction surgery. The reconstruction surgery can be performed at the same time as the mastectomy, known as an immediate reconstruction, or after the patient has healed from the mastectomy, commonly referred to as a second surgery reconstruction. The table below shows the percentages of choices regarding reconstruction for three age categories. A graphical display has been added to help visualize the distribution.
Age Under 35 35-50 Over 50 Immediate reconstruction 63% 48% 23% Second surgery reconstruction 31% 34% 41% No reconstruction 6% 18% 36% Total 100% 100% 100%
a) Use the data to sketch a graphical display for the data.
b) From your graphical display and the data, does there appear to be an association between reconstruction and age? Justify your response.
Yes. A higher percentage of older women, especially over 50, who have had mastectomies choose not to have reconstruction surgery. Likewise, a higher percentage of younger patients choose to have immediate reconstruction surgery. It appears that as the age of women have mastectomies increases, the importance of having reconstructive surgeries decreases.