C. (6pts) No Correlation (variables that some might think are related but turn out not to
be):
Pair 1:
Pair 2:
Pair 3:
Pair 1:
Pair 2:
Pair 3:
2. (10pts) Using the fake data provided below, calculate the correlation between peoples
Depression Scores (higher numbers = more severe depression) and the number of hours per week
they spend engaging in social interactions with others. Please do all calculations by hand, and
show all your work. You can round all of your statistics (mean, SD, z-scores, r) to 2 decimal
places.
Participant
1
2
3
4
5
6
Depression Score
45
51
31
60
40
54
7
8
36
35
22
19
*Hint* you need to start by calculating the mean and standard deviation for each variable.
3. (5pts) We all know that correlation is not evidence for causation. Explain the three possible
causal relationships that could exist be between Depression and Hours Spent with Others.
4. (10pts) Using the same data from question 2 above, use linear regression to determine what
depression score would we predict someone to have if we knew they spent 17 hours per week
interacting with others. Show your work.
5. (6pts) In a few sentences, please verbally explain what b0 (the intercept) and b1 (the slope)
mean in this example. In other words, how would you explain what their value means or
represents, if you had to explain it to a friend?
6. Use the Minitab data set BrainSize_IQ to answer the questions below. The data set contains
real data from a study conducted by Willerman, Schultz, Rutledge, and Bigler (1991) called "In
Vivo Brain Size and Intelligence. The study attempted to determine if there is any relationship
between ones physical features, including the size of their brain, and their IQ scores. They
collected data from 40 psychology students on 4 variables: 1) participant IQ score using the
WAIS test, 2) participant body weight in pounds, 3) participant height in inches, and 4)
participant brain size based on the number of pixels used in an fMRI image of the brain. They
also looked at Family Income.
A. (2pts) Copy and paste a scatterplot with the best fitting line to graph the relationship
between IQ and Brain Size and paste it below. Please put Brain Size on the x-axis and IQ
on the y-axis.
B. (4pts) What is the value of r for the relationship between IQ and Brain Size? Is this
positive or negative? Is this a strong, moderate, or weak relationship? In a sentence or
two, explain what this r value tells us about the relationship between peoples IQ and
their Brain Size.
C. (4pts) What is the value of r for the relationship between participants Heights and
participants Weights? Is this positive or negative? Is this a strong, moderate, or weak
relationship? In a sentence or two, explain what this r value tells us about the
relationship between peoples Height and their Weight.
D. (4pts) What is the correlation between participants Family Income and participants IQ?
Is this positive or negative? Strong, moderate or weak? In a sentence or two, explain what
this r value tells us about the relationship between Family Income and IQ. Also, please
copy and paste the scatterplot with the best fit line for Income and IQ.
F. (4pts) For the prediction you calculated in question E, what is the standard error of the
estimate? In a sentence or two, explain what this statistic tells us.
G. (4pts) What is the value of r2 for the relationship between IQ and Brain Size? In a
sentence or two, explain what this statistic tells us.
H. (4pts) What is the value of r2 for the relationship between Height and Weight? In a
sentence or two, explain what this statistic tells us.
9. (2pts) If we use regression to predict the value of Y given a certain known X, then this
statistic tells us, on average, how far off our prediction will be.
a. Standard Deviation
b. Standard error of the estimate
c. R-squared (r2)
d. Correlation coefficient
Answer: _____
10
12
10
6
5
4
6
0
40
50
60
70
80
DOPA MINE
90
100
V AR00004
SCHIZO
SCHIZO
3
4
0
50
60
70
80
100
0
2
10
VAR00005
DOPAMINE
90
10. (2pts) Which of the scatterplots shows the weakest relationship between two variables?
11. (2pts) If we call our two variables X and Y, in which of the scatterplots will a high z-score
on X correspond to a low z-score on Y?