Statistical Methods
Statistical Methods
Correlation
Statistical Methods
Cov( X , Y )
X Y
Usha A. Kumar, IIT Bombay
Illustrations of Correlation
Y
= -1
= -.8
X
Y
=0
Statistical Methods
=0
=1
X
Y
= .8
( x x )( y y )
=
(x x ) ( y y )
i
Statistical Methods
S xy
S xx S yy
Statistical Methods
Example
Income
(X)
(in `00)
Conditional means
of Y
E[Y| X]
80
100
120
140
160
180
200
220
240
260
55,60,65,70,75
65,70,74,80,85,88
79,84,90,94,98
80,93,95,103,108,113,115
102,107,110,116,118,125
110,115,120,130,135,140
120,136,140,144,145
135,137,140,152,157,160,162
137,145,155,165,175,189
150,152,175,178,180,185,191
65
77
89
101
113
125
137
149
161
173
Statistical Methods
Yi = 0 + 1 xi + i
where
Yi is the value of the dependent variable in the ith trial
y= b0 + b1 x
where
b0 is the estimate of 0
b1 is the estimate of 1
Statistical Methods
10
Errors in Regression
Y
the observed data point
yi
Error
yi
ei = yi yi
X
Xi
Statistical Methods
11
i =1
i =1
2
2
e
=
y
y
(
)
SSE = i i
i
Statistical Methods
12
( x x )( y y )
=
(x x )
i
S xy
S xx
b0= y b1 x
Statistical Methods
13
Example
Statistical Methods
Advertising
expenditure (in
ten thousands)
Sales
(in lakhs)
18
55
17
14
36
31
85
21
62
18
11
33
16
41
26
63
29
87
Usha A. Kumar, IIT Bombay
14
Statistical Methods
15
Statistical Methods
16
Residual Analysis
Residuals
Residuals
x or y
x or y
Time
x or y
17
y
(
)
i =
2
2
y
+
y
y
(
)
(
)
i
i i
Coefficient of Determination
r2 = SSR/SST
The proportion of observed y variation that can be
explained by the simple linear regression model.
Statistical Methods
18
Inference in Regression
Analysis
Source of
Variation
Sum of
Squares
Degrees of
Freedom Mean Square F Ratio
Regression
SSR
(1)
MSR
Error
SSE
(n-2)
MSE
Total
SST
(n-1)
MST
Statistical Methods
MSR
MSE
19
Inference in Regression
Analysis
Assumption
The model errors i are normally
2
distributed with mean 0 and variance .
yi N ( 0 + 1 xi , 2 ).
Multivariate Analysis
yi s.
20
=
E (b1 ) 1
0
1
S .E =
(b0 ) s
+
n
x2
n
2
x
x
(
)
i
i =1
S .E (b1 ) =
s
n
2
x
x
(
)
i
i =1
Multivariate Analysis
21
Hypothesis testing
H 0 : 1 = 10
H1 : 1 10
The test statistic is
b1 10
tn 2
S .E (b1 )
Multivariate Analysis
22
Confidence interval
Multivariate Analysis
23
Residual Analysis
Multivariate Analysis
24
Example
Statistical Methods
Advertising
expenditure (in
ten thousands)
Sales
(in lakhs)
18
55
17
14
36
31
85
21
62
18
11
33
16
41
26
63
29
87
25
Statistical Methods
26