Learning objectives
11.1 Apply regression analysis to estimate linear
relationships between two variables.
CHAPTER 11 11.2 Estimate the multiple linear regression model and
interpret the coefficients.
Basics of regression 11.3 Calculate and interpret the standard error of the
estimate.
analysis 11.4 Calculate and interpret the coefficient of
determination R2.
continued
Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd
Jaggia, Essentials of Business Statistics, 1e 11-1 Jaggia, Essentials of Business Statistics, 1e 11-2
Updated by: Azizur Rahman Updated by: Azizur Rahman
1 2
continued
Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd
Jaggia, Essentials of Business Statistics, 1e 11-3 Jaggia, Essentials of Business Statistics, 1e 11-4
Updated by: Azizur Rahman Updated by: Azizur Rahman
3 4
How are home loan (and other debt Simple linear regression model
repayments) and income related?
LO 11.1 Apply regression analysis to estimate linear
• Economist Madelyn Davis believes that income relationships between two variables.
differences between regions are the primary reason
for disparate mortgage and other debt repayments. • With regression analysis, we explicitly assume that
one variable, called the response variable, is
• She also wonders about the likely effect of influenced by other variables, called the
unemployment. explanatory variables.
• Madelyn would like to use regression analysis to: • Using regression analysis, we may predict the
– Make predictions for debt repayments for given values of response variable given values for our explanatory
income and the unemployment rate
variables.
– Find models that best fit the data, using goodness-of-fit
measures
– Determine the significance of income and unemployment at
the 5% level.
continued
Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd
Jaggia, Essentials of Business Statistics, 1e 11-5 Jaggia, Essentials of Business Statistics, 1e 11-6
Updated by: Azizur Rahman Updated by: Azizur Rahman
5 6
1
1/14/2019
7 8
(x − x )
2
distance between ŷ and y represents the residual e. i
b0 = y − b1 x
continued continued
Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd
Jaggia, Essentials of Business Statistics, 1e 11-9 Jaggia, Essentials of Business Statistics, 1e 11-10
Updated by: Azizur Rahman Updated by: Azizur Rahman
9 10
11 12
2
1/14/2019
– In the dialog box, for Input Y Range, select the data for
your response variable and, for Input X Range, select the
data for your explanatory variable(s).
continued
Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd
Jaggia, Essentials of Business Statistics, 1e 11-13 Jaggia, Essentials of Business Statistics, 1e 11-14
Updated by: Azizur Rahman Updated by: Azizur Rahman
13 14
• For example, we analysed how home loan where x1, x2, …, xk are the explanatory variables and
repayments are influenced by income, but ignored the b j values are the unknown parameters that we
the possible effect of unemployment. will estimate from the data.
• Multiple regression allows us to explore how several • As before, e is the random error term.
variables influence the response variable.
continued continued
Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd
Jaggia, Essentials of Business Statistics, 1e 11-15 Jaggia, Essentials of Business Statistics, 1e 11-16
Updated by: Azizur Rahman Updated by: Azizur Rahman
15 16
continued continued
Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd
Jaggia, Essentials of Business Statistics, 1e 11-17 Jaggia, Essentials of Business Statistics, 1e 11-18
Updated by: Azizur Rahman Updated by: Azizur Rahman
17 18
3
1/14/2019
continued continued
Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd
Jaggia, Essentials of Business Statistics, 1e 11-19 Jaggia, Essentials of Business Statistics, 1e 11-20
Updated by: Azizur Rahman Updated by: Azizur Rahman
19 20
– We simply insert those values into our estimated equation. • To compute the standard error of the estimate, we
ŷ = –202.28 + 1.08(2,000) + 68.45(5) = $2,300.42 first compute the error sum of squares.
n n
SSE = ei2 = ( yi − yˆ i )
2
i =1 i =1
continued
Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd
Jaggia, Essentials of Business Statistics, 1e 11-21 Jaggia, Essentials of Business Statistics, 1e 11-22
Updated by: Azizur Rahman Updated by: Azizur Rahman
21 22
• Dividing SSE by the appropriate degrees of freedom, • Example: Household home loan repayments
n – k – 1, yields the mean square error. – This table shows the standard error of the estimate for the
SSE simple linear regression for home loan repayments (Model
MSE = 1) and the multiple linear regression (Model 2).
n − k −1
• The square root of the MSE is the standard error of
the estimate se.
se = MSE =
SSE
=
ei 2
n − k −1 n − k −1
• In general, the less dispersion around the regression – Notice that according to the standard error, adding the
line, the smaller the se, which implies a better fit for unemployment rate as an explanatory variable did improve
the model. goodness-of-fit a little bit (i.e. 201.80 < 205.76).
continued
Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd
Jaggia, Essentials of Business Statistics, 1e 11-23 Jaggia, Essentials of Business Statistics, 1e 11-24
Updated by: Azizur Rahman Updated by: Azizur Rahman
23 24
4
1/14/2019
continued continued
Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd
Jaggia, Essentials of Business Statistics, 1e 11-25 Jaggia, Essentials of Business Statistics, 1e 11-26
Updated by: Azizur Rahman Updated by: Azizur Rahman
25 26
27 28
• The adjusted R2 penalises the R2 for adding • Example: Household home loan repayments
additional explanatory variables. – Compare the simple linear regression for home loan
repayments (Model 1) with the multiple linear regression
• As with our other goodness-of-fit measures, we (Model 2).
typically use computers to calculate the adjusted R2.
• The adjusted R2 is shown directly below the R2 in
the Excel regression output.
29 30
5
1/14/2019
31 32
33 34
continued continued
Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd Copyright © 2016 McGraw-Hill Education (Australia) Pty Ltd
Jaggia, Essentials of Business Statistics, 1e 11-35 Jaggia, Essentials of Business Statistics, 1e 11-36
Updated by: Azizur Rahman Updated by: Azizur Rahman
35 36
6
1/14/2019
37 38
39 40
41