Lanjutan……
Least Square Principle
• Determining a regression equation by minimizing the
sum of the squares of the squares of the vertical
distances between the actual Y values and the
predicted values of Y
sy
• General form: Y’ =a + bX Slope : b r
sx
Intercept : a Y b X
• r : correlation coefficient
Y : mean of Y
• Sy : standard deviation of Y
• Sx : standard deviation of X X : mean of X
Konsep Residu
x y residu
Buku y^ (y-y^)^2
Page Price y-y^
Sejarah 500 84 73.71 10.29 105.80
Matematika 700 75 84.00 (9.00) 81.00
Psikologi 800 99 89.14 9.86 97.16
Sosiologi 600 72 78.86 (6.86) 47.02
Manajemen 400 69 68.57 0.43 0.18
Biologi 500 81 73.71 7.29 53.08
Musik 600 63 78.86 (15.86) 251.45
Keperawatan 800 93 89.14 3.86 14.88
650.57
a 48
Nilai Kuadrat Terkecil
b 0.05
Kesalahan Standart Estimasi (Standard Error)
0
Y Yˆ
2
s y. x t hitung
n2 se
x y residu
Buku y^ (y-y^)^2
Page Price y-y^
Sejarah 500 84 73.71 10.29 105.80
Matematika 700 75 84.00 (9.00) 81.00
Psikologi 800 99 89.14 9.86 97.16
Sosiologi 600 72 78.86 (6.86) 47.02
Manajemen 400 69 68.57 0.43 0.18
Biologi 500 81 73.71 7.29 53.08
Musik 600 63 78.86 (15.86) 251.45
Keperawatan 800 93 89.14 3.86 14.88
(0.00) 650.57
a 48
b 0.05 Deviasi positif diimbangi 10.4129
dengan deviasi negatif
Asumsi Pokok Regresi Linear
• Memiliki distribusi normal
• Dalam garis regresi terdapat rata-rata
• Memiliki standar kesalahan estimasi yang
sama (sy.x); dan
• Distribusi yang terikat dengan yang lain
• If the values follow a normal distribution:
Y ' s y . x include the middle 68% of observation
Y ' 2s y . x include the middle 95% of observation
Y ' 3s y . x include virtually all the observations
Confidence Interval & Estimation Interval
suatu nilai X n X X 2
1 25 22
2
CI 48.5526 2.306(9.901)
10 760
48.5526 7.6356
Coefficient of Determination
Total var iation Un exp lained var iation
r
2
Y Y Y Y '
2 2
Y Y
2
• E.g. of Y = a + b X
R2 = 0.8, we say that 80% of the variation in
weekly production, Y, is determined by its
linear relationship with X
The Relationships among the coefficient of
correlation, coefficient of determination and
the standard error of estimate
Re gression
SSR Y 'Y
2
Y ' a b1 X 1 b2 X 2 ..... bn X n
Multiple Standard Error of
Estimate
• Multiple standard error of estimate
Y Y ' 2
s y .12...k
n k 1
Assumption about Multiple Regression
and Correlation
1. The independent variables and the dependent variable have a
linear relationship
2. The dependent variable is continous and at least interval scale
3. The variation in the difference between the actual and the
predicted values is the same for all fitted values of Y, so called
homoscedasticity (Y-Y’) must be aproximately same for all
values of Y’
4. The residuals (Y-Y’) are normally distributed with a mean of 0
5. Successive observations of the dependent variable are
uncorrelated, so called autocorrelation
The Relationships among the coefficient of
correlation, coefficient of determination and
the standard error of estimate
Re gression
SSR Y 'Y
2
Source df SS MS F
MSR/
Regression k SSR SSR/k MSE
Error n - (k + 1) SSE SSE/[n - (k+1)]
Total n-1 SS total
Cont’
• Coefficient of Multiple Determination
SSR
R
2
SS total
Df = (3, 16)
It means that some of the
independent variables do have ability
Reject to explain the variation of dependent
H0 variables
3.24
Evaluating Individual Regression
Coefficients
For temperature: For insulation: For furnace age:
H 0 : 1 0 H0 : 2 0 H 0 : 3 0
H 1 : 1 0 H1 : 2 0 H1 : 3 0
0.05
n 20
df n (k 1) 20 (3 1) 16
t table 2.120