Alejandro Bernales
Universidad de Chile
Universidad de Chile
Session 0. Econometric Review Alejandro Bernales
Goals
The term is called the error (and also called as residual or disturbance term)
In essence, it makes the linear relationship between y (dependent variable) and the
xs (independent variable) a random association:
Deviations from y are accepted from the values predicted by the xs
y [0 + 1x1+ 2x2 + + kxk] = y yfit =
represents several types of random influences
Omitted variables
Effects connecting the y to the xs that cannot be capture by a linear
relationship
Measurement errors of the variables involved
Irrational behaviors
Basic point: when is a pure random shock, it will have special, interesting
properties (white noise)
White noise: we will learn its meaning later in this session
Universidad de Chile 4/29
Session 0. Econometric Review Alejandro Bernales
y 0 1 x1 2 x2 ... k xk e
Idea: we can minimize the sum of the squared differences between yn and the fitted
values for n with the objective to find the coefficients 0 and 1
N
min RSS
0 , 1
min
( y n
0 , 1
y n
n 1
) 2
N N
min (
0 , 1
y n
n 1
0
x
1 1
2
) min (
0 , 1
en ) 2
n 1
N 1
2. The adjusted R2 (adjustment for number of regressors employed): R 2 1 (1 R 2 )
N k
where k is the number of regressors
Is it statistically reliable?
This task is very simple
Step 1 - Set up a null hypothesis such as
H0 : 1 = 0
and an alternative hypothesis such as
HA : 1 0 (two-tailed test)
Step 2 - Calculate a t statistic: 1 1
t
1
var( 1 )
is a sample estimate of the variance of the coefficient (i.e. this will be given by the
majority of econometric packages)
Step 3 - Compare the test statistic t1 to the critical value from the statistical table for the t
distribution with (N-k) degrees of freedom and with significant level:
t 1 t / 2, N k
where N is the sample size and with a 100(1- /2) percent critical value
For our example:
H0 : 1 = 0 t1 = 3,129 and t5%/2,(228-2)=1.96
At a standard significance level (size) of 5% (i.e. 100(1- /2) =97.5% critical value), the
10
slope is statistically significant
Universidad de Chile 10/29
Session 0. Econometric Review Alejandro Bernales
What if a lot can be learned about the behavior of yt using the past of the variable?
12
Et[]E [|It] is the conditional expectation at time t, where It is the information set
2y is the variance of y
k is the autocorrelation function (ACF ), which is also know as correlogram, measuring the
correlation between yt and yt-k. This is given by
13
14
15
16
17
18
19
where ut is white noise, 0 is a constant (intercept) term and the i are parameters/ coefficients
A shock to yt today tends to be reversed in time through the terms iyt-i
ACF
21
22
23
As one would expect of such a process, both the acf and the pacf decline geometrically the acf as
a result of the AR part and the pacf as a result of the MA part.
24
25
where zt are iid innovations with zero mean and unit variance.
I will not enter in detail in this type of time series, BUT remember that we will use some
GARCH models later on in the course when we will study risk management.
All these models can be easily calculated with any basic econometric software.
27
y: is human development index (HDI) per country in 2013. You can take the data from:
http://hdr.undp.org/en/content/table-1-human-development-index-and-its-components
x1: Gross domestic product based on purchasing-power-parity (PPP) per capita GDP for 2013.
You can take the data from IMF:
http://www.imf.org/external/pubs/ft/weo/2013/02/weodata/download.aspx
x2: Index of economic freedom in 2013, which you can take from:
http://www.heritage.org/index/download
-Repeat the same exercise using a statistical or econometric softwere (or matlab):
-Show that you obtain the same values for the betas, calculate R^2 and R^2 (adjusted), and
calculate t statistics