Econ 103 Lecture 01 PDF

Lecture 1: Introduction and Review of Probability
and Statistics
Moshe Buchinsky
UCLA
Course Logistics
I Lectures (Econ 103): Tuesdays and Thursdays,

12:30AM-1:45PM, at Broad Art Center, 2160E.
I Lab lectures (Econ 103L): Fridays 11:00-11:50AM, Fowler
Museum, A103B.
I One has to register to BOTH Econ 103 AND Econ 103L.
You cannot take one without the other.
I Office hours: Fridays 1:00-2:00PM, Bunche Hall 8373.
The Teaching Assistants
I Changsu Ko - Head TA
I Changsu Ko (head TA)
I Yun Feng
I Dong Ook Eun
I Kun Hu
I Jeonghwan Kim
I Lu Liu
I Pavel Andreyanov
Textbooks and Other Material
I Principles of Econometrics. Hill, R. C., Griffiths, W. E. and G.

C. Lim, 4th Edition, 2011. Wiley and Sons. (Required!)
I Using Stata for Principles of Econometrics. Lee C. Adkins &
R. Carter Hill, 4th Edition. Wiley and Sons. (Strongly
Recommended)
I Other:
I Basic Econometrics. Gujarati, D. N., 5th Revised Edition,
2010, McGraw-Hill.
I Introductory Econometrics. Wooldridge, J., 6th Edition,2013,
Cengage Learning.
I Probability and Statistical Inference. Hogg, R. V. and E. A.
Tanis, 8th Edition, 2009, MacMillan.
I Lecture notes: Slides will be posted on the class website before
class. I will not bring copies!
I Course outline - see Syllabus
Course Requirements - Problem Sets
I Problem sets account for 20% of the final grade.

I Mix of multiple choice questions, analytic exercises, and
STATA work.
I Grading of problem sets:
I 5 points for completing homework.
I 5 points for correct answer to question chosen randomly by
TAs.
I Schedule: See tentative schedule in the syllabus.
I Due at end of class on due date. No exceptions!
I Must hand in hardcopy.
I Note! You are encouraged to work together, but everyone
must hand in their own hardcopy, and do the computer work
independently.
Course Requirements - Midterm (20% or 0%
I Midterm account for 20% of grade if above the final.

I In class on Thursday, November 9, 2017.
I Very much like the problem sets.
I Optional I will compute the final grade both with and
without the midterm and use the better grade.
I No makeup exam!
Questions, Problems, etc.
I Come to my office hours or your TAs office hours.

I Post questions to the class website discussion board.
I TAs and myself will be monitoring the discussion board daily
to answer such questions.
I Emails to me or TAs on such questions will not be answered.
Regrading
I You can email your TA to request a regrade.

I The entire exam or homework will be regraded.
Please Remember
Please check the online syllabus for any updates

Course Overview
I Econometrics...
I Uses statistical methods to analyze economic data.
I aims to answer quantitative questions.
I Main tool: regression analysis
I We want to determine the causal effect of one variable
(X ) on another variable (Y ).
I Econ 41: statistical analysis of one variable.
I Econ 103: analysis of the relationship between two (or more)
variables.
Examples of questions of interest:
I How does demand change with the price of the good?

I What is the effect of a new marketing campaign on sales?
I How does class size affect education outcomes (e.g. test
scores)?
I How much an additional year of schooling increases wages?
I What is the relationship between credit scores and loan default
rates?
I How much does output grow if the Fed cuts interest rates by
1%?
I Do more policemen reduce crime?
Examples of questions of interest:
I Theory:
I How to interpret results in articles, or the results your
computer gives.
I To understand the caveats of regression analysis.
I Why it is essential to do good empirical work.
I How to do empirical analysis yourself:
I You will work with real datasets.
I You will use statistical software, namely STATA.
Review
Review of Probability and Statistics

(Econ 41)
1. Definitions
I Random Variable (X): takes on different outcomes with certain

probabilities.
I Outcomes ( xi ): mutually exclusive potential results of a
random process.
I Probability of an outcome: proportion of times that the
outcome occurs in the long run.
I Types of Random Variables:
I Discrete Random Variable: takes on a finite number of values.
I Example: coin toss; the number of times a computer crashes.
I Continuous Random Variable: takes on any value in a real
interval, each specific value has zero probability.
I Example: height of an individual; time it takes to commute to
school.
1. Definitions
I The function f that summarizes the information relating the

possible outcomes of a random variable X and the
corresponding probabilities is called the PDF, which stands for:
I the Probability Distribution Function, for discrete variables; or

I the Probability Density Function for continuous functions.
I PDFs must satisfy:
I f (xi ) 0; and
P R
i f (xi ) = 1 (if discrete) or f (x)dx = 1 (if continuous)
I
1. Definitions
I Coin toss: x1 =heads, x2 =tails, f (x1 ) = f (x2 ) = 12 .

I Height of adult men in inches.
Figure: PDFs of Discrete (left) and Continuous (right) Random Variables

1. Definitions
I Cumulative Distribution Function (CDF): describes the

probability that the random variable is less than or equal to a
particular value.
Figure: CDFs of Discrete (left) and Continuous (right) Random Variables

1. Definitions
I Moments: summary statistics of the probability distribution:

I Expected Value or Expectation or Mean
P
i xi f (xi ) if X is discrete
X = E (X ) =
R
xf (x)dx if X is continuous
I Measure of central tendency

I Weighted average of the possible values of X with the
probability f (x) serving as weights
1. Definitions
I Moments (continued):
I Variance
X2 = Var (X ) = E (X X )2

P 2
i (xi X ) f (xi ) if X discrete
=
(x X )2 f (x)dx if X cont.
R
I Measure of dispersion. How the values of X are spread around

its mean
I Standard deviation
2. Definitions - Two Random Variables
I Joint Probability Density Function of X and Y : describes the

probability that the random variables simultaneously take on
certain values, x and y .
I It is the function
fX ,Y (x, y ) = Pr(X = x, Y = y ).
I Marginal Probability Distribution:

X X
fY (y ) = Pr(X = xi , Y = y ) = fX ,Y (x, y )
xi x
I Conditional Probability Distribution of Y given X : is the

probability that Y takes on the value y when X takes on the
value x.
fY |X (y | x) = Pr(Y = y | X = x)
Pr(X = x, Y = y )
=
Pr(X = x)
fX ,Y (x, y )
=
fX (x)
I Example: Men say they will vote for the Republican candidate
rather than the Democratic candidate in their districts by a margin
of 45 percent to 32 percent. The numbers are nearly reversed for
women, with 36 percent saying they will vote Republican and 43
percent saying they will vote Democratic. New York Times,
September 20, 2010
I Assume that there are 50% men and 50% women.

1 Democrat
1 male
X = Y = 2 Republican
2 female
3 other

I The joint PDF of X and Y is:
x1 = 1 x2 = 2
y1 = 1 .16 .215
y2 = 2 .225 .18
y3 = 3 .115 .105
I The marginal distributions of X and Y are in red:

P
x1 = 1 x2 = 2 xi
y1 = 1 .16 .215 .375
y2 = 2 .225 .18 .405
y3 = 3 .115 .105 .22
P
yj .5 .5 1
I The conditional distribution of being a Democrat, given being

female:
Pr(Democrat | Female)= Pr(Y = 1 | X = 2)

Pr(Y = 1, X = 2) .215
= = .43
Pr(X = 2) .5
I The conditional distribution of being a Republican, given being
a female:
Pr(Republican| Female)= Pr(Y = 2 | X = 2)

Pr(Y = 2, X = 2) .18
= = .36
Pr(X = 2) .5
I Covariance: is a measure of linear association between two

random variables, and is defined as
XY = Cov (X , Y ) = E ((X X )(Y Y ))
I Correlation coefficient: is a unit-free measure of linear

association between two random variables, and is defined as
Cov (X , Y )
XY =
X Y
I It can be shown that
1 1
I Independence:
fX ,Y (x, y ) = fX (x)fY (y )
I If two random variables are independent, then the conditional
distribution of each variable coincides with its marginal
distribution, that is
fY |X (y | x) = fY (y ), and
fX |Y (x | y ) = fX (x).
Properties of Expectations, Variance and Covariance
I E (a) = a
I E (aX + b) = aE (X ) + b
I E (X + Y ) = E (X ) + E (Y )
I Var (a) = 0
I Var (aX + b) = a2 Var (X )
I Cov (X , X ) = Var (X )Cov (X , Y ) = E (XY ) E (X )E (Y )
I Var (aX + bY ) = a2 Var (X ) + b 2 Var (Y ) + 2abCov (X , Y )
Properties of Expectations, Variance and Covariance
I If X and Y are independent, then

I E (XY ) = E (X )E (Y )
I Var (X + Y ) = Var (X ) + Var (Y )
I Cov (X , Y ) = 0, but the reverse is not true! Zero covariance
does not imply independence.
Special Probability Distributions - Normal
I Normal distribution: N(, 2 )

I Standard normal distribution: N(0, 1)
I A variable that follows a standard normal distribution is often
denoted by Z .
I Its CDF is often denoted by , that is, Pr(Z c) = (c) for
any c.
X
I Note that if X N(, 2 ), then Z = N(0, 1).
I Note also that if X and Y are normally distributed, so is
X + Y (and any other linear combination).
Special Probability Distributions - Normal
Figure: Standard Normal Density

Special Probability Distributions - Chi-Squared
I Chi-Squared distribution: assume that Zi N(0, 1), i. Then

m
X
W = Zi2 2m ,
i=1
where m is the degrees of freedom.

I 2 distribution with m degrees of freedom.
Special Probability Distributions - Chi-Squared
Figure: Chi-Squared Density

Special Probability Distributions - Student-t
I Student t distribution: assume that Z N(0, 1),

W 2m ,and Z and W are independent. Then
Z
p tm
W /m
I t distribution with m degrees of freedom

I Note: t = N(0, 1)
Special Probability Distributions - Student-t
Figure: Student-t Density

Special Probability Distributions
I Standard Normal vs. Student t distributions
Figure: Standard Normal Density

Special Probability Distributions - F distribution
I Assume that W 2m , V 2n , and W and V are

independent.
I Then
W /m
Fm,n
V /n
I F distribution with m and n degrees of freedom
I Note: mFm, = 2m and F1,n = tn2
Special Probability Distributions - F distribution
Figure: F -Distribution Density

Computing Probabilities
I Assume that adult male heights are normally distributed with

mean 70 inches (178 cm) and standard deviation 4 inches (7.6
cm).
I If you are 65 inches tall (165 cm), what percentage of men are
shorter than you?
I Let X denote the height of men. X N(70, 16).
I Let us standardize X :
X 70
Z= N(0, 1)
4
I We have to do the same manipulation for 65.
65 70
= 1.25
4
I What is the probability that Z 1.25? This is given by its

CDF!
(1.25) = 10.56%
I You can compute this value in Stata using command di
normal(-1.25).
I Alternatively, see Appendix Table 1 in textbook.
Figure: 1.25 in a Standard Normal Distribution

I Computing (1.25) from Standard Normal table:
1 0.8944 = 0.1056!

Econ 103 Lecture 01 PDF

Diunggah oleh

Informasi Dokumen

Judul Asli

Hak Cipta

Format Tersedia

Bagikan dokumen Ini

Bagikan atau Tanam Dokumen

Opsi Berbagi

Apakah menurut Anda dokumen ini bermanfaat?

Apakah konten ini tidak pantas?

Hak Cipta:

Format Tersedia

Econ 103 Lecture 01 PDF

Diunggah oleh

Hak Cipta:

Format Tersedia

Lecture 1: Introduction and Review of Probability

I Lectures (Econ 103): Tuesdays and Thursdays,

I Principles of Econometrics. Hill, R. C., Griffiths, W. E. and G.

I Problem sets account for 20% of the final grade.

I Midterm account for 20% of grade if above the final.

I Come to my office hours or your TAs office hours.

I You can email your TA to request a regrade.

Please check the online syllabus for any updates

I How does demand change with the price of the good?

Review of Probability and Statistics

I Random Variable (X): takes on different outcomes with certain

I The function f that summarizes the information relating the

I the Probability Distribution Function, for discrete variables; or

I Coin toss: x1 =heads, x2 =tails, f (x1 ) = f (x2 ) = 12 .

Figure: PDFs of Discrete (left) and Continuous (right) Random Variables

I Cumulative Distribution Function (CDF): describes the

Figure: CDFs of Discrete (left) and Continuous (right) Random Variables

I Moments: summary statistics of the probability distribution:

I Measure of central tendency

I Measure of dispersion. How the values of X are spread around

I Joint Probability Density Function of X and Y : describes the

I Marginal Probability Distribution:

I Conditional Probability Distribution of Y given X : is the

I The joint PDF of X and Y is:

I The marginal distributions of X and Y are in red:

I The conditional distribution of being a Democrat, given being

Pr(Democrat | Female)= Pr(Y = 1 | X = 2)

Pr(Republican| Female)= Pr(Y = 2 | X = 2)

I Covariance: is a measure of linear association between two

XY = Cov (X , Y ) = E ((X X )(Y Y ))

I Correlation coefficient: is a unit-free measure of linear

I If X and Y are independent, then

I Normal distribution: N(, 2 )

Figure: Standard Normal Density

I Chi-Squared distribution: assume that Zi N(0, 1), i. Then

where m is the degrees of freedom.

Figure: Chi-Squared Density

I Student t distribution: assume that Z N(0, 1),

I t distribution with m degrees of freedom

Figure: Student-t Density

Figure: Standard Normal Density

I Assume that W 2m , V 2n , and W and V are

Figure: F -Distribution Density

I Assume that adult male heights are normally distributed with

I What is the probability that Z 1.25? This is given by its

Figure: 1.25 in a Standard Normal Distribution

I Computing (1.25) from Standard Normal table:

Anda mungkin juga menyukai