Anda di halaman 1dari 3

# .

igma +athNet

1age 1 of 2

## Why we use (n -1) in the denominator of standard deviation formula ?

In short, if x1, x2 ,, xN be independent and identically distributed random variables. The standard deviation of the entire population is given by the formula :

However, if x1, x2 ,, xn

## n ! N " is not a set variables of the entire

#o#ulation, but only a sample of the #o#ulation, then we use the standard deviation formula :

## Why we do like this ?

Not so te\$hni\$al, we use n % 1" be\$ause we &ust li'e to ma'e the (s#read( or deviation" a little larger to refle\$t the fa\$t that, sin\$e we are using a sam#le, not the entire #o#ulation, we have more un\$ertainty.

Degree of freedom
Degrees of freedom is a measure of how mu\$h #re\$ision an estimate of variation has. ) general rule is that the degrees of freedom decrease when we have to estimate more parameters. Before you can compute the standard deviation, we have to first estimate a mean. This \$auses you to lose one degree of freedom and you should divide by n % 1" rather than n. In more \$om#lex situations, li'e )nalysis of *arian\$e and +ulti#le ,inear -egression, we usually have to estimate more than one #arameter. +easures of variation from these #ro\$edures have even smaller degrees of freedom.

n!iased estimator
) more formal way to \$larify the situation is to say that s or the sam#le standard deviation" is an unbiased estimator of , the #o#ulation standard deviation if the denominator of s is n % 1". .u##ose we are trying to estimate the #arameter using an estimator / that is, some fun\$tion of the observed data". Then the bias of / is defined to be 0 /" % , in short, (the ex#e\$ted value of the estimator minus the true value /.(

htt#:33www.6\$.edu.h'3math3)dvan\$ed724,evel3.tandard8deviation.htm

2134232415

Si"ma 5ath6et

4a"e # of \$

If E()

## Variance formula using n instead of (n 1) is biased

Suppose we use the variance formula (s uare of standard deviation)

## Since E(s#) (#).

#,

s# is a biased estimator of

## if we use (\$) instead of

Prerequisite
!e would li%e to clarif& some points before we li%e to prove (') . (irstl&, if ) and * are independent and identicall& distributed, we have+ ,ar (%)) = %# ,ar ()) ,ar () . *) = ,ar ()) . ,ar (*) 0he proofs of (-) and (/) are left to the reader. Secondl&, if is the sample mean and is the population mean, then .... (-) .... (/)

since 12 , 1# , 3, 1n are independent and identicall& distributed and havin" the same variance
#.

Proof of (2)

http+77www. c.edu.h%7math78dvanced9#0:evel7Standard;deviation.htm

#270#7#02'

Sigma MathNet

Page 3 of 3

21 !2 2!1"