Inferential statistics
Statistical inference is concerned with making inferences from one or more samples about the population from which they were drawn. One approach is to estimate the characteristics or parameters of a population from the corresponding characteristics or parameters of the sample(s). It is usually too ambitious to estimate every possible parameter of a population and the simplest parameter that we estimate is the population mean from the mean of a sample. Not only can we use such estimates for predictive purposes but we also use them for monitoring and controlling processes.
Sampling distributions
If we take many random samples then we could calculate the means of all the samples and construct a distribution of the sample means to show how these means are distributed. Such a distribution is called a sampling distribution of the means
Frequency 18 16 14 12 10 8 6 4 2 0
Mean
Modelling the sampling distribution of the means using the Standard Normal distribution
x Zx n
Confidence intervals
So far we can only estimate the population mean if we have taken a large number of random samples and this is potentially very time consuming and expensive. Rather than trying to find a single estimate for the population mean we can estimate an interval in which it lies by taking advantage of the Central Limit Theorem.
95% confidence
s n
< < x + Z*
s n
x - t*
s n
< < x + t*
s n
Hypothesis testing
Suppose that we have some preconceived idea about a population, for example that average life expectancy has increased over the last decade or that average household disposable income has decreased over the last five years. Such preconceptions are called hypotheses. We can test hypotheses by taking random samples to see if there is any evidence to support or refute them. In practice statisticians are very cautious people and do not say that they accept a hypothesis but instead say that a hypothesis cannot be rejected. In general the hypothesis that we set out to test is called the Null Hypothesis, which we label H0. If the Null Hypothesis is rejected then implicitly we cannot reject an alternative to the Null Hypothesis, which we label H1.
Chi-squared distribution
The formula for the Chi-squared distribution is far too complex for most of us to use and we often find tables of frequently used Chisquared probabilities. Like the students t distribution the shape of the Chi-squared distribution is determined by the degrees of freedom. We can see that it becomes similar to the Normal distribution as the number of degrees of freedom increases.
Calculating chi-squared