STATISTICS
IPTHO
ABDUL SUKUR KAMSIR
OUTLINE
Introduction and Definitions
Sampling
External and Internal Validity
Sources of error
Normal distribution
Standard error
Binomial probabilities
Poisson distribution
Statistical tests
Confidence intervals
What is statistics?
A Science where inferences are made
about specific random phenomena on the
basis of a relatively limited sample
material
The word “statistics” can also mean the
analytical tools used in this science i.e. the
calculated figures based on the data
collected.
Two Main Areas
Mathematical statistics – concerns with
the development of new methods of
statistical inference and requires a strong
mathematics knowledge
Applied statistics – applying the methods
of mathematical statistics to specific
subject areas; BIOSTATISTICS is when it
is applied to biological or medical
problems
Definitions
Statistics - Collection of methods for
planning experiments, obtaining data, and
then organizing, summarizing, presenting,
analyzing, interpreting, and drawing
conclusions.
Statistic - Characteristic or measure
obtained from a sample e.g mean,
variance, Chi-square statistic, t-test
statistic etc
Definitions
Inferential Statistics - Generalizing from
samples to populations using probabilities.
Performing hypothesis testing,
determining relationships between
variables, and making predictions.
Descriptive statistics refers to the process
of organizing and summarising collected
information (data) to study the properties
of a variable
Definitions
Population - All subjects possessing a
common characteristic that is being
studied.
Sample - A subgroup or subset of the
population.
Parameter - Characteristic or measure
obtained from a population.
Statistic - Characteristic or measure
obtained from a sample.
Definitions
Variable - Characteristic or attribute that
can assume different values, it is the
fundamental element of statistical
analysis; it is something
measured/counted or identified
Variables
Qualitative - Variables which assume non-
numerical values.
Quantitative - Variables which assume numerical
values.
Discrete - Variables which assume a finite or
countable number of possible values. Usually
obtained by counting.
Continuous - Variables which assume an infinite
number of possible values. Usually obtained by
measurement.
Variables (qualitative/categorical)
Nominal Level - Level of measurement
which classifies data into mutually
exclusive, all inclusive categories in which
no order or ranking can be imposed on the
data.
Ordinal Level - Level of measurement
which classifies data into categories that
can be ranked. Differences between the
ranks do not exist.
Variables (quantitative/numerical)
Interval Level - Level of measurement which
classifies data that can be ranked and
differences are meaningful. However, there is no
meaningful zero, so ratios are meaningless.
(temperature in celcius, fahrenheit etc)
Ratio Level - Level of measurement which
classifies data that can be ranked, differences
are meaningful, and there is a true zero. True
ratios exist between the different units of
measure. (temperature in Kelvin)
SAMPLING
Random - Sampling in which the data is collected using chance
methods or random numbers.
Systematic - Sampling in which data is obtained by selecting every
kth object.
Stratified - Sampling in which the population is divided into groups
(called strata) according to some characteristic. Each of these strata
is then sampled using one of the other sampling techniques.
Cluster - Sampling in which the population is divided into groups
(usually geographically). Some of these groups are randomly
selected, and then all of the elements in those groups are selected.
• Two samples of equal-size fish were taken from the same population
• One group fed with vitamin supplement diet for 300days & the other
untreated control group
• The supplement diet caused 10% increase in length but the difference
is small compared with the variation in growth among individuals
which may obscure any effect of treatment
Because of the natural variability
among living species: