Definition of statistics
Use of statistics
Common terms in biostatistics
Data & Variables
Descriptive & inferential statistics
norainiag@usm.my 2
Learning objectives
3
Definition
An application of statistics to a biological data, such as biology, human,
disease, medicaletc.
Biostatistics provides the tools for the summary and digestion of a lot of
numerical laboratory and clinical data including critical reading and
understanding of scientific literature.
norainiag@usm.my 4
What is statistics?
Mathematical procedures to
collect
organise
analyse, and
interpret
5
Statistics also includes the planning of data collection
in terms of the design of:
survey, and
experiments
6
Use of statistics
Statistics is especially useful in drawing general
conclusions about a set of data from a sample of
data
7
Why is statistics important?
It is part of the quantitative approaches to
knowledge
9
Common terms
Population
A set of individuals, items or data from which statistical sample is taken
Sample
A subset of a population which is selected to represent the population
Population
Sample Sample
n 1
10
- The raw or original measurement of statistics.
- Values taken by the variable
e.g.
10 years old (Data for age variable),
Data
Male (Data for gender variable),
Chinese (Data for race variable),
160oC (Data for temperature variable),
74 kg (Data for body weight variable)
11
Parameter
-used to represent a certain Population characteristics
e.g., population mean
Statistic
-used to represent a certain Sample characteristics
e.g., Sample mean
Variables
-Characteristic of event/object/person.
-Observed/measured
e.g. age, gender, race, height and weight
12
Parametric method/statistic/test
-the method that used on the data from a type of probability distribution
e.g., normal distribution, binomial distribution
Non-parametric method/statistic/test
-the method that used on the data not based on parameterized families of probability
distribution; distribution free
Mathematical procedures to
collect
organise
analyse, and
interpret
14
*Collecting data
Observe = measurement
What to measure?
Variables
15
Types of variable
VARIABLE
16
Qualitative/Categorical Variable
Nominal Data
- Data differentiated by NAME Frequency
- e.g. Blood group, Gender, Race and etc. (%)
Ordinal/Ranked Data
- Data that is RANKED
- e.g. 1st, 4th and 8th winners of a marathon race
Has unequal distance between adjacent points in a scale.
Has underlying continuity, thus may be used as interval data (numerical
data) 17
Quantitative/Numerical Variable
Discrete
- Data with finite values (fixed values)
- e.g. Number of students in PPSK, number of cars in a particular car park
Measure of centrality/
Continuous Data Measure of dispersion
Nominal
Discrete
Ordinal
Variable
Interval
Continuous
Ratio
19
Types of variable- Discrete vs Continuous
Discrete Continuous
Restricted to whole, countable numbers e.g An infinite number of possible values fall
number of patients, frequency of cough at night etc. between any two observed values e.g. body
weight, blood sugar etc.
No values can exist between two
neighbouring categories e.g. 4 or 5 number of Very rare to obtain identical measurement for
patients; no 4.5 patients two different individuals
Observations that differ qualitatively e.g. male Intervals rather than category Two people who
& female both claim to weigh 90 kgs are probably not the same weight.
However they are both around 90 kgs. One person may actually
weigh 89.6 kgs and the other 90.3.
20
Interval
Ordered categories that are all intervals of exactly the same size e.g.
weight in kg, height in cm.
Ordered e.g. 3>2>1,
21
Ratio
Interval scale with an absolute zero point
e.g. 0 kg means no weight
Dependent
Variable/ outcome/response
variable/ y
23
Concepts related to variable
Independent Variable / predictors/covariate/x
A variable that stands alone and isnt change by the variables we are trying to
measure
(Independent variable!!)
.
(Independent variable!!)
Weight-dependent variable
25
Identify each of the following as examples of
qualitative or quantitative variables:
28
Identify each of the following as examples of qualitative (nominal
or ordinal) or quantitative (discrete or continous variables):
Mathematical procedures to
collect
organise
analyse, and
interpret
32
Why we need statistics/biostatistics?
What are the main uses of statistics in health sciences?
1. summarising data
2. calculating probabilities associated with experimental
findings
3. test of significance
33
Analyse data
Collectively, statistical analysis are divided into two:
1. Descriptive statistics
2. Inferential statistics
34
Analyse data
Descriptive statistics
Decribe, summarise e.g. mean (SD), %
35
Analyse data Discuss in next
topic: Descriptive
apply sampling
statistics
techniques
Sample
Population Do statistics
DESCRIBE
SUMMARISE
Descriptive The results confine within sample
statistics
36
Analyse data
Inferential statistics
Infer beyond sample e.g SEM, 95% CI
Population
use sample
statistics
38
Analyse data-various statistics methods
41
What is statistics/biostatistics?
Mathematical procedures to
collect
organise
analyse, and
interpret
Will learn in each topic!
42
Conclusion- Take home message
What is statistics?
Common terms
Types of variables
Descriptive statistics vs Inferential
statistics
43
Thank you.
Please install SPSS in your notebook.
We strongly encourage you to use your own notebook
during the class.
44