• Unique – there is only one and there are infinitely many alternatives
Examples:
• Null Hypothesis 1: Cigarette smoking does not increase risk for idiopathic
pulmonary fibrosis.
• Null Hypothesis 2: Insulin resistance does not increase risk for coronary artery
disease
Real Truth vs Your Sample
• A large p-value indicates that any observed difference MAY be due entirely to
chance, not that it IS due entirely to chance
• P-values DO NOT measure strength of association
• Not a serious problem when the risk estimates by level of confounder, and overall risk estimate,
are all very similar
Identification of a confounder
Denote disease by D, risk factor by F and the third variable by C
If C is to be a confounder it must:
A B A causes B
A is non-casually related
A B to B
Confounding type 1
• Confounder causes both potential risk factor and disease
Grey stroke
F D hair
C age
• Grey hair tends to be related to any disease, such as stroke, which is age-related. This does
not mean that grey hair is an independent risk factor for such diseases!
Confounder Type 2
• Confounder causes disease and is related to risk factor
No of
Breast
children
F D cancer
C age
• Older women have had more children and also tend to have more breast cancer.
Having had more children is not necessarily a cause of breast cancer, but is
confounded with age
Confounder Type 3
• Confounder causes disease and risk factor is related to disease and confounder
Poor
housing CHD
F D
C smoking
• Poor housing is related to CHD, maybe not causally, but through other causal
factors, besides smoking, that are also associated with social disadvantage
Selecting confounders
AND
BUT NOT
• Confidence intervals tell us about how large a role chance may play,
but reveal nothing about bias. Interpretation of point estimates
should consider possible bias.
• Example:
• We compared HPV prevalence in:
• Cases --- all cervical cancer patients in Maryland
• Controls --- age-matched non-cancer patients from the HIV clinic
Information Bias
Method for collecting information on exposure, outcome or other relevant
factors (confounders and modifiers) introduces bias
Different interviewers used for cases vs. controls
Diagnostic test performance changes over time in cohort study
Diseased and non-diseased people remember their exposure history differently
Sources of Information Bias In
Epidemiological Research
• Participant responses
• differential recall (recall bias)
• e.g., mothers of children with birth defects more likely to remember their exposures during
pregnancy vs. mothers with normal deliveries because they want to find a reason for the
defect
• socially desirable answers
• e.g., participants in HIV-prevention trial know that they are not supposed to have risky sex so
may under-report their behavior
• Interviewers
• leading questions
• Interviewers with knowledge of the hypothesis may probe more on primary exposure history
when interviewing people with disease than the comparison group without disease.
• Measurement devices
• calibration
• blood pressure from patients with heart disease is collected at the hospital vs.
collected by Walgreen machine for comparison group without heart disease.
• operator differences
• Laboratory Assays
• sensitivity/specificity
• falsely classify exposure or disease
• sample collection and storage issues
• blood collected from cancer cases in hospital and immediately frozen vs. carried
around in a cooler for 10 hours after community collection from comparison group.
• operator differences
Types and Consequences of
Information Bias
Types
4Non-differential (“noise”)
4 e.g., using an insensitive and non-specific test to classify exposure or disease
4Differential
8 recall bias
8 interviewer probing cases more than comparison group
Consequences
• Sources and methods of data collection should be similar for all study groups