During a 20-year longitudinal study of cognitive change in old age 2,342 of 5,842 participants died and 3,204
dropped out. To study cognitive change as death approaches, we grouped participants by survival, death, dropout,
or dropout followed by death. Linear mixed-effects pattern-mixture models compared rates of cognitive change
before death and dropout from four quadrennial administrations of tests of fluid intelligence, vocabulary, and
verbal learning. After we took into account the significant effects of age, gender, demographics, and recruitment
cohorts, we found that approach to death and dropout caused strikingly similar reductions in mean test scores
and amounts of practice gains between successive quadrennial testing sessions. Participants who neither dropped
out nor died showed significant but slight cognitive declines. These analyses illustrate how neglect of dropout
miscalculates effects of death, of worsening health, and of all other factors affecting rates of cognitive change.
Key Words: Cognitive changeDeathDropout.
METHODS
P271
P272
RABBITT ET AL.
Materials
Some of us have given details of the entire study elsewhere
(Rabbitt, Diggle, Holland, McInnes, Bent, et al., 2004). The
data analyzed here are from tests of general fluid intelligence,
that is, the Heim (1970) AH4-1 intelligence test; of vocabulary,
that is, the Raven (1965) Mill Hill B Vocabulary Test; and
cumulative verbal learning, that is, the Cumulative Verbal
Learning task (CVL task) administered on four successive
occasions at 4-year intervals between 19831985 and 2003.
The AH4-1 test consists of 64 logic, verbal comparisons, and
arithmetic problems. Scores are the percentages of correct
answers given within 10 minutes. The Mill Hill B Vocabulary
Test requires correct definitions for each of 34 words with no
time limit. For the CVL task, 15 three-syllable words matched
for frequency (1/100,000) and concreteness are projected, one
at a time, on a screen at a rate of 1/1.5 s. Participants recall as
many words as possible and the words are then shown three
times in different random orders and recalled without sight of
previous attempts. Scores are percentages of correct answers.
P273
Subset
RESULTS
Table 1 shows details of the demographic and age categories,
and death and withdrawal categories, and mean scores of these
subgroups on the AH4-1 test, the Mill Hill B Vocabulary Test,
and the CVL task.
Analyses
Methodology. Our analyses were based on the models
described by some of us elsewhere (Rabbitt, Diggle, Smith,
Holland, & McInnes, 2001; Rabbitt, Diggle, Holland, McInnes,
Bent, et al., 2004; Rabbitt et al., 2005). We consider age,
gender, socioeconomic status, and whether participants are
taking the test for the first, second, third, or fourth time (the
practice effect). We also include cities of residence and years of
recruitment of cohorts to adjust for unidentified confounding
factors. The model can be considered in two parts: a model for
the average response over time for a subject with given values
of all explanatory variables, and a model for the random
variation about the main response.
The aim is to determine the effect of imminent death on
a participants cognitive performance. However, some participants who withdraw survive beyond the census date whereas
others do not, giving three mechanisms by which they fail to
Manchester (2,615)
Newcastle (3,227)
Men (1,711)
Women (4,131)
Oclass C1 (261)
Oclass C2 (1,849)
Oclass C3(NM)(2,061)
Oclass C3M (771)
Oclass C4 (433)
Oclass C5 (47)
Oclass Na (420)
Age 4960 (1,347)
Age 6170 (2,849)
Age 7180 (1,477)
Age 80 (169)
M(SD) CVL
M(SD) MHB
49.6
47.5
50.9
47.4
61.1
56.2
48.4
39.2
34.4
31.0
40.5
55.4
49.5
41.8
33.7
(17.7)
(17.3)
(17.7)
(17.4)
(14.0)
(16.6)
(14.9)
(15.8)
(14.0)
(16.6)
(18.3)
(17.1)
(16.6)
(16.5)
(15.5)
71.6
67.5
66.2
70.4
72.2
73.1
70.1
63.0
61.7
55.5
62.6
75.5
69.7
63.5
57.5
(13.9)
(14.8)
(15.1)
(14.2)
(13.4)
(13.4)
(13.6)
(14.4)
(14.7)
(17.4)
(18.0)
(12.9)
(13.8)
(14.7)
(16.7)
54.8
45.2
52.1
48.5
63.3
58.3
48.1
40.6
35.3
29.9
42.7
51.7
49.4
48.1
47.8
(17.9)
(17.9)
(18.4)
(18.5)
(15.8)
(16.6)
(16.2)
(16.7)
(16.3)
(16.7)
(20.1)
(18.0)
(18.2)
(19.4)
(20.0)
56.3
43.0
46.7
49.9
53.6
39.4
42.9
48.7
45.2
49.8
51.4
(16.2)
(17.4)
(16.6)
(14.8)
(16.3)
(17.5)
(15.4)
(16.0)
(17.1)
(16.8)
(15.6)
74.9
63.4
65.3
70.1
70.2
62.5
63.2
67.9
69.6
71.7
71.4
(12.1)
(16.8)
(14.5)
(12.7)
(12.9)
(15.9)
(14.8)
(14.1)
(14.3)
(13.2)
(13.3)
53.0
48.3
50.8
52.8
50.4
45.2
47.0
50.5
45.6
50.4
52.9
(17.4)
(19.1)
(19.2)
(18.7)
(18.8)
(19.5)
(19.4)
(19.5)
(18.2)
(17.8)
(16.6)
Pattern
C (1,504)
D1 (365)
D2 (408)
D3 (246)
D4 (115)
WD1 (745)
WD2 (354)
WD3 (109)
W1 (1,013)
W2 (595)
W3 (388)
RABBITT ET AL.
P274
SE
df
53.63
0.77
0.014
2.25
1.42
9.58
6.14
8.56
11.96
14.85
5.42
3.71
5.54
7.22
0.75
0.04
0.00
0.44
0.66
0.96
0.46
0.60
0.74
2.07
0.79
0.18
0.29
0.52
6732
6732
6732
5814
5814
5814
5814
5814
5814
5814
5814
6732
6732
6732
70.68
19.16
5.78
5.09
2.13
9.91
13.26
14.09
16.06
7.14
6.78
20.43
19.08
13.78
,.0001
,.0001
,.0001
,.0001
.03
,.0001
,.0001
,.0001
,.0001
,.0001
,.0001
,.0001
,.0001
,.0001
2.49
1.86
1.16
0.27
1.59
0.55
9.28
1.86
3.08
1.73
0.89
0.85
0.68
1.10
0.74
2.79
0.86
0.82
5814
5814
5814
5814
5814
5814
5814
5814
5814
0.28
2.09
1.36
0.40
1.43
0.74
3.32
2.16
3.72
.7737
.0365
.1713
.6884
.1502
.4552
.0009
.0305
.0002
7.22
5.03
3.71
0.41
9.93
7.24
3.43
8.15
4.71
2.46
0.87
0.79
0.96
1.34
0.69
0.85
1.38
0.60
0.68
0.80
5814
5814
5814
5814
5814
5814
5814
5814
5814
5814
8.21
6.31
3.86
0.31
14.28
8.47
2.48
13.45
6.85
3.07
,.0001
,.0001
.0001
.7569
,.0001
,.0001
.0129
,.0001
,.0001
.0021
0.08
0.04
0.12
0.20
0.03
0.02
0.04
0.07
6732
6732
6732
6732
2.66
1.82
2.62
2.61
.0076
.0686
.0086
.0089
Item
(Intercept)
Age
Age2
Gender
City
Oclass C1
Oclass C2
Oclass C3M
Oclass C4
Oclass C5
Oclass Na
T2
T3
T4
Entry year
1982
1983
1984
1986
1987
1988
1990
1991
1992
Pattern
D1
D2
D3
D4
WD1
WD2
WD3
W1
W2
W3
Interactions
Age
Age
Age
Age
3
3
3
3
Gender
T2
T3
T4
Intercept
Age
Residual
SD
13.086
0.389
5.467
0.05
rA2
rAB
rAB
rB2
and are independent of the Level 1 error, Eij, which follows a univariate normal distribution with a mean of zero and
variance rE2.
This allows us to quantify the mean effect of death or
withdrawal, including also the test occasion by which this has
occurred. It also allows us to make a direct comparison of the
effects of death and of withdrawal, as we shall see.
We have also considered interactions between the various
explanatory variables and these are included in the final models.
Analysis of AH4-1 intelligence test scores. To illustrate
general trends common to all tasks, we give analyses of AH4-1
scores in detail. Scores for vocabulary and cumulative learning
are jointly discussed in the subsequent text. Table 2 shows
results from a linear mixed-effects pattern-mixture model comparing percentages of correct unadjusted AH4-1 scores for the
death and dropout groups after the effects of age, gender,
occupational category, city of residence, recruitment cohort,
and practice are considered. Age was centered at 49, the lowest
recorded in the study. Variance between individuals has
been modeled with random effects because these are longitudinal data and measurements from the same individual are
correlated.
Significant linear and quadratic terms show an accelerating
decline in mean scores with increasing age. Because there is no
Age 3 Death or Age 3 Dropout Group interaction, there is no
evidence that the cognitive effects of death or dropout differ
with the ages at which they occur. There is no interaction
between group and any of the variables of age, gender, city,
occupational class, or cohort (entry year), but there is a
significant constant effect of group at each session time, as seen
in the plot of the raw data. (It is not possible to look for an
interaction with session time because these regression coefficients would be unidentifiable.)
There are, however, some significant interactions of age with
practice gains. The interaction between age and the difference
between scores at T1 and T2 is not significant, but the
interactions between age and the T3 versus T1 difference (p
.0086) and between age and the T4 versus T1 difference (p
.0089) are significant. In survivors, deceased individuals, and
dropouts alike, the effect of age on test scores increases with
the interval over which it is measured. Overall, men (M 50.9,
SD 17.7) scored higher than women did (M 47.4, SD
17.4). The significant Age 3 Sex interaction shows that even
when longevity has been taken into account, women decline
less as they age. Mancunians or persons from Greater
Manchester (M 49.6, SD 17.7) score higher than
Novocastrians or persons from Newcastle (M 47.5, SD
17.3). Scores range from 61.1, SD 14.0, for C1 (the most
advantaged SEA group), to 31.0, SD 16.6, for SEA C5. This
is reflected in the main model, shown in Table 2. There, we can
see that the mean score of C5 is over 14% lower than that of
C3 (both C3N and C3M), the baseline class, and that of C1 is
9.5% higher than C3 (C3N and C3M). There are significant
differences between recruitment waves with performance for
entry years 1990 and 1992 that are greater than those for 1985
(baseline). There is a highly significant practice gain between
D1
D2
D3
D4
W1
W2
W3
.0001 .0001
.0001
.0002
.0001
.0001 .0001
Item
.0001 .0001
.001
(Intercept)
Age
Age2
Gender
City
Oclass C1
Oclass C2
Oclass C3M
Oclass C4
Oclass C5
Oclass Na
T2
T3
T4
.0009
.0001
Note: Values of t statistics are derived from the main model. After correction, p , .001 is taken as the threshold for significance. C control group;
D death and W withdrawal (see text for particular designations).
P275
CE
SE
df
68.96
0.82
0.01
4.18
0.53
3.14
2.93
4.59
6.09
11.44
3.17
0.56
4.79
0.21
0.70
0.04
0.01
0.46
0.76
0.90
0.41
0.55
0.68
1.95
0.93
0.24
0.35
0.60
5603
5603
5603
4774
4774
4774
4774
4774
4774
4774
4774
5603
5603
5603
97.93
19.06
3.95
10.28
0.70
3.48
7.12
8.23
8.87
5.85
3.38
2.32
13.57
0.35
,.0001
,.0001
.0001
,.0001
.4824
.0005
,.0001
,.0001
,.0001
,.0001
.0007
.0202
,.0001
.7237
1.77
5.68
1.23
1.41
0.59
0.48
0.62
0.75
1.53
0.87
0.85
0.59
0.98
0.71
0.97
1.49
4774
4774
4774
4774
4774
4774
4774
4774
1.15
6.52
1.45
2.37
0.60
0.68
0.63
0.50
.2490
,.0001
.1468
.0176
.5479
.4957
.5237
.6138
5.25
4.56
0.78
0.17
7.05
6.06
2.40
4.78
2.56
1.58
0.93
0.77
0.92
1.18
0.68
0.75
1.16
0.58
0.60
0.65
4774
4774
4774
4774
4774
4774
4774
4774
4774
4774
5.64
5.91
0.84
0.14
10.34
8.06
2.06
8.22
4.24
2.41
,.0001
,.0001
.4004
.8811
,.0001
,.0001
.0391
,.0001
,.0001
.0157
0.36
0.08
0.18
0.36
0.03
0.03
0.05
0.09
5603
5603
5603
5603
10.41
2.46
3.25
3.92
,.0001
.0137
.0012
.0001
Entry year
1982
1983
1984
1986
1987
1988
1991
1992
Pattern
D1
D2
D3
D4
WD1
WD2
WD3
W1
W2
W3
Interactions
Age
Age
Age
Age
3
3
3
3
City
T2
T3
T4
Intercept
Age
Residual
SD
10.607
0.444
7.433
0.533
RABBITT ET AL.
P276
D1
D2
D3
D4
W1
W2
W3
C
.0001 .0001
.0001 .0001
.0001 .0001
D1
.0002 .0004
.0066 .0003
D2
.0005 .0009
.0007
D3
D4
W1
.0007 .0001
W2
W3
WD1
.0002
WD2
Note: Values of t statistics are derived from the main model. After correction, p , .001 is taken as the threshold for significance. CVL task Cumulative Verbal Learning task; C control group; D death and W withdrawal
(see text for particular designations).
CE
SE
df
57.88
0.01
0.01
1.02
6.08
12.67
8.91
7.36
11.69
16.88
3.57
0.13
1.40
3.24
0.82
0.04
0.01
0.47
0.72
1.03
0.49
0.65
0.80
2.26
0.86
0.26
0.37
0.66
6797
6797
6797
5837
5837
5837
5837
5837
5837
5837
5837
6797
6797
6797
70.51
0.02
0.87
2.12
8.48
12.24
18.06
11.21
14.46
7.46
4.12
0.51
3.70
4.89
,.0001
0.97
.3789
.0339
,.0001
,.0001
,.0001
,.0001
,.0001
,.0001
,.0001
.6039
.0002
,.0001
1.82
0.07
2.36
1.81
3.75
0.12
1.05
3.34
3.06
1.83
0.95
0.90
0.74
1.18
0.80
2.94
0.93
0.90
5837
5837
5837
5837
5837
5837
5837
5837
5837
0.99
0.08
2.59
2.45
3.18
0.15
0.35
3.56
3.38
.3210
.9331
.0094
.0141
.0015
.8751
.7209
.0004
.0007
5.41
3.57
2.31
1.02
7.90
6.09
1.78
7.26
4.67
1.73
0.98
0.85
1.01
0.73
0.76
0.92
1.47
0.65
0.72
0.84
5837
5837
5837
5837
5837
5837
5837
5837
5837
5837
5.51
4.20
2.27
1.39
10.29
6.61
1.21
11.09
6.48
2.05
,.0001
,.0001
.0227
.4652
,.0001
,.0001
.2239
,.0001
,.0001
.0402
0.01
0.04
0.31
0.03
0.06
0.09
6797
6797
6797
0.41
0.72
3.25
.6755
.4687
.0011
Entry year
GENERAL DISCUSSION
These analyses replicate the main findings of Rabbitt and
colleagues (2005) for AH4-1 intelligence test scores on a much
1982
1983
1984
1986
1987
1988
1990
1991
1992
Pattern
D1
D2
D3
D4
WD1
WD2
WD3
W1
W2
W3
Interaction
Age 3 T2
Age 3 T3
Age 3 T4
Intercept
Age
Residual
SD
13.513
0.097
9.163
0.845
larger sample, extend and compare these to the CVL task and
production vocabulary (Mill Hill B Vocabulary Test), and
examine the time courses of changes in greater detail, concluding that mean scores on all analyzed cognitive tests decline
according to nearness of death or dropout.
P277
D1
D2
D3 D4
.0001 .0001
W1
W2
W3
.0001 .0001
WD1
WD2
WD3
.0001 .0001
.001
.0002 .0001
.001
.0009
.0004
Note: Values of t statistics are derived from the main model. After correction, p , .001 is taken as the threshold for significance. MHB Mill Hill B
Vocabulary Test; C control group; D death and W withdrawal (see text
for particular designations).
Methodological Issues
Most studies have found that particular pathologies that
become common in later life, such as diabetes (e.g., Bent,
Rabbitt, & Metcalf, 2000), hypertension and other cardiovascular problems (Fahlander et al., 2000; Hertzog, Schaie, &
Gribbin, 1978; Lopez et al., 2003), respiratory problems
(Holland & Rabbitt, 1991), and undifferentiated health
problems (McInnes & Rabbitt, 1997), have significant but
surprisingly small effects on cognitive performance. Most
studies have compared patients and healthy controls only at
a single time point, and the few longitudinal studies have
ignored death and dropout. The current analyses suggest that if
individuals in so-called patient groups who die or withdraw
are not included in comparisons against healthy controls, then
the true effects of pathologies must be severely underestimated.
If dropouts and deaths are excluded from analyses then the
P278
RABBITT ET AL.