0002-8614/13/$15.00
JAGS
1171
This study is a secondary analysis of a previously published Cochrane systematic review on ChEIs for AD.7
Although that review included 10 trials, only four of these
provided sufficient data to allow all of the effect size
indices of interest to be computed. For consistency, the
names of the trials used in that review have been kept:
DON-302,8 GAL-INT-Wilcock,9 GAL-USA-Raskind,10
and RIV-B303.11
Multidimensional Analysis
METHODS
1172
PETERS
RESULTS
Individual Domains of Function
The effect size indices for the four RCTs are reported in
Table 1. For the cognition domain, SRD values ranged
from 0.07 to 0.27, NNT values ranged from 14 to 4, and
AUC-positive values ranged from 0.54 to 0.64. The 95%
CI for the SRD in the RIV-B303 trial crossed 0 ( 0.01 to
0.15), indicating a statistically nonsignificant benefit of the
drug for cognition. The corresponding NNT for this effect
is 14, and the 95% CI ( 100 to 7) does not contain this
point estimate. This is a known limitation of the NNT for
effects that are not statistically significant, which complicates its interpretation.17,18 In addition, the AUC-positive
value for the RIV-B303 trial was 0.54, and the 95% CI
included 0.50, which is the value of this statistic when
there is no difference between the two groups. Accordingly, in this case, a SRD of 0.07, a NNT of 14, and an
AUC-positive value of 0.54 are all associated with statistically nonsignificant differences between the two groups.
The SRD values in the global function domain ranged
from 0.01 to 0.17 and were generally lower than those
reported for the cognition domain (the drugs benefitted
cognition more than global function). The CIs for the two
galantamine trials crossed 0, indicating no statistically
JAGS
Multidimensional Analysis
Figure 1 shows the harm:cognition and the harm:global
function ratio scores for each of the four trials. The best
pair of outcomes is observed for the DON-302 trial, both
of the ratios are greater than 1.0. The ratio of 2.5 for
harm:cognition indicates that the NNH (10) is 2.5 times
as great as the NNT (4), and in this trial, the drug benefitted cognition more than global function. The results for
the GAL-INT-Wilcock trial are inconsistent: Although
there is a positive outcome for cognition (the NNH is 2.86
times as great as the NNT), there is more harm than benefit for global function (the NNT is 5 times as great as the
NNH). The other two studies, GAL-USA-Raskind and
RIV-B303, show no net benefit at all or some degree of
harm.
Cognition
Drug responders, n/N
Placebo responders, n/N
SRD (95% CI)
NNT (95% CI)
AUC-positive (95% CI)
Global Function
Drug responders, n/N
Placebo responders, n/N
SRD (95% CI)
NNT (95% CI)
AUC-positive (95% CI)
Harm
Drug withdrawals due to
adverse events, n/N
Placebo withdrawals due
to adverse events, n/N
Harm rate difference (95% CI)
NNH (95% CI)
AUC-negative (95% CI)
DON-302
80/150
41/153
0.27 (0.15 to 0.38)
4 (7 to 3)
0.64 (0.58 to 0.69)
37/149
17/152
0.14 (0.04 to 0.23)
7 (25 to 4)
0.57 (0.52 to 0.62)
GAL-INT-Wilcock
64/220
32/215
0.14 (0.06 to 0.22)
7 (17 to 5)
0.57 (0.53 to 0.61)
36/206
33/203
0.01 ( 0.07 to 0.09)a
100 ( 14 to 11)b
0.51 (0.47 to 0.55)
GAL-USA-Raskind
RIV-B303
67/202
34/207
0.17 (0.08 to 0.26)
6 (13 to 4)
0.59 (0.54 to 0.63)
57/242
39/238
0.07 ( 0.01 to 0.15)a
14 ( 100 to 7)b
0.54 (0.50 to 0.58)
37/186
27/196
0.06 ( 0.02 to 0.14)a
17 ( 50 to 7)b
0.53 (0.49 to 0.57)
80/219
46/230
0.17 (0.08 to 0.25)
6 (13 to 4)
0.59 (0.54 to 0.63)
26/157
31/220
49/212
55/242
11/162
19/215
16/213
16/239
JAGS
1173
Figure 1. Cognition and global function harm:benefit ratios for each study. Ratios equal to 1.00 indicate no difference between
the amount of harm and benefit. Values greater than 1.00 indicate more benefit than harm; values less than 1.00 indicate more
harm than benefit. The x-axis is a logarithmic scale to reflect equal spacing of values above and below 1.00. NNH, number
needed to harm; NNT, number needed to treat.
DISCUSSION
This investigation highlights two important points. First,
effect size indices help to interpret the clinical meaningfulness of RCT results. For example, consider the AUC-positive values reported here; the highest was 0.64. Although in
this trial, the number of responders on the ADAS-Cog in
the drug group was significantly greater than in the placebo
group (SRD = 0.27, 95% CI = 0.150.38), this AUC is not
large considering that a value of 0.50 indicates an equal
probability of success if one were to sample a participant
from each group. The information from the effect size indices provide a meaningful context for interpreting trial
results and therefore complement the traditional approach
of relying on measures of statistical significance alone.
Second, there is value in comparing effect size indices
across outcome domains. Harm:benefit ratios were computed for the cognition and global function domains. This
analysis revealed that only one of the four RCTs showed
consistent evidence of more benefit than harm in these two
domains. In addition, the GAL-USA-Raskind and the RIVB303 studies nicely illustrate the limitations of looking at
one index in isolation. The NNT values in these two studies
ranged from 6 to 17, which would be considered very good
considering that the azidothymidine drug trial for AIDS
was halted early with a NNT of 919 and the Physicians
Health Study examining the benefit of aspirin was also
halted early with a NNT of 130.20 However, in the GALUSA-Raskind and the RIV-B303 studies, the interpretation
of these NNT values changes considerably when one takes
into account the fact that the NNH in these two trials is 6,
and the harm:benefit ratios illustrate this point. This
approach of directly comparing the harm and benefit of a
treatment intervention is more consistent with what physicians do when treating patients; they compare the potential
harm and potential benefit of a medication to inform their
therapeutic decision-making. Rarely do physicians ponder
the probability of whether the results of a given trial were
1174
PETERS
ACKNOWLEDGMENTS
A poster based on this work was presented at the 2012 Alzheimer Association International Conference held in Vancouver, British Columbia, Canada. The author gratefully
acknowledges the assistance of Ms. Jaclyn Orsetto for her
help in preparing the manuscript, and Dr. Peter Whitehouse for helpful comments on an earlier draft of this manuscript.
Conflict of Interest: This study was supported by an
operating grant from the Canadian Institutes of Health
JAGS
REFERENCES
1. Cumming G. Understanding the New Statistics: Effect Sizes, Confidence
Intervals, and Meta-Analysis. New York: Routledge, 2012.
2. Grissom RJ, Kim JJ. Effect Sizes for Research: Univariate and Multivariate
Applications. New York: Routledge, 2012.
3. Kraemer HC, Kupfer DJ. Size of treatment effects and their importance to
clinical research and practice. Biol Psychiatry 2006;59:990996.
4. Molnar FJ, Man-Son-Hing M, Fergusson D. Systematic review of measures
of clinical significance employed in randomized controlled trials of drugs
for dementia. J Am Geriatr Soc 2009;57:536546.
5. Lanctot K, Herrmann N, Yau KK et al. Efficacy and safety of cholinesterase
inhibitors in Alzheimers disease: A meta-analysis. CMAJ 2003;169:
557564.
6. Livingston G, Katona C. How useful are cholinesterase inhibitors in the
treatment of Alzheimers disease? A number needed to treat analysis. Int
J Geriatr Psychiatry 2000;15:203207.
7. Birks J. Cholinesterase inhibitors for Alzheimers disease. Cochrane
Database Syst Rev 2006;25:CD005593.
8. Rogers SL, Farlow MR, Doody RS et al. A 24-week, double-blind, placebo-controlled trial of donepezil in patients with Alzheimers disease.
Donepezil Study Group. Neurology 1998;50:136145.
9. Wilcock GK, Lilienfeld S, Gaens E. Efficacy and safety of galantamine in
patients with mild to moderate Alzheimers disease: Multicentre randomised controlled trial. Galantamine International-1 Study Group. BMJ
2000;321:14451449.
10. Raskind MA, Peskind ER, Wessel T et al. Galantamine in AD: a 6-month
randomized, placebo-controlled trial with a 6-month extension. The Galantamine USA-1 Study Group. Neurology 2000;54:22612268.
11. Rosler M, Anand R, Cicin-Sain A et al. Efficacy and safety of rivastigmine
in patients with Alzheimers disease: International randomised controlled
trial. BMJ 1999;318:633638.
12. Rosen WG, Mohs RC, Davis K. A new rating scale for Alzheimers disease.
Am J Psychiatry 1984;141:13561364.
13. Cook RJ, Sackett DL. The number needed to treat: a clinically useful measure of treatment effect. BMJ 1995;310:452454.
14. R Development Core Team. R: A Language and Environment for Statistical
Computing. Vienna, Austria: R Foundation for Statistical Computing, 2012.
15. Schneider LS, Olin JT, Doody RS et al. Validity and reliability of the
Alzheimers Disease Cooperative Study-Clinical Global Impression of
Change. The Alzheimers Disease Cooperative Study. Alzheimer Dis Assoc
Disord 1997;11(Suppl 2):S22S32.
16. Citrome L. Miracle pills for weight loss: what is the number needed to
treat, number needed to harm and likelihood to be helped or harmed for
naltrexone-bupropion combination? Int J Clin Pract 2010;64:14621465.
17. McAlister FA. The number needed to treat turns 20and continues to
be used and misused. Can Med Assoc J 2008;179:549553.
18. Altman DG. Confidence intervals for the number needed to treat. BMJ
1998;317:13091312.
19. Barnes DM. Promising results halt trial of anti-AIDS drug. Science
1986;234:1516.
20. Steering Committee of the Physicians Health Study Research Group. Preliminary report: findings from the aspirin component of the ongoing physicians health study. N Engl J Med 1988;318:262264.
21. Moher D, Hopewell S, Schulz KF et al. CONSORT 2010 explanation and
elaboration: updated guidelines for reporting parallel group randomised
trials. BMJ 2010;340:c869c897.
22. Karlawish JHT. Making sense of the language of benefit for Alzheimers
disease treatments. In: Ballenger JF, Whitehouse PJ, Lyketsos CG et al.,
eds. Treating Dementia: Do We Have a Pill for It? Baltimore: The Johns
Hopkins University Press, 2009, pp 125133.
23. Charles C, Whelan T, Gafni A. What do we mean by partnership in making decisions about treatment? BMJ 1999;319:780782.
24. Muller-Engelmann M, Keller H, Donner-Banzhoff N et al. Shared decision
making in medicine: The influence of situational treatment factors. Patient
Educ Couns 2011;82:240246.
25. McAlister FA, Straus SE, Guyatt GH et al. Users guides to the medical literature: XX. Integrating research evidence with the care of the individual patient.
Evidence-Based Medicine Working Group. JAMA 2000;283:28292836.