Original article
a r t i c l e i n f o a b s t r a c t
Article history: Background: Antidementia drugs have been associated with an increased risk of cardiovascular events.
Received 2 September 2015 The objective of this study was to identify the predictors for cardiovascular events among patients with
Received in revised form Alzheimer's disease (AD) on antidementia drugs, mining large longitudinal claims data.
27 January 2016
Methods: Using 2006e2011 claims from a 5% random sample of Medicare beneficiaries, I identified
Accepted 31 January 2016
Available online xxx
patients with AD who filled a prescription for an antidementia drug between 2007 and 2011. I followed
them from the initiation of the antidementia drug until a cardiovascular event or December 31, 2011,
censored by death or discontinuation of antidementia drugs. The outcome was the incidence of car-
Keywords:
adverse events
diovascular events, which include acute myocardial infarction, bradycardia, syncope, atrioventricular
antidementia drugs block, QT prolongation, and ventricular tachycardia. Covariates included predefined patient character-
claims data istics and empirical attributes identified from the claims, including International Classification of Diseases,
data mining Ninth Revision (ICD-9) diagnosis codes, Healthcare Common Procedure Coding System codes, and ther-
apeutic classes of all prescriptions filled. After using feature selection to choose the top covariates, a
logistic regression with multivariate variable selection was constructed.
Results: With an accuracy of 83.9% and a sensitivity of 93.3%, the algorithm identified 22 predictors for
cardiovascular events, including a history of ischemic heart disease, congestive heart failure, syncope,
stroke or transient ischemic attack, diabetes, number of other comorbidities, and procedures including
venipuncture and radiologic examinations.
Conclusion: The results of this study can help clinicians identify AD patients with a higher risk of car-
diovascular events who therefore should be prescribed antidementia drugs cautiously.
Copyright © 2016, Asia Pacific League of Clinical Gerontology & Geriatrics. Published by Elsevier Taiwan
LLC. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/
by-nc-nd/4.0/).
http://dx.doi.org/10.1016/j.jcgg.2016.01.002
2210-8335/Copyright © 2016, Asia Pacific League of Clinical Gerontology & Geriatrics. Published by Elsevier Taiwan LLC. This is an open access article under the CC BY-NC-ND
license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
Please cite this article in press as: Hernandez I, Risk factors for cardiovascular events of antidementia drugs in Alzheimer's disease patients,
Journal of Clinical Gerontology & Geriatrics (2016), http://dx.doi.org/10.1016/j.jcgg.2016.01.002
2 I. Hernandez / Journal of Clinical Gerontology & Geriatrics xxx (2016) 1e6
Please cite this article in press as: Hernandez I, Risk factors for cardiovascular events of antidementia drugs in Alzheimer's disease patients,
Journal of Clinical Gerontology & Geriatrics (2016), http://dx.doi.org/10.1016/j.jcgg.2016.01.002
I. Hernandez / Journal of Clinical Gerontology & Geriatrics xxx (2016) 1e6 3
procedures and diagnosis and antidementia drugs. A total of 3056 statistic for categorical variables and on the p value of the F
empirical attributes were identified, which included 232 indicator statistic for continuous variables. Variables whose p value
variables for unique therapeutic classes of drugs, 483 indicators for was below the specified threshold were selected.
unique first three digits of ICD-9 diagnosis codes, 794 indicators for (3) Logistic regression with multivariate variable selection. To
unique first three digits of HCPCS codes, and one indicator variable further perform multivariate selection, a logistic regression
for surgery, all defined at baseline; and 242 indicators for unique was built to predict the incidence of cardiovascular events
therapeutic class of drugs, 485 unique first three digits of ICD-9 using forward stepwise variable selection. The initial model
diagnosis codes, 818 unique first three digits of the HCPCS codes, only included the constant and at each step, likelihood ratio
and one indicator variable for surgery, all defined during follow-up tests were performed to determine whether the addition of
period. the most significant covariate among those that had not
entered the model improved the predictive ability of the
model significantly. At each step, the significance of all var-
2.4. Model learning iables in the model was assessed and if any term could be
removed without significantly decreasing the variance
The data set was randomly split, and 50% of the observations explained by the model, it was dropped. The p value for entry
were used for training the algorithm and the remaining 50% for of new attributes in the model was fixed at 0.05 and at 0.1 for
validation. Because of the large number of potential covariates removal.
(3077, 21 predefined covariates and 3056 empirical attributes), I (4) Model validation. To evaluate the performance of the model,
constructed a four-step algorithm to identify the top predictors I validated the final logistic regression in the test set and
(Figure 1). calculated the accuracy, the area under the curve (AUC),
specificity, sensitivity, positive predictive value, and negative
(1) Selection of empirical attributes by prevalence prioritization. predictive value of the model.
This step excluded empirical attributes whose prevalence
was lower than 1%. The 1% threshold was fixed according to
previous studies applying similar claims data mining tech-
niques in the prediction of clinical events.18 From the original 2.5. Parameter adjustment
3056 empirical attributes, 2417 covariates were excluded.
(2) Univariate variable selection. This step ranked the 639 The algorithm required one parameter: the p value for selection
empirical attributes selected and the 21 predefined cova- of covariates by univariate selection. According to previous studies,
riates according to the p value of univariate analysis. The I searched the space of p values that allowed 0.5%, 1%, 1.5%, and 2.5%
ranking was based on the p values of the Pearson Chi-square of the variables to be selected.18 I therefore constructed four
Random Split
Pre-defined Empirical
Covariates Attributes
Final Model
Figure 1. Overview of the algorithm. The figure shows the four steps in the construction and evaluation of the algorithm. First, the original data was randomly split into a training
set that contained 50% of the observations and a test set that contained the remaining 50%. Using the training set, empirical attributes whose prevalence was lower than 1% were
excluded (Step 1). Then, the selected empirical attributes and the predefined covariates were ranked according to the p value of univariate analysis (Step 2). Using the selected
covariates and stepwise multivariate selection method, a logistic regression was built that predicted the incidence of any cardiovascular event (Step 3). This logistic regression model
was validated on the test set (Step 4).
Please cite this article in press as: Hernandez I, Risk factors for cardiovascular events of antidementia drugs in Alzheimer's disease patients,
Journal of Clinical Gerontology & Geriatrics (2016), http://dx.doi.org/10.1016/j.jcgg.2016.01.002
4 I. Hernandez / Journal of Clinical Gerontology & Geriatrics xxx (2016) 1e6
models: each of them included the top 0.5%, 1%, 1.5%, and 2.5% the odds of cardiovascular event by 15% (95% CI, 9e20%). Other risk
covariates selected by univariate selection, respectively. Four factors for cardiovascular events included having a diagnosis for
models were compared using the following performance mea- diabetes mellitus, essential hypertension, and occlusion and ste-
sures: accuracy, AUC, specificity, sensitivity, positive predictive nosis of precerebral arteries, and undergoing radiologic examina-
value, and negative predictive value. tion or pacemaker evaluation, all in the year prior to antidementia
initiation.
3. Results
4. Discussion
3.1. Patient characteristics and incidence of cardiovascular events
Using large longitudinal claims data, I developed an algorithm
The sample included 160,096 patients, of which 31,698 (19.8%) that identifies the risk factors for cardiovascular events among AD
experienced a cardiovascular event during the follow-up period. patients on antidementia drugs. With an accuracy of 84% and a
Syncope was the most common cardiovascular event; specifically, sensitivity of 93%, the algorithm identified 22 predictors for car-
18,710 (11.69%) of the study participants experienced a syncope. diovascular events, including ischemic heart disease, congestive
The mean follow-up period was 2.77 years (interquartile range, heart failure, a history of syncope, a history of stroke or TIA, the
0.92e4.75 years). number of CMS priority comorbidities, and having a medical claim
Table 1 compares patient demographic and clinical character- with ICD-9 diagnosis code 427.XX (cardiac dysrhythmias), 250.XX
istics between those that experienced a cardiovascular event and (diabetes mellitus), 786.XX (symptoms involving the respiratory
those who did not. Patients experiencing a cardiovascular event system or other chest symptoms) or HCPCS code 364XX (veni-
were older, more likely to be male, and have a history of cardio- puncture), 710XX and 721XX (radiologic examinations), or 930XX
vascular disease than those who did not experience an event. (electrocardiogram) in the year prior to antidementia initiation.
One may argue that patients with a history of cardiovascular
3.2. Model evaluation disease should have been excluded from the study sample because
it is not possible to determine whether the occurrence of cardio-
Four logistic regression models were built, each of which used as vascular events among these patients is a result of the use of
input the 0.5%, 1%, 1.5%, and 2.5% top predictors identified by uni- antidementia drugs or just a recurrence of their cardiovascular
variate selection. Using the top 0.5% variables as input, the logistic condition. The study sample included patients with a history of
regression built identified 10 risk factors for cardiovascular events. cardiovascular disease for two reasons: first, as Table 1 shows, pa-
When the percentage for univariate selection was fixed at 1%, 1.5%, tients with a history of cardiovascular disease represent a sub-
and 2.5%, the models identified 13, 17, and 22 risk factors for car- stantial proportion of all the AD patients who used antidementia
diovascular events, respectively. Table 2 compares the performance drugs; and second, the recurrence rate of cardiovascular events was
measures of the four models. The model including the top 2.5% higher among patients using antidementia drugs than in the gen-
predictors was the one with the highest accuracy, AUC, sensitivity, eral population. For instance, the study sample included 7528
positive predictive, power and negative predictive power. Specif- participants who had a history of myocardial infarction prior to the
ically, the accuracy of this model was 83.9%, the AUC 0.872, sensi- initiation of an antidementia drug. The average follow-up time for
tivity 45.7%, and specificity 93.3%. these 7528 participants was 2.5 years. In this period, 800 (10.63%)
experienced a recurrent myocardial infarction. This recurrence rate
3.3. Risk factors for experiencing cardiovascular events is considerably higher than the risk of reinfarction for the general
population of patients surviving a heart attack, which has been
Table 3 shows the risk factors for cardiovascular events identi- estimated at 6e7% in 3 years.19 Because this evidence suggests that
fied by each of the four models. Ischemic heart disease, number of the use of antidementia drugs increases the risk of a cardiovascular
other CMS priority comorbidities, having a medical claim in the event regardless of the history of cardiovascular disease, patients
year prior to treatment initiation with ICD-9 codes 427.XX (cardiac with prior cardiovascular disease were included in the study.
dysrhythmias), 780.XX (general symptoms) or 786.XX (symptoms Nevertheless, this algorithm could be separately trained in future
involving the respiratory system or other chest symptoms), or studies in samples of patients with and without a history of car-
HCPCS codes 364XX (venipuncture), 930XX (electrocardiogram), or diovascular disease.
992XX (hospital care or outpatient visit) were identified by four This study is subject to three main limitations. First, the sensi-
models as risk factors for cardiovascular events. tivity (45.7%) and positive predictive power (62.5%) of the final
Table 4 shows the odds ratio for cardiovascular events for each model were low. However, this limitation does not invalidate the
of the variables identified as risk factors by the model built with the results because the aim of the algorithm was not to predict which
top 2.5% covariates. The odds of experiencing a cardiovascular patients will experience a cardiovascular event but to identify risk
event while using antidementia drugs were 35% (95% CI, 28e42%) factors for cardiovascular events. Second, overfitting is a main
higher among patients with ischemic heart disease than those limitation of most data mining models,20 and it is likely that the
without it. In addition, having a history of stroke or TIA increased model built with the top 2.5% covariates was subject to
Table 2
Comparison of model performance.a
Percentage of attributes allowed in univariate variable selection Accuracy (%) AUC Sensitivity (%) Specificity (%) PPV (%) NPV (%)
AUC ¼ area under the curve; NPV ¼ negative predictive power; PPV ¼ positive predictive power.
a
The performance measures were obtained applying the logistic regression built to the test set.
Please cite this article in press as: Hernandez I, Risk factors for cardiovascular events of antidementia drugs in Alzheimer's disease patients,
Journal of Clinical Gerontology & Geriatrics (2016), http://dx.doi.org/10.1016/j.jcgg.2016.01.002
I. Hernandez / Journal of Clinical Gerontology & Geriatrics xxx (2016) 1e6 5
Table 3
Risk factors for cardiovascular events identified by four models.
2.5% Top attributes 1.5% Top attributes 1% Top attributes 0.5% Top attributes
CMS ¼ Centers for Medicare and Medicaid Services; ICD-9 ¼ International Classification of Diseases, Ninth Revision; HCPCS ¼ Healthcare Common Procedure Coding System;
TIA ¼ transient ischemic attack.
Table 4
Odds ratios for risk factors for cardiovascular events.a
CI ¼ confidence interval; CMS ¼ Centers for Medicare and Medicaid Services; HCPCS ¼ Healthcare Common Procedure Coding
System; ICD-9 ¼ International Classification of Diseases, Ninth Revision; OR ¼ odds ratio; TIA ¼ transient ischemic attack.
a
Results from the model built when the percentage of variables allowed for univariate selection was fixed at 2.5%.
overparametrization. This would explain why some factors were study sample included both patients taking AChEIs and memantine.
identified as risk factors for cardiovascular events, even though Because of their different mechanisms of action, risk factors for
there is no clinical rationale behind such association. Third, the cardiovascular events may differ between patients taking AChEIs
Please cite this article in press as: Hernandez I, Risk factors for cardiovascular events of antidementia drugs in Alzheimer's disease patients,
Journal of Clinical Gerontology & Geriatrics (2016), http://dx.doi.org/10.1016/j.jcgg.2016.01.002
6 I. Hernandez / Journal of Clinical Gerontology & Geriatrics xxx (2016) 1e6
and memantine. In future research, the algorithm could be sepa- 2. Gill SS, Anderson GM, Fischer HD, Bell CM, Li P, Normand SL, et al. Syncope and
its consequences in patients with dementia receiving cholinesterase inhibitors:
rately applied in samples of patients using AChEIs and memantine.
a population-based cohort study. Arch Intern Med 2009;169:867e73.
Nevertheless, this study is an important first step in the use of 3. Howes LG. Cardiovascular effects of drugs used to treat Alzheimer's disease.
large longitudinal claims data in the identification of risk factors for Drug Saf 2014;37:391e5.
side effects of pharmacological treatments. Similar approaches 4. McCain KR, Sawyer TS, Spiller HA. Evaluation of centrally acting cholinesterase
inhibitor exposures in adults. Ann Pharmacother 2007;41:1632e7.
could be easily used in the identification of risk factors for adverse 5. Poluzzi E, Raschi E, Moretti U, De Ponti F. Drug-induced torsades de pointes:
events of other medications for two reasons. First, this algorithm is data mining of the public version of the FDA Adverse Event Reporting System
less computationally intensive than standard stepwise pro- (AERS). Pharmacoepidemiol Drug Saf 2009;18:512e8.
6. Tanaka A, Koga S, Hiramatsu Y. Donepezil-induced adverse side effects of
cedures21; nevertheless, the accuracy and discriminatory power of cardiac rhythm: 2 cases report of atrioventricular block and Torsade de Pointes.
the model are excellent. Second, the results of this algorithm are Intern Med 2009;48:1219e23.
easily interpretable. One of the main limitations of alternative 7. Leitch A, McGinness P, Wallbridge D. Calculate the QT interval in patients
taking drugs for dementia. BMJ (Clin Res Ed) 2007;335:557.
machine learning techniques is their complexity, often producing 8. Fisher AA, Davis MW. Prolonged QT interval, syncope, and delirium with gal-
complicated models whose results are difficult to interpret by cli- antamine. Ann Pharmacother 2008;42:278e83.
nicians. However, the final product of this algorithm was a logistic 9. Fosbol EL, Peterson ED, Holm E, Gislason GH, Zhang Y, Curtis LH, et al.
Comparative cardiovascular safety of dementia medications: a cross-national
regression, which provides easily interpretable odds ratios for the study. J Am Geriatr Soc 2012;60:2283e9.
risk of cardiovascular events for each covariate. 10. Gallini A, Sommet A, Montastruc JL. Does memantine induce bradycardia? A
In conclusion, I built an algorithm that exploits high- study in the French PharmacoVigilance Database. Pharmacoepidemiol Drug Saf
2008;17:877e81.
dimensional claims data to identify risk factors for cardiovascular
11. Igeta H, Suzuki Y, Motegi T, Sasaki A, Yokoyama Y, Someya T. Deterioration in
events among AD patients on antidementia drugs with an accuracy donepezil-induced PR prolongation after a coadministration of memantine in a
of 84% and sensitivity of 93%. I identified 22 predictors for cardio- patient with Alzheimer's disease. Gen Hosp Psychiatry 2013;35:2.
vascular events, including a history of several cardiovascular con- 12. Jane Newby V, Anne Kenny R, McKeith IG. Donepezil and cardiac syncope: case
report. Int J Geriatr Psychiatry 2004;19:1110e2.
ditions and undergoing cardiovascular procedures. These results 13. Bordier P, Lanusse S, Garrigue S, Reynard C, Robert F, Gencel L, et al. Causes of
could guide prescribers in the identification of patients at high risk syncope in patients with Alzheimer's disease treated with donepezil. Drugs
of experiencing cardiovascular events with antidementia drugs and Aging 2005;22:687e94.
14. Kaduszkiewicz H, Zimmermann T, Beck-Bornholdt HP, van den Bussche H.
who should be prescribed these drugs with caution. Cholinesterase inhibitors for patients with Alzheimer's disease: systematic
review of randomised clinical trials. BMJ (Clin Res Ed) 2005;331:321e7.
15. Raina P, Santaguida P, Ismaila A, Patterson C, Cowan D, Levine M, et al. Effec-
Conflicts of interest tiveness of cholinesterase inhibitors and memantine for treating dementia:
evidence review for a clinical practice guideline. Ann Intern Med 2008;148:
None. 379e97.
16. O'Brien JT, Burns A. Clinical practice with anti-dementia drugs: a revised
(second) consensus statement from the British Association for Psychophar-
Acknowledgments macology. J Psychopharmacol 2011;25:997e1019.
17. Chronic Conditions Data Warehouse. 27 Chronic Condition Algorithm. 2014;
https://www.ccwdata.org/cs/groups/public/documents/document/ccw_
I acknowledge funding from the Institute of Medicine condition_categories.pdf [accessed 22.02.15].
(HHSP22320042509XI), the Commonwealth Foundation (No. R01 18. He D, Mathews SC, Kalloo AN, Hutfless S. Mining high-dimensional adminis-
trative claims data to predict early hospital readmissions. J Am Med Inform
HS018657), and “La Caixa” Foundation, Spain. I thank Dr Yuting
Assoc 2014;21:272e9.
Zhang for her advice about the study design and interpretation of 19. Stone GW, Witzenbichler B, Guagliumi G, Peruga JZ, Brodie BR, Dudek D, et al.
results, Drew Michael Donnell for his help with feature selection, Heparin plus a glycoprotein IIb/IIIa inhibitor versus bivalirudin monotherapy
and Dr Jerrold May for his advice about data structuring. and paclitaxel-eluting stents versus bare-metal stents in acute myocardial
infarction (HORIZONS-AMI): final 3-year results from a multicentre, rando-
mised controlled trial. Lancet 2011;377:2193e204.
References 20. Dietterich T. Overfitting and undercomputing in machine learning. ACM Com-
put Surv (CSUR) 1995;27:326e7.
21. Huang SH, Wulsin LR, Li H, Guo J. Dimensionality reduction for knowledge
1. Medicare Beneficiary Prevalence for Chronic Conditions for 2002 Through 2011 discovery in medical claims database: application to antidepressant medication
https://www.ccwdata.org/cs/groups/public/documents/document/wls_ucm1-
utilization study. Comput Methods Programs Biomed 2009;93:115e23.
000774.pdf. [accessed 05.01.15].
Please cite this article in press as: Hernandez I, Risk factors for cardiovascular events of antidementia drugs in Alzheimer's disease patients,
Journal of Clinical Gerontology & Geriatrics (2016), http://dx.doi.org/10.1016/j.jcgg.2016.01.002