research-article2016
Original Article
Abstract
Objective: Self-reported information from questionnaires is frequently used in epidemiological studies, but few of these
studies provide information on the reproducibility of individual items contained in the questionnaire. We studied the test
retest reliability of self-reported diabetes among 33,919 participants in Norwegian Women and Cancer Study.
Methods: The testretest reliability of self-reported type 1 and type 2 diabetes diagnoses was evaluated between three selfadministered questionnaires (completed in 1991, 1998, and 2005 by Norwegian Women and Cancer participants) by kappa
agreement. The time interval between the testretest studies was ~7 and ~14years. Sensitivity of the kappa agreement for
type 1 and type 2 diabetes diagnoses was assessed. Subgroup analysis was performed to assess whether testretest reliability
varies with age, body mass index, physical activity, education, and smoking status.
Results: The kappa agreement for both types of self-reported diabetes diagnoses combined was good (0.65) for all three
testretest studies (19911998, 19912005, and 19982005). The kappa agreement for type 1 diabetes was good (0.73) in
the 19912005 and the 19982005 testretest studies, and very good (0.83) in the 19911998 testretest study. The kappa
agreement for type 2 diabetes was moderate (0.57) in the 19912005 testretest study and good (0.66) in the 19911998
and 19982005 testretest studies. The overall kappa agreement in the 19911998 testretest study was stronger than in
the 19912005 testretest study and the 19982005 testretest study. There was no clear pattern of inconsistency in the
kappa agreements within different strata of age, BMI, physical activity, and smoking. The kappa agreement was strongest
among the respondents with 17 or more years of education, while generally it was weaker among the least educated group.
Conclusion: The testretest reliability of the diabetes was acceptable and there was no clear pattern of inconsistency in
the kappa agreement stratified by age, body mass index, physical activity, and smoking. The study suggests that self-reported
diabetes diagnosis from middle-aged women enrolled in the Norwegian Women and Cancer Study is reliable.
Keywords
Type 2 diabetes, type 1 diabetes, metabolic syndrome, kappa, testretest reliability, reproducibility, questionnaires,
Norway, Norwegian Women and Cancer, Kvinner og kreft
Date received: 1 September 2015; accepted: 17 November 2015
Introduction
Epidemiological studies often rely on self-reported information, as this renders the costs of data collection lower than that
of clinical studies.1 However, the validity and reliability of
the instruments used for data collection are often not reported.2
Commonly, the Cohens kappa coefficient is used to
determine inter-rater agreement for disease (or other categorical outcomes) by comparing self-reported information against a gold standard (diagnostic test, medical
records, physiological measures, etc.). Previous validation studies of self-reported diabetes diagnosis have
Department of Community Medicine, University of Troms, Troms, Norway
Corresponding author:
Mashhood Ahmed Sheikh, Department of Community Medicine,
University of Troms, N-9037 Troms, Norway.
Email: mashhood.a.sheikh@uit.no
Creative Commons CC-BY: This article is distributed under the terms of the Creative Commons Attribution 3.0 License
(http://www.creativecommons.org/licenses/by/3.0/) which permits any use, reproduction and distribution of
the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages
(https://us.sagepub.com/en-us/nam/open-access-at-sage).
Downloaded from smo.sagepub.com by guest on February 3, 2016
Methods
Study cohort and sampling
The NOWAC Study is a prospective nationwide study
which started in 1991,54,55 and contains data from 170,000
women. Participants were randomly selected from the
National Population Register of Norway. The external
validity of the study56 and validity of some measures5759
have been published elsewhere. NOWAC Study participants are assumed to be representative of the female
Norwegian population in the corresponding age groups.56
The detailed characteristics of the participants are described
Sheikh et al.
Diabetes in the 2005 retest study (for comparison against the
1991 test study). Diabetes cases from the 2005 retest study,
for comparison against 1991 test study, were defined as participants who reported a diabetes diagnosis in the 2005 questionnaire, provided they reported a date of diagnosis prior to
1992. The same criteria were applied to women with type 1
or type 2 diabetes.
Diabetes in the 2005 retest study (for comparison against 1998
test study).Diabetes cases from the 2005 retest study, for
comparison against the 1998 test study, were defined as participants with self-reported diabetes in the 2005 questionnaire, provided that they reported a date of diagnosis prior to
1999. The same criteria were applied to women with type 1
or type 2 diabetes.
Covariates. Self-reported information on height and weight
from 1998 study was used to calculate BMI (kg/m2). BMI
was categorized into three groups: normal weight (BMI:
<25kg/m2), overweight (BMI: 2529.9kg/m2), and obese
(BMI: 30kg/m2). Smoking status was derived from the
replies to two questions in the 1998 questionnaire: Have
you ever smoked? (yes, no) and Do you smoke on a daily
basis at the moment? (yes, no). Women who answered no
to the former were categorized as never smokers. Those
who answered yes to the former, and no to the latter,
were categorized as former smokers, and those who
answered yes to both questions were categorized as current smokers. A 10-category scale measured the level of
self-reported physical activity in the 1998 questionnaire, the
validity of which has been reported previously.21 Responses
to questions about physical activity were used to assign a
category of physical activity: low [13], medium [47], and
high [810]. Education (duration in years) was categorized
into four groups: primary/intermediate (09), secondary
(1012), university (1316), and postgraduate and above
(17+). Age (years) was categorized in four groups with
5-year interval.
Statistical analysis
Statistical analysis was performed with SAS version 9.2 and
Stata version 13.1. Means (standard deviation (SD)) were
estimated for all continuous variables, and the percentage of
participants in each category was calculated for all categorical variables. General characteristics of the data are presented
as frequencies, percentages, and means with SDs, respectively (Table 1). Variables for all diabetes diagnoses, as well
as for type 1 and type 2 diabetes separately, were constructed,
and the kappa agreement for the two types of diabetes was
calculated for the 19911998 testretest study, the 19912005
testretest study, and 19982005 testretest study, respectively. The kappa coefficients summarize the total agreement
beyond that expected by chance. 95% confidence intervals
(CIs) for kappa statistic were estimated with analytical
Sensitivity analysis
Since self-reported age at diagnosis was used as the only discriminative criterion for distinguishing between type 1 and
type 2 diabetes, sensitivity analysis was performed by
restricting age at diagnosis <35years for type 1 diabetes and
age at diagnosis >44years for type 2 diabetes (Table 5).
Those reporting age at diagnosis 35-44 were excluded for the
purpose of assessing sensitivity of the kappa agreements
(Table 5).
Subgroup analysis
Subgroup analysis was performed to assess the consistency
of the kappa agreement across stratas of the covariates
(Tables 610).
Ethical approval
The NOWAC Study was approved by the Regional
Committee for Medical and Health Research Ethics. All participating women gave written informed consent.
Results
Table 1 presents the general characteristics of the study sample. Among the 33,919 women participating in 1991, 1998,
and 2005 study, the age distribution was between 40 and 59
(mean: 47.74.3) in 1998. Majority (64.6%) of the respondents had normal weight (BMI: <25kg/m2). Almost 40.3% of
the respondents had some university education or more.
Most (75.5%) of the respondents were classified as having
medium level of physical activity. In this study sample,
28.2% were classified as being current smoker, while 31.2%
were classified as being former smokers.
Table 2 presents the self-reported diabetes diagnosis in
1991 study, and 1998 study by self-reported age at diagnosis
in respective studies. Majority (56%) of the self-reported
diabetics reported age at diagnosis as 30years or over in
1991 study, while over 64.7% reported age at diagnosis as
40years or over in the 1998 study. This may partly be due to
the aging cohort itself.
Tables 3 and 4 present the kappa statistics for the test
retest studies. The agreement for all self-reported diabetes
N (%)
Age (years)
4044
4549
5054
5559
BMIa
Normal weight (<25kg/m2)
Overweight (2529.9kg/m2)
Obese (30kg/m2)
Education level (duration in
years)a
Primary/intermediate (09)
Secondary (1012)
University (1316)
Postgraduate and above (17+)
Physical activity levela
Low
Medium
High
Smoking status
Never smoker
Former smoker
Current smoker
9926 (29.3)
11,382 (33.6)
10,849 (32.0)
1762 (5.2)
21,553 (64.6)
9106 (27.3)
2709 (8.1)
Mean (SD)
47.7 (4.3)
24.4 (3.8)
12.5 (3.2)
6736 (20.1)
12,102 (36.1)
10,226 (30.5)
4460 (13.3)
3686 (11.5)
24,229 (75.5)
4186 (13.0)
13,763 (40.6)
10,582 (31.2)
9574 (28.2)
5.6 (1.7)
Age at diagnosis
aDiabetes
04
59
1014
1519
2024
2529
3034
3539
4044
4549
5054
Total
Diabetic in
1991a
Diabetic in
1998b
n (%)
n (%)
(3.4)c
5
10 (6.8)c
18 (12.2)c
10 (6.8)c
7 (4.7)c
15 (10.1)c
23 (15.5)c
23 (15.5)c
25 (16.9)c
12 (8.1)c
148 (100.0)
4 (1.3)d
6 (1.9)d
17 (5.4)d
11 (3.5)d
12 (3.8)d
12 (3.8)d
19 (6.0)d
30 (9.5)d
75 (23.8)d
70 (22.2)d
59 (18.7)d
315 (100.0)
cases in the 1991 test study were defined as those who reported
having diabetes, and their age at diagnosis in the 1991 study.
bDiabetes cases in the 1998 test study were defined as those who
reported having diabetes, and their age at diagnosis in the 1998 study.
One respondent to the 1998 questionnaire fulfilled the criteria for both
gestational diabetes and type 2 diabetes and was excluded.
cN and % of respondents reporting age at diagnosis in 1991 study.
dN and % of respondents reporting age at diagnosis in 1998 study.
Discussion
In this study, we analyzed the testretest reliability of selfreported diabetes diagnosis in a large sample of middle-aged
women in Norway. We observed that the agreement was
good for all diabetes diagnoses combined in all three test
retest studies. The weakest agreement was found in the
19912005 testretest study. This was to be expected, as the
time interval between these studies was the longest. These
results also suggest that other confounding factors may have
affected self-reported diabetes diagnosis in the 19911998,
or 19982005 testretest studies, as the agreement in these
periods was expected to be more similar. The fact that diabetes diagnosis may change over time could have contributed
to the decreasing agreement observed between the 1991
1998 testretest study and the 19912005 testretest study.
However, looking at the two types of diabetes separately
revealed some differences in the kappa agreement. The
Sheikh et al.
Table 3. Kappa agreements for self-reported diabetes diagnoses (excluding gestational diabetes).
Diabetes
Test study
Retest study
Consistency (%)a
113/148 (76.4)
90/148 (60.8)
209/315 (66.3)
0.75 (0.700.81)
0.65 (0.580.71)
0.70 (0.660.74)
Consistency(%) =
Table 4. Kappa agreements for self-reported type 1 and type 2 diabetes diagnoses.
Diabetes type
Test study
Retest study
Type 1 diabetesb
Type 2 diabetesc
Type 1 diabetesb
Type 2 diabetesc
Type 1 diabetesb
Type 2 diabetesc
Consistency (%)a
83/111 (74.7)
29/37 (78.4)
64/111 (57.6)
21/37 (56.6)
70/111 (63.1)
125/204 (61.3)
0.83 (0.760.89)
0.67 (0.550.79)
0.76 (0.680.84)
0.57 (0.430.71)
0.73 (0.660.81)
0.66 (0.590.72)
Consistency(%) =
bType
cType
Table 5. Sensitivity analysis of kappa agreements for self-reported type 1 (age at diagnosis: <35years) and type 2 diabetes diagnoses
(age at diagnosis: >44years).
Diabetes type
Test study
Retest study
Type 1 diabetesb
Type 2 diabetesc
Type 1 diabetesb
Type 2 diabetesc
Type 1 diabetesb
Type 2 diabetesc
Consistency(%) =
bOnly
cOnly
Consistency (%)a
68/88 (77.3)
6/12 (50.0)
54/88 (61.4)
3/12 (25.0)
57/81 (70.4)
75/129 (58.1)
0.80 (0.650.95)
0.52 (0.270.77)
0.69 (0.510.88)
0.33 (0.050.61)
0.60 (0.380.81)
0.63 (0.560.70)
Table 6. Kappa agreements for self-reported diabetes diagnoses (excluding gestational diabetes) stratified by age groups.
Age
Age
Age
4044
4549
5054
5559
4044
4549
5054
5559
4044
4549
5054
5559
Test study
Retest study
Consistency (%)a
27/38 (71.1)
27/39 (69.2)
48/59 (81.4)
11/12 (91.7)
26/38 (68.4)
24/39 (61.5)
34/59 (57.6)
6/12 (50.0)
42/64 (65.6)
49/75 (65.3)
98/143 (68.5)
20/33 (60.6)
0.78 (0.670.89)
0.70 (0.580.82)
0.77 (0.690.86)
0.76 (0.580.93)
0.76 (0.650.88)
0.65 (0.520.77)
0.60 (0.490.71)
0.52 (0.270.77)
0.69 (0.600.79)
0.69 (0.610.78)
0.71 (0.650.77)
0.65 (0.510.79)
Consistency(%)=
Table 7. Kappa agreements for self-reported diabetes diagnoses (excluding gestational diabetes) stratified by BMI.
BMI
BMI
BMI
Test study
Retest study
Consistency (%)a
49/62 (79.0)
35/44 (79.5)
29/41 (70.7)
44/62 (80.0)
21/44 (47.7)
25/41 (61.0)
74/99 (74.7)
59/99 (59.6)
74/114 (64.9)
0.79 (0.710.87)
0.76 (0.660.86)
0.70 (0.590.82)
0.73 (0.640.82)
0.53 (0.400.66)
0.65 (0.530.78)
0.79 (0.720.85)
0.65 (0.560.73)
0.79 (0.720.85)
Consistency(%) =
bThe
numbers do not add up to 148 due to missing values on height or weight (consequently on BMI).
numbers do not add up to 130 due to missing values on height or weight (consequently on BMI).
dThe numbers do not add up to 315 due to missing values on height or weight (consequently on BMI).
eThe numbers do not add up to 282 due to missing values on height or weight (consequently on BMI).
cThe
Sheikh et al.
Table 8. Kappa agreements for self-reported diabetes diagnoses (excluding gestational diabetes) stratified by physical activity.
Low
Medium
High
Low
Medium
High
Low
Medium
High
Test study
Retest study
Consistency (%)a
19/24 (79.2)
80/106 (75.5)
8/11 (72.7)
18/24 (75.0)
63/106 (59.4)
5/11 (45.5)
43/62 (69.4)
139/209 (66.5)
17/26 (65.4)
0.68 (0.540.82)
0.77 (0.710.84)
0.73 (0.520.94)
0.74 (0.560.85)
0.66 (0.580.73)
0.50 (0.230.77)
0.72 (0.630.81)
0.70 (0.650.75)
0.67 (0.520.82)
Consistency(%) =
bThe
numbers do not add up to 148 due to missing values on physical activity level.
numbers do not add up to 151 due to missing values on physical activity level.
dThe numbers do not add up to 130 due to missing values on physical activity level.
eThe numbers do not add up to 315 due to missing values on physical activity level.
fThe numbers do not add up to 282 due to missing values on physical activity level.
cThe
Table 9. Kappa agreements for self-reported diabetes diagnoses (excluding gestational diabetes) stratified by education level.
Education level
(duration in years)
Education level
(duration in years)
Education level
(duration in years)
Test study
Retest study
33b
14b
Cases in 1991 (n)
35b
63b
24c
15c
Cases in 2005 (n)
32d
53d
21/33 (63.6)
13/14 (92.9)
30d
12d
Cases in 2005 (n)
78f
112f
24/33 (72.7)
10/14 (71.4)
Primary/intermediate (09)
Secondary (1012)
33b
14b
Cases in 1998 (n)
85e
133e
University (1316)
Postgraduate and above (17+)
63e
30e
61f
27f
Primary/intermediate (09)
Secondary (1012)
University (1316)
Postgraduate and above (17+)
Primary/intermediate (09)
Secondary (1012)
University (1316)
Postgraduate and above (17+)
Consistency(%) =
bThe
Consistency (%)a
28/35 (80)
50/63 (79.4)
18/35 (51.4)
37/63 (58.7)
0.75 (0.640.86)
0.75 (0.670.83)
0.74 (0.610.87)
0.90 (0.781.00)
0.54 (0.390.68)
0.64 (0.530.74)
55/85 (64.7)
85/133 (64.0)
0.76 (0.640.88)
0.77 (0.590.95)
0.67 (0.590.75)
0.69 (0.620.76)
45/63 (71.4)
21/30 (70.0)
0.72 (0.640.81)
0.74 (0.610.86)
Table 10. Kappa agreements for self-reported diabetes diagnoses (excluding gestational diabetes) stratified by smoking status.
Smoking status
Smoking status
Smoking status
Never smoker
Former smoker
Current smoker
Never smoker
Former smoker
Current smoker
Never smoker
Former smoker
Current smoker
Test study
Retest study
Consistency (%)a
38/51 (74.5)
37/51 (72.5)
38/46 (82.6)
29/51 (56.9)
31/51 (60.8)
30/46 (65.2)
72/108 (66.7)
62/103 (60.2)
75/104 (72.1)
0.78 (0.680.87)
0.73 (0.630.83)
0.76 (0.670.85)
0.63 (0.510.75)
0.68 (0.570.79)
0.63 (0.520.74)
0.71 (0.640.78)
0.63 (0.550.71)
0.75 (0.680.82)
Consistency(%) =
Sheikh et al.
the investigation tool increases the kappa agreement for the
testretest reliability of the studies using interviews to collect data.
However, a study from Manhattan (New York)80 reported
on the testretest reliability of diabetes diagnosis using telephone interviews. The retest study was conducted within
30days of the test study, and the kappa agreement between
the test and retest studies was found to be 0.48, which is very
low considering the short time interval, and despite the use
of interviews to collect data. This shows that a short time
interval between the test and the retest study and the use of
interviews do not necessarily increase the kappa agreement.
The strength of this study is that, it is the first to assess the
testretest reliability of self-reported diabetes diagnosis separately for type 1 and type 2 diabetes. Other strengths of our
study include a large cohort size, sensitivity of the estimates
by self-reported age at diagnosis, and subgroup analysis
within different covariates. This study provides new insights
into earlier research by providing the reliability of selfreported diagnosis separately for type 1 and type 2 diabetes.
Conclusion
In conclusion, this study shows that the reliability of the
self-reported information on diabetes diagnosis from a
large prospective cohort study with long time interval is
satisfactory.
Acknowledgements
The authors are thankful for the anonymous reviewers' comments
and suggestions to improve the quality of the article.
Authors contributions
This work was completed as part of M.A.S.s Masters in Public
Health thesis, supervised by T.B. M.A.S. performed statistical analyses, data interpretation, and drafted the article. T.B. is the principle
investigator of this study. E.L. is the principle investigator of the
NOWAC study. E.L. and T.B. critically reviewed the article.
Ethics approval
The NOWAC Study was approved by the Regional Committee for
Medical and Health Research Ethics.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Informed consent
All participants gave written informed consent.
References
1. Beckett M, Weinstein M, Goldman N, etal. Do health interview surveys yield reliable data on chronic illness among
older respondents? Am J Epidemiol 2000; 151(3): 315323.
2. Feinstein AR and Horwitz RI. Double standards, scientific
methods, and epidemiologic research. N Engl J Med 1982;
307(26): 16111617.
3. Goldman N, Lin I-f, Weinstein M, etal. Evaluating the quality of self-reports of hypertension and diabetes. Office of
Population Research working paper no. 2002-3, http://westoff.
princeton.edu/papers/opr0203.pdf
4. De Burgos-Lunar C, Salinero-Fort M, Cardenas-Valladolid J,
etal. Validation of diabetes mellitus and hypertension diagnosis in computerized medical records in primary health care.
BMC Med Res Methodol 2011; 11: 146.
5. Robinson JR, Young TK, Roos LL, etal. Estimating the burden of disease: comparing administrative data and self-reports.
Med Care 1997; 35(9): 932947.
6. Helivaara M, Aromaa A, Klaukka T, etal. Reliability and
validity of interview data on chronic diseases. The miniFinland Health Survey. J Clin Epidemiol 1993; 46(2):
181191.
7. Harlow SD and Linet MS. Agreement between questionnaire and medical records. Am J Epidemiol 1989; 129(2):
233248.
8. Huerta JM, Jos Tormo M, Egea-Caparrs JM, etal. Accuracy
of self-reported diabetes, hypertension, and hyperlipidemia in
the adult Spanish population. DINO study findings. Rev Esp
Cardiol 2009; 62(2): 143152.
9. Midthjell K, Holmen J, Bjorndal A, etal. Is questionnaire
information valid in the study of a chronic disease such as
diabetes? The Nord-Trndelag diabetes study. J Epidemiol
Community Health 1992; 46(5): 537542.
10. Kriegsman DMW, Penninx BWJH, Van Eijk JTM, etal. Selfreports and general practitioner information on the presence
of chronic diseases in community dwelling elderly: a study on
the accuracy of patients self-reports and on determinants of
inaccuracy. J Clin Epidemiol 1996; 49(12): 14071417.
11. Parr C, Veierd M, Laake P, etal. Test-retest reproducibility
of a food frequency questionnaire (FFQ) and estimated effects
on disease risk in the Norwegian Women and Cancer Study
(NOWAC). Nutr J 2006; 5(1): 4.
12. Veierd MB, Parr CL, Lund E, etal. Reproducibility of selfreported melanoma risk factors in a large cohort study of
Norwegian women. Melanoma Res 2008; 18(1): 19.
13. Tretli S, Lund-Larsen PG and Foss OP. Reliability of questionnaire information on cardiovascular disease and diabetes:
10
29. Johansen JB, Roe C, Bakke E, etal. Reliability and responsiveness of the Norwegian version of the Neck Disability
Index. Scand J Pain 2014; 5(1): 2833.
30. Nordtorp HL, Nyquist A, Jahnsen R, etal. Reliability of the
Norwegian Version of the Childrens Assessment of Participation
and Enjoyment (CAPE) and Preferences for Activities of Children
(PAC). Phys Occup Ther Pediatr 2013; 33(2): 199212.
31. Lchting I, Grotle M, Storheim K, etal. Individualized quality of
life in patients with low back pain: reliability and validity of the
Patient Generated Index. J Rehabil Med 2014; 46(8): 781787.
32. Iversen M, Espehaug B, Rokne B, etal. Psychometric properties
of the Norwegian version of the Audit of Diabetes-Dependent
Quality of Life. Qual Life Res 2013; 22(10): 28092812.
33. Haldorsen B, Svege I, Roe Y, etal. Reliability and validity of
the Norwegian version of the Disabilities of the Arm, Shoulder
and Hand questionnaire in patients with shoulder impingement syndrome. BMC Musculoskelet Disord 2014; 15: 78.
34. Aaby C and Heimdal J-H. The voice-related quality of life
(V-RQOL) measurea study on validity and reliability of the
Norwegian version. J Voice 2013; 27(2): 258.e29258.e33.
35. Tavoly M, Jelsness-Jrgensen L-P, Wik H, etal. Quality of life
after pulmonary embolism: first cross-cultural evaluation of the
pulmonary embolism quality-of-life (PEmb-QoL) questionnaire
in a Norwegian cohort. Qual Life Res 2015; 24(2): 417425.
36. Kapstad H, Nelson M, vers M, etal. Validation of the
Norwegian short version of the Body Shape Questionnaire
(BSQ-14). Nord J Psychiatry 2015; 69(7): 509514.
37. Klokkerud M, Grotle M, Lchting I, etal. Psychometric properties
of the Norwegian version of the patient generated index in patients
with rheumatic diseases participating in rehabilitation or selfmanagement programmes. Rheumatology 2013; 52: 924932.
38. Amble I, Gude T, Stubdal S, etal. Psychometric properties of
the outcome questionnaire-45.2: the Norwegian version in an
international context. Psychother Res 2014; 24(4): 504513.
39. Agerup T, Lydersen S, Wallander J, etal. Maternal and
paternal psychosocial risk factors for clinical depression
in a Norwegian community sample of adolescents. Nord J
Psychiatry 2015; 69(1): 3541.
40. Skre I, Friborg O, Elgaroy S, etal. The factor structure and
psychometric properties of the Clinical Outcomes in Routine
EvaluationOutcome Measure (CORE-OM) in Norwegian
clinical and non-clinical samples. BMC Psychiatr 2013; 13: 99.
41. Erdvik IB, verby NC and Haugen T. Translating, reliability
testing, and validating a Norwegian Questionnaire to Assess
Adolescents Intentions to be Physically Active After High
School Graduation. SAGE Open. Epub ahead of print 13 April
2015. DOI: 10.1177/2158244015580374.
42. Bergland , Hofoss D, Kirkevold M, etal. Person-centred
ward climate as experienced by mentally lucid residents in
long-term care facilities. J Clin Nurs 2015; 24(34): 406414.
43. sters N, Garratt A, Grotle M, etal. Patient-reported quality of care for osteoarthritis: development and testing of the
osteoarthritis quality indicator questionnaire. Arthritis Care
Res 2013; 65(7): 10431051.
44. Trochim WMK. Types of reliability, http://www.social
researchmethods.net/kb/reltypes.php (2006, accessed 24
December 2011).
45. Andresen EM, Malmstrom TK, Miller DK, etal. Retest reliability of self-reported function, self-care, and disease history.
Med Care 2005; 43(1): 9397.
11
Sheikh et al.
46. Yount BW, Wyrwich KW and Brownson RC. The reliability of a questionnaire-based metabolic syndrome surveillance
tool. Metab Syndr Relat Disord 2007; 5(3): 282289.
47. Brownson RC, Jackson-Thompson J, Wilkerson JC, etal.
Reliability of information on chronic disease risk factors collected in the Missouri Behavioral Risk Factor Surveillance
System. Epidemiology 1994; 5(5): 545549.
48. Starr GJ, Grande ED, Taylor AW, etal. Reliability of selfreported behavioural health risk factors in a South Australian
telephone survey. Australian N Z J Public Health 1999; 23(5):
528530.
49. Bosetti C, Tavani A, Negri E, etal. Reliability of data on medical conditions, menstrual and reproductive history provided
by hospital controls. J Clin Epidemiol 2001; 54(9): 902906.
50. Bowlin SJ, Morrill BD, Nafziger AN, etal. Reliability and
changes in validity of self-reported cardiovascular disease risk
factors using dual response: the behavioral risk factor survey.
J Clin Epidemiol 1996; 49(5): 511517.
51. Khardori R. Type 2 diabetes mellitus, http://emedicine.
medscape.com/article/117853-overview (2011, accessed 24
December 2011).
52. Sheikh MA, Lund E and Braaten T. The predictive effect of
body mass index on type 2 diabetes in the Norwegian women
and cancer study. Lipids Health Dis 2014; 13: 164.
53. Lund E. The Norwegian Women and Cancer study, NOWAC,
http://site.uit.no/nowac/; http://uit.no/ (accessed 14 October 2011).
54. The Norwegian Women and Cancer study (NOWAC). The
Norwegian Women and Cancer study. Troms: Department of
Community medicine, Faculty of medicine, The University of
Troms, 2008 (updated 4 July 2008), http://site.uit.no/nowac/
(accessed 21 December 2015).
55. Lund E, Dumeaux V, Braaten T, etal. Cohort profile: the
Norwegian Women and Cancer StudyNOWACKvinner
og kreft. Int J Epidemiol 2008; 37(1): 3641.
56. Lund E, Kumle M, Braaten T, etal. External validity in a
population-based national prospective studythe Norwegian
Women and Cancer Study NOWAC. Cancer Causes Control
2003; 14(10): 10011008.
57. Skeie G, Mode N, Henningsen M, etal. Validity of selfreported body mass index among middle-aged participants
in the Norwegian Women and Cancer study. Clin Epidemiol
2015; 7: 313323.
58. Hjartker A, Andersen LF and Lund E. Comparison of diet
measures from a food-frequency questionnaire with measures from repeated 24-hour dietary recalls. The Norwegian
Women and Cancer Study. Public Health Nutr 2007; 10(10):
10941103.
59. Brustad M, Skeie G, Braaten T, etal. Comparison of telephone
vs face-to-face interviews in the assessment of dietary intake
by the 24h recall EPIC SOFT programthe Norwegian calibration study. Eur J Clin Nutr 2003; 57(1): 107113.
60. Fleiss JL, Levin B and Paik MC. Statistical methods for rates
and proportions (ed WA Shewart and SS Wilks ). 3rd ed.
Hoboken, NJ: John Wiley & Sons, 2003, 800 pp.
61. Reichenheim ME. Confidence intervals for the kappa statistic.
Stata J 2004; 4(4): 421428.
62. Simon S. What is a kappa coefficient? (Cohens kappa).
Kansas City, MO: Childrens Mercy Hospitals and Clinics,
2008 (updated 14 July 2008), http://www.childrensmercy.org/
stats/definitions/kappa.htm (accessed 24 December 2011).