Monitoring Improvement Using A Patient Rated Depression Scale During Treatment With Anti Depressants in General Practice A Validation Study On The

Scandinavian Journal of Primary Health Care
ISSN: 0281-3432 (Print) 1502-7724 (Online) Journal homepage: http://www.tandfonline.com/loi/ipri20
Monitoring improvement using a patient-rated

depression scale during treatment with anti-
depressants in general practice. A validation study
on the Goldberg Depression Scale
Jørgen Holm, Liselotte Holm, Per Bech
To cite this article: Jørgen Holm, Liselotte Holm, Per Bech (2001) Monitoring improvement using
a patient-rated depression scale during treatment with anti-depressants in general practice. A
validation study on the Goldberg Depression Scale, Scandinavian Journal of Primary Health Care,
19:4, 263-266, DOI: 10.1080/02813430152706819
To link to this article: https://doi.org/10.1080/02813430152706819
© 2001 Informa UK Ltd All rights reserved:

reproduction in whole or part not permitted
Published online: 12 Jul 2009.
Submit your article to this journal
Article views: 458
Citing articles: 8 View citing articles
Full Terms & Conditions of access and use can be found at

http://www.tandfonline.com/action/journalInformation?journalCode=ipri20
ORIGINAL PAPER
Monitoring improvement using a patient-rated

depression scale during treatment with
anti-depressants in general practice
A ×alidation study on the Goldberg Depression Scale
Jørgen Holm1, Liselotte Holm 2 and Per Bech3
1
General Practice and 2Psychological Practice, Svendborg and 3Psychiatric Research Unit, Department of
Psychiatry, Frederiksborg General Hospital, Hillerød, Denmark.
Scand J Prim Health Care 2001;19:263– 266. ISSN 0281-3432 berg Depression Scale was demonstrated. The Loevinger co-
ef cient varied from 0.25 at the time of diagnosis to 0.57, 0.65 and
ObjectiØe – To perform a pilot study on the value of the Goldberg 0.69 by visits two, three and four. Factor analysis identi ed only one
Depression Scale as an instrument for monitoring improvement in general factor explaining 50% or more of the variants, except at visit
depressed patients treated with anti-depressants in general practice. 1. When the Goldberg Depression Scale was correlated to the
Design – A comparative study using simultaneous ratings on the Hamilton Depression Scales, a coef cient of 0.74 was obtained
observer-based 17-item Hamilton Depression Scale and the patient- (p B0.001).
rated Goldberg Depression Scale. Conclusion – This pilot study indicates that the Goldberg Depression
Setting – General practice. Scale is suitable for monitoring improvement in depressed patients
Patients – Twenty-one patients meeting the ICD-10 criteria of a treated in general practice. Further studies are recommended.
moderate depressive episode were assessed at the time of inclusion
and through three follow-up visits. Key words: depression, rating scales, primary health care.
Main outcome measures – Scores on the Goldberg Depression Scale
compared to the Hamilton Depression Scale. Jørgen Holm, General Practice, Klosterplads 9, DK-5700 SØ end-
Results – An acceptable internal and external validity of the Gold- borg, Denmark. E-mail: jorgenholm@dadlnet.dk
General practitioners (GPs) are currently under criti- tient-friendly as possible in ensuring a high degree of
cism for inef ciency in the detection and treatment of applicability in the general practice setting.
the majority of depressed patients (1,2). In this re- The Goldberg Depression Scale (GDS) (8) was the
gard, it may indeed seem strange that, in everyday questionnaire selected as being patient-friendly by the
practice, so little effort is made to monitor the state general practitioner involved in this study. The objec-
of patients during treatment. Few doctors would like tive of this pilot study has been to investigate the
to do without tools when dealing with asthma. value of the Goldberg Depression Scale as an instru-
There are a number of depression rating scales ment for monitoring the improvement of depressed
available today, the Hamilton Depression Scale patients treated in general practice.
(HAM-D) (3) probably being the best known and the
gold standard in research studies. HAM-D is an
observer-rated scale and requires some instruction in METHODS
its use. It has been shown, however, that even experi- Nine general practitioners were recruited (through
enced psychiatrists demonstrate considerable inter- advertising) as clinical investigators and received
observer variation performing a HAM-D score on training in the use of HAM-D.
one and the same patient (4). There are also validated
patient-rated scales available, such as the Zung De- Scales
pression Scale (5) and Beck’s Depression Inventory The Goldberg Depression Scale. The GDS is an
(6), but these have been used as research tools rather 18-item self-rating scale (Fig. 1), with each item rated
than for clinical purposes, and neither has been on a 0 – 5 point Likert scale. The total score can
adopted by general practice to any signi cant extent. therefore range from 0 (complete absence of depres-
In the 1990s, the development of new question- sive symptoms) to 90 (most severe depression). The
naires was concerned, on the one hand, with content GDS was translated into Danish by the authors and
validity covering the DSM-IV dimension of major back-translated into American English. This back-
depression (7) and, on the other, with being as pa- translated version was accepted by Goldberg himself.
Scand J Prim Health Care 2001; 19

264 J. Holm et al.
The Hamilton Depression Scale. The 17-item ver- For questionnaires like the GDS, the test –retest
sion of the HAM-D followed the original version (3), reliability is often discussed but rarely used. Thus, the
but has been further improved. This Danish version test – retest analysis has to rely on a very stable condi-
was endorsed by Hamilton himself (9) and has been tion of the patient between test and retest. In this
used in several Danish and Scandinavian trials pilot study of the acute therapy of depressive patients
(10,11). The score range of this scale is from 0 (no the situation was not suited for a test –retest
depressive symptoms) to 52 (most severe depression); examination.
the cut-off score for major depression is 18, while full
remission is a score of 7 or lower (9). Validity. The validity of a scale covers both inter-
nal and external validity. Internal validity covers the
Patients extent to which the total score is a suf cient statistic.
Patients ful lling the ICD-10 criteria of an at least Factor analysis has been used to measure the internal
moderate depressive episode demanding medical consistency of the GDS. Thus, an overall, general
treatment were included. They were assessed by GDS factor is indicative of internal validity. The factor
and HAM-D at inclusion and over the course of analysis is extremely sensitive to the dispersion of
three follow-up visits. The doctors were blinded to scale scores in the population investigated (13,14).
the GDS scores. Exclusion criteria were conditions The Loevinger coef cient of homogeneity is a test of
precluding the patients from handling a self-rating uni-dimensionality, i.e. of the summed total score
questionnaire, e.g. dementia or visual impairment. A being a suf cient or adequate statistic. Coef cients of
total of around 20 patients was found suf cient for 0.30 or higher indicate homogeneity or uni-dimen-
testing the validity of the GDS in this pilot study. sionality. The GDS has been compared to the HAM-
D for Loevinger coef cients.
Statistical analysis An essential aspect of external validity is respon-
Reliability. For interview-based scales such as the siveness. This is the ability of a scale to measure
HAM-D, inter-rater reliability is important, and so to improvement in patients during a period of treat-
ensure doctor capability in using the HAM-D, a ment. In the acute therapy of major depression a
WHO videotaped interview was used to measure response is de ned as a 50% (or more) reduction of
inter-rater reliability (12). The WHO interview has an the baseline scores at endpoint. The GDS has been
of cially accepted score of 21.09 2.3. compared to the HAM-D in this way in relation to its
Fig. 1. The Goldberg Depression Scale – original version.

Monitoring impro×ement with a patient-rated depression scale 265
Table I. Factor analysis and Loevinger coef cient of homogeneity at the four visits.
Visit 1 Visit 2 Visit 3 Visit 4
Loevinger co-ef cient of homogeneity 0.25 0.57 0.65 0.69

No. of factors with an eigenvalue greater than 1 6 4 4 3
First factor General General General General
Explained variance 28.8% 55.0% 61.7% 66.6%
Second factor Bipolar Bipolar Bipolar Bipolar
Explained variance 15.0% 10.6% 8.3% 10.6%
Scree plot number of factors 3 1 1 1
responsiveness and a correlation coef cient was cal- patients (62%) had a reduction of 50% or more on
culated by use of the Spearman rank-order method. the baseline scores at endpoint. On the GDS, 12 of
the 21 (57%) patients had a reduction of 50% or
more. The difference in responsiveness between 62%
RESULTS and 57% was statistically insigni cant. The correla-
Twenty-four patients were included; 3 dropped out tion of the total GDS and HAM-D scores was 0.74
before completing all four visits, thus leaving 21 (p B 0.001).
patients (5 men) ranging in age from 28 to 69 years
(mean 43 years).
The investigator’s inter-rater reliability in using DISCUSSION
HAM-D was adequate with a mean score of 22.2 and Statement of principal ndings
a standard deviation of 2.7 (absolute values 17– 25). In this study, we identi ed one factor with fairly high
Loevinger coef cients of homogeneity through visits
Internal ×alidity two to four (the dispersion at visit one being too
The internal validity of the GDS was acceptable. small for testing homogeneity). This, combined with
Table I gives the Loevinger coef cient of homogene- the nding of high correlation coef cients versus
ity as well as the results of the factor analysis at the HAM-D showing an acceptable external validity, in-
four visits. At visit one (baseline), the Loevinger dicates that the GDS is a uni-dimensional scale for
coef cient was below 0.40, and six factors were iden- measuring the degree of improvement in depressive
ti ed. However, the rst factor was a general one, i.e. states treated in general practice.
all items had positive loadings. The second factor was
bipolar and the scree plot identi ed three factors.
In contrast, the next three visits had accepted
Loevinger coef cients and the scree plot identi ed
only one general factor explaining 50% of the vari-
ance. This re ects the difference between visit one Table II. Rank order of items. At the top, the most inclusive
milder items. At the bottom the most severe items. In paren-
and the other visits in terms of dispersion of the theses the original number as shown in Fig. 1.
Goldberg Scale. Before treatment (visit one) the pa-
tients had a similar level of depression. During the Fatigue (9)
treatment period (visits two to four) the dispersion Great effort to do things (10)
was greater as the patients responded more heteroge- Dif culty making decisions (5)
Sad and unhappy (7)
neously. The result indicates that the 18 items on the Disturbed sleep (14)
scale have a valid rank order and structure. Table II Do things slowly (1)
gives the rank order. The items of fatigue, sadness, Lost interest in things (6)
disturbed sleep and lack of interests are the most Pleasure gone out of life (4)
inclusive items, i.e. are present even in mild degrees of Dif culty concentrating (3)
Depressed even when good things happen (17)
depression. These items are listed in the ICD-10 as Feeling trapped (16)
the most inclusive items. Among the most exclusive Future seems hopeless (2)
items are hopelessness, feelings of guilt and thoughts Feeling agitated (8)
of suicide. Feeling lifeless (13)
Weight changes (18)
Feeling like a failure (12)
External ×alidity Feeling guilty (11)
The corresponding mean scores on the two scales are Suicidal thoughts (15)
given in Table III. On the HAM-D, 13 of the 21

266 J. Holm et al.
Table III. Mean scores at the various visits. ACKNOWLEDGEMENTS

We thank the investigating general practitioners: Jan
Visit no. Interval (days) HAM-D GDS Harder, Peter Lindegaard, Bente Lützen, Anne-Marie
1 21 56
Munck, Kim Rønhof, Erik Schaumburg (deceased),
11 (7–22) Ida Zacho, Jørn Fisch-Thomsen (the ninth investigator
2 14 45 was author JH). Also, we thank Ove Aaskoven,
11 (6–25) Tolstrup Data, for carrying out the statistical analysis.
3 12 35 The study received nancial support from the Council
13 (3–20)
4 8 26
of Quality Development in the County of Funen.
Strengths and weaknesses of the study

Tools such as rating scales should be tested in the REFERENCES
context of their intended use, as in fact in this study. 1. Badger LW, deGuy FV, Hartman J, Plant MA, Leeper J,
et al. Patient presentation, interview content and the detec-
However, the recruitment of investigators through
tion of depression by primary care physicians. Psychosom
advertising implies a bias of presumed special interest Med 1994;56:128 – 35.
in psychiatry. This would be a major problem had the 2. Rosholm J-U, Gram LF, Damsbo N, Hallas J. Antidepres-
HAM-D been the scale under validation, but in this sive treatment in general practice— an interview study.
Scand J Prim Health Care 1995;13:281 – 6.
situation it merely provides fair condition to the tested 3. Hamilton M. Development of a rating scale for primary
self-rating scale. Although our patient material is large depressive illness. Br J Soc Clin Psychol 1967;6:278 –96.
enough to produce fairly signi cant test results, we still 4. Demitrack MA, Faries D, Herrera JM, DeBrotta D, Poter
feel that only limited conclusions should be drawn. WZ. The problem of measurement error in multisite clini-
cal trials. Psychopharmacol Bull 1998;34:19 – 24.
5. Zung WWK. A self-rating depression scale. Arch Gen
Other studies on self -rating scales Psychiatry 1965;12:63 – 70.
6. Beck AT, Steer RA, Brown GK. Manual for the Beck
To our knowledge, no other studies have been pub- Depression Inventory, 2nd ed. San Antonio, TX: Psycho-
lished on the GDS. The demonstrated statistical values logical Corporation, 1996.
against HAM-D in analysing internal and external 7. Bech P, Rasmussen N-A, Raabaek Olsen L, Noerholm V,
validity, and the rank order of items found in this Abildgaard W. Major Depression Inventory, using the
Present State Examination as the index of diagnostic valid-
study, are similar to those produced in the validation ity. J Affect Disord 2001. In press.
studies of other accepted self-rating scales (15). 8. Goldberg IK. Questions and answers about depression and
its treatment: a consultation with a leading psychiatrist.
Charles Press Publishers, 1993.
Meaning of the study 9. Bech P, Kastrup M, Rafaelsen OJ. Mini-compendium of
This pilot study indicates that the GDS may be a valid rating scales for states of anxiety, depression, mania,
self-rating scale for monitoring improvement in de- schizophrenia with corresponding DSM-III syndromes.
Acta Psychiatr Scand 1986;326(73 Suppl):7– 37.
pressed patients treated in general practice. Modern 10. Danish University Antidepressant Group (DUAG). Citalo-
self-rating scales can serve as documentation for the pram: Clinical effect pro le in comparison with
course of treatment, thus meeting the increasing legal clomipramine. A controlled multicenter study. Psychophar-
demands on the GP dealing with psychiatric patients. macology 1986;90:131 – 8.
11. Fuglum E, Rosenberg C, Damsbo N, Stage K, Lauritzen
Eliminating the inter-observer variation, such scales do L, Bech P and Danish University Antidepressant Group.
have advantages over observer-rated scales when refer- Screening and treating depressed patients. A comparison of
ring the patient or during shared care treatment. two controlled citalopram trials across treatment settings:
hospitalized patients vs. patients treated by their family
doctors. Acta Psychiatr Scand 1996;96:18 – 25.
Unanswered questions and further research 12. Bech P, Gastpar M, Mendlewicz J. The role of training
This study was not designed to investigate the possible courses in multicenter trials: WHO experiences. In: Chagas
C, Josiassen RG, Brider WH, Weiss KJ, Stoff D, Simpson
use of the GDS as a diagnostic tool or its potential as GM, editors. Developments in psychiatry, vol. 7. Elsevier:
a screening instrument in detecting depression. The Amsterdam, 1986:1513 – 5.
ICD-10 criteria for depression have been formulated 13. Mokken RJ. A theory and procedure of scale analysis.
very clearly and should not be questioned as the basis Paris: Mouton, 1971.
14. Molenaar IW, Debets P, Systsma K, et al. User’s Manual
of the diagnosis. We do, however, suggest further MSP: A program for Mokken Scale analysis for polyto
studies of the GDS in a larger population: 1) to con rm mous items (version 3.0). Groningen: lec ProGramma,
the ndings in this study, 2) to investigate patients’ and 1994.
doctors’ choices in comparing the GDS with HAM-D 15. Bech P. Rating Scales for Psychopathology, Health Status
and Quality of life. A compendium on documentation in
for monitoring purposes, and 3) to evaluate the poten- accordance with the DSM-III-R and WHO systems.
tial of the GDS as a screening tool. Heidelberg: Springer, 1993.

Monitoring Improvement Using A Patient Rated Depression Scale During Treatment With Anti Depressants in General Practice A Validation Study On The

Diunggah oleh

Informasi Dokumen

Judul Asli

Hak Cipta

Format Tersedia

Bagikan dokumen Ini

Bagikan atau Tanam Dokumen

Opsi Berbagi

Apakah menurut Anda dokumen ini bermanfaat?

Apakah konten ini tidak pantas?

Hak Cipta:

Format Tersedia

Monitoring Improvement Using A Patient Rated Depression Scale During Treatment With Anti Depressants in General Practice A Validation Study On The

Diunggah oleh

Hak Cipta:

Format Tersedia

Scandinavian Journal of Primary Health Care

ISSN: 0281-3432 (Print) 1502-7724 (Online) Journal homepage: http://www.tandfonline.com/loi/ipri20