James E. Johnson1
with other (or no) abnormalities. Four of the readers who best detected pneumothoraces
William R. Ferrell2 on screen-film examinations performed worse when interpreting computed radiographic
studies; the other four readers detected pneumothoraces similarly with both techniques.
No relationship was found between the size of a pneumothorax and its likelihood of
detection by either technique.
These results raise concerns about implementing computed radiography for compre-
hensive chest imaging.
TABLE 1: Abnormalities Present on Screen-Film and Computed for the 36 x 36 cm imaging plate is 2.5 lp/mm (approximately half
Chest Radiographs the conventional screen-film system), corresponding to a pixel size
of 0.2 mm (Fig. 2).
Number of Cases
Fig. 1.-Conventional screen-film (A), computed (B), and frequency-modified computed (C) radiographs show a right apical pneumothorax (arrows).
a) 06
U)
C
Results
mothorax less than 1 0%, measurements were recalculated if they computed radiography, % Diff = percent difference between SF and CR =
did not agree within 2%. When a sufficient level of agreement was [(CR - SF)/SF].
478 FAJARDO ET AL. AJR:152, March 1989
- x Screen film
#{149}
CR
a)
E3
z4
0.800 0.820 0.840 0.860 0.880 0.900 0.920 0.940 0.960 0.980 1.000
Area Under ROC Curve
Fig. 3.-Graph shows areas under individual readers receiver operating Pneumothorax (%)
characteristic (ROC) curves for detecting pneumothoraces on screen-film
and computed radiography (CR) images. Four readers who best detected Fig. 4.-Graph shows size distribution of pneumothoraces among 25
pneumothoraces on screen-film images did most poorly with computed study cases.
images.
than 5% disparity in specificity between screen-film and corn- TABLE 3: Size of Pneumothorax for Cases Missed by 50% or
puted radiography images. More of the Observers on Screen-Film or Computed Chest
Areas under the ROC curves for individual readers detec- Radiographs
tion of pneumothoraces ranged from 0.890 to 0.968 for Observers Who Missed the
American Journal of Roentgenology 1989.152:475-480.
screen-film images and 0.824 to 0.930 for computed images. Pneumothorax (%)
Size of
The three readers with the greatest areas under their screen- Pneumothorax on Screen-
film ROC curves and one other radiologist detected pneu- (%) Film on Computed
mothoraces much less well on computed radiography images Radiographs
Radiographs
(readers 2, 6, 7, and 8) (Fig. 3). As shown in Table 2, the 25.2 0 62.5
areas under the computed radiography ROC curves for these 16.7 12.5 87.5
radiologists were, on average, 9.3% less than the areas under 1.4 25.0 62.5
their screen-film curves. The other four radiologists performed 3.4 25.0 50.0
7.3 25.0 50.0
similarly with both techniques, having an average difference
8.0 50.0 0
in the areas under their screen-film and computed radiography 2.7 50.0 50.0
ROC curves of only 1 .2%. A one-tailed paired t test of the 5.5 50.0 12.5
areas under individual readers screen-film and computed 1.5 62.5 75.0
7.3 100.0 87.5
radiography ROC curves shows a significant difference be-
tween the two techniques (t = 2.489, df = 7, p < .003).
Our ad hoc trinomial distribution analysis indicates that, by
all four models tested (categorizing readers with differences frequency of detection or diagnostic certainty for either tech-
in performance with each technology of 3% or less as being
nology.
equivalent), the probability is less than .03 that the readers
could have performed as poorly as they did with computed
radiography, if computed radiography were truly equivalent
Discussion
to screen-film imaging for detection of pneumothoraces This study extends previous psychophysical investigations
(model 1 : p = .003; model 2: p = .01 9; model 3: p = .025; of the suitability of computed imaging for chest radiography.
model 4: p = .015). If we apply a more stringent criterion for The results of the ROC analyses indicate that radiologists are
categorizing a reader as having performed equally with both better able to discriminate pneumothoraces on conventional
technologies-with a difference in performance of either 1% screen-film radiographs than on computed radiographic im-
or 2%-the result is even more significant. if we make
Only ages. The number of false-positive diagnoses of pneumotho-
the criterion a difference in performance between the two rax made with each technique was small: only 1 1 of the 400
technologies as large as 6% (five of eight readers with no possible responses for screen film and 1 0 for computed
appreciable difference) is the probability just greater than .05 radiography. This is important because of the low clinical
of obtaining a result as unfavorable to computed radiography prevalence of pneumothorax and because of the morbidity
by chance (and then only for trinomial model 2). associated with improper treatment. However, the radiolo-
The distribution of pneumothorax size among the 25 cases gists had significantly more false-negative diagnoses on the
is shown in Figure 4. Sixty-eight percent of the pneumotho- computed images. Because of the morbidity associated with
races involved 10% or less of the total area of the involved delayed treatment for missed pneumothoraces, these data
lung. In 10 cases (five for screen film and eight for computed suggest that computed radiography may be inadequate for
radiography), 50% or more of the observers failed to detect depicting pneumothoraces in clinical practice.
the pneumothorax (Table 3). There was no statistically signif- Previous reports have focused on the poorer spatial reso-
icant correlation between the size of pneumothorax and either lution of computed vs screen-film radiography to explain
AJR:152, March 1989 DIGITAL VS CONVENTIONAL IMAGING OF PNEUMOTHORAX 479
differences in radiologists performances in detecting high- radiography for clinical chest diagnosis. Further psychophys-
frequency, low-contrast abnormalities such as pneumothora- ical researchis needed to determine what improvements are
ces and interstitial thickening [3-5]. These studies, which needed in physical factors of the computed radiography sys-
dealt with small numbers of these types of abnormalities, tem and/or radiologists interactions with computed images.
have provided conflicting results, variously indicating spatial
resolution requirements of 0.4-mm [3], 0.2-mm [5], and 0.1-
mm [4] pixel sizes. The biphasic nature of our results sug- Appendix
gests that influences other than spatial resolution may be As described in the analysis section, we tested the hypothesis that
important in explaining why radiologists detect pneumotho- the two methods did not differ in detectability of pneumothorax using
races less well on computed than on screen-film images. The the one-tailed paired t test. This test indicated that computed radiog-
radiologists who were among the most proficient in interpret- raphy was less effective (p < .003). However, this test assumes an
ing the screen-film images performed much more poorly with approximately normal distribution of differences in detectability and
is weighted in proportion to the size of differences in detectability
computed radiography images; the other four radiologists
among readers. Thus, in principle, the null hypothesis-that radiolo-
performances with and computed
screen-film radiography
gists detect pneumothoraces equally well with computed and screen-
differed little. Granted, it is possible to hypothesize that the
film radiography-could be inappropriately rejected using the t test
former group in some manner made better use than the latter because of several large differences in performance.
group of the greater spatial resolution that screen-film radiol- Indeed, our interpretation of the data suggests that a categorical
ogy offers. However, an alternative explanation suggested by approach, which does not weight differences among readers perform-
the dichotomy of performances is that subtle diagnostic clues ances according to magnitude, better reflects the results of the
are portrayed better on the screen film than on the computed experiment, where some readers did as well with either method and
images, clues that are better exploited
by the more proficient some interpreted computed radiographs less accurately. Using this
American Journal of Roentgenology 1989.152:475-480.
radiologists. Apparent differences between the two technol- categorical approach, we determined whether our result could have
ogies that might contribute to explaining our result include occurred by chance by calculating the total probability of the possible
outcomes that are as favorable to computed radiography as what we
computed radiographys lesser spatial resolution, greater im-
observed and the outcomes less favorable to computed radiography
age noise, and smaller image size, and radiologists relative
than what we observed. Our null hypothesis was that computed and
unfamiliarity with computed radiography. Indeed, several of screen-film technologies are equivalent for detecting pneumothora-
these factors may have been important in influencing our ces.
results. The effects of each of these, and perhaps other With the outcome we observed, this is easy to do. Even assuming
influences, bear further investigation. a difference of 1% or less as the criterion for doing about the same
As with any imaging study that uses selected cases and a with both methods, no reader did better with computed radiography.
small group of radiologist readers, questions will arise about Hence, we need to sum the probability of all the outcomes in which
the generalizability of our results. Our cases were not ran- no readerdetects pneumothoraces better with computed radiography
domly selected, nor can we be sure that the 25 pneumotho- and the probability of all the outcomes in which no fewer readers
than we observed do better with screen-film images. Outcomes for
races that composed the case material are representative of
which any readers do better with computed radiography, even if
all pneumothoraces encountered in clinical practice. Nonethe-
balancecf by more readers detecting pneumothoraces better with
less, these considerations are not germane to the purpose of screen-film images, are ones that are more favorable to computed
the study. Insofar as the selected cases portray pneumotho- radiography, because they would demonstrate the possibility that
races that, if discovered, would affect clinical care, the sample computed radiography could be used more effectively than screen-
is useful in determining whether computed radiography per- film images.
mits detection comparable with screen-film radiography of an Our null hypothesis (i.e., that the two technologies are equivalent)
important selection of pneumothoraces. The radiologist read- was that a reader taken at random had probability Pw of doing worse
ers were chosen primarily on the basis of their willingness to with computed radiography, probability Ps of doing about as well with
participate in the study. Although they were not randomly both technokgses, and probability p of doing better with computed
radiography, where Pb Pw. Such a distribution is represented by the
selected, because their number is a sizable fraction (47%) of
multinomial with three possible outcomes, the trinomial. In this ex-
our 1 7-member group, the readers are representative of our
perlment, none of eight readers performed better with computed
department. To the extent that radiologists in our department radiography and M of eight performed equally well with both methods
are like those in other groups, similar performances can be (the value of M depending on the level of difference in performance
expected in interpreting computed and screen-film chest im- accepted as equivalent).
The probability of getting results at least as
ages. unfavorable for computed radiography as those we obtained is the
In summary, our study indicates that some radiologists will sum of the probability of all the outcomes in whith no reader does
detect pneumothoraces less well on digital chest images than better with computed radiology and the probability of all the outcomes
on screen-film examinations. We chose to investigate the in which no fewer than M do better with screen film. It is given by the
following equation:
detectability of pneumothoraces because we think pneumo-
thorax is representative of other high-frequency, low-contrast
abnormalities, the adequate depiction of which is important
mO [m!(88-- m)!] (Pw)8m(Ps)m.
to chest diagnosis. Because an imaging system capable of
supplanting conventional chest radiography should fulfill the Thus, our trinomial test is ad hoc, developed specifically for the
need to visualize adequately all clinically important structures, results of this study. It does not represent a general test for the
there should be concern over fully implementing computed performance of any two radiographic systems.
480 FAJARDO ET AL. AJR:152, March 1989
REFERENCES 7. Fraser RG, Breatnach E, Barnes GT. Digital radiography of the chest:
dinical experience with a prototype unit. RadioGraphics 1985;5:415-428
1. Templeton AW, Dwyer SJ Ill. Johnson JA, et al. An on-line digital image 8. Merritt CRB, Tutton RH, Bell KA, et al. Clinical application of digital
management system. Radiology 1984;152:321-325 radiography: computed radiographic imaging. RadioGraphics 1985;5:
2. Fajardo LF, Hillman B,J, Hunter TB, Claypool HR, Westerman BR, Mockbee 397-414
B. Excretory urography using computed radiography. Radiology 9. Seeley GW, Roehng H, Nudelman S. Frost MM, Capp MP. Psychophysical
1987;162:345-351 evaluation of video systems compared to film. AppI Photogr Eng
3. Lams PM, Cocklin ML. Spatial resolution requirements for digital chest 1978;4: 160-1 65
radiographs: an ROC study of observer performance in selected cases. 10. Sonoda M, Takano M, Miyahara J, Kato H. Computed radiography utilizing
Radiology 1986;158: 11-19 scanning laser stimulated luminescence. Radiology 1983;148:833-838
4. MacMahon H, Vybomy CJ, Metz CE, Dci K, Sabeti V. Solomon SL. Digital 11. Metz CE. Basic principles of ROC analysis. Semin NucI Med 1978;8:
radiography of subtle pulmonary abnormalities: an ROC study of the effect 283-298
of pixel size on observer performance. Radiology 1986;158:21-26 12. Swets JA. ROC analysis applied to the evaluation of medical imaging
5. Goodman LR, Foley DW, Wilson CR, Rimm AA, Lawson TL. Digital and techniques. Invest Radiol 1979;14:109-121
conventional chest images: observer performance with film digital radiog- 13. Dorfman DD, Alf E. Maximum likelihood estimation of parameters of signal
raphy system. Radiology 1986;158:27-33 detection and determination of confidence intervals: rating method data. J
6. Chakraborty DP, Breatnach ES, Yester MV, Soto B, Barnes GT, Fraser Math Psychol 1969;6:487-497
RG. Digital and conventional chest imaging: a modified ROC study of 14. Hanley JA, McNeil BJ. The meaning and use of area under a receiver
observer performance using simulated nodules. Radiology 1986;158: operating characteristic (ROC) curve. Radiology 1982;143:29-36
35-39
American Journal of Roentgenology 1989.152:475-480.