| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Full paper |
1Department of Radiation Physics, Faculty of Health Sciences, Linköping University, SE 581 85 Linköping, Sweden, Departments of 2Radiation Physics and 6Diagnostic Radiology at Malmö, Lund University, Malmö University Hospital, SE 205 02 Malmö, Sweden, 3Joint Department of Physics, The Royal Marsden NHS Trust, Fulham Road, London SW3 6JJ, UK, and Departments of 4Radiation Physics and 5Radiology, Göteborg University, Sahlgrenska University Hospital, SE 413 45 Göteborg, Sweden
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
The image quality required will vary with the radiological task. For a selected number of routine radiographic projections, the European Commission has proposed sets of image criteria [5] that may be used for clinical image quality assessment. The image criteria are expressed as the visibility of characteristic features of imaged anatomical structures and are based on the normal anatomy. They apply to adult patients ofstandard size for the type of examination beingconsidered. An underlying assumption and philosophyof these criteria is that if the normal anatomy is faithfully reproduced in the image, then the pathological lesions will also be visualized.
When the optimization of radiographic imaging systems is based on a study of physical parameters, it is important that the correlation between these physical parameters and clinical measures of image quality be established. However, in previous work on chest radiography using anthropomorphic test phantoms [6], the ability to predict clinical image quality based on physical parameters has been questioned. The authors studied 24 chest imaging systems and found no correlation between image quality assessed in a visual grading analysis study and system parameters such as the relative amount of scattered radiation in the image plane, beam quality (tube potential), sensitivity of the image receptor (speed class) and focal spot size. They did not evaluate the optical density nor the dynamic range of the image. They considered only single parameters at a time and not the combined effect of the parameters on the overall contrast and signal-to-noise ratio (SNR) of important details. However, a positive correlation was found between the number of low contrast details detected in the image of a contrast detail phantom and the best ranked systems. Thisis to be expected, since the detectability of small,low contrast details depends on how contrast, sharpness and noise combine to yield the SNR.
The inability to correlate individual system parameters with measures of image quality may be related to the multivariate nature of the problem and the difficulty of obtaining a controlled experimental situation when measurements are made with systems in several centres. To demonstrate a correlation, it is essential to look at the effects of system parameters in combination, to use appropriate physical measures of image quality and to obtain patient images in a controlled way, preferably at the same centre. The objective of the present work, therefore, was to search for correlations between physical image quality measures and the corresponding assessments of image quality of patient radiographs by expert radiologists.
The work has been performed as part of a European study of image quality in chest and lumbar spine imaging. It brings together separate work on the assessment of clinical image quality and the development of computer simulation models. 16 imaging alternatives for a posteroanterior (PA) chest examination and 4 imaging alternatives for an anteroposterior (AP) lumbar spine examination were considered. Assessment of patient images was based on the European image criteria [5] and the results for chest and lumbar spine imaging systems are reported elsewhere [7, 8]. The Monte Carlo computer simulation model [9] incorporated a voxel phantom to simulate the patient, with superimposed anatomical details for the calculation of contrast and SNR. The patient absorbed doses are needed for optimization and can be found in previous work [710].
| Materials and methods |
|---|
|
|
|---|
|
Measures of clinical image quality
The image criteria used to assess the quality of the patient images are listed in Table 2
. Clinical trials were performed and images obtained with the different imaging techniques were assessed by seven European radiologists. In the analysis of lumbar spine radiographs, the seven original image criteria [5] were used. For chest radiography, the original image criteria were modified prior to the clinical trial [7]. Criteria devoted primarily to positioning of the patient were omitted as fulfilment of these criteria is likely to depend on the skill and training of the radiographer and not on the imaging system itself. In the revised criteria, the parenchyma, mediastinum and costopleural junction were separated and details to be visualized for each region were given (C5CHC7CH). The criteria C1CHC4CH are the same as in the original criteria.
|
|
|
where Fi,c,o is the fulfilment of criterion (c) for image (i) and observer (o); I is the number of images assessed for each imaging system (I=15 for chest, I=10 for lumbar spine), C is the number of criteria (C=7) and O is the number of observers (O=7). If a criterion is fulfilled, Fi,c,o is 1, and if it is not Fi,c,o is 0. Since there were 16 chest and 4 lumbar spine imaging configurations, there were 240 chest and 40 lumbar spine images in total.
The second method of scoring was visual grading analysis. For this relative rating, each image was compared to a reference image. A patient image taken with 80 kV and 400 Lanex Regular Plus screenfilm system was used as the reference image in the lumbar spine AP examination [8]. In the chest PA examination, the selection of reference image was more complicated. Prior to the collection of images, a statistical analysis of the necessary number of volunteers was performed [7]. The study was designed as an "incomplete but balanced block trial". 120 volunteers were required to test four technical factors (each under two conditions) and each volunteer was examined with 2 of the 16 techniques mentioned above. The observers viewed the radiographs in pairs. In the evaluation, the volunteers were then used as their own reference. If the structure in the image is reproduced much worse than in the reference image, it is given the score -2. If the structure is reproduced worse, equally, better or much better than in the reference image, it is given the score -1, 0, +1 or +2, respectively.
For a given system, a visual grading analysis score (VGAS) was calculated as:
|
|
where Gi,s,o is the relative grading for a particular image (i), structure (s) and observer (o); S is the number of structures compared; and I and O are as described above.
In the clinical trial for chest [7], it was found that the modified criteria gave better discrimination between different techniques than the original criteria. The visual grading analysis was performed only on the modified criteria C5CHC7CH, whereas the original criteria C1CHC4CH were assessed using both ICS and VGAS. It was therefore interesting to investigate whether the modified criteria show a more significant correlation with physical image quality than the original criteria. This was only possible for VGAS, since ICS values were not available for the modified criteria [7].
Measures of physical image quality
A Monte Carlo computer model of the complete imaging system was used to assess physical image quality. The model is an extension of previous work [11, 12]. It models the patient using an anthropomorphic 3-dimensional, segmented male anatomy (voxel phantom) originally developed elsewhere [13]. Appropriate anatomical details (Table 3
) have been added to this phantom so that realistic estimates of the contrast and SNR of important details in the normal anatomy can be made. Estimates of the energy imparted per unit area to the image receptor at any point in the image plane were used to compute the optical density on the film by using the film's characteristic H&D curve. In this way it was possible to estimate the variations of the energy imparted to the screenfilm system by scattered and primary photons and hence to assess the effects of the limited dynamic range of the screenfilm system. The model takes specific account of the X-ray spectrum (anode material and angle, peak tube potential and ripple, and added filtration), antiscatter grid (strip frequency, lead strip width, grid ratio and material in interspaces and covers) or air gap, couch top or chest stand, and image receptor (cassette front, screenfilm system and H&D curve). The computer program has been validated [9, 14] against measurements on phantoms and patients.
|
To calculate the contrast or difference in optical density,
OD, beside and behind a particular detail, the effects of film gradient (
) and imaging system unsharpness were considered. The film gradient was obtained from measurements of the H&D curve using the ISO standard [15]. The effect of unsharpness on
OD was calculated byconsidering the modulation transfer function (MTF) of receptor (screen), geometric (focal spotsize and magnification) and motion unsharpness [16].
SNR was calculated in two steps. First, the SNRQ due to quantum noise (index Q) only was calculated using the fluence of photons at the screen and the single event size distribution of energy imparted to the screen [17]. The SNRQ is based on the energy imparted per unit area to image elements beside and behind the detail and was calculated using the methodology in reference [11] and reference [18]. The SNRQ overestimates the actual SNR. Multiplicative correction factors were applied to SNRQ2 to include the effects of additional noise from light emission from the screen and from film granularity. Methods from the literature [19] were used to derive these correction factors [16].
In addition to
OD and SNR, a measure of the dynamic range of the image data was computed. Dynamic range is important for the following reason: even though the object contrast may be large, the contrast on the film may be low owing to the low film contrast (gradient) in some parts of the image, and thus
OD will be reduced. Our measure of dynamic range was therefore defined as the percentage of pixels in the computed image having an OD such that the gradient
(OD) exceeds a pre-set value, in our case 0.75 or 1.25 (chest) and 2.25 (lumbar spine). This physical image quality measure is thus an indication of how much of the image is properly exposed, i.e. with a "reasonable" film contrast, and is subsequently referred to as the PEF (properly exposed fraction). The pre-set values of the gradients were selected so that the PEF was sensitive to changes in imaging conditions (Table 1
).
Statistical analysis
The software package Statistica® was used to compute the correlation coefficient r (the Pearson product-moment) between calculated values of
OD or SNR and ICS or VGAS. Calculated p-values were used to express the significance of the correlation (t-test). The correlation coefficient measures the magnitude, if any, of a linear causal relation. The null hypothesis is that there is no linear association between clinical and physical image quality. Correlations significant at p<0.10, p<0.05 and p<0.01 were identified. Correlations were sought between the physical image quality factors and the scores of the individual criteria as well as with the average scores when several or all criteria were used.
| Results and discussion |
|---|
|
|
|---|
OD for the detail in the retrocardiac area are all systems that use the lower maximum optical density in the lung region (ODmax=1.3). This shows the importance of not underexposing the chest film. Figures 1b and 1c
OD of the costophrenic angle area, and between C5aCH (ICS) and
OD of the central right lung, respectively. These correlations are less significant.
|
|
>1.25), and the contrast of calcifications in the right lung apex. This can be explained by considering the H&D curve. Both of these details are situated in areas where the OD is less than 1.0, hence on the toe of the H&D curve. The
OD is therefore much increased if the ODmax in the chest image is increased from 1.3 to 1.8. The
OD of details situated in areas where the OD is generally higher (OD>1.0), such as the central right lung and the left lung apex, does not show as significant a correlation with ICS and VGAS as details in regions with low optical density. A possible explanation for this is that the
OD of these details is already high enough and the criteria are therefore already fulfilled. For the same details, the
OD shows a more significant correlation with ICS and VGAS than the SNR. For example, there is no significant (p<0.05) correlation between ICS and SNR. However, there is a significant correlation between VGAS for criteria C5aCH and C5bCH and SNR, but this is not as significant as with
OD (p<0.01) for the same detail. This is an indication that clinical image quality is limited more by contrast than by noise in chest screenfilm radiography. The most significant correlation between clinical and physical image quality is found with criteria C5aCH and C5bCH (VGAS), whereas a poor correlation is found with C1CH (ICS). An explanation could be that the wording of C1CH is not as specific as the wording of C5aCH and C5bCH, and that it may be difficult to find a single physical measure that correlates to such a general criterion as C1CH.
No significant correlation was found between the physical parameters such as applied tube potential, screenfilm speed and scatterrejection technique on the one hand and ICS and VGAS on the other. However, a significant correlation was found between the maximum OD in the chest PA image (ODmax) and both ICS and VGAS. This indicates that the ODmax is the most important parameter of the four tested; the other three are of lesser importance. If the image is properly exposed (hence not underexposed, as with ODmax=1.3), the choice of screen speed, scatterrejection technique and tube potential is not critical, or at least will not affect the image quality enough to generate significantly different ICS and VGAS in the clinical trial [7]. Similar conclusions have also been found on the basis of the computational model alone [14].
Contrary to earlier work [6], this work was able to demonstrate that clinical image quality can be predicted, provided that three conditions are satisfied. This may prove useful, as optimization based on clinical image quality alone can be difficult and time consuming. The conditions are as follows. First, it is important to characterize the imaging system in sufficient detail for the model calculations to agree with measurements on the imaging system on an absolute scale [18]. Second, the effect of the different radiographic technique factors (Table 1
) must be acknowledged in combination and not used separately in attempts to correlate with clinical image quality. Finally, the effect of the different radiographic technique factors must be combined into appropriate measures of physical image quality (i.e. contrast and SNR) that correspond to the perception or visualization of relevant anatomical details, i.e. to specific diagnostic tasks.
Lumbar spine
Table 5
shows the correlation coefficients r between the clinical image quality measures ICS and VGAS for different combinations of image criteria and calculated physical image quality measures
OD and SNR of the anatomical details and our measure of dynamic range, PEF. Examples of the correlation between clinical and physical measures of image quality are given in Figures 2ac
at three levels of significance.
|
|
OD and SNR (transverse processes (L1TL5T) and trabecular details (L1DL5D)). Typically, a stronger correlation was found with C5LS than with C4LS. Also, a stronger correlation with clinical image quality was found for the physical image quality measures that use the trabecular structure detail than those that use the transverse processes. The visibility of the transverse processes is also influenced by the stomach content that may interfere with the perception of the processes.
The
OD and SNR of the L1DL5D trabecular details were the best predictors of clinical image quality amongst those tested. The percentage of the calculated image with a film gradient larger than 2.25 (PEF) was not as good as the
OD and SNR of particular details. This may indicate that, provided the spine is properly exposed, the surrounding soft tissue with significantly higher optical density, possibly overexposed, is not a problem.
| Conclusions |
|---|
|
|
|---|
OD) of blood vessels in regions with comparatively low optical densities, such as the retrocardiac area. No significant correlation, however, was found between the SNR of details and clinical image quality. To quantify the effect of dynamic range on image quality, a new quantity, the properly exposed fraction, was introduced. The PEF shows a significant correlation with clinical image quality in chest imaging and demonstrates the importance of proper film exposure.
For AP lumbar spine radiography, the
OD and SNR of trabecular details in the L1L5 vertebrae are the best predictors of clinical image quality, whereas the PEF is not as good.
The significant correlations found between clinical image quality and some physical image quality measures in this work are encouraging and show that, for the situations considered, the clinical image quality can be predicted provided the imaging conditions are known in detail and relevant measures of physical image quality are used.
| Acknowledgments |
|---|
| Footnotes |
|---|
Received for publication June 30, 2000. Revision received December 18, 2000. Accepted for publication January 24, 2001.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
M. J. Tapiovaara Review of relationships between physical measurements and user evaluation of image quality Radiat Prot Dosimetry, March 1, 2008; 129(1-3): 244 - 248. [Abstract] [Full Text] [PDF] |
||||
![]() |
M Bath and L G Mansson Visual grading characteristics (VGC) analysis: a non-parametric rank-invariant statistical method for image quality evaluation Br. J. Radiol., March 1, 2007; 80(951): 169 - 176. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Borjesson, M. Hakansson, M. Bath, S. Kheddache, S. Svensson, A. Tingberg, A. Grahn, M. Ruschin, B. Hemdal, S. Mattsson, et al. A software tool for increased efficiency in observer performance studies in radiology Radiat Prot Dosimetry, May 17, 2005; 114(1-3): 45 - 52. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Bath, M. Hakansson, J. Hansson, and L. G. Mansson A conceptual optimisation strategy for radiography in a digital environment Radiat Prot Dosimetry, May 17, 2005; 114(1-3): 230 - 235. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Geijer and J. Persliden Varied tube potential with constant effective dose at lumbar spine radiography using a flat-panel digital detector Radiat Prot Dosimetry, May 17, 2005; 114(1-3): 240 - 245. [Abstract] [Full Text] [PDF] |
||||
![]() |
A Tingberg, C Herrmann, B Lanhede, A Almen, M Sandborg, G McVey, S Mattsson, W Panzer, J Besjakov, L G Mansson, et al. Influence of the characteristic curve on the clinical image quality of lumbar spine and chest radiographs Br. J. Radiol., March 1, 2004; 77(915): 204 - 215. [Abstract] [Full Text] [PDF] |
||||
![]() |
B Molander, H-G Grondahl, and A Ekestubbe Quality of film-based and digital panoramic radiography Dentomaxillofac. Radiol., January 1, 2004; 33(1): 32 - 36. [Abstract] [Full Text] [PDF] |
||||
![]() |
A Ekestubbe, H-G Grondahl, and B Molander Quality of digital pre-implant tomography: comparison of film-screen images with storage phosphor images at normal and low dose Dentomaxillofac. Radiol., September 1, 2003; 32(5): 322 - 326. [Abstract] [Full Text] [PDF] |
||||
![]() |
G McVey, M Sandborg, D R Dance, and G Alm Carlsson A study and optimization of lumbar spine X-ray imaging systems Br. J. Radiol., March 1, 2003; 76(903): 177 - 188. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| BJR | DMFR | IMAGING | ALL BIR JOURNALS |