BJR
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

British Journal of Radiology (2004) 77, 204-215
© 2004 British Institute of Radiology
doi: 10.1259/bjr/22642890

This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Tingberg, A
Right arrow Articles by Zankl, M
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Tingberg, A
Right arrow Articles by Zankl, M

Full Paper

Influence of the characteristic curve on the clinical image quality of lumbar spine and chest radiographs

A Tingberg, PhD 1 C Herrmann, PhD 2 B Lanhede, MSc 3 A Almén, PhD 1 M Sandborg, PhD 4 G McVey, BSc, MSc 5 S Mattsson, PhD 1 W Panzer, MSc 2 J Besjakov, PhD, MD 6 L G Månsson, PhD 3 S Kheddache, PhD, MD 7 G Alm Carlsson, PhD 4 D R Dance, PhD 5 U Tylén, PhD, MD 7 and M Zankl, MSc 2

1 Department of Radiation Physics, Malmö University Hospital, SE-205 02 Malmö, Sweden, 2 GSF-National Research Centre for Environment and Health, D-857 64 Neuherberg, Germany, 3 Department of Radiation Physics, Sahlgrenska University Hospital, SE-413 45 Göteborg, Sweden, 4 Department of Radiation Physics, Faculty of Health Sciences, Linköping University, SE-581 85 Linköping, Sweden, 5 Joint Department of Physics, The Royal Marsden NHS Trust, Fulham Road, London SW3 6JJ, UK, 6 Department of Diagnostic Radiology, Malmö University Hospital, SE-205 02 Malmö, Sweden and 7 Department of Diagnostic Radiology, Sahlgrenska University Hospital, SE-413 45 Göteborg, Sweden


    Abstract
 Top
 Abstract
 Introduction
 Material and methods
 Results
 Discussion
 Conclusions
 References
 
The "European Guidelines on Quality Criteria for Diagnostic Radiographic Images" do not address the choice of the film characteristic (H&D) curve, which is an important parameter for the description of a radiographic screen–film system. The image contrast of clinical lumbar spine and chest radiographs was altered by digital image processing techniques, simulating images with different H&D curves, both steeper and flatter than the original. The manipulated images were printed on film for evaluation. Seven experienced radiologists evaluated the clinical image quality by analysing the fulfilment of the European Image Criteria (ICS) and by visual grading analysis (VGA) of in total 224 lumbar spine and 360 chest images. A parallel study of the effect of the H&D curve has also been made using a theoretical model. The contrast ({Delta}OD) of relevant anatomical details was calculated, using a Monte Carlo simulation-model of the complete imaging system including a 3D voxel phantom of a patient. Correlations between the calculated contrast and the radiologists' assessment by VGA were sought. The results of the radiologists' assessment show that the quality in selected regions of lumbar spine and chest images can be significantly improved by the use of films with a steeper H&D curve compared with the standard latitude film. Significant (p<0.05) correlations were found between the VGA results and the calculations of the contrast of transverse processes and trabecular details in the lumbar spine vertebrae, and with the contrast of blood vessels in the retrocardiac area of the chest.


    Introduction
 Top
 Abstract
 Introduction
 Material and methods
 Results
 Discussion
 Conclusions
 References
 
The characteristic curve (H&D curve) of a screen–film combination describes the relationship between the exposure and the optical density of the film [1], and is one of the most important parameters for the resulting image quality. The shape of the H&D curve determines the image contrast in terms of optical density. By selecting a film with a steep characteristic curve (high average gradient) there is a potential to increase the image contrast. This type of system is often used in mammography, where high contrast is needed to detect subtle attenuation differences. Due to the typical S-like shape of the H&D curve, high gradient films bear the risk of unwanted under or over exposure with a loss of important diagnostic information either in the "foot" (base and fog level) or in the saturation plateau of the H&D curve. In examinations requiring a wide dynamic range, a film with a low average gradient H&D curve is used to prevent areas of the film being over or under-exposed. This type of film (latitude film or "L" film) offers a compromise between latitude and image contrast. It has commonly been used in lumbar spine or chest radiography where the exposure varies considerably across the image.

For the evaluation of the relevant physical image quality parameters of medical screen–film systems there are well-established methodologies, which are described in detailed standardization documents, e.g. ISO report 9236-1 for measurement of the characteristic curve [1]. The relevance of the corresponding image quality parameters for the evaluation of patient images, however, must be proved in clinical studies. For such purposes, common experiments involve the detection by observers of artificial objects like disks or wires which should simulate typical pathology and which are hidden in the patient image [2, 3]. Typically, such studies use an approach based on receiver operating characteristic (ROC) analysis [47]. Such methods are well established. The mathematical background of ROC has been thoroughly investigated and the method is well documented in the literature. One of the main disadvantages is that, in a very strict sense, the validity of results obtained by a detection task is limited to the type of object that had to be detected. Furthermore, the images that should be used for the ROC analysis must contain a known signal (pathological structure) and many images must be used for statistical reasons.

An alternative approach for the evaluation of the diagnostic quality of patient images is the application of a catalogue of normal anatomical details, which should be visualized in a patient's image. The Commission of the European Communities has put strong effort into developing such a catalogue of so called image criteria for different radiographic examinations [8]. We have used the image criteria to evaluate the image quality of lumbar spine and chest images produced under various imaging conditions (such as different X-ray tube voltage, type of scatter reduction) [911]. However, the formulation of the image criteria sometimes turned out to be imprecise [911]. Consequently, the discrimination between different radiographic techniques was relatively weak in some cases. Therefore, in our work the use of image criteria has been supplemented by visual grading analysis (VGA) [12, 13]: images produced by different radiographic techniques were visually compared with a set of reference images for the structures mentioned in the image criteria. The advantages of these methods compared with ROC analysis include that ordinary images can be used since it is the visibility of the normal anatomy that is evaluated, and that much fewer images are needed. These new methods have been tested against an ROC related method (the free-response forced error experiment, FFE [14]) and a strong indication that a correlation exists between the new methods and the FFE method shown [13, 15, 16].

To investigate the influence of a variation of particular parameters, such as the characteristic curve, on the diagnostic quality, image sets have to be produced, which are different with respect to that parameter. To reduce the influence of patient-specific variations, the different image sets should be produced with the same group of patients. However, for reasons of radiation protection, the number of radiographs taken of the same patient must be strictly limited. Therefore, the current paper follows a different approach by digitizing existing radiographs, manipulating the digital images, and finally printing them on to photographic film.

An effective method of investigating the influence of a change of a particular image quality parameter, like the image contrast, is to simulate the imaging process with a Monte Carlo model of the imaging system. Several different imaging techniques can be evaluated in terms of clinical image quality at a relatively low cost since no radiologists are needed for the evaluation. Validation of the model is a prerequisite for using the model results. It is therefore necessary to verify that the model predictions are the same as that of a group of radiologists.

The current paper reports the results of a study of the influence of different characteristic curves on the diagnostic quality of radiographs of the lumbar spine and the chest. Existing sets of radiographs have been digitized, the contrast of the images altered and the images are printed onto film. The clinical image quality of the resulting images has been evaluated by a group of expert radiologists by judgement of the fulfilment of the image criteria and by visual grading analysis. The different imaging situations have also been modelled in a Monte Carlo computer program and descriptors of physical image quality derived. The results from the model have been compared with the results of the clinical evaluations performed by the radiologists.

The aims of this study were:


    Material and methods
 Top
 Abstract
 Introduction
 Material and methods
 Results
 Discussion
 Conclusions
 References
 
Production of images with different H&D curves
The basic patient images
This study makes use of a bank of clinical lumbar spine and chest radiographs previously collected for studies of image quality [911]. The images were collected together with detailed information about the patients and the physical and technical parameters concerning the exposure. The patients were selected to be in a limited range of sizes (weight, length and thickness). The selection procedure has been described earlier [911]. 32 lumbar spine images, divided into two radiographic technique groups, and 60 chest images, divided into four radiographic technique groups, were used for the current study (Table 1Go). The study described in [9] showed that the quality of lumbar spine images of speed class 400 could not be separated from images of speed class 600, thereby justifying the pooling of these two speed classes. For all these images, a standard latitude film (Kodak "L") was employed.


View this table:
[in this window]
[in a new window]
 
Table 1. Radiographic techniques used for the production of the images in this study and the number of images used for each technique. Six new H&D curves were used for the lumbar spine images and four new curves were used for the chest images

 
Film digitization and printing
It must be ensured that the digital scanning and printing system is capable of reproducing the original radiographs without effects detectable by the radiologists reading the images [17, 18]. The quality of both scanners and the laser imager were thoroughly controlled prior to this study, for example with respect to spatial resolution, density range and noise. A close, subjective inspection of the quality of the reproduced images was performed by asking a group of radiologists to compare the quality of the original film images with the reproduced copies of the same images. No difference in quality could be detected by the group. A further test of the constancy of the printing process was performed by having two copies of the same image and making comparisons of the image quality between the two (see visual grading analysis, below).

The lumbar spine images were digitized by means of a CCD flatbed scanner (Vexcel "Ultrascan 5000"; Vexcel Imaging GmbH, Vienna, Austria), and the chest radiographs by means of a drum-scanner (Linotype-Hell "TANGO"; Heidelberger Druckmaschinen, Heidelberg, Germany). The reason for using two different scanners was practical. The flatbed scanner, which was already available when the study was initiated, was not sufficient for scanning the larger chest radiographs at a high enough spatial resolution. Therefore the drum-scanner was obtained and used for digitizing the chest radiographs. All images used for the manipulations described later have a nominal spatial resolution of 40 µm and a dynamic resolution of 16 bits. The scanners were calibrated with respect to optical density by film step wedges produced by X-ray sensitometry of the screen–film systems as used for the original radiographs.

For image display, a medical laser imager (AGFA "LR 5200"; Agfa Gevaert, Munich, Germany) was employed. The nominal spatial resolution of the laser imager is 40 µm, and its dynamic resolution is 8 bits. The maximum optical density as well as the shape of the calibration curve can be adjusted separately. The same calibration curve was used for printing all images. This curve was a good compromise between high-density range and high-density resolution, covering a density range up to about density 3.3 ODU with a resolution of more than 200 grey levels up to density 2.1. This calibration curve allowed coverage of the density range of the screen–film systems used for the original lumbar spine and chest radiographs with a density resolution better than that of the human eye, especially in the diagnostically important range [8].

Simulation of different characteristic curves
The measurements of the H&D curves (characteristic curves) of the screen–film systems used for the original radiographs were performed according to ISO 9236 [1]. The screen–film systems used for lumbar spine (Kodak TMAT L/RA film and Kodak Regular Plus screen, sensitivity class 400; Eastman Kodak, Rochester, NY) were measured at an X-ray beam quality of 70 kV, the systems used for chest at 120 kV (Kodak TMAT L/RA film and Kodak Lanex 160 and 320, respectively). Additionally, the Regular plus system in combination with Kodak G-film was measured at 70 kV. For a moderate change of tube voltage, i.e. from 70 kV to 90 kV, or a change of screen (same family of screen but different speed class), no significant change of the shape of the H&D curves was measured for the lumbar spine systems. It was assumed that this result also holds for the systems used for chest radiography.

Based on these measurements, sets of H&D curves were simulated, which not only cover the range of characteristics of commercially available films but go beyond that range: the film used for the original radiographs, Kodak T-Mat L, is a typical latitude film (L-film) and common especially in chest radiography. Four systems with steeper film characteristics than L were simulated: "G" which is common for skeletal radiography, "M" which is between L and G, "UG" which is similar to the characteristics of a mammography film, and "UGP" which is even steeper than a mammography film. Additionally, three systems with flatter film-characteristics than L were simulated, labelled "IL", "IL2" and "A". The H&D curves for the lumbar spine and the chest systems are shown in Figure 1Go and Figure 2Go, respectively. The normalization of the simulated H&D curves has been chosen so that the curves all pass through the same point for either the lumbar spine or the chest film types, viz., the point where the two measured H&D curves (for the L and G films) intersect.



View larger version (19K):
[in this window]
[in a new window]
 
Figure 1. H&D curves of different screen–film systems for lumbar spine radiographs with simulated film characteristics. The characteristic curves of L and G film were measured according to ISO 9236 (at 70 kV [1]).

 


View larger version (17K):
[in this window]
[in a new window]
 
Figure 2. H&D curves of different screen–film systems for chest radiographs with simulated film characteristics. The dotted curve corresponds to measurements according to ISO 9236 (at 120 kV [1]).

 
Preparation of images with simulated H&D curves
Images with altered characteristic curves were prepared in two steps. First the original images were transferred from optical density images to air kerma images, by using the H&D curve of the original images and taking the calibration curve of the scanner into account. The kerma image so obtained is a distribution of the air kerma in the image plane. By applying a new H&D curve to the kerma image, taking account of the calibration curve of the laser imager, an optical density image is created with the characteristics of another film type. The new image will appear as if it had been produced with a totally different film than it actually was.

The application of different H&D curves does not only affect image contrast but also the density level of an image: only image regions with an optical density close to the crossing point of the H&D curves — e.g. for lumbar spine this is at an optical density of about 0.8 — will keep their optical density. Brighter or darker regions will be shifted to different density levels compared with the original image. Since the average optical density of the lumbar spine radiographs was about 1.25, the manipulated images would suffer strong density shifts especially if high contrast film characteristics are applied. In practice, the automatic exposure control would prevent films being too dark. Therefore, it was necessary to perform a density correction along the H&D curve, so that the manipulated images should have an average density close to that of the original radiograph. This density correction corresponds to a certain shift with respect to relative dose to the detector (and to the patient), which is described in Table 2Go. Such a change in dose and film would also have influenced the noise level of the images if it had been done with a real film. In this study, however, only changes in contrast are studied. No changes in noise level have been simulated.


View this table:
[in this window]
[in a new window]
 
Table 2. Average gradients for the H&D curves for the lumbar spine and chest images according to ISO 9236 [1], and for lumbar spine, shifts of relative dose to maintain an average optical density of about 1.25 ODU

 
As it can be seen from Table 2Go, the dose shifts are not insignificant. At an average density of 1.25, the use of a film with UGP characteristics could save about 20% of patient absorbed dose compared with the conventional L-film (assuming the H&D curves of the hypothetical UGP film and the L film cross at an optical density of about 0.8, see Figures 1 and 2GoGo). However, it is difficult to adjust the automatic exposure control to function properly with such a steep film, bearing in mind the consequence of unwanted under or over exposure. Vice versa, the use of the very flat "IL2" would require an increase of the average dose by about 14%. For the chest images, the situation was different because the average density of these images was close to the crossing section of the H&D curves, and therefore the application of different film characteristics had only a small effect on the average density of the image. Consequently, no density shift was necessary.

The final film images
Based on the group of 32 lumbar spine radiographs, 224 film images were produced, showing the effect of seven different H&D curves — IL2, IL, L, M, G, UG and UGP — on the same set of patient images. For the chest images, it was decided to concentrate on flat film characteristics A, IL2, IL and only one steeper film characteristic, G. Based on 60 original chest radiographs, 300 images with five different characteristic curves were produced.

Image evaluation
The European Quality Criteria define diagnostic requirements for normal, basic radiographs specifying anatomical image criteria and important image details [8]. They indicate criteria for the radiation dose to the patient, and they give examples of good radiographic techniques which fulfil both diagnostic and dose requirements. Based on this catalogue of image criteria, a set of anatomical structures for evaluating the images in the current study was selected (Table 3Go). Experiences from an earlier study were taken into account [911].


View this table:
[in this window]
[in a new window]
 
Table 3. The image criteria used in this study. The anatomical structures mentioned in the individual criteria were used for the visual grading analysis

 
In the above study [911], image quality was assessed by a group of seven expert radiologists, each having at least 15 years experience. The radiologists evaluated all images during 4 consecutive days. The images were evaluated on conventional viewing boxes without restrictions on viewing time or distance. The illumination in the viewing room was dim and kept at a constant level. All identification tags on the films were removed and the films were assigned a randomly generated code.

Discussions with the radiologists prior to this trial indicated that the observers tended to view different parts of the images resulting in relatively large interobserver variations. Therefore, all images were individually masked, showing only the areas and details to be observed. The masking forced the observers to view exactly the same areas of each image. In the lumbar spine images, a region around L3 was observed (Figure 3aGo). For the chest images, there were six areas to be observed (Figure 3bGo), and the anatomical structures demonstrated in each of these areas are given in Table 4Go.



View larger version (95K):
[in this window]
[in a new window]
 
Figure 3. Example of (a) lumbar spine and (b) chest radiograph without and with masking.

 

View this table:
[in this window]
[in a new window]
 
Table 4. Description of the sizes and positions of the windows used to mask the chest images

 
Quantified evaluation of the images was performed using two different methods:

Visual grading analysis
Visual grading analysis (VGA) is a method for evaluation of image quality, by visual comparison of one image or part of an image with a reference image [12, 13]. In this study we used the structures mentioned in a revised version of the European Quality Criteria [8] for the VGA (Table 3Go). The original images (L-film characteristics) were used as reference images, and the processed images were always compared with an image of the same patient. The visibility of a structure was graded on a five-level scale: clearly inferior to (-2), slightly inferior to (-1), equal to (0), slightly better than (+1) and clearly better than (+2) the structure in the reference image. A visual grading analysis score (VGAS) was determined for each radiographic technique. The VGAS is the ratio of the total grading given by all observers for all criteria and all images corresponding to the same H&D curve divided by the total number of observations: Go


where

Gi,s,o=Grading (-2, -1, 0, +1 or +2) for image i, structure s and observer o.

I=Number of images

S=Number of structures (Table 3Go)

O=Number of observers

VGAS for an individual structure may be obtained by omitting the sum over S and putting S=1 in the denominator.

The chest study included two digital copies of the original images; one served as the reference image and the other was included in the study. The purpose of this was to test the VGA methodology for systematic errors, and to test the constancy of the printing process.

The simplicity and the strong discriminating power of the VGA method makes it a good method for separating different image production techniques, e.g. different X-ray units, in the clinic, but the drawback is that the resulting score is relative to that of the reference image [12, 13]. If the reference image is not the same it is difficult to use the VGAS to compare two different techniques. This is the case in this study, i.e. different patients were imaged at different radiographic techniques, and therefore the following complementary method was also used.

Image criteria score
A revised version of the image criteria of the European Quality Criteria [8] was used for a test of fulfilment of criteria. A suggestion of a revision of the image criteria was proposed in accordance with the results of our previous studies [911]. The image criteria used are listed in Table 3Go. For each criterion, the observers had to decide whether a certain criterion was fulfilled in an image or not (yes/no). A decision of "Yes, the criterion is fulfilled", resulted in a score of 1, and a decision of "No, the criterion is not fulfilled" resulted in a score of 0. The image criteria score (ICS) is defined analogously to VGAS, as a fraction of fulfilled criteria, summing up the scores of all observers for all criteria and all images corresponding to the same H&D curve.Go


where

Fi,c,o=Fulfilment of criterion c for image i and observer o. Fi,c,o=1 if criterion c is fulfilled, otherwise Fi,c,o=0

I=Number of images

C=Number of criteria

O=Number of observers

ICS for an individual criterion may be obtained in the same manner as VGAS for an individual structure. The strength of this method is that the resulting scores are absolute so that images of different techniques can be compared even though the imaged object, i.e. the patient, is not the same. A particularly interesting question in this study was whether the use of a film with a steeper characteristic curve can compensate the poorer radiation contrast of the 90 kV technique for lumbar spine.

Intraobserver variation
To evaluate the intraobserver variation the observers read a number of images twice. The fraction of changed answers for both visual grading and fulfilment of criteria between the first and the second reading, e.g. a change from "Yes — criterion 1 is fulfilled" to "No — criterion 1 is not fulfilled", was used as a measure of intraobserver variation. At the end of the reading session each observer re-read 14 lumbar spine images and 20 chest images (the first batch of the reading session and one batch that was read halfway through the reading session. One batch of images consisted of 7 lumbar spine and 10 chest images).

Model simulations
A Monte Carlo model of the complete imaging system was used to calculate physical image quality descriptors. The model includes an anthropomorphic three-dimensional, segmented male anatomy (voxel phantom) to simulate the patient. Estimates of the energy imparted per unit area to the image receptor at points in the image plane were used to compute the optical density on the film by using the H&D curve. The model takes specific account of the X-ray spectrum (anode material and angle, peak tube voltage and ripple, and added filtration), anti-scatter grid (strip frequency, lead strip width, grid ratio and material in interspaces and covers) or air gap, couch-top or chest stand and image receptor (cassette front, screen–film system, and H&D curve). A detailed description is found in [19, 20]. Appropriate anatomical details have been added to this phantom so that realistic estimates of the contrast and signal–noise ratio (SNR) of the details can be made. Here, anatomical details relevant to the actual study were selected (Table 5Go). The contrast for each detail was calculated as the difference in optical density ({Delta}OD) due to the presence of the detail. The following radiographic techniques were simulated for lumbar spine: 70 kV, 400 screen and IL2, L, M or UGP (four techniques); and for chest: 102 kV or 141 kV, 160 screen and G, L, IL, IL2 or A (10 techniques). Other system parameters (filter, grid, etc.) are taken from the systems used in the original exposure. Correlations between the calculated {Delta}OD of the details and VGAS for all structures (Table 3Go) and for VGAS of criterion 5 (lumbar spine) and criterion 6 (chest) were tested for significance. The individual criteria were chosen because the anatomical details used in the model calculations are explicitly mentioned in these criteria. It is noted that no changes of the noise level were simulated in this work and consequently, no calculations of SNR of the details were performed for the simulated films. The study is limited to the effects of changing the film contrast at a constant noise level.


View this table:
[in this window]
[in a new window]
 
Table 5. Properties of the anatomical details for which the change in optical density was calculated in the lumbar spine and chest examinations

 
Statistical analysis
For the VGAS and the ICS, the statistical tests were performed with the analysis of variance (ANOVA) test followed by the Newman–Keuls test, with the hypothesis that the images produced with films of different H&D curves could not be separated with respect to image quality by methods described in this paper. A p-value less than 5% was considered to indicate a significant difference between two data points. The standard errors were calculated based on the mean values of the gradings (scores) over the observers and over the structures (criteria).

Correlations between the clinical image quality as evaluated by the radiologists and the calculated physical image quality descriptors were tested for significance with the Pearson product-moment test. A p-value of less than or equal to 5% was considered to indicate a significant correlation.


    Results
 Top
 Abstract
 Introduction
 Material and methods
 Results
 Discussion
 Conclusions
 References
 
Measurement of clinical image quality, VGAS and ICS
Lumbar spine
The results of the visual grading analysis study of the lumbar spine images showed that the images with a steeper H&D curve (higher gradient) than the original (L-film characteristics) had a significantly improved image quality (VGAS>0). Images with a flatter H&D curve than the original had a significantly decreased image quality (VGAS<0) (Figure 4Go). The films that had higher average gradient than the original (M, G, UG & UGP) could not be separated from each other in terms of image quality. The curve VGAS vs average gradient appears to reach a plateau at an average gradient of about 2.5. At even higher gradients the VGAS-values are expected to decrease (when the image will have too high a contrast) but more data points especially at average gradients higher than 3.5, would be needed to form a conclusion on this matter.



View larger version (9K):
[in this window]
[in a new window]
 
Figure 4. Lumbar spine: the visual grading analysis score (VGAS) vs the average gradient. One standard error is indicated. The horizontal lines indicate data points that cannot be separated from each other (the same VGAS). VGAS=0 means "equal to the reference image".

 
By using the image criteria score, ICS, a comparison between the image quality of the 70 kV images and the 90 kV images can be performed. With ICS, no differences could be found between the film types of one radiographic technique, i.e. images produced based on the same tube voltage, for the lumbar spine images (Figure 5Go). The 70 kV images of a particular H&D curve always had a significantly better image quality than the 90 kV images of the same H&D curve. The 70 kV images with L-film characteristics were significantly better than the 90 kV images even when the high gradient films were employed (Figure 5Go).



View larger version (8K):
[in this window]
[in a new window]
 
Figure 5. Lumbar spine: the image criteria score (ICS) vs the average gradient. One standard error is indicated. The horizontal times indicate data points that cannot be separated from each other (the same ICS). There was no significant difference in ICS value for the different data points.

 
Chest
The chest images (Figure 6Go) show the same trend as the lumbar spine images. The images with a higher gradient had a significantly higher VGAS than the original gradient (L-film characteristics), and the images with lower gradient had a significantly lower VGAS than the original. All film types of one radiographic technique, i.e. one combination of tube voltage and screen, are significantly different from each other. The choice of radiographic technique (tube voltage and speed) did not affect the VGAS; for one film type the radiographic techniques had the same VGAS. The VGAS for the copies of the reference images was equal to zero with a standard deviation of ±0.1. That is, the image quality of different copies of the same digital image was the same, indicating a constant quality especially of the film printing process.



View larger version (15K):
[in this window]
[in a new window]
 
Figure 6. Chest: the visual grading analysis score (VGAS) vs the average gradient. One standard error is indicated. All data points within one radiographic technique are significantly different from each other (the same VGAS). VGAS=0 means "equal to the reference image".

 
The image criteria score, ICS, can be used to compare images based on the four different radiographic techniques, used for the original chest images. The corresponding values are shown in Figure 7Go. Independent of tube voltage and speed, there is a clear decrease in ICS with H&D curve flattening from "G" to "A", which is statistically significant for the film characteristics "IL2" and "A". The low gradient film ("A") always had a significantly lower image quality than the others. The three film types with the highest average gradients, "IL", "L" and "G", always had the best image quality score.



View larger version (14K):
[in this window]
[in a new window]
 
Figure 7. Chest: the image criteria score (ICS) vs the average gradient. One standard error is indicated. The horizontal lines indicate data points that cannot be separated from each other (the same ICS).

 
Intraobserver variation
The average intraobserver variation was 25±11% (1 standard deviation (SD)) for the lumbar spine and 27±6% (1 SD) for the chest images, meaning that the radiologists changed their opinion on the visibility of a structure or criterion in about one out of four times, on the average. The variation is strongly dependant on the observer and the examination type.

Correlation between physical and clinical measures of image quality
For lumbar spine, significant correlations were found between the {Delta}OD of the transverse processes (L1T, L3T and L5T) and of the trabecular structures (L1D, L3D and L5D) on one hand and VGAS for structure 5 (Table 6Go), which specifically mentions the transverse processes. No significant correlations with ICS were found.


View this table:
[in this window]
[in a new window]
 
Table 6. Correlation between physical and clinical measures of image quality in the lumbar spine anteroposterior (70 kV, 400 speed) and chest posteroanterior (141 kV, 160 speed) examinations. The correlation coefficient, r, between the visual grading analysis score, VGAS (structure 5 and an average over all six structures C1LS–C6LS, cf. Table 3Go) on one hand and change in optical density ({Delta}OD) for the anatomical details on the other (cf. Table 5Go) are given for the lumbar spine examination. Corresponding data for chest criterion C6CH and for the average over all criteria C1CH–C7CH.The significance of the correlation is given by the number of asterisks (*) behind the correlation coefficient; ***: p<0.01; **: p<=0.05; *: p<0.10

 
For chest radiography, a significant correlation exists between the {Delta}OD of the blood vessels in the retrocardiac area and the VGAS for structure 6 (Table 6Go), which specifically mentions the retrocardiac vessels. This is also true for ICS. In addition, a significant correlation also exists between VGAS for all seven structures and the details in the costophrenic angles, retrocardiac area and in the left lung apex, this also being true for ICS except for the details in the costophrenic angles. Figure 8Go shows graphically the correlation for lumbar spine between {Delta}OD L3T and VGAS for structure 5 and for chest between {Delta}OD RCA and VGAS for structure 6.



View larger version (12K):
[in this window]
[in a new window]
 
Figure 8. The correlation between clinical and physical measures of image quality in the (a) lumbar spine and (b) chest examination. Correlation for lumbar spine between visual grading analysis score (VGAS) for structure 5 and change in optical density ({Delta}OD) L3T (r=0.95, p=0.050) and for chest for structure 6 and {Delta}OD RCA (r=0.98, p=0.005) are shown. The solid curves in (a) and (b) are the linear regression lines. {Delta}OD L3T denotes the optical density difference of a transverse process in the L3 vertebrae and {Delta}OD RCA denotes the optical density difference of a blood vessel in the retrocardiac area (RCA).

 

    Discussion
 Top
 Abstract
 Introduction
 Material and methods
 Results
 Discussion
 Conclusions
 References
 
The results of this study suggest that for both lumbar spine and for chest, films of higher average gradients are preferred over films with lower average gradients. Care must however be taken because the masking strongly reduces the viewing area, and a test prior to this study showed that the overall impression of the high gradient films was poor. There is, however, a potential to increase the perceived image quality locally by using different gradients depending on the area to be observed. In digital radiography this can easily be performed since many digital radiography units have a built-in ability to choose the relationship between the exposure and the density (or pixel value) produced by the exposure. In future versions of PACS workstations there may be different pre-set window and level settings for viewing different part of an image, e.g. one setting for viewing the lung fields, one for viewing the mediastinum etc.

Even though there is a lack of statistical power for lumbar spine, the two image quality descriptors used in this study, ICS and VGAS, appear to reach a plateau at average gradients above about 2.0 and 2.5, respectively. Beyond this plateau the ICS and VGAS values are expected to decrease when the contrast will be too high for diagnostic purposes, i.e. the image will tend to be too much "black and white". The plateau is reached at a lower average gradient with the ICS method than with VGAS. A probable explanation for this finding is the inherent properties of the two image quality evaluation methods. ICS reaches a saturation level, when all or almost all criteria are fulfilled, and the score cannot increase more than this level. As the average gradient increases, a certain point is reached when the ICS starts to decrease. The image quality is not good enough and fewer criteria than before are fulfilled. In the VGA method, however, the quality of one image is compared with the quality of a reference image. The reference images were produced with an L-film (mean average gradient 2.18). Thus the start and end points of plateaus detected with the VGA methodology will be relative to the average gradient of the reference images. If the reference images had had a different average gradient then the plateau would probably have had different start and end points. For the chest images, only one H&D curve with a higher average gradient than the reference H&D curve was produced. Therefore no plateau could be detected, but the same trend is expected to be found also for chest if the average gradient is increased sufficiently.

The use of a steeper film such as the G film in lumbar spine and chest radiography offers the possibility of decreasing the dose at a constant clinical image quality level compared with the standard L film (assuming that the increase of noise due to the dose reductions is so small that it could be ignored [18]). This would require the proper adjustment of the automatic exposure control of the X-ray system. Under such conditions dose savings could be achieved for lumbar spine by about 10% compared with the standard L-film at 70 kV. Conclusions about possible dose savings using the other (simulated) steep gradient films is not possible since the normalization of their H&D curves is hypothetical. Provided, however, that they cross the H&D curve of the L film as assumed in Figure 1Go, a 20% dose saving would be possible with the UGP film.

VGAS of the copies of the reference images (i.e. the L-films) could not be separated from the reference (Figure 6Go, VGAS=0) as was expected. Therefore we can conclude that the quality of the film printing was constant, and that there was no systematic error in the visual grading analysis study. If VGAS of the copies had not equalled zero, resulting in a non-fixed grading system, this would have introduced a source of uncertainty that would have been very hard to estimate.

According to the results for ICS, Figure 5Go, lumbar spine images taken at 90 kV are significantly worse than those taken at 70 kV independent of the film gradient. This can be interpreted in the following way. By switching from 70 kV to 90 kV, the contrast in the radiation field leaving the patient is decreased to such a degree that it cannot be restored by using a steeper film. This interpretation is supported by Monte Carlo calculations simulating the corresponding exposure parameters [21]. The radiation contrast of the studied anatomical details is reduced by 30% when the tube voltage increases from 70 kV to 90 kV according to the Monte Carlo model calculations [22]. The model calculations also show a reduction of the SNR of these details (by 30–40%) at 90 kV compared with 70 kV in the original images which may additionally contribute to the lower ICS at 90 kV. This result is not in accord with the kV interval recommended by the European Guidelines (75–90 kV) [8]. A lower kV will increase the radiation contrast, but it will also increase the entrance surface dose, which could, but will not necessarily lead to an increased effective dose to the patient. The optimum kV for a particular examination is not only dependent on the composition of the anatomical region to be imaged, but also on the type of detector used. This study suggests that for lumbar spine radiography with screen–film systems this kV may be lower than that indicated in the European Guidelines [8].

A comparison between the results of the visual grading analysis and the image criteria score method for lumbar spine (Figure 4Go and Figure 5Go) and for chest (Figure 6Go and Figure 7Go) shows the stronger discriminative power of VGA compared with ICS. For lumbar spine the different film types could not be separated with ICS whereas with VGA the higher gradient techniques were significantly better than the reference image (L-film) and the lower gradient techniques were significantly worse than the reference image. Thus ICS has less discriminatory power than VGA for evaluation of the effect of the shape of the characteristic curve on the clinical image quality of lumbar spine radiographs. In the chest case the results from the VGA showed that all film types were significantly different from each other whereas the results from the ICS method showed that some of the techniques could not be separated from each other.

The study of the intraobserver variance showed that, on average about one out of four readings, the score given by the radiologists was changed when an image was re-read. This corresponds well with our experiences from previous studies performed under similar conditions [911].

The significant correlation found between VGAS structure 5 and the model calculations of the contrast of the L1, L3 and L5 processes and trabecular structures in lumbar spine anteroposterior (AP) examination is encouraging and shows that the model can predict changes in clinical image quality. It is noted, however, that the radiologists' response (VGAS) saturates for films with the highest average gradients (Figure 4Go) which may be important as only linear correlations are sought here. The absence of correlation with ICS for lumbar spine can be explained by the fact that none of the tested films show any significant differences in terms of ICS (Figure 5Go). Contrary to the lumbar spine AP, the chest posteroanterior (PA) examination shows significant correlation between both VGAS and ICS on one hand and the calculated contrast of anatomical details on the other. The strongest correlation is found for anatomical details situated at a low optical density (OD<1.0) such as vessels behind the heart and the calcification in the right lung apex [23]. We believe that the detection of the small calcifications and trabecular structures will also depend on the noise and not only on the contrast, but this was not considered here, as the noise was not altered in the experiment.


    Conclusions
 Top
 Abstract
 Introduction
 Material and methods
 Results
 Discussion
 Conclusions
 References
 
The results of the study presented above indicate that the fulfilment of the image criteria, which were initially introduced as a practical tool for routine quality control in X-ray diagnosis, can be used as a quantitative descriptor of clinical image quality of real patient images. However, due to interobserver and intraobserver variations within the group of radiologists, the results do not always exactly mirror expected and well-known influences of radiographic technique parameters. This is mainly true in cases where the straightforward image criteria score is used and when the impact of the parameters on diagnostic quality is less pronounced. In such cases the application of the VGA method provides a sharper separation between techniques.

A statistically significant correlation exists between some of the physical image quality measures calculated by the Monte Carlo model and clinical image quality assessed by the radiologists. In lumbar spine AP radiography, significant correlations were found between calculations of the contrast of transverse processes and trabecular structures and experimentally determined VGAS. For PA chest radiography, the most significant correlation to VGAS was the contrast of blood vessels in the retrocardiac area. Hence the influence of the H&D curve can be predicted provided the imaging system is carefully modelled and relevant measures of physical image quality are used.


    Acknowledgments
 
We would like to thank the members of the European Panel of Expert Radiologists (Prof. M Laval-Jeantet, Dr M Maffessanti, Prof. J-O Oestmann and Prof. G Whitehouse) for evaluating the images and for many fruitful suggestions during the course of the study and Dr Francis Verdun at the Institute of Applied Radiation Physics of the University Hospital of Lausanne for measuring the characteristic curves for this study.


    Footnotes
 
This work has been supported by grants from the Commission of European Communities (FI4P CT950005), and the Swedish Foundation for Strategic Research (R98:006). Back

Received for publication September 30, 2002. Revision received July 21, 2003. Accepted for publication September 3, 2003.


    References
 Top
 Abstract
 Introduction
 Material and methods
 Results
 Discussion
 Conclusions
 References
 

  1. ISO. Photography — Sensitometry of screen-film systems for medical radiography — Part 1: Method for determination of sensitometric curve shape, speed and average gradient, Report 9236-1. Geneva, Switzerland: International Organization for Standardization, 1996.
  2. Goodenough DJ, Rossmann K, Lusted LB. Factors affecting the detectability of a simulated radiographic signal. Invest Radiol 1973;8:339–44.[CrossRef][Medline]
  3. Gray JE, Taylor KW, Hobbs BB. Detection accuracy in chest radiographs. AJR Am J Roentgenol 1978;131:247–53.[Abstract]
  4. Swets JA, Pickett RM. The evaluation of diagnostic systems: methods from signal detection theory. New York: Academic Press; 1982.
  5. Burgess AE, Ghandeharian H. Visual signal detection: II Signal location identification. J Opt Soc Am 1984;1:906–10.[Medline]
  6. Metz CE. ROC methodology in radiologic imaging. Invest Radiol 1986;21:720–33.[Medline]
  7. Hanley JA. Receiver operating characteristic (ROC) methodology: the state of the art. Crit Rev Diagn Imaging 1989;29:307–33.[Medline]
  8. European Commission. European guidelines on quality criteria for diagnostic radiographic images, EUR 16260 EN. Brussels: CEC, 1996.
  9. Almén A, Tingberg A, Mattsson S, Besjakov J, Kheddache S, Lanhede B, et al. The influence of different technique factors on image quality of lumbar spine radiographs as evaluated by established CEC image criteria. Br J Radiol 2000;73:1192–9.[Abstract]
  10. Lanhede B, Tingberg A, Månsson LG, Kheddache S, Widell M, Björneld L, et al. The influence of technique factors on image quality for chest radiography — application of the CEC image quality criteria. Radiat Prot Dosimetry 2000;90:203–6.[Abstract]
  11. Lanhede B, Båth M, Kheddache S, Sund P, Björneld L, Widell M, et al. The influence of different technique factors on image quality of chest radiographs as evaluated by modified CEC image quality criteria. Br J Radiol 2002;75:38–49.[Abstract/Free Full Text]
  12. Månsson LG. Evaluation of radiographic procedures. Investigations related to chest imaging. Thesis. Göteborg: Göteborg University; 1994.
  13. Tingberg A. Quantifying the quality of medical x-ray images. An evaluation based on normal anatomy for lumbar spine and chest radiography. Thesis. Malmö: Lund University; 2000.
  14. Chakraborty DP, Winter LH. Free-response methodology: alternate analysis and a new observer-performance experiment. Radiology 1990;174:873–81.[Abstract/Free Full Text]
  15. Sund P, Herrmann C, Tingberg A, Kheddache S, Månsson LG, Almén A, et al. Comparison of two methods for evaluating image quality of chest radiographs. In: Krupinski EA, editor. Medical imaging 2000: image perception and performance. Proceedings of SPIE 2000;3981:251–7.[CrossRef]
  16. Tingberg A, Herrmann C, Besjakov J, Rodenacker K, Almén A, Sund P, et al. Evaluation of lumbar spine images with added pathology. In: Krupinski EA, editor. Medical imaging 2000: image perception and performance. Proceedings of SPIE 2000;3981:34–42.[CrossRef]
  17. Flynn M, Muka E, Davies E, Whiting B, Watt C, Beute G, et al. Replication of diagnostic radiographs using a scanning/printing device. In: Kim Y, editor. Medical imaging IV: image capture and display. Proceedings of SPIE 1990;1232:88–96.[CrossRef]
  18. Buhr E, Herrmann C, Hoeschen D. Correlation between physical image quality parameters and visually perceptible image quality in X-ray diagnosis. J Photographic Sci 1993;41:90–1.
  19. Dance DR, McVey G, Sandborg M, Persliden J, Alm Carlsson G. Calibration and validation of a voxel phantom for use in the Monte Carlo modelling and optimisation of X-ray imaging systems. In: Boone JM, Dobbins JT, editors. Medical imaging 1999: physics of medical imaging. Proceedings of SPIE 1999;3659:548–59.[CrossRef]
  20. Sandborg M, McVey G, Dance DR, Persliden J, Alm Carlsson G. A voxel phantom based Monte Carlo computer program for optimisation of chest and lumbar spine x-ray imaging systems. Radiat Prot Dosimetry 2000;90:105–8.[Abstract]
  21. Sandborg M, McVey G, Dance DR, Alm Carlsson G, Verdun FR. Optimization of chest and lumbar spine radiography by Monte Carlo modeling of the patient and imaging system. In: Boone JM, Dobbins JT, editors. Medical imaging 1999: physics of medical imaging. Proceedings of SPIE 1999;3659:444–54.[CrossRef]
  22. McVey G, Sandborg M, Dance DR, Alm Carlsson G. A study and optimisation of lumbar spine x-ray imaging systems. Br J Radiol 2003;76:177–88.[Abstract/Free Full Text]
  23. Sandborg M, Tingberg A, Dance DR, Lanhede B, Almén A, McVey G, et al. Demonstration of correlations between clinical and physical image quality measures in chest and lumbar spine screen–film radiography. Br J Radiol 2001;74:520–8.[Abstract/Free Full Text]



This article has been cited by other articles:


Home page
Radiat Prot DosimetryHome page
M. J. Tapiovaara
Review of relationships between physical measurements and user evaluation of image quality
Radiat Prot Dosimetry, March 1, 2008; 129(1-3): 244 - 248.
[Abstract] [Full Text] [PDF]


Home page
Br. J. Radiol.Home page
M Bath and L G Mansson
Visual grading characteristics (VGC) analysis: a non-parametric rank-invariant statistical method for image quality evaluation
Br. J. Radiol., March 1, 2007; 80(951): 169 - 176.
[Abstract] [Full Text] [PDF]


Home page
Radiat Prot DosimetryHome page
S. Borjesson, M. Hakansson, M. Bath, S. Kheddache, S. Svensson, A. Tingberg, A. Grahn, M. Ruschin, B. Hemdal, S. Mattsson, et al.
A software tool for increased efficiency in observer performance studies in radiology
Radiat Prot Dosimetry, May 17, 2005; 114(1-3): 45 - 52.
[Abstract] [Full Text] [PDF]


Home page
Radiat Prot DosimetryHome page
A. Tingberg, M. Bath, M. Hakansson, J. Medin, J. Besjakov, M. Sandborg, G. Alm-Carlsson, S. Mattsson, and L. G. Mansson
Evaluation of image quality of lumbar spine images: a comparison between FFE and VGA
Radiat Prot Dosimetry, May 17, 2005; 114(1-3): 53 - 61.
[Abstract] [Full Text] [PDF]


Home page
Radiat Prot DosimetryHome page
A. Tingberg, F. Eriksson, J. Medin, J. Besjakov, M. Bath, M. Hakansson, M. Sandborg, A. Almen, B. Lanhede, G. Alm-Carlsson, et al.
Inter-observer variation in masked and unmasked images for quality evaluation of clinical radiographs
Radiat Prot Dosimetry, May 17, 2005; 114(1-3): 62 - 68.
[Abstract] [Full Text] [PDF]


Home page
Radiat Prot DosimetryHome page
A. Tingberg and D. Sjostrom
Optimisation of image plate radiography with respect to tube voltage
Radiat Prot Dosimetry, May 17, 2005; 114(1-3): 286 - 293.
[Abstract] [Full Text] [PDF]


Home page
Radiat Prot DosimetryHome page
B. M. Moores, S. Mattsson, L. G. Mansson, W. Panzer, D. Regulla, D. Dance, G. Alm Carlsson, F. R. Verdun, E. Buhr, and C. Hoeschen
RADIUS--closing the circle on the assessment of imaging performance
Radiat Prot Dosimetry, May 17, 2005; 114(1-3): 450 - 457.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Tingberg, A
Right arrow Articles by Zankl, M
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Tingberg, A
Right arrow Articles by Zankl, M


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
BJR DMFR IMAGING  ALL BIR JOURNALS