BJR
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS

British Journal of Radiology (2004) 77, 312-314
© 2004 British Institute of Radiology
doi: 10.1259/bjr/84593467

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Pijnappel, R M
Right arrow Articles by Mali, W P T. M
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Pijnappel, R M
Right arrow Articles by Mali, W P T. M

Full Paper

Reproducibility of mammographic classifications for non-palpable suspect lesions with microcalcifications

R M Pijnappel, MD, PhD 1 P H M Peeters, MD, PhD 2 J H C L Hendriks, MD, PhD 3 and W P Th M Mali, MD, PhD 4

1 Department of Radiology, Martini Hospital, Locatie van Swieten, PO Box 30033, 9700 RM Groningen, 2 Julius Centre for General Practice and Patient Oriented Research, Utrecht University Medical School, PO Box 85500, 3508 GA Utrecht, 3 National Expert and Training Centre for Breast Cancer Screening, University Medical Centre Nijmegen, PO Box 9101, 6500 HB Nijmegen and 4 Department of Radiology, University Medical Centre Utrecht, PO Box 85500, 3508 GA Utrecht, The Netherlands


    Abstract
 Top
 Abstract
 Introduction
 Patients and methods
 Results
 Discussion
 References
 
Observer variability in interpretation of mammograms is a well-known problem, especially for microcalcifications. The classification of the mammographic findings depends upon this interpretation. We performed an intraobserver study to evaluate a breast imaging reporting and data system (BI-RADS) based method for description and classification of non-palpable lesions with microcalcifications. A set of 100 non-palpable mammographic lesions mainly consisting of microcalcifications was described and classified on two occasions, by two radiologists at an interval of 6 months. The intraobserver variability was evaluated with kappa statistics. The overall agreement for the classification was moderate (kappa 0.54). The lowest kappa values were observed for the categories "probably benign (BI-RADS 3, kappa 0.59)" and "suspicious abnormality (BI-RADS 4, kappa 0.44)". The clinical management (follow-up or biopsy) of non-palpable lesions consisting of microcalcifications depending upon radiological classification in the groups BI-RADS 3 (follow-up) and BI-RADS 4 (biopsy) is therefore debatable.


    Introduction
 Top
 Abstract
 Introduction
 Patients and methods
 Results
 Discussion
 References
 
The classification of non-palpable lesions consisting of microcalcifications on mammography in order to describe the radiologist's level of cancer suspicion is complex. This classification determines the final assessment of the lesion, histological biopsy or mammographic follow-up. One of the limitations of classifying mammographic lesions is the observer variability in interpretation of mammographic features [1, 2]. In order to produce a more standardized scheme for describing mammographic lesions, the American College of Radiology recommends the Breast Imaging Reporting and Data System (BI-RADS) [3]. This system includes terms for describing calcifications (morphology and distribution) and describes the radiologist's level of suspicion about the mammographic abnormality (Table 1Go). Several studies have evaluated this system [4, 5]. When BI-RADS is used, substantial interobserver and intraobserver variability in mammographic interpretation is noted [6]. Another limitation of the BI-RADS lexicon is that it does not make explicit recommendations regarding the assignment of final assessment categories for the different mammographic features [5].


View this table:
[in this window]
[in a new window]
 
Table 1. BI-RADS categories

 
We set out a study to evaluate the intraobserver variability of a BI-RADS based system for description and interpretation of non-palpable mammographic lesion mainly consisting of microcalcifications and analysed the clinical impact in terms of advising biopsy or follow-up.


    Patients and methods
 Top
 Abstract
 Introduction
 Patients and methods
 Results
 Discussion
 References
 
For this study we used data from the COBRA-study (core biopsy after radiological localization) [7]. This study evaluated the accuracy of stereotactic-guided large core needle biopsy for impalpable lesions classified as non-benign. The study was conducted between April 1997 and February 2000. A total of 973 patients with 1029 non-palpable lesions classified as BI-RADS 3 to 5 from 19 Dutch hospitals were included. After informed consent, all patients were first offered a stereotactic-guided 14-gauge core needle biopsy. Whenever needle biopsy yielded a malignancy, therapeutic surgery was performed. In those cases where no malignancy was found on core biopsy, the patients underwent a needle localized breast biopsy. The final diagnosis was based upon the pathologic result of the surgical specimen (needle localized breast biopsy or mastectomy).

All impalpable lesions, mainly consisting of microcalcifications, in the COBRA study were identified (n=533) and copies of the original mammography were reviewed by two radiologists (RP, JH). Both are specialists in breast imaging. Lesions were described and classified in consensus. The microcalcifications could be classified according to a total of six morphology categories (linear, coarse granular, fine granular countable, very fine granular non-countable, "cottonwool" and coarse non-granular (benign impression)) [8]. All the different types of microcalcifications present in a lesion were classified. The morphology and distribution of the microcalcifications were studied and lesions were classified into 3 groups (probably benign, suspicious and highly suggestive of malignancy) according to the BI-RADS lexicon 3 to 5. After a 6 month period a random sample of 100 out of these 533 mammograms were re-examined and classified again by the same team. We calculated the intraobserver variability and evaluated the description and interpretation with kappa statistics. We calculated the kappa values by comparing the results of the first and second reading session. Since one mammographic lesion may contain different morphological types of microcalcifications, we divided all morphological types into two groups, i.e. linear calcifications vs the rest; or cottonwool vs the rest, etc. By doing this we were able to calculate kappa values for each category. Perfect agreement is indicated by a kappa value of 1.0, whereas a kappa value of 0 indicates no agreement at all. Kappa is a percentage of agreement adjusted for chance fluctuations. Prior reports have suggested the following level of agreement between observers for the indicated kappa values: {els]le;0.20, slight agreement; 0.21–0.40, fair agreement; 0.41–0.60, moderate agreement; 0.61–0.80, substantial agreement; and 0.81–1.00, almost perfect agreement between observers [4, 9]. Furthermore we studied the inconsistencies in the BI-RADS classification and correlated these cases with the final pathologic assessment.


    Results
 Top
 Abstract
 Introduction
 Patients and methods
 Results
 Discussion
 References
 
38 of the 100 lesions were malignant (in situ or invasive). The kappa values for calcification morphology and distribution are shown in Table 2Go. The agreement was slight for very fine granular non-countable microcalcifications (kappa=0.40). There was moderate agreement for description of the microcalcifications categories: linear, coarse granular, fine granular countable and coarse non-granular (kappa values 0.55, 0.58, 0.43 and 0.58, respectively). Only for cottonwool calcifications was the agreement substantial (kappa=0.63). The overall agreement for the distribution of microcalcifications was moderate (kappa 0.49).


View this table:
[in this window]
[in a new window]
 
Table 2. Intraobserver variability in description of lesions with microcalcifications

 
Results of intraobserver variability in radiological classifications are shown in Table 3Go. For the BI-RADS categories 3 (kappa=0.59) and 4 (kappa=0.44) the agreement was moderate. For BI-RADS category 5 the agreement was substantial (kappa=0.73).


View this table:
[in this window]
[in a new window]
 
Table 3. Intraobserver variability of radiological classification

 
Table 4Go shows the final histological results according to first and second classification of BI-RADS categories. During the first classification 44 lesions were classified as a BI-RADS 3, and 6 turned out to be malignant.14 of these lesions were classified as BI-RADS 4 after the second classification. Two of these 14 lesions were malignant. 41 lesions were classified as a BI-RADS 4, 21 turned out to be malignant. Seven of these lesions were classified as BI-RADS 3 and 5 as BI-RADS 5 after the second session. Of these 7 and 5 differently classified lesions, 3 and 4 were malignant, respectively. 15 lesions were classified as BI-RADS 5 during the first classification, and 11 turned out to be malignant. Four of these lesions were classified as BI-RADS 4 after the second classification. Two of these 4 were malignant.


View this table:
[in this window]
[in a new window]
 
Table 4. Inconsistencies of BI-RADS classification

 

    Discussion
 Top
 Abstract
 Introduction
 Patients and methods
 Results
 Discussion
 References
 
In general, there was only moderate agreement in describing microcalcifications according to morphology and distribution and thus in classifying the lesions radiologically according to BI-RADS. These results are in concordance with the literature. Previous studies showed slight to moderate overall intraobserver and interobserver agreement for the description of microcalcifications morphology (kappa varied from 0.36 to 0.57) [4, 6].

In our study the group of fine granular calcifications shows the least agreement. This might be due to dividing the whole group of fine granular calcifications into two subgroups (fine granular countable and fine granular non-countable). Dividing granular calcifications into subgroups appears to be of little value. Our result for describing the distribution of microcalcifications (kappa 0.47) is similar to a previous study on this subject [6].

Inconsistency was greatest for BI-RADS category 3 (probably benign) and 4 (suspicious). All radiologists involved with mammography know the limitations of interpretation and classification. For the referring physician the difference between BI-RADS 3 (follow-up) or BI-RADS 4 (biopsy procedure) is important. The moderate agreement in radiological classification for the categories 3 and 4 show that in cases where lesions with microcalcifications are non-benign but not highly suspicious of malignancy it is difficult to clearly categorize them. In the COBRA-study, as in most of the hospitals in the Netherlands, all non-benign lesions (including BI-RADS 3) underwent histological confirmation. Outside this study and strictly following the BI-RADS lexicon, lesions categorized as BI-RADS 3 might not have had a biopsy procedure but would have been evaluated by follow-up. If the BI-RADS guidelines for the management of lesions categorized as BI-RADS 3 were followed in our study the diagnosis of 6/38 (16%) malignancies would have been delayed following the first reading session, and 7/38 (18%) in the second reading session. Of these cases 2 appeared to be invasive carcinoma (one invasive ductal carcinoma and one invasive lobular carcinoma), 4 DCIS grade I and 5 DCIS grade II. This might have resulted in a worse prognosis for these patients. These results support the Dutch and the Britsh strategy of obtaining a pathological diagnosis for lesions categorized as BI-RADS 3. We suggest that non-palpable lesions with microcalcifications categorized as BI-RADS 3 (probably benign) should undergo a biopsy procedure until a more reliable system for description and classification of microcalcifications is available. Indirect support for this strategy comes from another study that showed that malignancy was found in 24.8% of lesions where microcalcifications were stable and considered indeterminate or suspicious [10].

This study has several limitations. First, all lesions were classified as non-benign by a radiologist before entering this study; therefore in this study only selected cases were used. Second, the two observers in this study work closely together and one observer (JH) was responsible for the training of the other one (RP). Therefore our results may vary somewhat from results for radiologists at other institutions. Third, all descriptions and classifications were made in consensus, there was no individual observer description. Fourth, all mammograms were copies, introducing loss of image quality. Nevertheless, overall we found only moderate agreement.

In conclusion, the agreement in classification of non-palpable lesions with microcalcifications categories as BI-RADS 3 and BI-RADS 4 is moderate. Therefore we think that for non-palpable lesions with microcalcifications classified as non-benign (higher as BI-RADS 2) it is advisable to advocate a biopsy procedure. Trying to classify those lesions in the groups probably benign (BI-RADS 3) or suspicious abnormality (BI-RADS 4) has limited clinical value.

Received for publication May 19, 2003. Revision received October 13, 2003. Accepted for publication December 16, 2003.


    References
 Top
 Abstract
 Introduction
 Patients and methods
 Results
 Discussion
 References
 

  1. Elmore JG, Wells CK, Lee CH, Howard DH, Feinstein AR. Variability in radiologists' interpretations of mammograms. N Engl J Med 1994;331:1493–9.[Abstract/Free Full Text]
  2. Kopans DB. Accuracy of mammographic interpretation. N Engl J Med 1994;331:1521–2.[Free Full Text]
  3. American College of Radiology. Breast imaging reporting and data system (BI-RADS), 2nd edn. Reston, VA: American College of Radiology, 1995.
  4. Baker JA, Kornguth PJ, Floyd CE. Breast imaging reporting and data system standardized mammography lexicon: observer variability in lesion description. AJR Am J Roentgenol 1996;166:773–8.[Abstract/Free Full Text]
  5. Liberman L, Abramson AF, Squires FB, Glassman JR, Morris EA, Dershaw DD. The breast imaging reporting and data system: positive predictive value of mammographic features and final assessment categories. AJR Am J Roentgenol 1998;171:35–40.[Abstract/Free Full Text]
  6. Berg WA, Campassi C, Langenberg P, Sexton MJ. Breast imaging reporting and data system: inter- and intraobserver variability in feature analysis and final assessment. AJR Am J Roentgenol 2000;174:1769–77.[Abstract/Free Full Text]
  7. Verkooijen HM. Diagnostic accuracy of stereotactic large-core needle biopsy for nonpalpable breast disease: results of a multicenter prospective study with 95% surgical confirmation. Int J Cancer 2002;99:853–9.[CrossRef][Medline]
  8. Tabar L, Dean P. Teaching atlas of mammography (2nd edn). Stuttgart: Georg Thieme Verlag, 1985.
  9. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics 1977;33:159–74.[CrossRef][Medline]
  10. Lev-Toaff AS, Feig SA, Saitas VL, Finkel GC, Schwartz GF. Stability of malignant breast microcalcifications. Radiology 1994;192:153–6.[Abstract/Free Full Text]




This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Pijnappel, R M
Right arrow Articles by Mali, W P T. M
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Pijnappel, R M
Right arrow Articles by Mali, W P T. M


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
BJR DMFR IMAGING  ALL BIR JOURNALS