Limitations of mammography in the diagnosis of breast diseases compared with ultrasonography: a single-center retrospective analysis of 274 cases
European Journal of Medical Research volume 20, Article number: 49 (2015)
The aim of this study is to compare X-ray mammography (MG) and ultrasonography (US) in the diagnosis of breast diseases in Chinese women.
We retrospectively analyzed X-ray mammograms of 274 patients with US and surgical/pathological results of breast diseases diagnosed at The Second Affiliated Hospital of Anhui Medical University (Hefei, China) between March 2011 and November 2014. The MG and US data were compared to surgical records using the results from post-surgical pathological examinations as the gold standard.
The overall sensitivity, specificity, accuracy, false-positive, false-negative, positive predictive value, and negative predictive value for the detection of breast cancer were 88.5%, 57.9%, 73.7%, 42.1%, 11.5%, 69.2%, and 82.5%, respectively, for MG and 95.9%, 66.7%, 81.8%, 33.3%, 4.1%, 75.5%, and 93.8%, respectively, for US. Of the 274 cases, lesion size by MG agreed with surgery in 133 (48.5%) patients compared with 216 (78.8%) by US (P < 0.01). Lesion location by MG agreed with surgery in 146 (53.3%) patients compared with 257 (93.8%) by US (P < 0.01). These values were then stratified according to age, menstrual status, breast density, and breast volume, and the agreement rates of MG with surgery were lower than that of US (all P < 0.01), except when the lesion size was >5 cm (P > 0.05).
US was better than MG in the preoperative evaluation of breast diseases of Chinese women. These results suggest that US could be more useful for detecting breast lesions in China, especially for younger women with dense breasts.
Breast diseases, both benign and malignant, affect many women worldwide. To enhance early detection, women are encouraged to undergo routine screening by mammography (MG) . Breast density represents the proportion of different tissue types within a woman’s breast. Specifically, breast and connective tissues are denser than fat, and this difference is apparent by MG. When breast density is high (that is, when there is a greater amount of breast and connective tissues compared with fat), mammograms are more difficult to interpret because a lesion may be shadowed by the dense tissues. Moreover, research has shown that women with high breast density are at increased risk of developing breast cancer . Breast density varies by race, and many Chinese women have dense- or intermediate mixed-type breast density . Thus, MG may fail to accurately identify tumors within this population. In some countries, doctors have begun to implement alternative methods for women with dense breasts. Such measures include the use of ultrasonography (US) and magnetic resonance imaging (MRI) [4,5]. MRI is a useful tool to assess breast diseases and has been shown to have a higher sensitivity than MG [4,5]. However, MRI is expensive and waiting lists are often long, limiting its use in underdeveloped areas of China. In contrast, US might be more accurate than MG and is cheaper than MRI for the preoperative evaluation of breast diseases in women [4,5].
Therefore, the present study aimed to retrospectively analyze MG and US of 274 patients with surgical pathology-confirmed breast diseases in the diagnosis of breast diseases in Chinese women, to compare the diagnosis value of MG and US, and to establish an optimal modality of breast diseases in underdeveloped areas of China.
The results of the present study could identify the limitations of MG in the diagnosis of breast diseases in Chinese women, especially in those with high-density and relatively small breasts.
Two hundred seventy-four consecutive female patients diagnosed with breast diseases and who underwent surgery at The Second Affiliated Hospital of Anhui Medical University (Hefei, China) from March 2011 through November 2014 were included in the present study. Inclusion criteria were as follows: 1) presence of a breast lesion on imagery; 2) the lesion underwent surgery; 3) underwent preoperative MG and US before; 4) and lesion was confirmed by postoperative pathology. Women were excluded if they had undergone only MG or US. This retrospective study was approved by the Institutional Review Board of The Second Affiliated Hospital of Anhui Medical University. The need for individual consent was waived by the committee because of the retrospective nature of the study.
MG and US assessment
MG and US were both performed 2 weeks before surgery. Mediolateral oblique and craniocaudal digital MG of the breast were performed using a molybdenum-rhodium target full-field digital MG system (Senographe 2000D, General Electric, Pittsburgh, PA, USA). If required, additional MG views were obtained. An automatic exposure factor was used, and adequate pressure was applied on the breast. All MG examinations were read by two radiologists who were blinded to the patient’s identity and medical background. The imaging interpretation was based on the American College of Radiology (ACR) BI-RADS (Breast Imaging Reporting and Data System) lexicon . Breast lesions were classified into six categories according to the lesion margin and calcification status: BI-RADS 0 = unsatisfactory MG, and additional imaging evaluations are needed; BI-RADS 1 = negative, no abnormality on MG; BI-RADS 2 = benign findings, presence of definite benign lesions without any signs of malignancy; BI-RADS 3 = probably benign lesions, including uncalcified lump with negative palpation and clear boundary and focal, asymmetric, clustering, round or dot-like calcifications, and a follow-up in a short time frame is suggested; BI-RADS 4 = suspicious abnormality without typical signs of malignancy, including palpable, solid lumps with some clear margins, palpable complex cysts, palpable abscess, solid mass with irregular shape and infiltrating margin, and newly emerging clustered, tiny, polygonal calcifications, and biopsy should be considered; BI-RADS category 5 = highly suggestive of malignancy and appropriate actions should be taken. The total breast density was classified into ACR levels 1 to 4 : level 1, almost entirely fatty; level 2, scattered fibroglandular densities; level 3, heterogeneously dense; and level 4, extremely dense. In the present study, density levels 1 to 2 were defined as low density, and levels 3 to 4 were defined as high density. The volume of the breast was measured using the formula proposed by Kalbhen et al. [7,8]: breast volume = π/4 × (W × H × C), where W is the breast width, H is the breast height, and C is the compression thickness in craniocaudal MG.
US examination was performed using a color Doppler US device (PHLIPS iu22, Philips, Best, The Netherlands) with a probe frequency of 10 to 18 Hz. All US examinations were performed with the patient in the supine position for the medial parts of the breast and in the contralateral posterior oblique position with arms raised for the lateral parts of the breast. The US examinations were performed by board-certified radiographers classified by the ACR BI-RADS US standard.
The location and size of the lesions detected by MG and US were recorded. Lesion location was classified as located in the upper outer quadrant of the breast, the lower outer quadrant, the upper inner quadrant, the lower inner quadrant, the breast areola region, or the axillary tail region. Lesion size was classified as ≤2.0 cm, 2.1 to 5.0 cm, or >5.0 cm.
All included patients underwent surgery. The location and size of the lesions were recorded during the surgery according to the same standard as MG and US. Pathology results were collected.
Data were collected including BI-RADS category, microcalcifications, menstrual status, histopathology, lesion size, breast density, and breast volume. For the purpose of the present study, BI-RADS MG and US categories 1, 2, and 3 were considered as negative, and categories 4 and 5 were considered as positive.
SPSS 16.0 (SPSS Inc., Chicago, IL, USA) was used for statistical analysis. The breast cancer sensitivity, specificity, accuracy, false-positive, false-negative, positive predictive value, and negative predictive value were calculated. Histopathological examination was considered as the gold standard. A true negative was defined as negative benign lesion by histopathology. A true positive was defined as positive evidence of malignancy on histopathology. BI-RADS categories of 0 were excluded from sensitivity, specificity, accuracy, false-positive, false-negative, positive predictive value, and negative predictive value analysis but were kept for the analysis of the location agreement. Lesion size and location were compared between imaging modalities and surgery.
Characteristics of the patients
Of the 274 patients, 132 were with pathologically proven malignancy and 142 were benign. Among these patients, 185 (67.5%) were premenopausal and 89 (32.5%) were postmenopausal. Patients aged from 24 to 80 years, with 129 (47.1%) being ≤45 years old and 145 (52.9%) being >45 years old. The clinical data are shown in Table 1.
Comparison between MG and US assessment
As shown in Table 1, 41 (15.0%) cases were classified as ACR level 1; 92 (33.6%) were level 2; 127 (46.3%) were level 3; and 14 (5.1%) were level 4. The average breast volume of the 274 cases was 419 ± 149 ml (range 91 to 1,130 ml), among whom 120 (43.8%) were ≤400 ml, 142 (51.8%) were 400 to 800 ml, and 12 (4.4%) were >800 ml.
Of the 274 cases, MG BI-RADS category was 0 in 38 (13.9%) cases, category 1 in 30 (10.9%), category 2 in 7 (2.6%), category 3 in 43 (15.7%), category 4 in 113 (41.2%), and category 5 in 43 (15.7%). US BI-RADS category was 1 in 4 (1.5%) cases, category 2 in 32 (11.7%), category 3 in 67 (24.5%), category 4 in 150 (54.7%), and category 5 in 21 (7.7%) (Table 2).
Comparison of the diagnostic accuracy between MG and US
The overall sensitivity, specificity, accuracy, false-positive, false-negative, positive predictive value, and negative predictive value for the detection of breast cancer were 88.5%, 57.9%, 73.7%, 42.1%, 11.5%, 69.2%, and 82.5%, respectively, for MG and 95.9%, 66.7%, 81.8%, 33.3%, 4.1%, 75.5%, and 93.8%, respectively, for US. The overall values of US were higher than that of MG. These values were then stratified according to age, menstrual status, breast density, and breast volume (Table 3). Subgroups analyses presented in Table 3 also suggest that sensitivity and accuracy were lower with MG than with US in women ≤45 years old (sensitivity: 73.7% vs. 89.5%, accuracy: 65.4% vs. 76.0%), premenopausal (sensitivity: 81.0% vs. 91.9%, accuracy: 64.9% vs. 79.7%), or with high breast density (sensitivity: 63.2% vs. 92.3%, accuracy: 71.7% vs. 79.7%). We excluded 38 (14%) patients who were classified as BI-RADS category 0 in MG because the imaging findings were unsatisfactory to detect the lesions. Among them, 7 cases were invasive ductal carcinoma (IDC). Thirty (10.9%) cases with MG BI-RADS category 1 underwent surgery because of the presence of an obvious mass by US or clinical examination. Among these 30 patients, histopathology results revealed the presence of 9 cancers and 21 benign lesions. Figure 1 shows a typical example of the missed diagnosis of breast disease by MG.
Comparison of agreement rate with surgery between MG and US
Of the 274 cases, lesion size by MG agreed with surgery in 131 (47.8%) patients compared with 217 (79.2%) by US. Of the 274 cases, lesion location by MG agreed with surgery in 128 (46.7%) patients compared with 250 (91.2%) by US. These values were then stratified according to age, menstrual status, breast density, and breast volume (Table 4), and the agreement rates of MG with surgery were lower than that of US (all P < 0.01), except when the lesion size was >5 cm (P > 0.05) (Table 4). As shown in Figures 2 and 3, MG often failed to identify the size and location of the lesion due to dense glands and overlapping structures.
The chi-square test was used for agreement rates for lesion size and location between MG and US. P values <0.05 were considered as significant.
The aim of the present study was to compare X-ray MG and US in the diagnosis of breast diseases in Chinese women. Results showed that the overall sensitivity, specificity, accuracy, false-positive, false-negative. positive predictive value, and negative predictive value were significantly higher with US than with MG. Subgroups analyses suggested that sensitivity and accuracy were lower with MG than with US in women ≤45 years old, premenopausal, or with high breast density. Compared with the surgical data, the agreement rates for lesion size and location of MG were lower than that of US (all P < 0.01), except when the lesion size was >5 cm (P > 0.05). These results suggest that US could be a better breast imaging modality for Chinese women.
Assessment of breast diseases with imaging modalities such as MG and US provides a mean for lesion detection and diagnosis. In western countries, MG is the primary breast cancer screening tool and has demonstrated evidences of reduction of breast cancer mortality [9-11]. However, compared with women from western countries, Chinese women have their unique characteristics such as high breast density and small breast volume that influence the sensitivity and accuracy of MG in detecting breast diseases . Breast density is negatively associated with MG sensitivity , as well as with mortality from breast cancer . Indeed, the intrinsic limitations of MG result in failure to detect 10% to 15% of breast cancers, and MG sensitivity is reduced particularly in women with dense breast tissue , as shown in the present study. These data suggest that MG might not be an optimal choice for detecting breast lesions in Chinese women [15,16], which is supported by a study performed in American women with dense breasts [17,18].
In the present study, all patients were from the Anhui Province, which is an undeveloped province in the middle of China, and most of these patients had dense breast tissue and small breast volume. Of the 274 cases, 38 (13.9%) were classified as BI-RADS category 0, meaning that an important proportion of women undergoing MG could not be satisfactorily assessed, which is supported by previous studies [19,20]. In addition, 30 (10.9%) patients assessed as being BI-RADS category 1 by MG had a palpable mass by clinical examination or had an obvious mass by US, prompting surgery. Among these 30 patients, nine were diagnosed with cancer. Therefore, these results suggest that even MG BI-RADS category 1 was not accurate enough and may miss some malignant lesions.
In the present study, MG had significantly lower sensitivity, specificity, accuracy, false-positive, false-negative, positive predictive value, and negative predictive value than US. Stratified analysis showed that young age, premenopausal, and high breast density decreased the diagnostic accuracy of MG. Indeed, dense breast tissues interfere with the interpretation of MG [5,20-23].
In addition, MG could not exactly determine the size and location of the breast lesion in many cases. This method only achieved a low agreement rate with surgery for detecting the lesion size and location. A potential reason is that the surrounding tissues and the lesions have similar X-ray attenuation, covering the shape and size of the mass. Therefore, some small cancers may be missed, and some benign lesions may be subjected to an unnecessary surgery. Nevertheless, MG showed good sensitivity for large palpable lesions, but these lesions would undergo surgery anyway.
Compared with MG, dense breast tissues are hyperechoic on US and most lesions are hypoechoic [24,25]. Therefore, because US are not affected by high density breast tissues, breast US has a higher sensitivity for detecting breast cancers in women with dense breast tissue [26-28]. Therefore, since Chinese women often have dense breasts, US should be more effective, accurate, and useful as the breast imaging tools. In addition, women are not exposed to radiations.
In the present study, the results strongly suggest that US was significantly better than MG for detecting breast diseases. There was no BI-RADS category 0 case reported by US. In young women and women with dense breasts, US appears superior to MG as an effective diagnostic tool in the evaluation of breast diseases. US had a significantly greater diagnostic accuracy than MG. Finally, US had a high agreement rate with surgery and it could be used to determine the exact size and location of the breast lesions. Therefore, US could be a better screening modality than MG in Chinese women. In addition, it is much cheaper than other modalities such as MRI, making it the modality of choice for areas with a poor economic status. MRI’s sensitivity to invasive cancers is nearly 100% [29-31], and that it is not influenced by age or gland density degree [31,32]. However, MRI is not the best imaging modality to assess microcalcifications detected on MG since MRI is based on changes in the spin of hydrogen protons and that microcalcifications contain few of these . In addition, MRI machines are expensive, as well as the examinations per se. Nevertheless, US should be compared with new modalities such as breast tomosynthesis [34,35]. In some centers, MG and US could be used together to maximize the detection of breast cancer . In young asymptomatic high-risk women (<50 years old), digital MG could be used as the primary screening modality, and US could be performed if necessary . These results could be generalized to all women with dense breasts, not only Chinese ones.
The present study is not without limitations. In addition to its retrospective nature, the sample size was small and was from a single center. Multicenter studies should be performed to confirm these results.
In conclusion, US was better than MG in the preoperative evaluation of breast diseases of Chinese women. These results suggest that US could be more useful for detecting breast lesions in China.
magnetic resonance imaging
Brem RF, Rapelyea JA, Zisman G, Hoffmeister JW, Desimio MP. Evaluation of breast cancer with a computer-aided detection system by mammographic appearance and histopathology. Cancer. 2005;104:931–5.
Boyd NF, Guo H, Martin LJ, Sun L, Stone J, Fishell E, et al. Mammographic density and the risk and detection of breast cancer. N Engl J Med. 2007;356:227–36.
Maskarinec G, Meng L, Ursin G. Ethnic differences in mammographic densities. Int J Epidemiol. 2001;30:959–65.
Pisano ED, Gatsonis C, Hendrick E, Yaffe M, Baum JK, Acharyya S, et al. Diagnostic performance of digital versus film mammography for breast-cancer screening. N Engl J Med. 2005;353:1773–83.
Shao H, Li B, Zhang X, Xiong Z, Liu Y, Tang G. Comparison of the diagnostic efficiency for breast cancer in Chinese women using mammography, ultrasound, MRI, and different combinations of these imaging modalities. J Xray Sci Technol. 2013;21:283–92.
Bassett LW, Berg WA. Breast Imaging Reporting and Data System, BI-RADS: mammography. 4th ed. Reston: American College of Radiology; 2003.
Kalbhen CL, Kezdi-Rogus PC. Changes in breast compressibility with age: implications for stereotactic biopsy. Can Assoc Radiol J. 1999;50:93–7.
Kayar R, Civelek S, Cobanoglu M, Gungor O, Catal H, Emiroglu M. Five methods of breast volume measurement: a comparative study of measurements of specimen volume in 30 mastectomy cases. Breast Cancer (Auckl). 2011;5:43–52.
Miller AB, To T, Baines CJ, Wall C. The Canadian National Breast Screening Study: update on breast cancer mortality. J Natl Cancer Inst Monogr. 1997;1997:37–41.
Miller AB, Wall C, Baines CJ, Sun P, To T, Narod SA. Twenty five year follow-up for breast cancer incidence and mortality of the Canadian National Breast Screening Study: randomised screening trial. BMJ. 2014;348:g366.
Roberts MM, Alexander FE, Anderson TJ, Chetty U, Donnan PT, Forrest P, et al. Edinburgh trial of screening for breast cancer: mortality at seven years. Lancet. 1990;335:241–6.
Liu J, Liu PF, Li JN, Qing C, Ji Y, Hao XS, et al. Analysis of mammographic breast density in a group of screening chinese women and breast cancer patients. Asian Pac J Cancer Prev. 2014;15:6411–4.
Kerlikowske K, Grady D, Barclay J, Sickles EA, Ernster V. Effect of age, breast density, and family history on the sensitivity of first screening mammography. JAMA. 1996;276:33–8.
Gierach GL, Ichikawa L, Kerlikowske K, Brinton LA, Farhat GN, Vacek PM, et al. Relationship between mammographic density and breast cancer death in the Breast Cancer Surveillance Consortium. J Natl Cancer Inst. 2012;104:1218–27.
Suzuki A, Ishida T, Ohuchi N. Controversies in breast cancer screening for women aged 40–49 years. Jpn J Clin Oncol. 2014;44:613–8.
Wang FL, Chen F, Yin H, Xu N, Wu XX, Ma JJ, et al. Effects of age, breast density and volume on breast cancer diagnosis: a retrospective comparison of sensitivity of mammography and ultrasonography in China’s rural areas. Asian Pac J Cancer Prev. 2013;14:2277–82.
Hooley RJ, Greenberg KL, Stackhouse RM, Geisel JL, Butler RS, Philpotts LE. Screening US in patients with mammographically dense breasts: initial experience with Connecticut Public Act 09-41. Radiology. 2012;265:59–69.
Weigert J, Steenbergen S. The connecticut experiment: the role of ultrasound in the screening of women with dense breasts. Breast J. 2012;18:517–22.
Tsai HW, Twu NF, Ko CC, Yen MS, Yang MJ, Chao KC, et al. Compliance with screening mammography and breast sonography of young Asian women. Eur J Obstet Gynecol Reprod Biol. 2011;157:89–93.
Zanello PA, Robim AF, Oliveira TM, Elias Junior J, Andrade JM, Monteiro CR, et al. Breast ultrasound diagnostic performance and outcomes for mass lesions using Breast Imaging Reporting and Data System category 0 mammogram. Clinics (Sao Paulo). 2011;66:443–8.
Cherel P, Hagay C, Benaim B, De Maulmont C, Engerand S, Langer A, et al. Mammographic evaluation of dense breasts: techniques and limits. J Radiol. 2008;89:1156–68.
Devolli-Disha E, Manxhuka-Kerliu S, Ymeri H, Kutllovci A. Comparative accuracy of mammography and ultrasound in women with breast symptoms according to age and breast density. Bosn J Basic Med Sci. 2009;9:131–6.
Liberman L, Menell JH. Breast imaging reporting and data system (BI-RADS). Radiol Clin North Am. 2002;40:409–30.
Crystal P, Strano SD, Shcharynski S, Koretz MJ. Using sonography to screen women with mammographically dense breasts. AJR Am J Roentgenol. 2003;181:177–82.
Zhi H, Ou B, Luo BM, Feng X, Wen YL, Yang HY. Comparison of ultrasound elastography, mammography, and sonography in the diagnosis of solid breast lesions. J Ultrasound Med. 2007;26:807–15.
Jiang H, Bu X, Sun Y, Xu K, Zheng X. Ultrasonography is more accurate than mammography in preoperative evaluations of palpable breast tumors in Chinese women. Breast J. 2013;19:677–9.
Leong LC, Gogna A, Pant R, Ng FC, Sim LS. Supplementary breast ultrasound screening in Asian women with negative but dense mammograms-a pilot study. Ann Acad Med Singapore. 2012;41:432–9.
Masroor I, Ahmed MN, Pasha S. To evaluate the role of sonography as an adjunct to mammography in women with dense breasts. J Pak Med Assoc. 2009;59:298–301.
Orel SG. High-resolution MR, imaging for the detection, diagnosis, and staging of breast cancer. Radiographics. 1998;18:903–12.
Kelcz F, Santyr G. Gadolinium-enhanced breast MRI. Crit Rev Diagn Imaging. 1995;36:287–338.
Van Goethem M, Tjalma W, Schelfout K, Verslegers I, Biltjes I, Parizel P. Magnetic resonance imaging in breast cancer. Eur J Surg Oncol. 2006;32:901–10.
Lehman CD, Isaacs C, Schnall MD, Pisano ED, Ascher SM, Weatherall PT, et al. Cancer yield of mammography, MR, and US in high-risk women: prospective multi-institution breast cancer screening study. Radiology. 2007;244:381–8.
Bazzocchi M, Zuiani C, Panizza P, Del Frate C, Soldano F, Isola M, et al. Contrast-enhanced breast MRI in patients with suspicious microcalcifications on mammography: results of a multicenter trial. AJR Am J Roentgenol. 2006;186:1723–32.
Gur D. Tomosynthesis-based imaging of the breast. Acad Radiol. 2011;18:1203–4.
Zuley ML, Bandos AI, Ganott MA, Sumkin JH, Kelly AE, Catullo VJ, et al. Digital breast tomosynthesis versus supplemental diagnostic mammographic views for evaluation of noncalcified breast lesions. Radiology. 2013;266:89–95.
Chae EY, Kim HH, Cha JH, Shin HJ, Kim H. Evaluation of screening whole-breast sonography as a supplemental tool in conjunction with mammography in women with dense breasts. J Ultrasound Med. 2013;32:1573–8.
Fletcher SW, Black W, Harris R, Rimer BK, Shapiro S. Report of the International Workshop on Screening for Breast Cancer. J Natl Cancer Inst. 1993;85:1644–56.
This study was supported by the Anhui Provincial Natural Science Research Project (#KJ2011Z188). We wish to thank Tang Tong, MD (Department of Breast Surgery of The Second Affiliated Hospital of Anhui Medical University) for the help in data acquisition, Peng Mei, MD (Department of Ultrasonography of The Second Affiliated Hospital of Anhui Medical University) for the help in data analysis and interpretation, and Wu Qiang, PhD (Department of Pathology of The Second Affiliated Hospital of Anhui Medical University) for the help in data acquisition.
The authors declare that they have no competing interests.
HZ carried out the studies, participated in the sequence alignment, and drafted the manuscript; LWZ participated in the design of the study and performed the statistical analysis; XPG conceived of the study, participated in its design and coordination, and helped to draft the manuscript; and SHZ participated in the sequence alignment. All authors read and approved the final manuscript.
Hong Zhao and Liwei Zou contributed equally to this work.
About this article
Cite this article
Zhao, H., Zou, L., Geng, X. et al. Limitations of mammography in the diagnosis of breast diseases compared with ultrasonography: a single-center retrospective analysis of 274 cases. Eur J Med Res 20, 49 (2015). https://doi.org/10.1186/s40001-015-0140-6