Skip to main content

AcornHRD: an HRD algorithm highly associated with anthracycline-based neoadjuvant chemotherapy in breast cancer in China

Abstract

Purpose

Our study aimed to develop and validate a homologous recombination deficiency (HRD) scoring algorithm in the Chinese breast cancer population.

Methods and materials

Ninety-six in-house breast cancer (BC) samples and 6 HRD-positive standard cells were analyzed by whole-genome sequencing (WGS). Besides, 122 BCs from the TCGA database were down-sampled to ~ 1X WGS. We constructed an algorithm named AcornHRD for HRD score calculated based on WGS at low coverage as input data to estimate large-scale copy number alteration (LCNA) events on the genome. A clinical cohort of 50 BCs (15 cases carrying BRCA mutation) was used to assess the association between HRD status and anthracyclines-based neoadjuvant treatment outcomes.

Results

A 100-kb window was defined as the optimal size using 41 in-house cases and the TCGA dataset. HRD score high threshold was determined as HRD score ≥ 10 using 55 in-house BCs with BRCA mutation to achieve a 95% BRCA-positive agreement rate. Furthermore, the HRD status agreement rate of AcornHRD is 100%, while the ShallowHRD is 60% in standard cells. BRCA mutation was significantly associated with a high HRD score evaluated by AcornHRD and ShallowHRD (p = 0.008 and p = 0.003, respectively) in the TCGA dataset. However, AcornHRD showed a higher positive agreement rate than did the ShallowHRD algorithm (70% vs 60%). In addition, the BRCA-positive agreement rate of AcornHRD was superior to that of ShallowHRD (87% vs 13%) in the clinical cohort. Importantly, the high HRD score assessed by AcornHRD was significantly correlated with a residual cancer burden score of 0 or 1 (RCB0/1). Besides, the HRD-positive group was more likely to respond to anthracycline-based chemotherapy than the HRD-negative group (pCR [OR = 9.5, 95% CI 1.11–81.5, p = 0.040] and RCB0/1 [OR = 10.29, 95% CI 2.02–52.36, p = 0.005]).

Conclusion

Using the AcornHRD algorithm evaluation, our analysis demonstrated the high performance of the LCNA genomic signature for HRD detection in breast cancers.

Introduction

Breast cancer susceptibility genes BRCA1 and BRCA2 are involved in homologous recombination (HR) and play a pivotal role in the repair of DNA double-strand breaks [1]. Cancers with loss of HR function due to the inactivation of BRCA1/2 and other HRR genes are known to be sensitive to platinum and poly (adenosine diphosphate-ribose) polymerase (PARP) inhibitors [2,3,4]. Germline BRCA mutations account for 5.3% of all breast cancers [5], and Turner [6] showed that homologous recombination deficiency (HRD) has a prevalence of approximately 18% in breast cancer. Thus, HRD testing would allow more precise treatment recommendations and provide benefits to populations receiving platinum and PARP inhibitors (PARPi). Moreover, the conclusion has been confirmed in multiple clinical trials of ovarian cancer. Patients with BRCA wild-type but positive HRD have an equal benefit from PARPi compared with BRCA mutations, based on the results of the PRIMA study and the PAOLA-1 study [7, 8]. While commercial HRD detection methods are available abroad, there is no uniform standard in China so far.

In current practice, anthracycline-based regimens and the sequential administration of taxanes are the most commonly used chemotherapy regimens in neoadjuvant and adjuvant settings. Prior studies have shown that platinum chemotherapy agents are active in the treatment of breast cancer with a germline BRCA mutation and/or HRD [9,10,11]. In the neoadjuvant setting, a single-arm prospective study using cisplatin monotherapy reported a pathologic complete response (pCR) rate of 61% among BRCA1-mutated breast cancer patients [12]. Moreover, the GeparSixto trial demonstrated that the pCR rates were 33.9% and 63.5% in the paclitaxel plus liposomal doxorubicin (PM) group and PM plus carboplatin group, respectively, among HRD breast cancer patients [11]. Conversely, the INFORM trial results showed that anthracycline-based regimens are also effective in HER2-negative BRCA-mutated breast cancer. The pCR rate was 18% and 26% in the single-agent cisplatin group and doxorubicin-cyclophosphamide group, respectively, which yielded a risk ratio (RR) of 0.70 (90% CI 0.39–1.2) [13]. Moreover, it was recently reported that breast cancers with high HRD scores are more sensitive to anthracycline in the neoadjuvant setting [14, 15].

This study aimed to develop an HRD scoring algorithm based on the Chinese population and compare its performance with the Shallow algorithm in different cohorts. To validate the accuracy of this HRD scoring algorithm, we evaluated the correlation between HRD scores with BRCA mutations and pCR for anthracycline-based neoadjuvant chemotherapy (NAC).

Methods and materials

DNA extractions, library preparation, and sequencing

Five HRD-positive and 1 HRD-negative standards (Cat No. CBP90023) stored at -20℃ from Nanjing Cobioer Biosciences CO., LTD prepared for genome-wide DNA extraction. Tumor tissue was collected from 41 in-house samples (cohort I) and 55 in-house samples with BRCA mutations (cohort II) from 85 breast cancer patients, and leukocytes were collected from 50 baseline healthy control samples. All genomic DNA (gDNA) was extracted using the Genomic DNA Extraction Kit (Item No. DP304). According to the quantitative results of the QUIBT tool, 200 ng gDNA was used for library construction. Subsequently, 200 ng gDNA for each sample was transferred to a 50-μL Covaris tube and segmented to the main peak of 300–350 bp using the Covaris M220 instrument. Next, segmented DNA was end-repaired, A-tailed, and ligated with custom adapters in reaction pooling. The ligation product was amplified (6 cycles) and purified using AmpureXP beads (Agencourt/Beckman Coulter). After purification, the library was quantified using a Qubit 4 fluorimeter and the Qubit dsDNA HS Assay Kit (ThermoFisher). Finally, library fragment quality control was performed using Agilent 2100 Bioanalyzer and Agilent 2100 DNA 1000 Kit. Each library was programmed to generate ~ 3.5 Gb bases.

Filter and variant calling

The FASTP tool [16] was applied for FASTQ file quality control to remove reads with the adaptor, low-quality bases. High-quality reads were aligned into the human genome (hg19) with Burrows-Wheeler Aligner [17]. Duplicate reads generated by PCR were marked using Picard (broadinstitute.github.io/picard/). Moreover, local realignment around known InDels and base quality were recalibrated, and duplicate reads were subsequently removed using the Sentieon tool [18]. Finally, base alternatives and InDels detected by Sentieon were annotated using Annovar [19]. A series of 122 aligned bam files (Supplementary Table 1) downloaded from the TCGA breast cancer database (www.cancer.gov/about-nci/organization/ccg/research/structural-genomics/tcga) were down-sampled to ~ 1X whole-genome sequencing (WGS) with SamBamba software [20]. Subsequently, all bam files were also processed with the above pipelines. BRCA-positive status implies that any mutation was detected in either BRCA1 or BRCA2 for each sample. Identified mutation criteria in BRCA were as follows: (1) mutation information was collected from the TCGA database for somatic mutation; and (2) for germline mutation identified by the in-house analysis pipeline, its status was annotated as likely pathogenic or pathogenic by either InterVar [21] or ClinVar [22], and the number of supporting allele reads was greater than 3.

Workflow of the algorithm for HRD evaluation

We developed an internal algorithm for assessing the HRD status of cancer patients by detecting copy number variations (CNV) based on low-coverage WGS, termed AcornHRD. HRD score was predicted based on the count of large-scale copy number alteration (LCNA) events, and the methodology was similar to the LST (large-scale state transition) in SNP arrays. LCNA events were assessed by coverage of sliding windows in the genome. Here, windows (specific-sized regions) are continuous segments of the genome used to detect LCNA; the coverage refers to the number of reads in each specific position or window in WGS data. The detailed algorithm description can be divided into two parts. Part one was to calculate the CNV ratio in the window unit along the genome as follows:

  1. i

    The coverage from WGS data were counted and normalized according to the library size and GC content with LOWESS [23] statistics to calculate a \(Rati{o}^{gc-correction}\) in each window.

  2. ii

    \(Rati{o}_{j}^{M}\) calculated the median of the GC-corrected ratio for j-th window in the baseline samples. It is defined for a given window j as:

    $${\text{Ratio}}_{\text{j}}^{\text{M}}= {\text{Median}}\left({\text{Ratio}}_{\text{j}1}^{\text{gc}-\text{correction}}, {\text{Ratio}}_{\text{j}2}^{\text{gc}-\text{correction}}, \dots , {\text{Ratio}}_{\text{jn}}^{\text{gc}-\text{correction}}\right)$$

    *n is the number of samples in the baseline.

  3. iii

    \(Rati{o}_{j}^{gc-correction}\) represented the GC-corrected ratio for the j-th window of the test sample. The CNV ratio quantifies the relative abundance of a window compared to baseline samples. It is defined for a given window j as:

    $$\text{CNV Ratio}=\frac{Rati{o}_{j}^{gc-correction}}{Rati{o}_{j}^{M}}.$$

The CNV ratio result file was used as input data to estimate the HRD status for each sample. Part two was to detect the HRD status as follows: firstly, to minimize the impacts from highly complex genomic regions (such as centromere regions, telomere regions, and highly repetitive regions) and sex chromosomes, overlap windows were removed. Subsequently, CNV ratios in each window were processed with log2 fold change. Next, windows were merged into large segments with chromosome arm information and processed CNV ratio by the circular binary segmentation (CBS) method with the R procedure (bioconductor.org/packages/release/bioc/html/DNAcopy.html). Finally, the above segment larger than 10 Mb was defined as an LCNA event [24,25,26]. We calculated the HRD score, defined as the count of LCNA, which is determined by the coverage of 100-kb windows. Tumors with HRD scores ≥ 10 were classified as exhibiting HRD scores high (refer to the Results section for further details).

Validation by clinical breast cancer samples

We retrospectively reviewed the medical records of 1449 patients with primary breast cancer who visited the Zhejiang Cancer Hospital from February 2008 to October 2020 and completed a 98-gene panel genetic screening. Fifty patients who received anthracycline-based NAC and underwent subsequent surgery (mastectomy or breast-conserving surgery) were included in the statistical analysis (see Fig. 1 and Supplementary Table 2 for details). All of them received NAC with epirubicin (75 mg/m2) and cyclophosphamide (600 mg/m2), followed by docetaxel (80–100 mg/m2) every 3 weeks for 8 cycles. All biopsied tumor samples for histological and HRD examination were obtained from patients before NAC and kept by fixed-formalin paraffin-embedded (FFPE). The study was reviewed and approved by the Ethical Committee of Zhejiang Cancer Hospital and was performed in accordance with the Declaration of Helsinki.

Fig. 1
figure 1

Patient selection criteria. Some patients met multiple exclusion criteria. BC: breast cancer, MBC: metastatic breast cancer, EBC: early breast cancer, A: anthracycline

Statistical analyses

The primary endpoint was RCB 0/1 with a secondary endpoint of pCR. Fisher’s exact or Chi-squared test was used to test association with binary response and association of clinical variables and HRD score. We used the odds ratio (OR) as a measure of the strength of association between two variables and calculated a 95% confidence interval. A P-value less than 0.05 was considered statistically significant. All statistical analyses were performed using Python software version 3.

Results

Development of the AcornHRD algorithm

AcornHRD was based on the results of the CNV ratio as input data to estimate LCNA events on the genome. A fitness window size appears particularly important for samples of low coverage. To address the question of optimal window size, we adopted up to ten different window sizes (40 kb, 80 kb, 100 kb, 150 kb, 200 kb, 300 kb, 500 kb, 800 kb, 1 Mb, and 1.4 Mb) to estimate the number of LCNA in each sample from cohort I (Supplementary Table 3). The results showed that the 100-kb window size covered most of the samples (31, 75.6%), followed by 500-kb window sizes (73.2%) (Fig. 2A). Furthermore, similar results were observed in the TCGA cohort, with the 100-kb window size still covering the largest number of samples (Fig. 2B). Additionally, this window size maintains a balance between capturing sufficient genomic information and minimizing noise, ensuring accurate LCNA identification. In summary, our findings reveal that although some alternative window sizes captured comparable percentages of samples, the 100-kb window size consistently exhibited the highest stability and resolution across different datasets.

Fig. 2
figure 2

Mode frequency across ten distinct window sizes: For each sample, the mode of LCNA count is classified as 1, while other values are designated as 0. Subsequently, the frequency of mode samples is computed for each window. The horizontal axis signifies ten distinct window sizes, and the vertical axis denotes the sample count. Panel A shows the mode frequency in ten different window sizes for 41 breast cancer samples sourced from our in-house breast cancer cohort. Panel B displays the same for 122 breast cancer samples obtained from the TCGA database

Establishing a threshold for the HRD score

HRD, another tumor biomarker, is being used in guiding therapy in an increasing number of studies [27,28,29,30]. Both deleterious mutations and promoter methylation of BRCA1/2 could cause HRD and genomic instability. Furthermore, BRCA mutations are known to be strongly associated with HRD. The HRD threshold was selected to have a high sensitivity for detecting HR deficiency in breast cancer. We defined the BRCA-positive agreement rate as the proportion of the number of samples with a high HRD score in the BRCA-positive samples; the BRCA-negative agreement rate was the same as that mentioned above. To obtain a BRCA-positive agreement rate of at least 95%, the threshold was set at the 5th percentile of the HRD scores in cohort II (Supplementary Table 4) of known BRCA-positive tumors. The HRD scores of patients with BRCA mutation were assessed in the test panel with a 100-kb window size and 50-kb step size. Of 55 patients, 53 (96.4%) were identified as having a high HRD score due to a score greater than or equal to 10 (Supplementary Table 5). For a 95% confidence detection rate, the score of 10 was defined as the cut-off threshold value.

Comparing HRD status with AcornHRD and ShallowHRD algorithms in the standard sample

For a more comprehensive evaluation of the AcornHRD algorithm, ShallowHRD software [31] was added to the following comparative analysis. As described in the method section, six standard cells were sequenced with whole genomic DNA. Compared to the HRD status of 6 standard samples (5 HRD positive and 1 HRD negative), the HRD status agreement rate of AcornHRD is 100%, while the agreement rate of ShallowHRD is only 60% (Table 1).

Table 1 HRD status of six standards assessed by AcornHRD and ShallowHRD

Correlation between BRCA mutations and HRD status with AcornHRD and ShallowHRD algorithms in the TCGA cohort

Mutations in the BRCA are strongly associated with HRD positivity [32, 33]; thus, we applied the BRCA-positive agreement rate to assess and compare the accuracy of the two HRD assessment methods in a large TCGA cohort (2 samples without somatic mutation information were filtered out) [31]. The mutations of BRCA genes were confirmed in tumor sequencing reads by in-house calling variation pipeline (more details presented in the Methods). Of the 120 patients, 20 (16.7%) harbored BRCA mutations (Supplementary Table 9). The results of AcornHRD (Table 2) and ShallowHRD (Table 3) both showed that BRCA mutation is significantly correlated with a high HRD score (p = 0.008 and p = 0.003, respectively). However, the BRCA-positive agreement rate of AcornHRD was higher than that of the ShallowHRD algorithm, which was 70% (14/20) and 60% (12/20), respectively (Tables 2 and 3).

Table 2 The HRD status according to BRCA mutations by AcornHRD in TCGA cohort
Table 3 The HRD status according to BRCA mutations by ShallowHRD in TCGA cohort

Association of HRD scores with responses to NAC by AcornHRD and ShallowHRD algorithms

Of the clinical cohort, 15 BRCA-positive and 35 BRCA-negative patients were considered to evaluate HRD status (Supplementary Table 6). The HRD score result showed that the BRCA-positive agreement rate of AcornHRD is far superior to that of ShallowHRD, whereas the BRCA-negative agreement rate is not as good (Supplementary Table 7 and Supplementary Table 8). In addition, the AcornHRD evaluation performed better than did that of the ShallowHRD score; specifically, the high HRD score was significantly associated with residual tumor burden 0/1 (p = 0.020 for AcornHRD and p = 0.182 for ShallowHRD) (Table 4). In summary, AcornHRD was more stable in the application performance across three different cohorts of WGS data, which is superior to the published algorithm.

Table 4 Association of HRD status with RCB from clinical cohort between AcornHRD and ShallowHRD

Clinicopathologic characteristics of high HRD score tumors

Among the 50 patients in the clinical cohorts who received anthracycline-based neoadjuvant therapy, 28 had high HRD scores, and 22 had low HRD scores. A high HRD score was significantly correlated with BRCA mutations (see Table 5 and Supplementary Table 10 for details). The breast cancer samples selected for the clinical study were all HER-2 negative, including 24 TNBC samples and 26 ER and/or PR-positive samples. A high HRD score significantly correlated with TNBC and high Ki-67 expression (Table 5). High HRD scores showed a trend toward correlation with ER-negative and PR-negative status (Table 5).

Table 5 Patient characteristics and HRD score from clinical cohort

Correlation between HRD status and NAC efficacy

In this study, HRD positivity includes either a high HRD score or a BRCA mutation, whereas HRD negativity included a low HRD score and no BRCA mutation. Of the 50 patients, 30 were identified as HRD-positive. Moreover, pCR and residual tumor burden (RCB 0/1) were both important indicators for tumor efficacy evaluation, of which pCR (RCB 0) was the main evaluation indicator.

Patients with HRD positivity were more likely to respond to standard NAC containing anthracyclines than HRD-negative patients, as indicated by a pCR (RCB 0) outcome (OR = 9.5, 95% CI 1.11–81.5, p = 0.040) (Table 6). Similar results were observed for the combined endpoint of RCB 0/1. In addition, patients with HRD positivity were more likely to achieve RCB 0/1 compared to non-deficient patients (OR = 10.29, 95% CI 2.02–52.36, p = 0.005) (Table 7). These results are applicable to a cohort of 35 patients without germline BRCA mutations. Patients with HRD-positive status showed a higher tendency towards an RCB 0/1 response than did HRD-negative patients (OR = 6.0, 95% CI 1.00–35.91, p = 0.050) (Table 7).

Table 6 Association of BRCA mutation and HRD status with pCR (RCB 0) from clinical cohort
Table 7 Association of BRCA mutation and HRD status with RCB from clinical cohort

Discussion

Genomic scar analysis is a very important HRD detection method. When non-homologous end joining (NHEJ) repair is initiated, it leaves “genomic scars”, which are traces of damage repair in the genome [34]. Cells with HRD cannot repair DNA double-strand breaks as effectively as cells with HR pathways. As a result, they exhibit genomic scarring, which refers to quantifiable genomic alterations that can be used to reverse the cell's HRD status. There are three main types of genomic scars caused by HRD: loss of heterozygosity (LOH), telomeric allelic imbalance (TAI), and large-scale state transitions (LST) [33]. To date, the Food and Drugs Administration has approved two products for clinical testing of HRD, namely Myriad myChoice ® CDx (myriad-oncology.com/mychoice-cdx) and FoundationFocus ™ CDx BRCA LOH (www.accessdata.fda.gov/cdrh_docs/pdf16/p160018c.pdf). Both products use the detection of BRCA gene mutations combined with the genomic scar to assess HRD status. The former contains the BRCA genes coding region and 54,091 single nucleotide polymorphisms (SNPs) population. The Genomic Instability Score (GIS) was obtained by comprehensively calculating three indicators: LOH, TAI, and LST, while GIS ≥ 42 was considered positive for genomic instability status [29, 35]. The latter calculated the proportion of fragments with LOH in this genome by covering 3500 SNPs in 324 genes on 22 chromosomes, and LOH accounted for ≥ 16%, that is, “high LOH [36]”. The above two commercial kits lack large-sample prospective clinical study data applied to the Chinese population; thus, studies promoting the development and clinical validation of these kits in China are warranted.

It has been confirmed that LST is feasible and has unique advantages for assessing HRD [24, 25, 33, 37]. LST is referred to the number of chromosomal breaks between flanking regions of at least 10 Mb [24, 38]. Moreover, it has been reported that the LST genomic signature accurately identified tumors with HRD and displayed excellent performance in a TNBC cohort, reaching almost 100% in sensitivity and specificity for HRD detection, where HRD was defined as BRCA inactivation [24, 25]. Furthermore, LST had better HRD evaluation performance in low-coverage sequencing than did LOH [26].

The ShallowHRD is a software tool based on mining copy number alterations profile from TCGA breast cancer that displays a high performance for HRD detection in breast cancers in low coverage genomic data [31]. Fundamental to evaluating the HRD status is the robust determination of copy number data, which can be obtained using either SNP arrays, whole exome sequencing (WES), or WGS. Comparing CNV derived from SNP array, WES, and WGS has revealed that WGS yields a more uniform distribution of quality parameters, such as genotype quality and coverage [39,40,41]. Moreover, studies indicated an excellent agreement (93.75%) between the original and down-sampled WGS-derived HR classification status [26]. WGS at low coverage robustly detects CNV, even in FFPE samples and liquid biopsies [42], at low cost and with easy-storable data outputs.

LCNA identified with Shallow whole-genome sequencing is increasingly popular in many diagnosis institutions. However, low-coverage sequencing also brings some challenges. Since the ShallowHRD data are based on Western cases, it is unclear whether it is applicable to Chinese patients [31]. In the initial use of ShallowHRD, it was found that its performance of BRCA-positive agreement rate in detecting HRD was poor. Furthermore, in the context of lower coverage genomes, the ability to accurately characterize somatic variations, including single-nucleotide variants (SNVs), breakpoints, and CNVs, is compromised, particularly in tumors with low cellularity or sequencing data exhibiting significant GC bias. Moreover, the uniformity of sequencing should be high; otherwise, it will be accompanied by serious noise pollution. The low-coverage sequencing has to be balanced with the sensitivity and uniformity for robustly calling somatic mutations.

To address the questions, we developed an algorithm named AcornHRD, which detects LCNA events based on a low-depth detection algorithm of ~ 1 × WGS reads. Compared with similar software ShallowHRD, AcornHRD achieved a good capacity for HRD detection and improved the obvious disadvantage of ShallowHRD of low BRCA-positive agreement rate in the standard and Chinese breast cancer cohort. Moreover, patients with high HRD score tumors evaluated by AcornHRD were significantly (p = 0.020) more likely to obtain RCB0/1 than those with low HRD score tumors; however, the same was not statistically significant by ShallowHRD. In summary, the performance of AcornHRD in evaluating HRD is superior to that of ShallowHRD. However, it is worth mentioning that different HRD assessment methods and their algorithms are not equivalent, and AcornHRD needs to be further compared with the two FDA-approved products.

Further, we investigated the relationship between the high HRD score and clinicopathological features of breast cancer. A high HRD score demonstrated a significant correlation with BRCA mutations, high Ki-67 expression, and a tendency towards ER negativity and PR negativity. Thus, the phenotype of high HRD score tumors is considered to be biologically aggressive. High HRD score was significantly more prevalent in the triple-negative breast cancer (TNBC) subtype than in the other three subtypes, which is consistent with previously reported results [25, 33, 43,44,45].

It has been shown that anthracycline-based regimens are effective in HER2-negative BRCA-mutated breast cancer [11, 13, 46,47,48]. Telli et al. [14] reported that a high HRD TNBC identified by next-generation sequencing was more sensitive to anthracyclines in the neoadjuvant setting. Recently, it has been reported that HRD tumors are more likely to benefit from anthracyclines, and HRD scores may be a clinically useful marker of chemosensitivity based on subtypes [45, 49]. In contrast, Imanishi et al. [43] reported the opposite result. We conducted a retrospective analysis to examine the relationship between HRD score and the response to neoadjuvant anthracycline therapy in HER2-negative breast cancer patients. The findings revealed a significant association between HRD score and RCB 0/I as well as pCR in the overall population cohort (n = 50). Similar outcomes were observed in the subset of patients without germline BRCA mutations (n = 35).

Previous studies have shown that response to neoadjuvant platinum-based therapy (pCR and RCB0/1) is significantly associated with HRD status in TNBC [11, 29, 50], suggesting the clinical utility of HRD scoring in selecting breast tumors that are more likely to respond to platinum-based regimens. Conversely, the GeparOLA study 50 demonstrated that neoadjuvant therapy with Olaparib resulted in a higher rate of pCR compared to a carboplatin-based regimen (55.1% vs. 48.6%) in patients with HER2-negative or TNBC and HRD. Consistent results were obtained in younger (< 40 years) and HR-positive patients (76.2% vs 45.5%, 52.6% vs 20.0%, respectively), suggesting that HRD may have a good application in predicting the efficacy of PARPi for neoadjuvant treatment.

This study has several limitations. These include a small sample size used for validation, which may limit generalizability. Additionally, the relationship between more HRR genes and HRD status was not discussed. While AcornHRD method demonstrates promising performance in HRD detection, further validation against established authoritative assays or algorithms is warranted. Future studies will focus on optimizing performance, validating in larger cohorts, analyzing more HRR genetic variants and improving clinical applicability. Despite these limitations, our AcornHRD method represents an advancement in HRD detection and offers valuable insights for improving patient care and treatment outcomes.

Conclusions

In conclusion, we have devised AcornHRD, an HRD score algorithm that surpasses clinical variables or BRCA1/2 mutation status in effectively identifying tumors with a higher probability of responding to anthracycline-based neoadjuvant therapy in Chinese breast cancer patients. Moreover, AcornHRD holds potential for use in clinical applications and translational research, including patient screening for clinical trials and guiding the use of DNA-damaging drugs.

Availability of data and materials

Research data are stored in an institutional repository and will be shared upon request to the corresponding author.

References

  1. Severson TM, Peeters J, Majewski I, et al. Brca1-like signature in triple negative breast cancer: molecular and clinical characterization reveals subgroups with therapeutic potential. Mol Oncol. 2015;9:1528–38.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. Yip CH, Newman LA. American society of clinical oncology, American society for radiation oncology, and society of surgical oncology guideline for management of hereditary breast cancer. JAMA Surg. 2021;156:284–5.

    Article  PubMed  Google Scholar 

  3. Tung NM, Boughey JC, Pierce LJ, et al. Management of hereditary breast cancer: American society of clinical oncology, American society for radiation oncology, and society of surgical oncology guideline. J Clin Oncol. 2020;38:2080–106.

    Article  CAS  PubMed  Google Scholar 

  4. Robson M, Im SA, Senkus E, et al. Olaparib for metastatic breast cancer in patients with a germline BRCA mutation. N Engl J Med. 2017;377:523–33.

    Article  CAS  PubMed  Google Scholar 

  5. Cancer Genome Atlas Network. Comprehensive molecular portraits of human breast tumours. Nature. 2012;490:61–70.

    Article  Google Scholar 

  6. Turner NC. Signatures of DNA-repair deficiencies in breast cancer. N Engl J Med. 2017;377:2490–2.

    Article  PubMed  Google Scholar 

  7. Gonzalez-Martin A, Pothuri B, Vergote I, et al. Niraparib in patients with newly diagnosed advanced ovarian cancer. N Engl J Med. 2019;381:2391–402.

    Article  CAS  PubMed  Google Scholar 

  8. Ray-Coquard I, Pautier P, Pignata S, et al. Olaparib plus bevacizumab as first-line maintenance in ovarian cancer. N Engl J Med. 2019;381:2416–28.

    Article  CAS  PubMed  Google Scholar 

  9. Silver DP, Richardson AL, Eklund AC, et al. Efficacy of neoadjuvant cisplatin in triple-negative breast cancer. J Clin Oncol. 2010;28:1145–53.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Tutt A, Tovey H, Cheang MCU, et al. Carboplatin in brca1/2-mutated and triple-negative breast cancer brcaness subgroups: the TNT trial. Nat Med. 2018;24:628–37.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Loibl S, Weber KE, Timms KM, et al. Survival analysis of carboplatin added to an anthracycline/taxane-based neoadjuvant chemotherapy and HRD score as predictor of response-final results from geparsixto. Ann Oncol. 2018;29:2341–7.

    Article  CAS  PubMed  Google Scholar 

  12. Byrski T, Huzarski T, Dent R, et al. Pathologic complete response to neoadjuvant cisplatin in brca1-positive breast cancer patients. Breast Cancer Res Treat. 2014;147:401–5.

    Article  CAS  PubMed  Google Scholar 

  13. Tung N, Arun B, Hacker MR, et al. Tbcrc 031: Randomized phase ii study of neoadjuvant cisplatin versus doxorubicin-cyclophosphamide in germline BRCA carriers with her2-negative breast cancer (the inform trial). J Clin Oncol. 2020;38:1539–48.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Telli ML, Hellyer J, Audeh W, et al. Homologous recombination deficiency (HRD) status predicts response to standard neoadjuvant chemotherapy in patients with triple-negative or brca1/2 mutation-associated breast cancer. Breast Cancer Res Treat. 2018;168:625–30.

    Article  CAS  PubMed  Google Scholar 

  15. Davies H, Glodzik D, Morganella S, et al. Hrdetect is a predictor of brca1 and brca2 deficiency based on mutational signatures. Nat Med. 2017;23:517–25.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Chen S, Zhou Y, Chen Y, et al. Fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–90.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Li H, Handsaker B, Wysoker A, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–9.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Freed D, Aldana R, Weber JA, et al. The sentieon genomics tools - a fast and accurate solution to variant calling from next-generation sequence data. bioRxiv. 2017. https://doi.org/10.1101/115717.

    Article  Google Scholar 

  19. Wang K, Li M, Hakonarson H. Annovar: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38:e164.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Tarasov A, Vilella AJ, Cuppen E, et al. Sambamba: fast processing of NGS alignment formats. Bioinformatics. 2015;31:2032–4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Li Q, Wang K. Intervar: clinical interpretation of genetic variants by the 2015 ACMG-AMP guidelines. Am J Hum Genet. 2017;100:267–80.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Landrum MJ, Lee JM, Benson M, et al. Clinvar: public archive of interpretations of clinically relevant variants. Nucleic Acids Res. 2016;44:D862–8.

    Article  CAS  PubMed  Google Scholar 

  23. Schoen R, Land KC. A general algorithm for estimating a Markov-generated increment-decrement life table with applications to marital-status patterns. J Am Stat Assoc. 1979;74:761–76.

    Article  CAS  PubMed  Google Scholar 

  24. Popova T, Manie E, Rieunier G, et al. Ploidy and large-scale genomic instability consistently identify basal-like breast carcinomas with brca1/2 inactivation. Cancer Res. 2012;72:5454–62.

    Article  CAS  PubMed  Google Scholar 

  25. Manie E, Popova T, Battistella A, et al. Genomic hallmarks of homologous recombination deficiency in invasive breast carcinomas. Int J Cancer. 2016;138:891–900.

    Article  CAS  PubMed  Google Scholar 

  26. de Luca XM, Newell F, Kazakoff SH, et al. Using whole-genome sequencing data to derive the homologous recombination deficiency scores. NPJ Breast Cancer. 2020;6:33.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Frey MK, Pothuri B. Homologous recombination deficiency (HRD) testing in ovarian cancer clinical practice: a review of the literature. Gynecol Oncol Res Pract. 2017;4:4.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Lotan TL, Kaur HB, Salles DC, et al. Homologous recombination deficiency (HRD) score in germline brca2- versus ATM-altered prostate cancer. Modern Pathol. 2021;34:1185–93.

    Article  CAS  Google Scholar 

  29. Telli ML, Timms KM, Reid J, et al. Homologous recombination deficiency (HRD) score predicts response to platinum-containing neoadjuvant chemotherapy in patients with triple-negative breast cancer. Clin Cancer Res. 2016;22:3764–73.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Zhao EY, Shen Y, Pleasance E, et al. Homologous recombination deficiency and platinum-based therapy outcomes in advanced breast cancer. Clin Cancer Res. 2017;23:7521–30.

    Article  CAS  PubMed  Google Scholar 

  31. Eeckhoutte A, Houy A, Manie E, et al. Shallowhrd: detection of homologous recombination deficiency from shallow whole genome sequencing. Bioinformatics. 2020;36:3888–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Moschetta M, George A, Kaye SB, et al. BRCA somatic mutations and epigenetic BRCA modifications in serous ovarian cancer. Ann Oncol. 2016;27:1449–55.

    Article  CAS  PubMed  Google Scholar 

  33. Timms KM, Abkevich V, Hughes E, et al. Association of BRCA1/2 defects with genomic scores predictive of DNA damage repair deficiency among breast cancer subtypes. Breast Cancer Res. 2014;16:475.

    Article  PubMed  PubMed Central  Google Scholar 

  34. McLornan DP, List A, Mufti GJ. Applying synthetic lethality for the selective targeting of cancer. N Engl J Med. 2014;371:1725–35.

    Article  CAS  PubMed  Google Scholar 

  35. Jenner ZB, Sood AK, Coleman RL. Evaluation of rucaparib and companion diagnostics in the parp inhibitor landscape for recurrent ovarian cancer therapy. Future Oncol. 2016;12:1439–56.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Coleman RL, Oza AM, Lorusso D, et al. Rucaparib maintenance treatment for recurrent ovarian carcinoma after response to platinum therapy (ariel3): a randomised, double-blind, placebo-controlled, phase 3 trial. Lancet. 2017;390:1949–61.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Isakoff SJ, Mayer EL, He L, et al. TBCRC009: a multicenter phase ii clinical trial of platinum monotherapy with biomarker assessment in metastatic triple-negative breast cancer. J Clin Oncol. 2015;33:1902–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Natrajan R, Mackay A, Lambros MB, et al. A whole-genome massively parallel sequencing analysis of brca1 mutant oestrogen receptor-negative and -positive breast cancers. J Pathol. 2012;227:29–41.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Smolander J, Khan S, Singaravelu K, et al. Evaluation of tools for identifying large copy number variations from ultra-low-coverage whole-genome sequencing data. BMC Genomics. 2021;22:357.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Guan Y, Wang X, Guan K, et al. Copy number variation of urine exfoliated cells by low-coverage whole genome sequencing for diagnosis of prostate adenocarcinoma: a prospective cohort study. BMC Med Genomics. 2022;15:104.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Belkadi A, Bolze A, Itan Y, et al. Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants. Proc Natl Acad Sci U S A. 2015;112:5473–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Van Roy N, Van Der Linden M, Menten B, et al. Shallow whole genome sequencing on circulating cell-free DNA allows reliable noninvasive copy-number profiling in neuroblastoma patients. Clin Cancer Res. 2017;23:6305–14.

    Article  PubMed  Google Scholar 

  43. Imanishi S, Naoi Y, Shimazu K, et al. Clinicopathological analysis of homologous recombination-deficient breast cancers with special reference to response to neoadjuvant paclitaxel followed by FEC. Breast Cancer Res Treat. 2019;174:627–37.

    Article  CAS  PubMed  Google Scholar 

  44. Marquard AM, Eklund AC, Joshi T, et al. Pan-cancer analysis of genomic scar signatures associated with homologous recombination deficiency suggests novel indications for existing cancer drugs. Biomark Res. 2015;3:9.

    Article  PubMed  PubMed Central  Google Scholar 

  45. Kim SJ, Sota Y, Naoi Y, et al. Determining homologous recombination deficiency scores with whole exome sequencing and their association with responses to neoadjuvant chemotherapy in breast cancer. Transl Oncol. 2021;14:100986.

    Article  CAS  PubMed  Google Scholar 

  46. Wang C, Zhang J, Wang Y, et al. Prevalence of brca1 mutations and responses to neoadjuvant chemotherapy among brca1 carriers and non-carriers with triple-negative breast cancer. Ann Oncol. 2015;26:523–8.

    Article  CAS  PubMed  Google Scholar 

  47. Hahnen E, Lederer B, Hauke J, et al. Germline mutation status, pathological complete response, and disease-free survival in triple-negative breast cancer: secondary analysis of the geparsixto randomized clinical trial. JAMA Oncol. 2017;3:1378–85.

    Article  PubMed  PubMed Central  Google Scholar 

  48. Pohl-Rescigno E, Hauke J, Loibl S, et al. Association of germline variant status with therapy response in high-risk early-stage breast cancer: a secondary analysis of the geparocto randomized clinical trial. JAMA Oncol. 2020;6:744–8.

    Article  PubMed  Google Scholar 

  49. Sharma P, Barlow WE, Godwin AK, et al. Impact of homologous recombination deficiency biomarkers on outcomes in patients with triple-negative breast cancer treated with adjuvant doxorubicin and cyclophosphamide (SWOG S9313). Ann Oncol. 2018;29:654–60.

    Article  CAS  PubMed  Google Scholar 

  50. Kaklamani VG, Jeruss JS, Hughes E, et al. Phase ii neoadjuvant clinical trial of carboplatin and eribulin in women with triple negative early-stage breast cancer (NCT01372579). Breast Cancer Res Treat. 2015;151:629–38.

    Article  CAS  PubMed  Google Scholar 

Download references

Funding

The study was funded by the Key Research-Development Program of Zhejiang Province (Grant number: 2019C04001), the Natural Science Foundation of Zhejiang Province (Grant number: LY21H160005), the Medical and Health Research Project of Zhejiang Province (Grant number: 2021KY566), and the fourth round of Ningbo Key Medical Discipline (Grant number: 2022-F03).

Author information

Authors and Affiliations

Authors

Contributions

Jia-Ni Pan: Conceptualization, Software, Formal Analysis, Writing - Original Draft; Pu-Chun Li: Data Curation, Methodology, Writing - Original Draft; Meng Wang: Software, Methodology, Validation; Ming-Wei Li: Visualization, Conceptualization, Writing - Original Draft; Xiao-Wen Ding: Investigation, Funding Acquisition; Tao Zhou: Formal Analysis, Software; Hui-Na Wang: Project Administration; Yun-Kai Wang: Validation, Investigation; Li-Bin Chen: Software, Visualization; Rong Wang: Resources, Supervision; Wei-Wu Ye: Validation, Resources; Wei-Zhu Wu: Funding Acquisition, Supervision; Feng Lou: Formal analysis, Validation, Software; Xiao-Jia Wang: Visualization, Resources, Supervision; Wen-Ming Cao: Conceptualization, Funding Acquisition, Supervision, Writing - Review & Editing.

Corresponding authors

Correspondence to Feng Lou, Xiao-Jia Wang or Wen-Ming Cao.

Ethics declarations

Ethics approval and consent to participate

The study was reviewed and approved by the Ethical Committee of Zhejiang Cancer Hospital and was performed in accordance with the Declaration of Helsinki.

Competing interests

The authors have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pan, JN., Li, PC., Wang, M. et al. AcornHRD: an HRD algorithm highly associated with anthracycline-based neoadjuvant chemotherapy in breast cancer in China. Eur J Med Res 29, 366 (2024). https://doi.org/10.1186/s40001-024-01936-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s40001-024-01936-y