Skip to main content

Specific mortality in patients with diffuse large B-cell lymphoma: a retrospective analysis based on the surveillance, epidemiology, and end results database

Abstract

Background

The full potential of competing risk modeling approaches in the context of diffuse large B-cell lymphoma (DLBCL) patients has yet to be fully harnessed. This study aims to address this gap by developing a sophisticated competing risk model specifically designed to predict specific mortality in DLBCL patients.

Methods

We extracted DLBCL patients’ data from the SEER (Surveillance, Epidemiology, and End Results) database. To identify relevant variables, we conducted a two-step screening process using univariate and multivariate Fine and Gray regression analyses. Subsequently, a nomogram was constructed based on the results. The model’s consistency index (C-index) was calculated to assess its performance. Additionally, calibration curves and receiver operator characteristic (ROC) curves were generated to validate the model’s effectiveness.

Results

This study enrolled a total of 24,402 patients. The feature selection analysis identified 13 variables that were statistically significant and therefore included in the model. The model validation results demonstrated that the area under the receiver operating characteristic (ROC) curve (AUC) for predicting 6-month, 1-year, and 3-year DLBCL-specific mortality was 0.748, 0.718, and 0.698, respectively, in the training cohort. In the validation cohort, the AUC values were 0.747, 0.721, and 0.697. The calibration curves indicated good consistency between the training and validation cohorts.

Conclusion

The most significant predictor of DLBCL-specific mortality is the age of the patient, followed by the Ann Arbor stage and the administration of chemotherapy. This predictive model has the potential to facilitate the identification of high-risk DLBCL patients by clinicians, ultimately leading to improved prognosis.

Introduction

DLBCL, known as diffuse large B-cell lymphoma, is a highly heterogeneous disease and is the most common type of non-Hodgkin’s lymphoma, accounting for approximately 30–40% of all lymphoma cases [1]. While there have been significant advancements in the diagnosis and treatment of DLBCL in recent years, it is disheartening to note that 40–50% of patients with DLBCL still remain incurable [2]. For patients who experience a relapse or have refractory DLBCL, the prognosis is generally poor [3]. Hence, it becomes imperative to identify highly specific and sensitive prognostic markers that can effectively identify high-risk patients, thereby enabling improved treatment decisions and ultimately enhancing patient survival.

Several studies have examined prognostic factors in patients with DLBCL [2, 4,5,6,7]. However, many of these studies have relied on the conventional Cox proportional hazards model [7, 8]. It is important to note that competing mortality events frequently arise during the analysis of survival data. Yet, the traditional Cox regression often fails to consider the occurrence of these competing mortality events, leading to potential misjudgment of patient prognosis, irrespective of the independence between such events. If a patient dies from causes other than DLBCL, and the Cox regression fails to account for these competing mortality events, it introduces bias into the analysis results. The Fine and Gray model enables us to analyze data while taking into account competing risks. Similar to the Cox model, the Fine and Gray model utilizes a risk set function, but it also incorporates the concept of competition between different types of events. This model estimates the probability of each event by comparing the event-specific risk set function with the overall risk set function, while accounting for the impact of other event types. Competing risk models, specifically the Fine and Gray proportional hazards model, demonstrate excellent capability in addressing the correlation between cancer outcomes and competing events, ultimately leading to a remarkable enhancement in the accuracy of prognostic analysis [9]. Despite its potential, this methodology remains largely underutilized. Leveraging the SEER database, a comprehensive and extensive multi-center database with credible data sources, this study aims to establish a competing risk model based on DLBCL patients. The objective is to investigate the factors that influence cause-specific mortality in DLBCL patients.

Methods

Study cohort

We extracted data from the SEER database [Incidence—SEER Research Plus Data, 17 Registries, Nov 2021 Sub (2000–2019)] using SEER Stat (Version 8.4.1) software. The data pertain to patients diagnosed with DLBCL between 2000 and 2015. To ensure data quality, patients with less than 1 month of follow-up and those with one or more missing variables were excluded from the analysis. The collected data encompassed demographic information such as sex, race, age, marital status, median household income, and place of residence. It also included tumor characteristics such as site, primary site, presence of B symptoms, number of malignant tumors, and whether it was the first primary tumor. Additionally, the data recorded the Ann Arbor Stage, surgical and chemoradiotherapy information including surgery, radiation, chemotherapy, the sequence of systemic therapy and surgery, and treatment timing. Furthermore, the cause of death and follow-up information were documented. The diagnosis of diffuse large B-cell lymphoma (DLBCL) was made based on the criteria outlined in the International Classification of Diseases for Oncology, 3rd Edition (ICD-O-3). The staging of lymphoma was determined using the Ann Arbor stage system (AASS). Regarding the analysis of continuous variables, the subjects were categorized into different groups based on their treatment timing (the interval between the diagnosis and initiation of treatment): more than 1 month and 1 month or less. The subjects were also divided into 5 age groups: 0–19 years, 20–39 years, 40–59 years, 60–79 years, and 80–100 years. Furthermore, based on the median annual household income, the subjects were divided into 3 groups: less than $50,000, $50,000–$74,999, and greater than $75,000.

Statistical analysis

The study cohort was divided into a training cohort and a validation cohort in a ratio of 7:3. The purpose of this division was to use the training cohort to train the model and the validation cohort to test the model. All patient features were divided into the training and validation cohorts, and the balance of the data was assessed by comparing the differences between the two groups. Categorical variables were presented as frequencies and percentages (25%), and chi-square tests were used to compare the differences between the two groups. Normally distributed continuous variables were displayed as means and standard deviations [Mean (S.E.)], and t-tests were used to compare the differences. Non-normally distributed continuous variables were presented as medians and quartiles (median [IQR]), and rank sum tests were used to compare the differences. In the competing risk model, the outcome event of interest was death from DLBCL, and death from other causes was treated as a competing event. Variables were screened in two steps using univariate and multivariate Fine and Gray regression analyses. Variables that were statistically significant in the univariate analysis were included in the multivariate analysis. The variables that remained statistically significant in the multivariate analysis were used to construct a competing risk model and develop a corresponding nomogram. The model’s C-index was calculated, and its predictions were compared with the observed actual values. Calibration curves and ROC curves were plotted to assess the consistency and accuracy of the model. All statistical analyses were performed using R 4.2.1 (https://www.r-project.org/). The Fine and Gray regression analysis and competing risk modeling were conducted using the riskRegression (2021.10.10) software package. The pmsampsize (1.1.3) package was used to calculate the sample size and plot ROC and calibration curves, while the rms package was used for nomogram plotting.

Results

Patient features

A total of 117,171 patients diagnosed with diffuse large B-cell lymphoma (DLBCL) were identified from the dataset titled “Incidence—SEER Research Plus Data, 17 Registries, Nov 2021 Sub (2000–2019)” that was submitted to the SEER database in 2021. Patients who had less than 1 month of follow-up (N = 9758), patients without Ann Arbor Stage data (N = 29,974), and patients with one or more missing variables (N = 53,037) were excluded from the study (Fig. 1). Eventually, a total of 24,402 patients were included in this study. Among them, 6459 died from DLBCL and 4076 died from other causes. The median survival time for patients in the entire study cohort was 58 months (IQR: [16.00, 83.00]). The majority of patients were between the ages of 60 and 79 years (48.0%), and there was a higher proportion of men compared to women (57.1%). The characteristics of patients in both the training cohort and the validation cohort are described in Table 1. There were no statistically significant differences in each variable between the two cohorts (P > 0.05), indicating a balanced distribution of data.

Fig. 1
figure 1

Patient selection flowchart

Table 1 Description of features of all patients

Feature selection

The selection of features was carried out using univariate and multivariate Fine and Gray regression analyses. In the univariate analyses, only variables that showed statistical significance were included in the multivariate analyses. Similarly, in the multivariate analyses, only variables that showed statistical significance were included in the final model. The univariate analysis revealed that 14 variables were found to be statistically significant and were, therefore, considered potential risk factors for cause-specific mortality in DLBCL patients. These variables included race, tumor site (extranodal or nodal), primary site, Ann Arbor stage, whether surgery was performed, whether radiation therapy was administered, whether chemotherapy was administered, sequence of systemic therapy and surgery, treatment timing, presence of B symptoms, whether it was the first primary tumor, age, marital status, and median annual household income. Upon conducting a multivariate analysis with the above variables included in the adjusted model, it was revealed that 13 variables remained statistically significant and were identified as independent risk factors for cause-specific mortality in DLBCL patients (Additional file 1: Table S1). These variables included race, tumor site (extranodal or nodal), Ann Arbor stage, whether surgery was performed, whether radiation therapy was administered, whether chemotherapy was administered, sequence of systemic therapy and surgery, treatment timing, presence of B symptoms, whether it was the first primary tumor, age, marital status, and median annual household income. These 13 variables were further included in the competing risk model (Table 2). Furthermore, to effectively compare the disparities between Fine and Gray regression and Cox regression, we incorporated the aforementioned variables into the multivariate Cox regression analysis. The findings revealed that age, Ann Arbor Stage, b symptoms, absence of chemotherapy, absence of radiation, absence of surgery, the sequence of systemic therapy and surgery, and treatment timing exerted a more prominent influence on the risk of all-cause mortality (in the Cox proportional risk model) compared to the risk of DLBCL-specific mortality (in the Competing Risk Model) (refer to Table 3).

Table 2 The competing risk model
Table 3 Comparison of multivariate competing risk analysis and multivariate Cox regression analysis

Model development and validation

The competing risk model incorporated 13 independent risk factors, achieving a C-statistic of 0.709 (± 0.002). To facilitate the application of this model, a corresponding nomogram, as shown in Fig. 2, was constructed. The points assigned to each individual variable were determined based on the patients’ classification, and the sum of these points yielded the Total Points. By matching the Total Points with the corresponding predictor, the cause-specific survival probability of patients could be estimated. The performance of the model was further evaluated through ROC curve analysis. The area under the curve (AUC) for 6-month, 1-year, and 3-year mortality in DLBCL patients was 0.748 (95% confidence interval [CI] [0.736, 0.759]), 0.718 (95% CI [0.708, 0.728]), and 0.698 (95% CI [0.689, 0.707]) in the training cohort, respectively. In the validation cohort, the AUC values were 0.747 (95% CI [0.729, 0.765]), 0.721 (95% CI [0.706, 0.737]), and 0.697 (95% CI [0.683, 0.711]), as depicted in Fig. 3. Moreover, the calibration curve analysis demonstrated that the model’s predicted results aligned closely with the actual values, as illustrated in Fig. 4. This further confirms the reliability and accuracy of the model in predicting outcomes.

Fig. 2
figure 2

The nomogram of the competing risk model

Fig. 3
figure 3

a The results of ROC curve analysis in the training cohort. b The results of ROC curve analysis in the validation cohort

Fig. 4
figure 4

a The results of calibration curve analysis in the training cohort. b The results of calibration curve analysis in the validation cohort

Discussion

We have devised a competing risk model in this research to forecast cause-specific mortality among DLBCL patients, which is then represented by a graphical nomogram. The model demonstrated favorable predictive accuracy and can offer reliable prognostic insights. This, in turn, may enhance clinicians’ comprehension of DLBCL and facilitate the provision of targeted clinical assistance to individuals at high risk.

The results of the feature selection demonstrated that there are 13 variables that serve as independent predictors of cause-specific mortality in DLBCL patients. These variables include race, tumor site (extranodal or nodal), Ann Arbor stage, surgery, radiation therapy, chemotherapy, sequence of systemic therapy and surgery, treatment timing, B Symptoms, whether it was the first primary tumor, age, marital status, and median annual household income. According to the results obtained from the nomogram, patient age was identified as the most accurate predictor, followed by Ann Arbor stage and chemotherapy. With regard to treatment, our study revealed that the absence of surgery, radiation therapy, chemotherapy, and systemic therapy was associated with a poorer prognosis for patients. It is widely acknowledged that chemotherapy is the primary treatment for DLBCL, and its efficacy has been supported by numerous studies [10, 11]. Radiation therapy is often used in conjunction with chemotherapy and has been shown to improve clinical symptoms in relapsed or refractory DLBCL patients following chemotherapy [12]. For the majority of lymphoma patients, chemotherapeutic agents are deemed more effective, thus surgical treatment is generally not recommended [13]. In fact, one study has shown that surgical treatment for lymphoma does not improve patient prognosis [14]. Nevertheless, there are certain specific cases where surgical intervention is necessary. For example, patients with primary gastrointestinal lymphoma may present with intestinal obstruction or splenomegaly alongside symptoms of compression [15]. In terms of demographic information, our findings suggest that Asians have a significantly higher mortality rate among DLBCL patients as compared to whites, and divorced patients exhibit a higher mortality rate than married patients. Furthermore, age [16], Ann Arbor stage [17], and B symptoms [18] have all been identified as predictors for cause-specific mortality in DLBCL patients, which is consistent with our study’s findings.

We utilized both the Fine and Gray model and the Cox proportional risk model to evaluate the influence of various variables on the outcome. To determine the impact of each variable, we computed the hazard ratio (HR). Table 3 illustrates the disparities between the variables in the two models. Considering the independent risk factors, we observed that age, Ann Arbor Stage, presence of b symptoms, absence of chemotherapy, absence of radiation, absence of surgery, and the sequence of systemic therapy and surgery significantly affected the risk of all-cause mortality in comparison to DLBCL-specific mortality. Regarding the independent protective factors, we found that the presence of a first primary tumor, marital status, median household income, and timing of treatment exerted a more pronounced influence on the risk of all-cause mortality compared to the risk of DLBCL-specific mortality. However, the effect of race (White) on the risk of all-cause mortality was relatively smaller.

Several studies have been conducted to assess the prognosis of patients with DLBCL using the SEER database. One particular study focused on the risk of developing second primary malignancies in DLBCL patients and revealed that the oral cavity and pharynx were the most vulnerable regions for malignant tumor development [19]. Other studies, encompassing diverse populations with DLBCL, investigated the prognosis of patients [20,21,22,23]. However, it is worth noting that the majority of these studies relied on the conventional Cox proportional risk model. In contrast, our study adopts a competing risk model, which takes into account both DLBCL-specific mortality events and the influence of competing events on the analysis outcomes. Most prognostic studies commonly utilize the traditional Kaplan–Meier method and Cox regression model to analyze survival patterns and identify significant prognostic indicators [24]. Nevertheless, real-world medical studies often involve the occurrence of multiple competing outcome events rather than a single event. Consequently, it becomes imperative to employ a competing risk model to mitigate the bias resulting from the presence of these competing risk events [25, 26]. The Competitive Risk Model, also known as the Fine and Gray model, was proposed by Fine and Gray in 1999 to address proportional risk situations in which competing risks are present. Unlike traditional survival models, this model focuses on modeling the subdistribution hazard function instead of the risk function for survival time. The subdistribution hazard function calculates the conditional risk of a specific event occurring before a certain point in time, considering the occurrence of competing events. This model is particularly useful when the endpoint event of a study, such as disease recurrence, can be “competed for” by other types of events, such as patient death from other causes. In such cases, traditional survival analysis methods may not provide accurate results. Using competing risk models, researchers can obtain more precise risk estimates and evaluate and compare the risk of specific events while accounting for the influence of other risk events.

The nomogram, a visual representation of models [27], has been widely recognized for its ability to depict complex relationships. Numerous studies have shown that machine learning models, such as random forests, neural networks, and support vector machines, can effectively capture nonlinear patterns in the data, thereby enhancing their predictive power [28,29,30,31]. However, one drawback of these models is their “black box” nature, which limits our understanding of the underlying computational process and the importance of each feature. In contrast, the nomogram offers a simple and intuitive graphical interface that allows for the quantification of the risk associated with each feature, making it particularly valuable for clinical applications [32]. The SEER database, maintained by the National Cancer Institute (NCI) [33], is a comprehensive and diverse collection of cancer incidence and survival data for specific populations in the United States. It serves as a valuable resource for researchers and healthcare professionals in understanding and analyzing cancer trends. With its large sample size and inclusion of multiple centers and racial backgrounds, the SEER database ensures that statistical findings derived from it are generally representative and reliable. This database offers detailed information on various aspects of cancer cases, such as patient demographics, cancer type, onset time, treatment approaches, and follow-up outcomes. Researchers can utilize this wealth of data to gain insights into the impact of cancer and develop effective strategies for diagnosis, treatment, and prevention.

In summary, an extensive dataset was utilized to develop a competing risk model for the prediction of cause-specific mortality in DLBCL patients. The model was effectively visualized as a nomogram and displayed favorable predictive performance, offering valuable information. However, it is crucial to acknowledge certain limitations within this study. Firstly, although the model exhibited satisfactory performance within both the training and validation cohorts, external validation remains necessary and is planned for the subsequent phase of our research. Secondly, due to constraints imposed by public databases, certain variables of interest were regrettably excluded from this investigation, including the specific chemotherapy agents administered to the patients. Furthermore, the lack of clarity in the categorization of certain variables within the database hinders the interpretation of their clinical significance. One such instance is the subcategory labeled as “Other”. Additionally, the potential impact of small subcategorical sample sizes on the model’s performance should be taken into consideration. However, it should be noted that the large sample sizes in this study mitigated this concern.

Conclusion

Based on the SEER database, we have successfully developed a competing risk model for predicting the specific prognosis of DLBCL patients. The model has shown excellent performance in terms of its predictive accuracy. Among the various predictors evaluated, patient age emerges as the most crucial independent factor associated with DLBCL-specific mortality. Moreover, Ann Arbor stage and chemotherapy also demonstrate significant importance in predicting the prognosis. The clinical implications of our model are noteworthy as it aids clinicians in promptly identifying high-risk DLBCL patients. Consequently, this would facilitate the implementation of targeted clinical interventions and ultimately lead to improved patient outcomes.

Availability of data and materials

On reasonable request, the corresponding author will provide the information supporting the study’s conclusions.

References

  1. Arber DA, Orazi A, Hasserjian R, Thiele J, Borowitz MJ, Le Beau MM, Bloomfield CD, Cazzola M, Vardiman JW. The 2016 revision to the World Health Organization classification of myeloid neoplasms and acute leukemia. Blood. 2016;127(20):2391–405.

    Article  CAS  PubMed  Google Scholar 

  2. Liang XJ, Song XY, Wu JL, Liu D, Lin BY, Zhou HS, Wang L. Advances in multi-omics study of prognostic biomarkers of diffuse large B-cell lymphoma. Int J Biol Sci. 2022;18(4):1313–27.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Zheng W, Lin Q, Issah MA, Liao Z, Shen J. Identification of PLA2G7 as a novel biomarker of diffuse large B cell lymphoma. BMC Cancer. 2021;21(1):927.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Zhu YH, Meng WJ, He LH, Jia YS, Tong ZS. Prognosis analysis of primary breast diffuse large B cell lymphoma. Zhonghua zhong liu za zhi [Chin J Oncol]. 2019;41(3):235–40.

    CAS  PubMed  Google Scholar 

  5. Gao F, Wang ZF, Tian L, Dong F, Wang J, Jing HM, Ke XY. A prognostic model of gastrointestinal diffuse large B cell lymphoma. Med Sci Monit Int Med J Exp Clin Res. 2021;27: e929898.

    Google Scholar 

  6. Huo YJ, Xu PP, Fu D, Yi HM, Huang YH, Wang L, Wang N, Ji MM, Liu QX, Shi Q, et al. Molecular heterogeneity of CD30+ diffuse large B-cell lymphoma with prognostic significance and therapeutic implication. Blood Cancer J. 2022;12(3):48.

    Article  PubMed  PubMed Central  Google Scholar 

  7. He J, Chen Z, Xue Q, Sun P, Wang Y, Zhu C, Shi W. Identification of molecular subtypes and a novel prognostic model of diffuse large B-cell lymphoma based on a metabolism-associated gene signature. J Transl Med. 2022;20(1):186.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Xu H, Li Y, Jiang Y, Wang J, Sun H, Wu W, Lv Y, Liu S, Zhai Y, Tian L, et al. A novel defined super-enhancer associated gene signature to predict prognosis in patients with diffuse large B-cell lymphoma. Front Genet. 2022;13: 827840.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Austin PC, Lee DS, Fine JP. Introduction to the analysis of survival data in the presence of competing risks. Circulation. 2016;133(6):601–9.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Mondello P, Mian M. Frontline treatment of diffuse large B-cell lymphoma: beyond R-CHOP. Hematol Oncol. 2019;37(4):333–44.

    Article  PubMed  Google Scholar 

  11. He MY, Kridel R. Treatment resistance in diffuse large B-cell lymphoma. Leukemia. 2021;35(8):2151–65.

    Article  PubMed  Google Scholar 

  12. Ng AK, Yahalom J, Goda JS, Constine LS, Pinnix CC, Kelsey CR, Hoppe B, Oguchi M, Suh CO, Wirth A, et al. Role of radiation therapy in patients with relapsed/refractory diffuse large B-cell lymphoma: guidelines from the international lymphoma radiation oncology group. Int J Radiat Oncol Biol Phys. 2018;100(3):652–69.

    Article  PubMed  Google Scholar 

  13. Ollila TA, Olszewski AJ. Extranodal diffuse large B cell lymphoma: molecular features, prognosis, and risk of central nervous system recurrence. Curr Treat Options Oncol. 2018;19(8):38.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Ayub A, Santana-Rodríguez N, Raad W, Bhora FY. Primary appendiceal lymphoma: clinical characteristics and outcomes of 116 patients. J Surg Res. 2017;207:174–80.

    Article  PubMed  Google Scholar 

  15. Abbott S, Nikolousis E, Badger I. Intestinal lymphoma—a review of the management of emergency presentations to the general surgeon. Int J Colorectal Dis. 2015;30(2):151–7.

    Article  CAS  PubMed  Google Scholar 

  16. Zhao P, Zhu L, Li L, Zhou S, Qiu L, Qian Z, Xu W, Zhang H. A modified prognostic model in patients with diffuse large B-cell lymphoma treated with immunochemotherapy. Oncol Lett. 2021;21(3):218.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Shen Z, Zhang S, Jiao Y, Shi Y, Zhang H, Wang F, Wang L, Zhu T, Miao Y, Sang W, et al. LASSO model better predicted the prognosis of DLBCL than random forest model: a retrospective multicenter analysis of HHLWG. J Oncol. 2022;2022:1618272.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Yin T, Qi L, Zhou Y, Kong F, Wang S, Yu M, Li F. CD5+ diffuse large B-cell lymphoma has heterogeneous clinical features and poor prognosis: a single-center retrospective study in China. J Int Med Res. 2022;50(9):3000605221110075.

    Article  CAS  PubMed  Google Scholar 

  19. Jiang S, Zhen H, Jiang H. Second primary malignancy in diffuse large B-cell lymphoma patients: a SEER database analysis. Curr Probl Cancer. 2020;44(1): 100502.

    Article  PubMed  Google Scholar 

  20. Yang S, Chang W, Zhang B, Shang P. What factors are associated with the prognosis of primary testicular diffuse large B-cell lymphoma? A study based on the SEER database. J Cancer Res Clin Oncol. 2023;149(12):10269–78.

    Article  CAS  PubMed  Google Scholar 

  21. Du Y, Wang Y, Li Q, Chang X, Zhang H, Xiao M, Xing S. Risk and outcome of acute myeloid leukaemia among survivors of primary diffuse large B-cell lymphoma: a retrospective observational study based on SEER database. BMJ Open. 2022;12(9): e061699.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Kuczmarski TM, Tramontano AC, Mozessohn L, LaCasce AS, Roemer L, Abel GA, Odejide OO. Mental health disorders and survival among older patients with diffuse large B-cell lymphoma in the USA: a population-based study. Lancet Haematol. 2023;10(7):e530–8.

    Article  CAS  PubMed  Google Scholar 

  23. Liu P-P, Xia Y, Bi X-W, Wang Y, Sun P, Yang H, Li Z-M, Jiang W-Q. Trends in survival of patients with primary gastric diffuse large B-cell lymphoma: an analysis of 7051 cases in the SEER database. Dis Markers. 2018. https://doi.org/10.1155/2018/7473935.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Shao W, Wang T, Huang Z, Han Z, Zhang J, Huang K. Weakly supervised deep ordinal cox model for survival prediction from whole-slide pathological images. IEEE Trans Med Imaging. 2021;40(12):3739–47.

    Article  PubMed  Google Scholar 

  25. Southern DA, Faris PD, Brant R, Galbraith PD, Norris CM, Knudtson ML, Ghali WA. Kaplan–Meier methods yielded misleading results in competing risk scenarios. J Clin Epidemiol. 2006;59(10):1110–4.

    Article  PubMed  Google Scholar 

  26. Johnstone PAS, Spiess PE, Giuliano AR. New directions in penile cancer. Lancet Oncol. 2019;20(1):16–7.

    Article  PubMed  Google Scholar 

  27. Yang Y, Wang Y, Deng H, Tan C, Li Q, He Z, Wei W, Zhou E, Liu Q, Liu J. Development and validation of nomograms predicting survival in Chinese patients with triple negative breast cancer. BMC Cancer. 2019;19(1):541.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Moll M, Qiao D, Regan EA, Hunninghake GM, Make BJ, Tal-Singer R, McGeachie MJ, Castaldi PJ, SanJoseEstepar R, Washko GR, et al. Machine learning and prediction of all-cause mortality in COPD. Chest. 2020;158(3):952–64.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Poirion OB, Jing Z, Chaudhary K, Huang S, Garmire LX. DeepProg: an ensemble of deep-learning and machine-learning models for prognosis prediction using multi-omics data. Genome Med. 2021;13(1):112.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Ambale-Venkatesh B, Yang X, Wu CO, Liu K, Hundley WG, McClelland R, Gomes AS, Folsom AR, Shea S, Guallar E, et al. Cardiovascular event prediction by machine learning: the multi-ethnic study of atherosclerosis. Circ Res. 2017;121(9):1092–101.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Khera R, Haimovich J, Hurley NC, McNamara R, Spertus JA, Desai N, Rumsfeld JS, Masoudi FA, Huang C, Normand SL, et al. Use of Machine learning models to predict death after acute myocardial infarction. JAMA Cardiol. 2021;6(6):633–41.

    Article  PubMed  Google Scholar 

  32. Balachandran VP, Gonen M, Smith JJ, DeMatteo RP. Nomograms in oncology: more than meets the eye. Lancet Oncol. 2015;16(4):e173-180.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Doll KM, Rademaker A, Sosa JA. Practical guide to surgical data sets: surveillance, epidemiology, and end results (SEER) database. JAMA Surg. 2018;153(6):588–9.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

The authors declare that they did not receive any funding from any source.

Author information

Authors and Affiliations

Authors

Contributions

Conception and design: Hui Xu and Guo Ji were responsible for the conception and design of the study. Jun Li and Hui Xu provided administrative support. Provision of study materials or patients: Rong Yan was responsible for providing the study materials or patients. Collection and assembly of data: Hui Xu, Rong Yan, and Chunmei Ye collected and assembled the data. Data analysis and interpretation: Hui Xu and Rong Yan conducted the data analysis and interpretation. Manuscript writing: all authors contributed to writing the manuscript. Final approval of manuscript: all authors gave their final approval of the manuscript.

Corresponding author

Correspondence to Guo Ji.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors have no conflicts of interest to declare.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

The results of univariate and multivariate analysis in competing risk model.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xu, H., Yan, R., Ye, C. et al. Specific mortality in patients with diffuse large B-cell lymphoma: a retrospective analysis based on the surveillance, epidemiology, and end results database. Eur J Med Res 29, 241 (2024). https://doi.org/10.1186/s40001-024-01833-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s40001-024-01833-4

Keywords