2,061
Views
1
CrossRef citations to date
0
Altmetric
Research Paper

Construction and validation of nomograms for predicting the prognosis of grade 3 endometrial endometrioid adenocarcinoma cancers: a SEER-based study

, , &
Pages 1752-1765 | Received 09 Feb 2021, Accepted 21 Apr 2021, Published online: 11 May 2021

ABSTRACT

Most cases of endometrial adenocarcinoma (EAC) are diagnosed early and have a good prognosis; however, grade 3 (G3) EACs have poor outcomes. We retrospectively analyzed the data of 11,519 patients with G3 EACs registered between 2004 and 2015 in the Surveillance, Epidemiology, and End Results Program database and constructed a nomogram to guide clinicians in decision-making and accurate prediction of the prognosis. The caret package was used to divide samples into a training set and a validation set. Univariate and multivariate Cox regression analyses were performed, and a nomogram was constructed. A calibration curve was plotted, and a decision curve analysis was performed to verify the accuracy and clinical utility in both cohorts. The Cox regression analysis revealed that age, race, tumor size, number of lymph nodes resected, International Federation of Gynecology and Obstetrics stage, tumor/node stage, and adjuvant therapy were the prognostic factors for G3 EAC, and these were included in the nomogram. The area under the curve values of the training cohort for 1-, 3-, and 5-year were 0.832, 0.798, and 0.784, respectively for the overall survival (OS) group, and 0.858, 0.812, and 0.799, respectively for the cancer specific survival (CSS) group. A nomogram was constructed to predict the survival rate of patients with G3 EACs more accurately. The predictive nomogram will help clinicians manage patients with G3 EACs more effectively in terms of clinical prognosis.

Graphical abstract

Introduction

Endometrial carcinomas (ECs) are some of the most common malignant tumors of the female reproductive system. ECs had an estimated incidence of 65,620 new cases and 12,590 deaths in 2020 in the United States [Citation1]. The main clinical presentation of early EC is abnormal vaginal bleeding. While patients with ECs that are diagnosed early have good prognoses, with a 5-year overall survival (OS) of about 90%, the prognoses of patients with advanced or high-grade ECs remain poor [Citation2]. High-grade ECs account for 10–20% of all ECs and 40% of all mortality due to ECs. The pathological types of high-grade EC mainly include grade 3 (G3) endometrial adenocarcinoma (EAC), dedifferentiated carcinoma, undifferentiated carcinoma, clear cell carcinoma, serous carcinoma, mixed adenocarcinoma, and carcinosarcoma [Citation3]. The factors suggested by the International Federation of Gynecology and Obstetrics (FIGO) staging system are currently followed to determine the prognosis of patients with ECs; however, their performance in predicting the individual survival risk is poor due to the low accuracy and omission of independent risk factors, such as age, for the patients’ survival outcomes [Citation4–6]. Therefore, an individualized clinical prediction model for patients with G3 EACs is necessary.

In this study, the clinical characteristics and prognostic factors of patients with G3 EACs were obtained from the Surveillance, Epidemiology, and End Results (SEER) Program database from 2004 to 2015. We aimed to construct G3 EAC nomogram models based on the SEER data and predict the survival to meet the current clinical requirements. Our hypothesis is that a predictive nomogram will help clinicians manage patients with G3 EACs more effectively in terms of accurate clinical prognosis, which is very important to guide decision-making and predicting the prognoses of these patients.

Materials and methods

Data sources

Data were downloaded from the SEER Program database, and the patients were from the SEER population-based cancer registry (2004–2015 data set). The demographic and clinical characteristics of the patients were defined as follows: year of diagnosis (2004–2007, 2008–2011, or 2012–2015), race (white, black, or other), marital status (married, unmarried, or other), number of lymph nodes resected (1–10, >10, or none), FIGO stage (I, II, III, or IV), primary tumor stage (T1, T2, T3, or T4), regional lymph node stage (N0 or N1), distant metastases (M0 or M1), and neoadjuvant therapy (chemotherapy alone, radiotherapy alone, combination therapy, or neither therapy).

Inclusion and exclusion criteria

EAC samples were included according to the following criteria: (1) site recode International Classification of Diseases for Oncology, 3rd edition/WHO 2008 (C540-C543, C548-C549); (2) history code (8380–8383); (3) behavior code (3); (4) G3 (grade distinction); and (5) survival time >30 days. Samples were excluded if the following criteria were noted: (1) missing values and unknown clinical/tissue characteristics; and (2) ECs occurring in addition to other primary malignant tumors.

Statistical analysis

R 3.6.1 was used for statistical analysis. X-tile software is a new bioinformatics tool suitable for biomarker evaluation and result-based cut point optimization(Yale University, New Haven, USA). That is, different values are used as cutoff values to group for statistical testing. The result with the smallest p-value of the test result can be considered as the best cutoff value. Based on the results of X-tile software, we found that in OS group, when age was classified into 3 subgroups: <64 years, 64–77 years, and >77 years, the prognosis is the most significant among different subgroups (Supplementary Figure 1 abc). When Tumor size was classified as <48 mm, and ≥ 48 mm (Supplementary Figure 1 def), there is a significant prognostic difference between the two groups. Similarly. In CSS group, the best cutoff value for age is: <59 years, 59–69 years, and >69 years (Supplementary Figure1 ghi). The best cutoff value for Tumor size is: <60 mm, and ≥ 60 mm (Supplementary Figure1 jkl). Univariate Cox regression was used to determine the risk factors related to OS and CSS. Patients with a P-value <0.05 in the univariate Cox regression analysis were included in the multivariate Cox regression analysis (variable screening method: bidirectional) to determine the independent prognostic factors and establish nomograms for predicting OS or CSS based on independent factors. Univariate and multivariate analyses were performed using the survival package, receiver operating characteristic (ROC) curves were plotted with the survival ROC package. Nomogram was drawn based on the nomogram function in the rms package.

Results

Flowchart of the analysis

In order to facilitate the understanding of the research, a methodology flowchart is provided in . A total of 11,519 eligible patients with G3 EACs between 2004 and 2015 were enrolled from the SEER Program database. Patients were divided into a training set (OS: n = 3721; CSS: n = 3192) and a validation set (OS: n = 2480; CSS: n = 2126). There were no statistically significant differences in demographic or clinical characteristics between the two groups. list the demographic data and tumor characteristics of the patients in the OS and CSS groups, respectively. Subsequently, univariate and multivariate Cox regression analyses were performed to construct a nomogram and predict the 1-, 3-, and 5-year survival of the patients. The calibration plot and decision curve analysis (DCA) curve were used to evaluate the accuracy and clinical applicability of the model.

Table 1. Clinical information of OS cohort

Table 2. Clinical information of CSS cohort

Figure 1. Flow chart of analysis

Figure 1. Flow chart of analysis

Independent prognostic factors for G3 EAC

In the OS training set, univariate and multivariate Cox regression analysis of all the variables revealed that age, race, tumor size, number of lymph nodes resected, FIGO stage, tumor/node (T/N) stage, and adjuvant therapy were independent prognostic factors (P < 0.05, ). In the CSS training set, univariate and multivariate Cox analyses were also performed for all variables (P < 0.05, ). The results showed that age, race, tumor size, number of lymph nodes resected, FIGO stage, T/N stage, and adjuvant therapy were independent prognostic factors of CSS. The results were consistent with those of OS. To visualize these trends, we created a forest plot of the OS (Supplementary Figure 2) and CSS groups (Supplementary Figure 3).

Table 3. Univariate and multivariate cox analysis in OS

Table 4. Univariate and multivariate cox analysis in CSS

In addition, we plotted the survival curve of each significant variable in the univariate Cox regression analysis of the OS data set. We found that all the variables were related to prognosis except the age at diagnosis (). The survival rate of patients treated with postoperative radiotherapy alone was significantly better than those treated with postoperative chemotherapy alone, and the survival benefit of patients treated with combined postoperative chemoradiotherapy was better than that of patients treated with postoperative chemotherapy ()). The survival rate of the patients was not only affected by age, race, tumor size, number of lymph nodes resected, FIGO stage, and T/N stage (), but also by marital status. The survival rate of married patients was better than that of unmarried patients ()). The prognoses of black patients with G3 EACs were generally worse than those of white patients ()). Consistent results were observed in the CSS univariate survival curve ().

Figure 2. OS survival curves in patients with G3 EACs

Figure 2. OS survival curves in patients with G3 EACs

Figure 3. CSS survival curves in patients with G3 EACs

Figure 3. CSS survival curves in patients with G3 EACs

Construction of nomograms related to OS and CSS

We constructed nomogram models using the clinical characteristics of age, race, tumor size, number of lymph nodes resected, FIGO stage, T/N stage, and adjuvant therapy to predict OS and CSS based on the multivariate Cox regression analysis results. In the nomograms, the length of the line corresponds to the influence of the different variables, and the different values of the variables correspond to outcomes. The OS nomogram showed that FIGO stage had the maximum impact on prognosis, followed by age, number of lymph nodes resected, and N stage ()). The CSS nomogram showed that age and the FIGO stage had a considerable influence on CSS, followed by the number of lymph nodes resected, T stage, and the N stage ()). Each number/category of these variables was assigned a score on a points scale. After the total score was calculated and it was located on the total points scale, a straight line drawn to the 1-, 3-, and 5-year survival probability scale showed the estimated OS or CSS at each time point.

Figure 4. (a): The OS nomogram model. (b): The CSS nomogram model

Figure 4. (a): The OS nomogram model. (b): The CSS nomogram model

Clinical evaluation of the nomogram models

To evaluate the accuracy of the nomograms, we drew a calibration curve on each nomogram. The calibration diagrams showed consistency between the predicted values (x-axis) and observed values (y-axis) at 1, 3, and 5 years for OS and CSS, in both the training ()) and validation sets ()), indicating that the model had high accuracy.

Figure 5. (a): OS calibration diagram in the training set. (b): CSS calibration diagram in the training set. (c): OS calibration diagram in the validation set. (d): CSS calibration diagram in the validation set

Figure 5. (a): OS calibration diagram in the training set. (b): CSS calibration diagram in the training set. (c): OS calibration diagram in the validation set. (d): CSS calibration diagram in the validation set

Clinical utility evaluation of the nomograms

A DCA is a simple method to evaluate clinical prediction models, diagnostic tests, and molecular markers. To demonstrate the advantages of the nomogram models in the training set, we compared the 1-, 3-, and 5-year ROC curves of single variables (age, FIGO stage, marital status, tumor size, T/N stage, and adjuvant therapy) (). The results showed that the 1-, 3-, and 5-year area under the curve (AUC) values of the OS nomogram model were the higher 0.832, 0.798, and 0.784, respectively). The 1-, 3-, and 5-year AUC values of the CSS nomogram model were 0.858, 0.812, and 0.799, respectively (). The net benefits from each nomogram were higher than those from a single clinical variable in the 1-, 3-, and 5-year DCA curves (). These results demonstrated the good clinical utility of the nomograms in predicting the 1-, 3-, and 5-year survival of patients with G3 EACs, in turn confirming their positive net benefits and wide practical probability thresholds. Consistent results were obtained from the validation set ().

Figure 6. Clinical utility of the nomogram models in the training set. (a–c): ROC curve of each model at 1, 3, and 5 years (OS). (d–f): DCA curve of each model at 1, 3, and 5 years (OS)

Figure 6. Clinical utility of the nomogram models in the training set. (a–c): ROC curve of each model at 1, 3, and 5 years (OS). (d–f): DCA curve of each model at 1, 3, and 5 years (OS)

Figure 7. (a–c): ROC curve of each model at 1, 3, and 5 years (CSS). (d–f): DCA curve of each model at 1, 3, and 5 years (CSS)

Figure 7. (a–c): ROC curve of each model at 1, 3, and 5 years (CSS). (d–f): DCA curve of each model at 1, 3, and 5 years (CSS)

Figure 8. Clinical utility of the nomogram models in the validation set. (a–c): ROC curve of each model at 1, 3, and 5 years (OS). (d–f): DCA curve of each model at 1, 3, and 5 years (OS)

Figure 8. Clinical utility of the nomogram models in the validation set. (a–c): ROC curve of each model at 1, 3, and 5 years (OS). (d–f): DCA curve of each model at 1, 3, and 5 years (OS)

Figure 9. (a–c): ROC curve of each model at 1, 3, and 5 years (CSS). (d–f): DCA curve of each model at 1, 3, and 5 years (CSS)

Figure 9. (a–c): ROC curve of each model at 1, 3, and 5 years (CSS). (d–f): DCA curve of each model at 1, 3, and 5 years (CSS)

Discussion

With the continuous progress of medical diagnosis and treatment, the survival outcome of EC has been significantly improved, and almost 75% of ECs can be diagnosed in the early stages (FIGO stages I or II) [Citation7,Citation8]; however, some patients with ECs still have poor prognoses. The prognoses of patients with ECs are closely related to independent risk factors such as stage, grade, histological type, lymph node metastasis [Citation9]. However, these pathological parameters are still inadequate to predict the rates of survival and recurrence in patients with EC. In recent times, there have been many studies about the accurate prediction of the prognoses of patients with EC. An analysis of the data of 63,729 patients with ECs in the SEER Program database from 1988 to 2015 showed that the CSS nomogram was constructed using age, race, histological grade, clinical stage, tumor size, and the OS nomogram was constructed using histological grade, clinical stage, tumor size, and race, with C-indices of 0.859 and 0.782, respectively [Citation10]. However, the survival outcomes of patients with ECs of different stages, grades, and histological types are considerably different due to their high heterogeneity and different molecular characteristics.

Type I EACs account for about 65% of EACs and are associated with favorable prognoses. Type II mainly includes serous carcinoma, clear cell carcinoma, and carcinosarcoma, which are associated with low incidence, high grade of malignancy, and poor prognosis. The Bokhman dichotomy has certain limitations in predicting the prognoses of patients with ECs. In 2020, the inclusion of The Cancer Genome Atlas (TCGA) molecular typing in the clinical practice guidelines for EC diagnosis and treatment was recommended by the National Comprehensive Cancer Network and European Society of Gynecological Oncology(ESGO)/European Society for Radiotherapy and Oncology (ESTRO)/European Society for Pathology [Citation11]. Although most tumors with high copy numbers and p53 abnormalities are usually serous ECs, quite a few of them are G3 EACs [Citation12,Citation13]. In 2016, the European Society for Medical Oncology-ESGO-ESTRO guidelines defined the high-risk of recurrence group of EC as (I) endometrial carcinoma (type 1), FIGO stage IB, grade 3 tumor (T1/G3 EAC); (II) non-endometrioid carcinoma (type 2); and (III) advanced endometrial cancer, regardless of the pathological type [Citation14]. The prognoses of patients in the high-risk recurrence group are poor, and the risk of recurrence and metastasis is high [Citation15,Citation16]. Therefore, it is important to evaluate the high-risk recurrence of EC to improve the overall survival rate of the patients. At present, there are still some controversies regarding the G3 EAC pathogenesis, prognosis, and treatment.

The C-indices of the training and validation sets in a previous study were 0.814 and 0.837, respectively, and the AUC was 0.7. These were derived by analyzing the prognostic risk factors of 1172 patients with low-grade endometrial carcinosarcoma, indicating that the constructed nomogram had good predictive ability [Citation17]. Another study showed that the mortality of patients with G3 EACs was 45% lower than that of patients with endometrial carcinosarcoma [Citation18]. The 5-year OS of patients with G3 stage I, II, and III ECs were 77.5%, 62.7%, and 49.6%, respectively, indicating poorer prognoses than those of G1 and G2 ECs, while being slightly better than those of endometrial papillary serous carcinoma and clear cell carcinoma [Citation19]. Therefore, some scholars have proposed that G3 EACs should be classified as type II. However, there has been no large-scale retrospective study about G3 EACs.

In this study, we retrospectively analyzed the data of 11,519 patients with G3 EACs registered in the SEER Program database between 2004 and 2015. The univariate Cox regression analysis showed that age, race, tumor size, number of lymph nodes resected, FIGO stage, T/N stage, and adjuvant therapy were independent prognostic factors in terms of either OS or CSS. Previous studies have identified some of these variables as being associated with the survival of patients with ECs [Citation20–22]. These clinical features were used to construct nomograms to predict the prognoses. The accuracy and clinical utility of the nomograms were tested by calibration plots and DCA, respectively. The DCA is a new method to evaluate diagnostic tests and prediction models. Our study had good clinical application value. For OS, the 1-, 3-, and 5-year AUC values of the training set were 0.832, 0.798, and 0.784, respectively. For CSS, the 1-, 3-, and 5-year AUC values were 0.858, 0.812, and 0.799, respectively. This study was the first large-scale retrospective study on G3 EACs. This is the first time that nomograms and websites for patients with G3 ECs based on SEER data have been created. Compared with previous nomograms, our research provides clinicians a more convenient and accurate prediction method. The variables required for our nomograms are common and easy to obtain in clinical practice, making them more cost-effective than other prediction methods that use TCGA molecular typing or biomarkers [Citation23].

However, our study has some limitations. First, the SEER database still lacks some clinical information that is significant for the prognosis, such as invasion of the lymphovascular space. Additionally, there is no molecular profile information, which is likely to be the future trend for precision cancer therapies. Second, the nomograms were based on data retrospectively obtained from the SEER database. A robust nomogram needs to be verified externally in multi-center clinical trials and prospective studies. In the future, we plan to explore the possibility of including more predictors to further improve the performance of the nomograms.

Conclusions

Our prognostic nomograms will provide a new method to accurately predict the survival of individual patients with G3 EACs.

Highlights

  1. Marital status affects the survival of patients with G3 EAC.

  2. FIGO stage has a great influence on the prognosis of G3 EAC patients.

  3. The nomogram has good clinical applicability in predicting the survival of G3 EAC patients

Supplemental material

Supplemental Material

Download ()

Data availability statement

Some or all data, or code generated or used during the study are available from the corresponding author by request.

https://seer.cancer.gov/

Disclosure statement

No potential conflict of interest was reported by the author(s).

Supplementary material

Supplemental data for this article can be accessed here

Additional information

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

References

  • Siegel RL, Miller KD, Jemal A. Cancer statistics, 2020. CA Cancer J Clin. 2020;70(1):7–30.
  • Tirosh I, Izar B, Prakadan SM, et al. Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq. Science. 2016;352(6282):189–196.
  • Cancer Genome Atlas Research N, Albert Einstein College of M, Analytical Biological S, et al. Integrated genomic and molecular characterization of cervical cancer. Nature. 2017;543(7645):378–384.
  • Wang C, Yang C, Wang W, et al. A prognostic nomogram for cervical cancer after surgery from SEER database. J Cancer. 2018;9(21):3923–3928.
  • Wan G, Gao F, Chen J, et al. Nomogram prediction of individual prognosis of patients with hepatocellular carcinoma. BMC Cancer. 2017;17(1):91.
  • Balachandran VP, Gonen M, Smith JJ, et al. Nomograms in oncology: more than meets the eye. Lancet Oncol. 2015;16(4):e173–e80.
  • Raglan O, Kalliala I, Markozannes G, et al. Risk factors for endometrial cancer: an umbrella review of the literature. Int J Cancer. 2019;145(7):1719–1730.
  • Anderson C, Bae-Jump VL, Broaddus RR, et al. Long-term patterns of excess mortality among endometrial cancer survivors. Cancer Epidemiol Biomarkers Prev. 2021. DOI:10.1158/1055-9965.EPI-20-1631
  • Uccella S, Falcone F, Greggi S, et al. Survival in clinical stage I endometrial cancer with single vs. multiple positive pelvic nodes: results of a multi-institutional Italian study. J Gynecol Oncol. 2018;29(6):e100.
  • Zhu L, Sun X, Bai W. Nomograms for predicting cancer-specific and overall survival among patients with endometrial carcinoma: a SEER based study. Front Oncol. 2020;10:269.
  • Concin N, Matias-Guiu X, Vergote I, et al. ESGO/ESTRO/ESP guidelines for the management of patients with endometrial carcinoma. Radiother Oncol. 2021;154:327–353.
  • Bosse T, Nout RA, McAlpine JN, et al. Molecular classification of grade 3 endometrioid endometrial cancers identifies distinct prognostic subgroups. Am J Surg Pathol. 2018;42(5):561–568.
  • Bell DW, Ellenson LH. Molecular genetics of endometrial carcinoma. Annu Rev Pathol. 2019;14(1):339–367.
  • Colombo N, Creutzberg C, Amant F, et al. ESMO-ESGO-ESTRO consensus conference on endometrial cancer: diagnosis, treatment and follow-up. Ann Oncol. 2016 27;(1)16–41.
  • Colombo N, Creutzberg C, Amant F, et al. ESMO-ESGO-ESTRO consensus conference on endometrial cancer: diagnosis, treatment and follow-up. Radiother Oncol. 2015;117(3):559–581.
  • Chen JL, Huang YS, Huang CY, et al. Impact of adjuvant radiotherapy on the survival of women with optimally resected stage III endometrial cancer in the era of modern radiotherapy: a retrospective study. Radiat Oncol. 2020;15(1):72.
  • Wu J, Zhang H, Li L, et al. A nomogram for predicting overall survival in patients with low-grade endometrial stromal sarcoma: a population-based analysis. Cancer Commun (Lond). 2020;40(7):301–312.
  • Bansal N, Herzog TJ, Seshan VE, et al. Uterine carcinosarcomas and grade 3 endometrioid cancers: evidence for distinct tumor behavior. Obstet Gynecol. 2008;112(1):64–70.
  • McGunigal M, Liu J, Kalir T, et al. Survival differences among uterine papillary serous, clear cell and grade 3 endometrioid adenocarcinoma endometrial cancers: a national cancer database analysis. Int J Gynecol Cancer. 2017;27(1):85–92.
  • Wakayama A, Kudaka W, Matsumoto H, et al. Lymphatic vessel involvement is predictive for lymph node metastasis and an important prognostic factor in endometrial cancer. Int J Clin Oncol. 2018;23(3):532–538.
  • Helpman L, Perri T, Lavee N, et al. Impact of adjuvant treatment on outcome in high-risk early-stage endometrial cancer: a retrospective three-center study. Int J Gynecol Cancer. 2019;29(1):133–139.
  • Chapman BV, Swanick CW, Ning MS, et al. Adjuvant combined-modality therapy for stage IIIC endometrioid and non-endometrioid endometrial cancer. Gynecol Oncol. 2019;154(1):22–28.
  • Coll-de La Rubia E, Martinez-Garcia E, Dittmar G, et al. Prognostic biomarkers in endometrial cancer: a systematic review and meta-analysis. J Clin Med. 2020;9(6):1900.