Development and validation of survival prediction tools in early and late onset colorectal cancer patients
Abstract This study aims to develop online calculators using machine learning models to predict survival probabilities for early- and late-onset colorectal cancer (EOCRC and LOCRC) over a 1- to 8-year period. We extracted data on 117,965 CRC patients from the published database spanning 2010 to 2021...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-04-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-95385-0 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849699456836960256 |
|---|---|
| author | Wanling Li Jinshan Liu Yuntong Lan Dongling Yu Bingqiang Zhang |
| author_facet | Wanling Li Jinshan Liu Yuntong Lan Dongling Yu Bingqiang Zhang |
| author_sort | Wanling Li |
| collection | DOAJ |
| description | Abstract This study aims to develop online calculators using machine learning models to predict survival probabilities for early- and late-onset colorectal cancer (EOCRC and LOCRC) over a 1- to 8-year period. We extracted data on 117,965 CRC patients from the published database spanning 2010 to 2021, divided into training and internal testing datasets. The data of 200 CRC patients from Chongqing Hospital of Jiangsu Province Hospital was used as the external testing dataset. We conducted univariate and multivariate regression analyses on the training dataset to identify key survival factors and develop predictive machine learning models. The models were evaluated using internal and external testing datasets based on AUC, accuracy, precision, recall, and F1 score. Web-based calculators were subsequently developed to predict survival curves for EOCRC and LOCRC patients under different treatment strategies. In the multivariate Cox regression analysis, 16 and 18 variables were independently significant survival factors for EOCRC and LOCRC, respectively. In the EOCRC group, the machine learning models achieved AUC values of 0.880 and 0.804 in the internal and external testing cohorts. For the LOCRC group, the machine learning models exhibited AUC values of 0.857 and 0.823 in the internal and external testing cohorts. The online calculators, powered by trained machine learning models, are accessible at https://eocrc-surv.streamlit.app/ and https://locrc-surv.streamlit.app/ . These tools estimate survival probabilities for EOCRC and LOCRC patients under various treatment strategies and display the corresponding survival curves post-treatment over the 1- to 8-year period. This study successfully developed online calculators using machine learning algorithms to predict 1- to 8-year survival probabilities for EOCRC and LOCRC patients under various treatment strategies. |
| format | Article |
| id | doaj-art-d1afc5626f0f420d91a5c2e39b672c84 |
| institution | DOAJ |
| issn | 2045-2322 |
| language | English |
| publishDate | 2025-04-01 |
| publisher | Nature Portfolio |
| record_format | Article |
| series | Scientific Reports |
| spelling | doaj-art-d1afc5626f0f420d91a5c2e39b672c842025-08-20T03:18:34ZengNature PortfolioScientific Reports2045-23222025-04-0115111310.1038/s41598-025-95385-0Development and validation of survival prediction tools in early and late onset colorectal cancer patientsWanling Li0Jinshan Liu1Yuntong Lan2Dongling Yu3Bingqiang Zhang4Department of Gastroenterology, University-Town Hospital of Chongqing Medical UniversityDepartment of Gastrointestinal Surgery, Chongqing Hospital of Jiangsu Province Hospital, The People’s Hospital of Qijiang DistrictDepartment of Gastroenterology, Chongqing Hospital of Jiangsu Province Hospital, The People’s Hospital of Qijiang DistrictDepartment of Gastrointestinal Surgery, Chongqing Hospital of Jiangsu Province Hospital, The People’s Hospital of Qijiang DistrictDepartment of Gastroenterology, University-Town Hospital of Chongqing Medical UniversityAbstract This study aims to develop online calculators using machine learning models to predict survival probabilities for early- and late-onset colorectal cancer (EOCRC and LOCRC) over a 1- to 8-year period. We extracted data on 117,965 CRC patients from the published database spanning 2010 to 2021, divided into training and internal testing datasets. The data of 200 CRC patients from Chongqing Hospital of Jiangsu Province Hospital was used as the external testing dataset. We conducted univariate and multivariate regression analyses on the training dataset to identify key survival factors and develop predictive machine learning models. The models were evaluated using internal and external testing datasets based on AUC, accuracy, precision, recall, and F1 score. Web-based calculators were subsequently developed to predict survival curves for EOCRC and LOCRC patients under different treatment strategies. In the multivariate Cox regression analysis, 16 and 18 variables were independently significant survival factors for EOCRC and LOCRC, respectively. In the EOCRC group, the machine learning models achieved AUC values of 0.880 and 0.804 in the internal and external testing cohorts. For the LOCRC group, the machine learning models exhibited AUC values of 0.857 and 0.823 in the internal and external testing cohorts. The online calculators, powered by trained machine learning models, are accessible at https://eocrc-surv.streamlit.app/ and https://locrc-surv.streamlit.app/ . These tools estimate survival probabilities for EOCRC and LOCRC patients under various treatment strategies and display the corresponding survival curves post-treatment over the 1- to 8-year period. This study successfully developed online calculators using machine learning algorithms to predict 1- to 8-year survival probabilities for EOCRC and LOCRC patients under various treatment strategies.https://doi.org/10.1038/s41598-025-95385-0Colorectal cancerMachine learningOnline calculatorsSurvival |
| spellingShingle | Wanling Li Jinshan Liu Yuntong Lan Dongling Yu Bingqiang Zhang Development and validation of survival prediction tools in early and late onset colorectal cancer patients Scientific Reports Colorectal cancer Machine learning Online calculators Survival |
| title | Development and validation of survival prediction tools in early and late onset colorectal cancer patients |
| title_full | Development and validation of survival prediction tools in early and late onset colorectal cancer patients |
| title_fullStr | Development and validation of survival prediction tools in early and late onset colorectal cancer patients |
| title_full_unstemmed | Development and validation of survival prediction tools in early and late onset colorectal cancer patients |
| title_short | Development and validation of survival prediction tools in early and late onset colorectal cancer patients |
| title_sort | development and validation of survival prediction tools in early and late onset colorectal cancer patients |
| topic | Colorectal cancer Machine learning Online calculators Survival |
| url | https://doi.org/10.1038/s41598-025-95385-0 |
| work_keys_str_mv | AT wanlingli developmentandvalidationofsurvivalpredictiontoolsinearlyandlateonsetcolorectalcancerpatients AT jinshanliu developmentandvalidationofsurvivalpredictiontoolsinearlyandlateonsetcolorectalcancerpatients AT yuntonglan developmentandvalidationofsurvivalpredictiontoolsinearlyandlateonsetcolorectalcancerpatients AT donglingyu developmentandvalidationofsurvivalpredictiontoolsinearlyandlateonsetcolorectalcancerpatients AT bingqiangzhang developmentandvalidationofsurvivalpredictiontoolsinearlyandlateonsetcolorectalcancerpatients |