Using Machine Learning Approaches on Dynamic Patient-Reported Outcomes to Cluster Cancer Treatment-Related Symptoms
In patients undergoing systemic treatment for cancer, symptom tracking via electronic patient-reported outcomes (ePROs) has been used to optimize communication and monitoring, and facilitate the early detection of adverse effects and to compare the side effects of similar drugs. We aimed to examine...
Saved in:
| Main Authors: | , , , , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-06-01
|
| Series: | Current Oncology |
| Subjects: | |
| Online Access: | https://www.mdpi.com/1718-7729/32/6/334 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | In patients undergoing systemic treatment for cancer, symptom tracking via electronic patient-reported outcomes (ePROs) has been used to optimize communication and monitoring, and facilitate the early detection of adverse effects and to compare the side effects of similar drugs. We aimed to examine whether the patterns in electronic patient-reported outcomes, without any additional clinician data input, are predictive of the underlying cancer type and reflect tumor- and treatment-associated symptom clusters (SCs). The data were derived from a total of 226 patients who self-reported on the presence and severity (according to the Common Terminology Criteria for Adverse Events (CTCAEs)) of more than 90 available symptoms via the medidux<sup>TM</sup> app (versions 2.0 and 3.2, developed by mobile Health AG based in Zurich, Switzerland). Among these, 172 had breast cancer as the primary tumor, 19 had lung, 16 had gut, 12 had blood–lymph, and 7 had prostate cancer. For this secondary analysis, a subgroup of 25 patients with breast cancer were randomly selected to reduce the risk of overfitting. The symptoms were aggregated by counting the days on which a particular symptom was reported, resulting in a symptom vector for each patient. A logistic regression model was trained to predict the type of the respective tumor from the symptom vectors, and the symptoms with coefficients above (0.1) were graphically displayed. The machine learning model was not able to recognize any of the patients with prostate and blood–lymph cancer, likely as these cancer types were barely represented in the dataset. The Area Under the Curve (AUC) values for the three remaining cancer types were breast cancer: 0.74 (95% CI [0.624, 0.848]); gut cancer: 0.78 (95% CI [0.659, 0.893]); and lung cancer: 0.63 (95% CI [0.495, 0.771]). Despite the small datasets, for the breast and gut cancers, the respective models demonstrated a fair predictive performance (AUC > 0.7). The generalization of the findings are limited especially due to the heterogeneity of the dataset. This line of research could be especially interesting to monitor individual treatment trajectories. Deviations in the electronic patient-reported symptoms from the treatment-associated symptom patterns could dynamically indicate treatment non-adherence or lower treatment efficacy, without clinician input or additional costs. Similar analyses on larger patient cohorts are needed to validate these preliminary findings and to identify specific and robust treatment profiles. |
|---|---|
| ISSN: | 1198-0052 1718-7729 |