Using Machine Learning Approaches on Dynamic Patient-Reported Outcomes to Cluster Cancer Treatment-Related Symptoms

In patients undergoing systemic treatment for cancer, symptom tracking via electronic patient-reported outcomes (ePROs) has been used to optimize communication and monitoring, and facilitate the early detection of adverse effects and to compare the side effects of similar drugs. We aimed to examine...

Full description

Saved in:
Bibliographic Details
Main Authors: Nora Asper, Hans Friedrich Witschel, Louise von Stockar, Emanuele Laurenzi, Hans Christian Kolberg, Marcus Vetter, Sven Roth, Gerd Kullak-Ublick, Andreas Trojan
Format: Article
Language:English
Published: MDPI AG 2025-06-01
Series:Current Oncology
Subjects:
Online Access:https://www.mdpi.com/1718-7729/32/6/334
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In patients undergoing systemic treatment for cancer, symptom tracking via electronic patient-reported outcomes (ePROs) has been used to optimize communication and monitoring, and facilitate the early detection of adverse effects and to compare the side effects of similar drugs. We aimed to examine whether the patterns in electronic patient-reported outcomes, without any additional clinician data input, are predictive of the underlying cancer type and reflect tumor- and treatment-associated symptom clusters (SCs). The data were derived from a total of 226 patients who self-reported on the presence and severity (according to the Common Terminology Criteria for Adverse Events (CTCAEs)) of more than 90 available symptoms via the medidux<sup>TM</sup> app (versions 2.0 and 3.2, developed by mobile Health AG based in Zurich, Switzerland). Among these, 172 had breast cancer as the primary tumor, 19 had lung, 16 had gut, 12 had blood–lymph, and 7 had prostate cancer. For this secondary analysis, a subgroup of 25 patients with breast cancer were randomly selected to reduce the risk of overfitting. The symptoms were aggregated by counting the days on which a particular symptom was reported, resulting in a symptom vector for each patient. A logistic regression model was trained to predict the type of the respective tumor from the symptom vectors, and the symptoms with coefficients above (0.1) were graphically displayed. The machine learning model was not able to recognize any of the patients with prostate and blood–lymph cancer, likely as these cancer types were barely represented in the dataset. The Area Under the Curve (AUC) values for the three remaining cancer types were breast cancer: 0.74 (95% CI [0.624, 0.848]); gut cancer: 0.78 (95% CI [0.659, 0.893]); and lung cancer: 0.63 (95% CI [0.495, 0.771]). Despite the small datasets, for the breast and gut cancers, the respective models demonstrated a fair predictive performance (AUC > 0.7). The generalization of the findings are limited especially due to the heterogeneity of the dataset. This line of research could be especially interesting to monitor individual treatment trajectories. Deviations in the electronic patient-reported symptoms from the treatment-associated symptom patterns could dynamically indicate treatment non-adherence or lower treatment efficacy, without clinician input or additional costs. Similar analyses on larger patient cohorts are needed to validate these preliminary findings and to identify specific and robust treatment profiles.
ISSN:1198-0052
1718-7729