GPT-based prediction of short-term survival following decompressive hemicraniectomy in malignant middle cerebral artery infarction
IntroductionAn analysis of the prognostic ability of the large language model (LLM) Generative Pre-trained Transformer (GPT) to predict short-term survival and functional outcomes in patients with malignant middle cerebral artery (MCA) infarction following decompressive hemicraniectomy.MethodsThis r...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Frontiers Media S.A.
2025-07-01
|
| Series: | Frontiers in Neurology |
| Subjects: | |
| Online Access: | https://www.frontiersin.org/articles/10.3389/fneur.2025.1603536/full |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849316456468054016 |
|---|---|
| author | Sebastian Lehmann Martin Vychopen Erdem Güresir Johannes Wach |
| author_facet | Sebastian Lehmann Martin Vychopen Erdem Güresir Johannes Wach |
| author_sort | Sebastian Lehmann |
| collection | DOAJ |
| description | IntroductionAn analysis of the prognostic ability of the large language model (LLM) Generative Pre-trained Transformer (GPT) to predict short-term survival and functional outcomes in patients with malignant middle cerebral artery (MCA) infarction following decompressive hemicraniectomy.MethodsThis retrospective study included 100 patients with malignant MCA infarction who underwent decompressive craniectomy (DC). GPT-4 and GPT-4 Omni were used to predict patient outcomes based on 20 patient-specific factors. Each version of GPT was tested with and without context enrichment (CE). CE versions were provided with the current AHA/ASA 2019 guidelines and meta-analyses of RCTs to inform decision-making. The real-life outcome of the patients, measured by the modified Rankin Scale (mRS), served as a reference. The following endpoints were evaluated: survival during inpatient stay, achievement of a functional status of mRS 0–4 at discharge, and at 3-, 6-, and 12-months post-discharge. We analyzed the prognostic prediction of GPT by calculating the area under the curve (AUC) and determining the optimal cutoff using the Youden index for divergent prediction outcomes. After dichotomization according to the cutoff set, a chi-squared test (two-sided) was performed.ResultsGPT-4 and GPT-4 Omni demonstrated the ability to estimate survival during in-hospital stay. In both versions, the CE GPT outperformed the non-CE versions. GPT-4 Omni (CE) achieved an AUC of 0.67 (95% CI: 0.54–0.79; p = 0.002), while GPT-4 (CE) reached an AUC of 0.70 (95% CI: 0.57–0.82; p = 0.018). GPT-4 also achieved statistical significance even without CE (AUC of 0.66; 95% CI: 0.53–0.78; p = 0.018). In contrast, the non-CE version of GPT-4 Omni did not reach significance in predicting the survival of hospitalization (AUC of 0.60; 95% CI: 0.48–0.73; p = 0.07). For questions regarding the functional outcome of patients, neither version of GPT was able to make a sufficient prognostic prediction. However, when provided with the pre-stroke mRS, GPT-4 Omni was able to predict the mRS at discharge (p = 0.01; Pearson's correlation coefficient = 0.696).ConclusionThe study shows the already existing high potential of AI in predicting short-term outcomes. It also shows the existing limitations for the evaluation of more complex questions, such as functional outcomes. |
| format | Article |
| id | doaj-art-134ea7a632e8476f9953d54095ec61dd |
| institution | Kabale University |
| issn | 1664-2295 |
| language | English |
| publishDate | 2025-07-01 |
| publisher | Frontiers Media S.A. |
| record_format | Article |
| series | Frontiers in Neurology |
| spelling | doaj-art-134ea7a632e8476f9953d54095ec61dd2025-08-20T03:51:44ZengFrontiers Media S.A.Frontiers in Neurology1664-22952025-07-011610.3389/fneur.2025.16035361603536GPT-based prediction of short-term survival following decompressive hemicraniectomy in malignant middle cerebral artery infarctionSebastian LehmannMartin VychopenErdem GüresirJohannes WachIntroductionAn analysis of the prognostic ability of the large language model (LLM) Generative Pre-trained Transformer (GPT) to predict short-term survival and functional outcomes in patients with malignant middle cerebral artery (MCA) infarction following decompressive hemicraniectomy.MethodsThis retrospective study included 100 patients with malignant MCA infarction who underwent decompressive craniectomy (DC). GPT-4 and GPT-4 Omni were used to predict patient outcomes based on 20 patient-specific factors. Each version of GPT was tested with and without context enrichment (CE). CE versions were provided with the current AHA/ASA 2019 guidelines and meta-analyses of RCTs to inform decision-making. The real-life outcome of the patients, measured by the modified Rankin Scale (mRS), served as a reference. The following endpoints were evaluated: survival during inpatient stay, achievement of a functional status of mRS 0–4 at discharge, and at 3-, 6-, and 12-months post-discharge. We analyzed the prognostic prediction of GPT by calculating the area under the curve (AUC) and determining the optimal cutoff using the Youden index for divergent prediction outcomes. After dichotomization according to the cutoff set, a chi-squared test (two-sided) was performed.ResultsGPT-4 and GPT-4 Omni demonstrated the ability to estimate survival during in-hospital stay. In both versions, the CE GPT outperformed the non-CE versions. GPT-4 Omni (CE) achieved an AUC of 0.67 (95% CI: 0.54–0.79; p = 0.002), while GPT-4 (CE) reached an AUC of 0.70 (95% CI: 0.57–0.82; p = 0.018). GPT-4 also achieved statistical significance even without CE (AUC of 0.66; 95% CI: 0.53–0.78; p = 0.018). In contrast, the non-CE version of GPT-4 Omni did not reach significance in predicting the survival of hospitalization (AUC of 0.60; 95% CI: 0.48–0.73; p = 0.07). For questions regarding the functional outcome of patients, neither version of GPT was able to make a sufficient prognostic prediction. However, when provided with the pre-stroke mRS, GPT-4 Omni was able to predict the mRS at discharge (p = 0.01; Pearson's correlation coefficient = 0.696).ConclusionThe study shows the already existing high potential of AI in predicting short-term outcomes. It also shows the existing limitations for the evaluation of more complex questions, such as functional outcomes.https://www.frontiersin.org/articles/10.3389/fneur.2025.1603536/fulldecompressive hemicraniectomymiddle cerebral artery infarctionartificial intelligenceGPTsurvivalfunctional outcome |
| spellingShingle | Sebastian Lehmann Martin Vychopen Erdem Güresir Johannes Wach GPT-based prediction of short-term survival following decompressive hemicraniectomy in malignant middle cerebral artery infarction Frontiers in Neurology decompressive hemicraniectomy middle cerebral artery infarction artificial intelligence GPT survival functional outcome |
| title | GPT-based prediction of short-term survival following decompressive hemicraniectomy in malignant middle cerebral artery infarction |
| title_full | GPT-based prediction of short-term survival following decompressive hemicraniectomy in malignant middle cerebral artery infarction |
| title_fullStr | GPT-based prediction of short-term survival following decompressive hemicraniectomy in malignant middle cerebral artery infarction |
| title_full_unstemmed | GPT-based prediction of short-term survival following decompressive hemicraniectomy in malignant middle cerebral artery infarction |
| title_short | GPT-based prediction of short-term survival following decompressive hemicraniectomy in malignant middle cerebral artery infarction |
| title_sort | gpt based prediction of short term survival following decompressive hemicraniectomy in malignant middle cerebral artery infarction |
| topic | decompressive hemicraniectomy middle cerebral artery infarction artificial intelligence GPT survival functional outcome |
| url | https://www.frontiersin.org/articles/10.3389/fneur.2025.1603536/full |
| work_keys_str_mv | AT sebastianlehmann gptbasedpredictionofshorttermsurvivalfollowingdecompressivehemicraniectomyinmalignantmiddlecerebralarteryinfarction AT martinvychopen gptbasedpredictionofshorttermsurvivalfollowingdecompressivehemicraniectomyinmalignantmiddlecerebralarteryinfarction AT erdemguresir gptbasedpredictionofshorttermsurvivalfollowingdecompressivehemicraniectomyinmalignantmiddlecerebralarteryinfarction AT johanneswach gptbasedpredictionofshorttermsurvivalfollowingdecompressivehemicraniectomyinmalignantmiddlecerebralarteryinfarction |