Risk-based evaluation of machine learning-based classification methods used for medical devices

Abstract Background In the future, more medical devices will be based on machine learning (ML) methods. In general, the consideration of risks is a crucial aspect for evaluating medical devices. Accordingly, risks and their associated costs should be taken into account when assessing the performance...

Full description

Saved in:
Bibliographic Details
Main Authors: Martin Haimerl, Christoph Reich
Format: Article
Language:English
Published: BMC 2025-03-01
Series:BMC Medical Informatics and Decision Making
Subjects:
Online Access:https://doi.org/10.1186/s12911-025-02909-9
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850040383196626944
author Martin Haimerl
Christoph Reich
author_facet Martin Haimerl
Christoph Reich
author_sort Martin Haimerl
collection DOAJ
description Abstract Background In the future, more medical devices will be based on machine learning (ML) methods. In general, the consideration of risks is a crucial aspect for evaluating medical devices. Accordingly, risks and their associated costs should be taken into account when assessing the performance of ML-based medical devices. This paper addresses the following three research questions towards a risk-based evaluation with a focus on ML-based classification models. Methods First, we analyzed how often risk-based metrics are currently utilized in the context of ML-based classification models. This was performed using a literature research based on a sample of recent scientific publications. Second, we introduce an approach for evaluating such models where expected risks and associated costs are integrated into the corresponding performance metrics. Additionally, we analyze the impact of different risk ratios on the resulting overall performance. Third, we elaborate how such risk-based approaches relate to regulatory requirements in the field of medical devices. A set of use case scenarios were utilized to demonstrate necessities and practical implications, in this regard. Results First, it was shown that currently most scientific publications do not include risk-based approaches for measuring performance. Second, it was demonstrated that risk-based considerations have a substantial impact on the outcome. The relative increase of the resulting overall risks can go up to 196% when the ratio between different types of risks (false negatives vs. false positives) changes by a factor of 10.0. Third, we elaborated that risk-based considerations need to be included into the assessment of ML-based medical devices, according to the relevant EU regulations and standards. In particular, this applies when a substantial impact on the clinical outcome / in terms of the risk-benefit relationship occurs. Conclusion In summary, we demonstrated the necessity of a risk-based approach for the evaluation of medical devices which include ML-based classification methods. We showed that currently many scientific papers in this area do not include risk considerations. We developed basic steps towards a risk-based assessment of ML-based classifiers and elaborated consequences that could occur, when these steps are neglected. And, we demonstrated the consistency of our approach with current regulatory requirements in the EU.
format Article
id doaj-art-1478d10a367546fea22b9853bd2a1ebe
institution DOAJ
issn 1472-6947
language English
publishDate 2025-03-01
publisher BMC
record_format Article
series BMC Medical Informatics and Decision Making
spelling doaj-art-1478d10a367546fea22b9853bd2a1ebe2025-08-20T02:56:06ZengBMCBMC Medical Informatics and Decision Making1472-69472025-03-0125112810.1186/s12911-025-02909-9Risk-based evaluation of machine learning-based classification methods used for medical devicesMartin Haimerl0Christoph Reich1Furtwangen University of Applied SciencesFurtwangen University of Applied SciencesAbstract Background In the future, more medical devices will be based on machine learning (ML) methods. In general, the consideration of risks is a crucial aspect for evaluating medical devices. Accordingly, risks and their associated costs should be taken into account when assessing the performance of ML-based medical devices. This paper addresses the following three research questions towards a risk-based evaluation with a focus on ML-based classification models. Methods First, we analyzed how often risk-based metrics are currently utilized in the context of ML-based classification models. This was performed using a literature research based on a sample of recent scientific publications. Second, we introduce an approach for evaluating such models where expected risks and associated costs are integrated into the corresponding performance metrics. Additionally, we analyze the impact of different risk ratios on the resulting overall performance. Third, we elaborate how such risk-based approaches relate to regulatory requirements in the field of medical devices. A set of use case scenarios were utilized to demonstrate necessities and practical implications, in this regard. Results First, it was shown that currently most scientific publications do not include risk-based approaches for measuring performance. Second, it was demonstrated that risk-based considerations have a substantial impact on the outcome. The relative increase of the resulting overall risks can go up to 196% when the ratio between different types of risks (false negatives vs. false positives) changes by a factor of 10.0. Third, we elaborated that risk-based considerations need to be included into the assessment of ML-based medical devices, according to the relevant EU regulations and standards. In particular, this applies when a substantial impact on the clinical outcome / in terms of the risk-benefit relationship occurs. Conclusion In summary, we demonstrated the necessity of a risk-based approach for the evaluation of medical devices which include ML-based classification methods. We showed that currently many scientific papers in this area do not include risk considerations. We developed basic steps towards a risk-based assessment of ML-based classifiers and elaborated consequences that could occur, when these steps are neglected. And, we demonstrated the consistency of our approach with current regulatory requirements in the EU.https://doi.org/10.1186/s12911-025-02909-9ClassificationRisk managementRisk-based metricsDecision theoryMedical devices
spellingShingle Martin Haimerl
Christoph Reich
Risk-based evaluation of machine learning-based classification methods used for medical devices
BMC Medical Informatics and Decision Making
Classification
Risk management
Risk-based metrics
Decision theory
Medical devices
title Risk-based evaluation of machine learning-based classification methods used for medical devices
title_full Risk-based evaluation of machine learning-based classification methods used for medical devices
title_fullStr Risk-based evaluation of machine learning-based classification methods used for medical devices
title_full_unstemmed Risk-based evaluation of machine learning-based classification methods used for medical devices
title_short Risk-based evaluation of machine learning-based classification methods used for medical devices
title_sort risk based evaluation of machine learning based classification methods used for medical devices
topic Classification
Risk management
Risk-based metrics
Decision theory
Medical devices
url https://doi.org/10.1186/s12911-025-02909-9
work_keys_str_mv AT martinhaimerl riskbasedevaluationofmachinelearningbasedclassificationmethodsusedformedicaldevices
AT christophreich riskbasedevaluationofmachinelearningbasedclassificationmethodsusedformedicaldevices