Digitization of Medical Device Displays Using Deep Learning Models: A Comparative Study

With the growing number of patients living with chronic conditions, there is an increasing need for efficient systems that can automatically capture and convert medical device readings into digital data, particularly in home-based care settings. However, most home-based medical devices are closed sy...

Full description

Saved in:

Bibliographic Details
Main Authors:	Pedro Ferreira, Pedro Lobo, Filipa Reis, João L. Vilaça, Pedro Morais
Format:	Article
Language:	English
Published:	MDPI AG 2025-05-01
Series:	Applied Sciences
Subjects:	artificial intelligence display digitization object detection single shot multibox detector (SSD) you only look once (YOLO)
Online Access:	https://www.mdpi.com/2076-3417/15/10/5436
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850258118545506304
author	Pedro Ferreira Pedro Lobo Filipa Reis João L. Vilaça Pedro Morais
author_facet	Pedro Ferreira Pedro Lobo Filipa Reis João L. Vilaça Pedro Morais
author_sort	Pedro Ferreira
collection	DOAJ
description	With the growing number of patients living with chronic conditions, there is an increasing need for efficient systems that can automatically capture and convert medical device readings into digital data, particularly in home-based care settings. However, most home-based medical devices are closed systems that do not support straightforward automatic data export and often require complex connections to access or transmit patient information. Since most of these devices display clinical information on a screen, this research explores how a standard smartphone camera, combined with artificial intelligence, can be used to automatically extract the displayed data in a simple and non-intrusive way. In particular, this study provides a comparative analysis of several You Only Look Once (YOLO) and Single Shot MultiBox Detector (SSD) models to evaluate their effectiveness in detecting and recognizing the readings on medical device displays. In addition to these comparisons, we also explore a hybrid approach that combines the YOLOv8l model for object detection with a Convolutional Neural Network (CNN) for classification. Several iterations of the aforementioned models were tested, using image resolutions of 320 × 320 and 640 × 640. The performance was assessed using metrics such as precision, recall, mean average precision at 0.5 Intersection over Union (mAP@50), and frames per second (FPS). The results show that YOLOv8l (640) achieved the highest mAP@50 of 0.979, but at a lower inference speed (13.20 FPS), while YOLOv8n (320) offered the fastest inference (129.79 FPS) with a reduction in mean average precision (0.786). Combining YOLOv8l with a CNN classifier resulted in a slight reduction in overall accuracy (0.96) when compared to the standalone model (0.98). While the results are promising, the study acknowledges certain limitations, including dataset-specific biases, controlled acquisition settings, and challenges in adapting to real-world scenarios. Nevertheless, the comparative analysis offers valuable insights into the trade-off between inference time and accuracy, helping guide the selection of the most suitable model based on the specific demands of the intended scanning application.
format	Article
id	doaj-art-b196d25a17e741d3a07c4c01dde4107a
institution	OA Journals
issn	2076-3417
language	English
publishDate	2025-05-01
publisher	MDPI AG
record_format	Article
series	Applied Sciences
spelling	doaj-art-b196d25a17e741d3a07c4c01dde4107a2025-08-20T01:56:14ZengMDPI AGApplied Sciences2076-34172025-05-011510543610.3390/app15105436Digitization of Medical Device Displays Using Deep Learning Models: A Comparative StudyPedro Ferreira0Pedro Lobo1Filipa Reis2João L. Vilaça3Pedro Morais42Ai, School of Technology, IPCA, 4750-810 Barcelos, Portugal2Ai, School of Technology, IPCA, 4750-810 Barcelos, PortugalGLINTT Life, 2710-693 Sintra, Portugal2Ai, School of Technology, IPCA, 4750-810 Barcelos, Portugal2Ai, School of Technology, IPCA, 4750-810 Barcelos, PortugalWith the growing number of patients living with chronic conditions, there is an increasing need for efficient systems that can automatically capture and convert medical device readings into digital data, particularly in home-based care settings. However, most home-based medical devices are closed systems that do not support straightforward automatic data export and often require complex connections to access or transmit patient information. Since most of these devices display clinical information on a screen, this research explores how a standard smartphone camera, combined with artificial intelligence, can be used to automatically extract the displayed data in a simple and non-intrusive way. In particular, this study provides a comparative analysis of several You Only Look Once (YOLO) and Single Shot MultiBox Detector (SSD) models to evaluate their effectiveness in detecting and recognizing the readings on medical device displays. In addition to these comparisons, we also explore a hybrid approach that combines the YOLOv8l model for object detection with a Convolutional Neural Network (CNN) for classification. Several iterations of the aforementioned models were tested, using image resolutions of 320 × 320 and 640 × 640. The performance was assessed using metrics such as precision, recall, mean average precision at 0.5 Intersection over Union (mAP@50), and frames per second (FPS). The results show that YOLOv8l (640) achieved the highest mAP@50 of 0.979, but at a lower inference speed (13.20 FPS), while YOLOv8n (320) offered the fastest inference (129.79 FPS) with a reduction in mean average precision (0.786). Combining YOLOv8l with a CNN classifier resulted in a slight reduction in overall accuracy (0.96) when compared to the standalone model (0.98). While the results are promising, the study acknowledges certain limitations, including dataset-specific biases, controlled acquisition settings, and challenges in adapting to real-world scenarios. Nevertheless, the comparative analysis offers valuable insights into the trade-off between inference time and accuracy, helping guide the selection of the most suitable model based on the specific demands of the intended scanning application.https://www.mdpi.com/2076-3417/15/10/5436artificial intelligencedisplay digitizationobject detectionsingle shot multibox detector (SSD)you only look once (YOLO)
spellingShingle	Pedro Ferreira Pedro Lobo Filipa Reis João L. Vilaça Pedro Morais Digitization of Medical Device Displays Using Deep Learning Models: A Comparative Study Applied Sciences artificial intelligence display digitization object detection single shot multibox detector (SSD) you only look once (YOLO)
title	Digitization of Medical Device Displays Using Deep Learning Models: A Comparative Study
title_full	Digitization of Medical Device Displays Using Deep Learning Models: A Comparative Study
title_fullStr	Digitization of Medical Device Displays Using Deep Learning Models: A Comparative Study
title_full_unstemmed	Digitization of Medical Device Displays Using Deep Learning Models: A Comparative Study
title_short	Digitization of Medical Device Displays Using Deep Learning Models: A Comparative Study
title_sort	digitization of medical device displays using deep learning models a comparative study
topic	artificial intelligence display digitization object detection single shot multibox detector (SSD) you only look once (YOLO)
url	https://www.mdpi.com/2076-3417/15/10/5436
work_keys_str_mv	AT pedroferreira digitizationofmedicaldevicedisplaysusingdeeplearningmodelsacomparativestudy AT pedrolobo digitizationofmedicaldevicedisplaysusingdeeplearningmodelsacomparativestudy AT filipareis digitizationofmedicaldevicedisplaysusingdeeplearningmodelsacomparativestudy AT joaolvilaca digitizationofmedicaldevicedisplaysusingdeeplearningmodelsacomparativestudy AT pedromorais digitizationofmedicaldevicedisplaysusingdeeplearningmodelsacomparativestudy

Digitization of Medical Device Displays Using Deep Learning Models: A Comparative Study

Similar Items