Ensemble Learning for Three-dimensional Medical Image Segmentation of Organ at Risk in Brachytherapy Using Double U-Net, Bi-directional ConvLSTM U-Net, and Transformer Network

Aim: This article presents a novel approach to automate the segmentation of organ at risk (OAR) for high-dose-rate brachytherapy patients using three deep learning models combined with ensemble learning techniques. It aims to improve the accuracy and efficiency of segmentation. Materials and Methods...

Full description

Saved in:
Bibliographic Details
Main Authors: Soniya Pal, Raj Pal Singh, Anuj Kumar
Format: Article
Language:English
Published: Wolters Kluwer Medknow Publications 2024-12-01
Series:Journal of Medical Physics
Subjects:
Online Access:https://journals.lww.com/10.4103/jmp.jmp_160_24
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841556613516754944
author Soniya Pal
Raj Pal Singh
Anuj Kumar
author_facet Soniya Pal
Raj Pal Singh
Anuj Kumar
author_sort Soniya Pal
collection DOAJ
description Aim: This article presents a novel approach to automate the segmentation of organ at risk (OAR) for high-dose-rate brachytherapy patients using three deep learning models combined with ensemble learning techniques. It aims to improve the accuracy and efficiency of segmentation. Materials and Methods: The dataset comprised computed tomography (CT) scans of 60 patients obtained from our own institutional image bank and 10 patients from the other institute, all in Digital Imaging and Communications in Medicine format. Experienced radiation oncologists manually segmented four OARs for each scan. Each scan was preprocessed and three models, Double U-Net (DUN), Bi-directional ConvLSTM U-Net (BCUN), and Transformer Networks (TN), were trained on reduced CT scans (240 × 240 × 128) due to memory limitations. Ensemble learning techniques were employed to enhance accuracy and segmentation metrics. Testing and validation were conducted on 12 patients from our institute (OID) and 10 patients from another institute (DID). Results: For DID test dataset, using the ensemble learning technique combining Transformer Network (TN) and BCUN, i.e., TN + BCUN, the average Dice similarity coefficient (DSC) ranged from 0.992 to 0.998, and for DUN and BCUN (DUN + BCUN) combination, the average DSC ranged from 0.990 to 0.993, which reflecting high segmentation accuracy. The 95% Hausdorff distance (HD) ranged from 0.9 to 1.2 mm for TN + BCUN and 1.1 to 1.4 mm for DUN + BCUN, demonstrating precise segmentation boundaries. Conclusion: The proposed method leverages the strengths of each network architecture. The DUN setup excels in sequential processing, the BCUN captures spatiotemporal dependencies, and transformer networks provide a robust understanding of global context. This combination enables efficient and accurate segmentation, surpassing human expert performance in both time and accuracy.
format Article
id doaj-art-f41176d5dab64f70920b87d7cf8e4599
institution Kabale University
issn 0971-6203
1998-3913
language English
publishDate 2024-12-01
publisher Wolters Kluwer Medknow Publications
record_format Article
series Journal of Medical Physics
spelling doaj-art-f41176d5dab64f70920b87d7cf8e45992025-01-07T07:19:03ZengWolters Kluwer Medknow PublicationsJournal of Medical Physics0971-62031998-39132024-12-0149457458210.4103/jmp.jmp_160_24Ensemble Learning for Three-dimensional Medical Image Segmentation of Organ at Risk in Brachytherapy Using Double U-Net, Bi-directional ConvLSTM U-Net, and Transformer NetworkSoniya PalRaj Pal SinghAnuj KumarAim: This article presents a novel approach to automate the segmentation of organ at risk (OAR) for high-dose-rate brachytherapy patients using three deep learning models combined with ensemble learning techniques. It aims to improve the accuracy and efficiency of segmentation. Materials and Methods: The dataset comprised computed tomography (CT) scans of 60 patients obtained from our own institutional image bank and 10 patients from the other institute, all in Digital Imaging and Communications in Medicine format. Experienced radiation oncologists manually segmented four OARs for each scan. Each scan was preprocessed and three models, Double U-Net (DUN), Bi-directional ConvLSTM U-Net (BCUN), and Transformer Networks (TN), were trained on reduced CT scans (240 × 240 × 128) due to memory limitations. Ensemble learning techniques were employed to enhance accuracy and segmentation metrics. Testing and validation were conducted on 12 patients from our institute (OID) and 10 patients from another institute (DID). Results: For DID test dataset, using the ensemble learning technique combining Transformer Network (TN) and BCUN, i.e., TN + BCUN, the average Dice similarity coefficient (DSC) ranged from 0.992 to 0.998, and for DUN and BCUN (DUN + BCUN) combination, the average DSC ranged from 0.990 to 0.993, which reflecting high segmentation accuracy. The 95% Hausdorff distance (HD) ranged from 0.9 to 1.2 mm for TN + BCUN and 1.1 to 1.4 mm for DUN + BCUN, demonstrating precise segmentation boundaries. Conclusion: The proposed method leverages the strengths of each network architecture. The DUN setup excels in sequential processing, the BCUN captures spatiotemporal dependencies, and transformer networks provide a robust understanding of global context. This combination enables efficient and accurate segmentation, surpassing human expert performance in both time and accuracy.https://journals.lww.com/10.4103/jmp.jmp_160_24brachytherapyimage segmentationlstm u-netmedical image segmentationorgan at risktransformeru-net
spellingShingle Soniya Pal
Raj Pal Singh
Anuj Kumar
Ensemble Learning for Three-dimensional Medical Image Segmentation of Organ at Risk in Brachytherapy Using Double U-Net, Bi-directional ConvLSTM U-Net, and Transformer Network
Journal of Medical Physics
brachytherapy
image segmentation
lstm u-net
medical image segmentation
organ at risk
transformer
u-net
title Ensemble Learning for Three-dimensional Medical Image Segmentation of Organ at Risk in Brachytherapy Using Double U-Net, Bi-directional ConvLSTM U-Net, and Transformer Network
title_full Ensemble Learning for Three-dimensional Medical Image Segmentation of Organ at Risk in Brachytherapy Using Double U-Net, Bi-directional ConvLSTM U-Net, and Transformer Network
title_fullStr Ensemble Learning for Three-dimensional Medical Image Segmentation of Organ at Risk in Brachytherapy Using Double U-Net, Bi-directional ConvLSTM U-Net, and Transformer Network
title_full_unstemmed Ensemble Learning for Three-dimensional Medical Image Segmentation of Organ at Risk in Brachytherapy Using Double U-Net, Bi-directional ConvLSTM U-Net, and Transformer Network
title_short Ensemble Learning for Three-dimensional Medical Image Segmentation of Organ at Risk in Brachytherapy Using Double U-Net, Bi-directional ConvLSTM U-Net, and Transformer Network
title_sort ensemble learning for three dimensional medical image segmentation of organ at risk in brachytherapy using double u net bi directional convlstm u net and transformer network
topic brachytherapy
image segmentation
lstm u-net
medical image segmentation
organ at risk
transformer
u-net
url https://journals.lww.com/10.4103/jmp.jmp_160_24
work_keys_str_mv AT soniyapal ensemblelearningforthreedimensionalmedicalimagesegmentationoforganatriskinbrachytherapyusingdoubleunetbidirectionalconvlstmunetandtransformernetwork
AT rajpalsingh ensemblelearningforthreedimensionalmedicalimagesegmentationoforganatriskinbrachytherapyusingdoubleunetbidirectionalconvlstmunetandtransformernetwork
AT anujkumar ensemblelearningforthreedimensionalmedicalimagesegmentationoforganatriskinbrachytherapyusingdoubleunetbidirectionalconvlstmunetandtransformernetwork