OCT-SelfNet: a self-supervised framework with multi-source datasets for generalized retinal disease detection

IntroductionIn the medical AI field, there is a significant gap between advances in AI technology and the challenge of applying locally trained models to diverse patient populations. This is mainly due to the limited availability of labeled medical image data, driven by privacy concerns. To address...

Full description

Saved in:

Bibliographic Details
Main Authors:	Fatema-E Jannat, Sina Gholami, Minhaj Nur Alam, Hamed Tabkhi
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2025-07-01
Series:	Frontiers in Big Data
Subjects:	self-supervised transformer deep learning SwinV2 autoencoder OCT
Online Access:	https://www.frontiersin.org/articles/10.3389/fdata.2025.1609124/full
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849730678497738752
author	Fatema-E Jannat Sina Gholami Minhaj Nur Alam Hamed Tabkhi
author_facet	Fatema-E Jannat Sina Gholami Minhaj Nur Alam Hamed Tabkhi
author_sort	Fatema-E Jannat
collection	DOAJ
description	IntroductionIn the medical AI field, there is a significant gap between advances in AI technology and the challenge of applying locally trained models to diverse patient populations. This is mainly due to the limited availability of labeled medical image data, driven by privacy concerns. To address this, we have developed a self-supervised machine learning framework for detecting eye diseases from optical coherence tomography (OCT) images, aiming to achieve generalized learning while minimizing the need for large labeled datasets.MethodsOur framework, OCT-SelfNet, effectively addresses the challenge of data scarcity by integrating diverse datasets from multiple sources, ensuring a comprehensive representation of eye diseases. By employing a robust two-phase training strategy self-supervised pre-training with unlabeled data followed by a supervised training stage, we utilized the power of a masked autoencoder built on the SwinV2 backbone.ResultsExtensive experiments were conducted across three datasets with varying encoder backbones, assessing scenarios including the absence of self-supervised pre-training, the absence of data fusion, low data availability, and unseen data to evaluate the efficacy of our methodology. OCT-SelfNet outperformed the baseline model (ResNet-50, ViT) in most cases. Additionally, when tested for cross-dataset generalization, OCT-SelfNet surpassed the performance of the baseline model, further demonstrating its strong generalization ability. An ablation study revealed significant improvements attributable to self-supervised pre-training and data fusion methodologies.DiscussionOur findings suggest that the OCT-SelfNet framework is highly promising for real-world clinical deployment in detecting eye diseases from OCT images. This demonstrates the effectiveness of our two-phase training approach and the use of a masked autoencoder based on the SwinV2 backbone. Our work bridges the gap between basic research and clinical application, which significantly enhances the framework's domain adaptation and generalization capabilities in detecting eye diseases.
format	Article
id	doaj-art-cd58deb892bf4c0dacdb52aa3593d32c
institution	DOAJ
issn	2624-909X
language	English
publishDate	2025-07-01
publisher	Frontiers Media S.A.
record_format	Article
series	Frontiers in Big Data
spelling	doaj-art-cd58deb892bf4c0dacdb52aa3593d32c2025-08-20T03:08:47ZengFrontiers Media S.A.Frontiers in Big Data2624-909X2025-07-01810.3389/fdata.2025.16091241609124OCT-SelfNet: a self-supervised framework with multi-source datasets for generalized retinal disease detectionFatema-E JannatSina GholamiMinhaj Nur AlamHamed TabkhiIntroductionIn the medical AI field, there is a significant gap between advances in AI technology and the challenge of applying locally trained models to diverse patient populations. This is mainly due to the limited availability of labeled medical image data, driven by privacy concerns. To address this, we have developed a self-supervised machine learning framework for detecting eye diseases from optical coherence tomography (OCT) images, aiming to achieve generalized learning while minimizing the need for large labeled datasets.MethodsOur framework, OCT-SelfNet, effectively addresses the challenge of data scarcity by integrating diverse datasets from multiple sources, ensuring a comprehensive representation of eye diseases. By employing a robust two-phase training strategy self-supervised pre-training with unlabeled data followed by a supervised training stage, we utilized the power of a masked autoencoder built on the SwinV2 backbone.ResultsExtensive experiments were conducted across three datasets with varying encoder backbones, assessing scenarios including the absence of self-supervised pre-training, the absence of data fusion, low data availability, and unseen data to evaluate the efficacy of our methodology. OCT-SelfNet outperformed the baseline model (ResNet-50, ViT) in most cases. Additionally, when tested for cross-dataset generalization, OCT-SelfNet surpassed the performance of the baseline model, further demonstrating its strong generalization ability. An ablation study revealed significant improvements attributable to self-supervised pre-training and data fusion methodologies.DiscussionOur findings suggest that the OCT-SelfNet framework is highly promising for real-world clinical deployment in detecting eye diseases from OCT images. This demonstrates the effectiveness of our two-phase training approach and the use of a masked autoencoder based on the SwinV2 backbone. Our work bridges the gap between basic research and clinical application, which significantly enhances the framework's domain adaptation and generalization capabilities in detecting eye diseases.https://www.frontiersin.org/articles/10.3389/fdata.2025.1609124/fullself-supervisedtransformerdeep learningSwinV2autoencoderOCT
spellingShingle	Fatema-E Jannat Sina Gholami Minhaj Nur Alam Hamed Tabkhi OCT-SelfNet: a self-supervised framework with multi-source datasets for generalized retinal disease detection Frontiers in Big Data self-supervised transformer deep learning SwinV2 autoencoder OCT
title	OCT-SelfNet: a self-supervised framework with multi-source datasets for generalized retinal disease detection
title_full	OCT-SelfNet: a self-supervised framework with multi-source datasets for generalized retinal disease detection
title_fullStr	OCT-SelfNet: a self-supervised framework with multi-source datasets for generalized retinal disease detection
title_full_unstemmed	OCT-SelfNet: a self-supervised framework with multi-source datasets for generalized retinal disease detection
title_short	OCT-SelfNet: a self-supervised framework with multi-source datasets for generalized retinal disease detection
title_sort	oct selfnet a self supervised framework with multi source datasets for generalized retinal disease detection
topic	self-supervised transformer deep learning SwinV2 autoencoder OCT
url	https://www.frontiersin.org/articles/10.3389/fdata.2025.1609124/full
work_keys_str_mv	AT fatemaejannat octselfnetaselfsupervisedframeworkwithmultisourcedatasetsforgeneralizedretinaldiseasedetection AT sinagholami octselfnetaselfsupervisedframeworkwithmultisourcedatasetsforgeneralizedretinaldiseasedetection AT minhajnuralam octselfnetaselfsupervisedframeworkwithmultisourcedatasetsforgeneralizedretinaldiseasedetection AT hamedtabkhi octselfnetaselfsupervisedframeworkwithmultisourcedatasetsforgeneralizedretinaldiseasedetection

OCT-SelfNet: a self-supervised framework with multi-source datasets for generalized retinal disease detection

Similar Items