Data Homogeneity Effect in Deep Learning-Based Prediction of Type 1 Diabetic Retinopathy

This study is aimed at evaluating a deep transfer learning-based model for identifying diabetic retinopathy (DR) that was trained using a dataset with high variability and predominant type 2 diabetes (T2D) and comparing model performance with that in patients with type 1 diabetes (T1D). The Kaggle d...

Full description

Saved in:
Bibliographic Details
Main Authors: Jui-En Lo, Eugene Yu-Chuan Kang, Yun-Nung Chen, Yi-Ting Hsieh, Nan-Kai Wang, Ta-Ching Chen, Kuan-Jen Chen, Wei-Chi Wu, Yih-Shiou Hwang, Fu-Sung Lo, Chi-Chun Lai
Format: Article
Language:English
Published: Wiley 2021-01-01
Series:Journal of Diabetes Research
Online Access:http://dx.doi.org/10.1155/2021/2751695
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832566212273373184
author Jui-En Lo
Eugene Yu-Chuan Kang
Yun-Nung Chen
Yi-Ting Hsieh
Nan-Kai Wang
Ta-Ching Chen
Kuan-Jen Chen
Wei-Chi Wu
Yih-Shiou Hwang
Fu-Sung Lo
Chi-Chun Lai
author_facet Jui-En Lo
Eugene Yu-Chuan Kang
Yun-Nung Chen
Yi-Ting Hsieh
Nan-Kai Wang
Ta-Ching Chen
Kuan-Jen Chen
Wei-Chi Wu
Yih-Shiou Hwang
Fu-Sung Lo
Chi-Chun Lai
author_sort Jui-En Lo
collection DOAJ
description This study is aimed at evaluating a deep transfer learning-based model for identifying diabetic retinopathy (DR) that was trained using a dataset with high variability and predominant type 2 diabetes (T2D) and comparing model performance with that in patients with type 1 diabetes (T1D). The Kaggle dataset, which is a publicly available dataset, was divided into training and testing Kaggle datasets. In the comparison dataset, we collected retinal fundus images of T1D patients at Chang Gung Memorial Hospital in Taiwan from 2013 to 2020, and the images were divided into training and testing T1D datasets. The model was developed using 4 different convolutional neural networks (Inception-V3, DenseNet-121, VGG1, and Xception). The model performance in predicting DR was evaluated using testing images from each dataset, and area under the curve (AUC), sensitivity, and specificity were calculated. The model trained using the Kaggle dataset had an average (range) AUC of 0.74 (0.03) and 0.87 (0.01) in the testing Kaggle and T1D datasets, respectively. The model trained using the T1D dataset had an AUC of 0.88 (0.03), which decreased to 0.57 (0.02) in the testing Kaggle dataset. Heatmaps showed that the model focused on retinal hemorrhage, vessels, and exudation to predict DR. In wrong prediction images, artifacts and low-image quality affected model performance. The model developed with the high variability and T2D predominant dataset could be applied to T1D patients. Dataset homogeneity could affect the performance, trainability, and generalization of the model.
format Article
id doaj-art-6dbf1ab63a6b4d289165e185d9f863d1
institution Kabale University
issn 2314-6753
language English
publishDate 2021-01-01
publisher Wiley
record_format Article
series Journal of Diabetes Research
spelling doaj-art-6dbf1ab63a6b4d289165e185d9f863d12025-02-03T01:04:46ZengWileyJournal of Diabetes Research2314-67532021-01-01202110.1155/2021/2751695Data Homogeneity Effect in Deep Learning-Based Prediction of Type 1 Diabetic RetinopathyJui-En Lo0Eugene Yu-Chuan Kang1Yun-Nung Chen2Yi-Ting Hsieh3Nan-Kai Wang4Ta-Ching Chen5Kuan-Jen Chen6Wei-Chi Wu7Yih-Shiou Hwang8Fu-Sung Lo9Chi-Chun Lai10School of MedicineDepartment of OphthalmologyDepartment of Computer Science and Information Engineering National Taiwan UniversityDepartment of OphthalmologyDepartment of OphthalmologyDepartment of OphthalmologyDepartment of OphthalmologyDepartment of OphthalmologyDepartment of OphthalmologyDivision of Pediatric Endocrinology and GeneticsDepartment of OphthalmologyThis study is aimed at evaluating a deep transfer learning-based model for identifying diabetic retinopathy (DR) that was trained using a dataset with high variability and predominant type 2 diabetes (T2D) and comparing model performance with that in patients with type 1 diabetes (T1D). The Kaggle dataset, which is a publicly available dataset, was divided into training and testing Kaggle datasets. In the comparison dataset, we collected retinal fundus images of T1D patients at Chang Gung Memorial Hospital in Taiwan from 2013 to 2020, and the images were divided into training and testing T1D datasets. The model was developed using 4 different convolutional neural networks (Inception-V3, DenseNet-121, VGG1, and Xception). The model performance in predicting DR was evaluated using testing images from each dataset, and area under the curve (AUC), sensitivity, and specificity were calculated. The model trained using the Kaggle dataset had an average (range) AUC of 0.74 (0.03) and 0.87 (0.01) in the testing Kaggle and T1D datasets, respectively. The model trained using the T1D dataset had an AUC of 0.88 (0.03), which decreased to 0.57 (0.02) in the testing Kaggle dataset. Heatmaps showed that the model focused on retinal hemorrhage, vessels, and exudation to predict DR. In wrong prediction images, artifacts and low-image quality affected model performance. The model developed with the high variability and T2D predominant dataset could be applied to T1D patients. Dataset homogeneity could affect the performance, trainability, and generalization of the model.http://dx.doi.org/10.1155/2021/2751695
spellingShingle Jui-En Lo
Eugene Yu-Chuan Kang
Yun-Nung Chen
Yi-Ting Hsieh
Nan-Kai Wang
Ta-Ching Chen
Kuan-Jen Chen
Wei-Chi Wu
Yih-Shiou Hwang
Fu-Sung Lo
Chi-Chun Lai
Data Homogeneity Effect in Deep Learning-Based Prediction of Type 1 Diabetic Retinopathy
Journal of Diabetes Research
title Data Homogeneity Effect in Deep Learning-Based Prediction of Type 1 Diabetic Retinopathy
title_full Data Homogeneity Effect in Deep Learning-Based Prediction of Type 1 Diabetic Retinopathy
title_fullStr Data Homogeneity Effect in Deep Learning-Based Prediction of Type 1 Diabetic Retinopathy
title_full_unstemmed Data Homogeneity Effect in Deep Learning-Based Prediction of Type 1 Diabetic Retinopathy
title_short Data Homogeneity Effect in Deep Learning-Based Prediction of Type 1 Diabetic Retinopathy
title_sort data homogeneity effect in deep learning based prediction of type 1 diabetic retinopathy
url http://dx.doi.org/10.1155/2021/2751695
work_keys_str_mv AT juienlo datahomogeneityeffectindeeplearningbasedpredictionoftype1diabeticretinopathy
AT eugeneyuchuankang datahomogeneityeffectindeeplearningbasedpredictionoftype1diabeticretinopathy
AT yunnungchen datahomogeneityeffectindeeplearningbasedpredictionoftype1diabeticretinopathy
AT yitinghsieh datahomogeneityeffectindeeplearningbasedpredictionoftype1diabeticretinopathy
AT nankaiwang datahomogeneityeffectindeeplearningbasedpredictionoftype1diabeticretinopathy
AT tachingchen datahomogeneityeffectindeeplearningbasedpredictionoftype1diabeticretinopathy
AT kuanjenchen datahomogeneityeffectindeeplearningbasedpredictionoftype1diabeticretinopathy
AT weichiwu datahomogeneityeffectindeeplearningbasedpredictionoftype1diabeticretinopathy
AT yihshiouhwang datahomogeneityeffectindeeplearningbasedpredictionoftype1diabeticretinopathy
AT fusunglo datahomogeneityeffectindeeplearningbasedpredictionoftype1diabeticretinopathy
AT chichunlai datahomogeneityeffectindeeplearningbasedpredictionoftype1diabeticretinopathy