Deep learning model for detecting high-grade dysplasia in colorectal adenomas

Objective: Early detection and removal of suspicious polyps during routine colonoscopies play an important role in reducing the risk of colorectal cancer. Patient management and follow-up are determined by the type of polyps removed and the degree of dysplasia present on histological evaluation. Whe...

Full description

Saved in:
Bibliographic Details
Main Authors: Eric Steimetz, Zeliha Celen Simsek, Asmita Saha, Rong Xia, Raavi Gupta
Format: Article
Language:English
Published: Elsevier 2025-04-01
Series:Journal of Pathology Informatics
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2153353925000264
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Objective: Early detection and removal of suspicious polyps during routine colonoscopies play an important role in reducing the risk of colorectal cancer. Patient management and follow-up are determined by the type of polyps removed and the degree of dysplasia present on histological evaluation. Whereas discerning between a benign polyp and a dysplastic one is a trivial task, distinguishing between tubular adenomas with low-grade dysplasia (LGD) and high-grade dysplasia (HGD) is a challenging task. In this study, we trained a deep learning model to distinguish between colorectal adenomas with LGD and HGD. Design: We retrieved 259 slides of adenomatous polyps taken between January 2011 and October 2024. Slides with HGD were reviewed by a subspecialty-trained GI pathologist. After excluding discordant and duplicate cases, 200 slides remained: 71 (35.5%) with HGD and 129 (64.5%) with LGD. The slides were divided into training (160 slides, 80%) and test (40 slides, 20%) sets. After patch generation and stain normalization, a ResNet34 model (pre-trained on ImageNet) was trained using 5-fold cross-validation. Slide classification was determined by aggregating patch-level predictions. Results: The model's slide-level prediction accuracy was 95.0%, correctly classifying all but 2 out of 40 slides. The model achieved an area under the receiver operating characteristic curve score of 0.981 and an F1 score of 0.923. Conclusions: This study demonstrates that deep learning models can accurately distinguish between colonic adenomas with LGD and HGD. Training on a larger dataset could increase the accuracy and generalizability of the model and should be a focus of further studies.
ISSN:2153-3539