Fastest Prostate Cancer Prediction on Whole-Mount Pathological Slides Using Self-Reform Multilayer Transformer

In the clinical diagnosis of prostate cancer (PCa), the pathological image is regarded as the &#x201C;golden standard&#x201D;. Pathological images are characterized by their immense size with giga-pixel. Moreover, pixels in each whole-mount slide of PCa are increased by approximately <inl...

Full description

Saved in:
Bibliographic Details
Main Authors: Linjie Xu, Xiaotian Yu, Haoming Luo, Jili Wang, Wenjie Liang, Jing Zhang, Zunlei Feng, Cheng Lu, Xiuming Zhang
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10949171/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In the clinical diagnosis of prostate cancer (PCa), the pathological image is regarded as the &#x201C;golden standard&#x201D;. Pathological images are characterized by their immense size with giga-pixel. Moreover, pixels in each whole-mount slide of PCa are increased by approximately <inline-formula> <tex-math notation="LaTeX">$4\sim 9$ </tex-math></inline-formula> times, therefore containing more information for diagnosis. Therefore, rapid feature extraction of large prostate slices can effectively improve the diagnostic efficiency of prostate pathological images. This article introduces the self-reform multilayer transformer (SMT) to enhance acceleration of model training and inference in prostate pathological images. The SMT workflow involves a stepwise focus among three layers, progressing from macro-scale to micro-scale. A gradient-based strategy is proposed to efficiently focus on crucial regions, and a backward rethinking strategy is also proposed to enhance the precision of former layers. This iterative process of forward and backward training improves classification and focusing performance. Extensive experiments demonstrate that SMT achieves up to <inline-formula> <tex-math notation="LaTeX">$2.71\times $ </tex-math></inline-formula> faster training and <inline-formula> <tex-math notation="LaTeX">$775\times $ </tex-math></inline-formula> faster inference compared to traditional patch-cropping methods, while maintaining comparable accuracy and requiring less storage compared to existing SOTA methods. The current approach of cropping each slide into patches results in extensive time and storage consumption. The SMT of this paper applies forward focusing and backward rethinking to achieve fast and accurate diagnosis of PCa. Even for samples with microcarcinoma, SMT can still focus on cancer region precisely, which has great significance for avoiding missed diagnosis in clinic. This paper indicates that most regions of pathological slides are inessential or repetitive for diagnosis. The strategy of SMT can provide a new perspective for future task on pathological images.
ISSN:2169-3536