Prediction of gene expression-based breast cancer proliferation scores from histopathology whole slide images using deep learning

Abstract Background In breast cancer, several gene expression assays have been developed to provide a more personalised treatment. This study focuses on the prediction of two molecular proliferation signatures: an 11-gene proliferation score and the MKI67 proliferation marker gene. The aim was to as...

Full description

Saved in:
Bibliographic Details
Main Authors: Andreas Ekholm, Yinxi Wang, Johan Vallon-Christersson, Constance Boissin, Mattias Rantalainen
Format: Article
Language:English
Published: BMC 2024-12-01
Series:BMC Cancer
Subjects:
Online Access:https://doi.org/10.1186/s12885-024-13248-9
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850255859356008448
author Andreas Ekholm
Yinxi Wang
Johan Vallon-Christersson
Constance Boissin
Mattias Rantalainen
author_facet Andreas Ekholm
Yinxi Wang
Johan Vallon-Christersson
Constance Boissin
Mattias Rantalainen
author_sort Andreas Ekholm
collection DOAJ
description Abstract Background In breast cancer, several gene expression assays have been developed to provide a more personalised treatment. This study focuses on the prediction of two molecular proliferation signatures: an 11-gene proliferation score and the MKI67 proliferation marker gene. The aim was to assess whether these could be predicted from digital whole slide images (WSIs) using deep learning models. Methods WSIs and RNA-sequencing data from 819 invasive breast cancer patients were included for training, and models were evaluated on an internal test set of 172 cases as well as on 997 cases from a fully independent external test set. Two deep Convolutional Neural Network (CNN) models were optimised using WSIs and gene expression readouts from RNA-sequencing data of either the proliferation signature or the proliferation marker, and assessed using Spearman correlation (r). Prognostic performance was assessed through Cox proportional hazard modelling, estimating hazard ratios (HR). Results Optimised CNNs successfully predicted the proliferation score and proliferation marker on the unseen internal test set (ρ = 0.691(p < 0.001) with R2 = 0.438, and ρ = 0.564 (p < 0.001) with R2 = 0.251 respectively) and on the external test set (ρ = 0.502 (p < 0.001) with R2 = 0.319, and ρ = 0.403 (p < 0.001) with R2 = 0.222 respectively). Patients with a high proliferation score or marker were significantly associated with a higher risk of recurrence or death in the external test set (HR = 1.65 (95% CI: 1.05–2.61) and HR = 1.84 (95% CI: 1.17–2.89), respectively). Conclusions The results from this study suggest that gene expression levels of proliferation scores can be predicted directly from breast cancer morphology in WSIs using CNNs and that the predictions provide prognostic information that could be used in research as well as in the clinical setting.
format Article
id doaj-art-4bc232c6cce1401ebf02083c99e6d507
institution OA Journals
issn 1471-2407
language English
publishDate 2024-12-01
publisher BMC
record_format Article
series BMC Cancer
spelling doaj-art-4bc232c6cce1401ebf02083c99e6d5072025-08-20T01:56:46ZengBMCBMC Cancer1471-24072024-12-0124111310.1186/s12885-024-13248-9Prediction of gene expression-based breast cancer proliferation scores from histopathology whole slide images using deep learningAndreas Ekholm0Yinxi Wang1Johan Vallon-Christersson2Constance Boissin3Mattias Rantalainen4Department of Medical Epidemiology and Biostatistics, Karolinska InstitutetDepartment of Medical Epidemiology and Biostatistics, Karolinska InstitutetDivision of Oncology, Department of Clinical Sciences Lund, Lund UniversityDepartment of Medical Epidemiology and Biostatistics, Karolinska InstitutetDepartment of Medical Epidemiology and Biostatistics, Karolinska InstitutetAbstract Background In breast cancer, several gene expression assays have been developed to provide a more personalised treatment. This study focuses on the prediction of two molecular proliferation signatures: an 11-gene proliferation score and the MKI67 proliferation marker gene. The aim was to assess whether these could be predicted from digital whole slide images (WSIs) using deep learning models. Methods WSIs and RNA-sequencing data from 819 invasive breast cancer patients were included for training, and models were evaluated on an internal test set of 172 cases as well as on 997 cases from a fully independent external test set. Two deep Convolutional Neural Network (CNN) models were optimised using WSIs and gene expression readouts from RNA-sequencing data of either the proliferation signature or the proliferation marker, and assessed using Spearman correlation (r). Prognostic performance was assessed through Cox proportional hazard modelling, estimating hazard ratios (HR). Results Optimised CNNs successfully predicted the proliferation score and proliferation marker on the unseen internal test set (ρ = 0.691(p < 0.001) with R2 = 0.438, and ρ = 0.564 (p < 0.001) with R2 = 0.251 respectively) and on the external test set (ρ = 0.502 (p < 0.001) with R2 = 0.319, and ρ = 0.403 (p < 0.001) with R2 = 0.222 respectively). Patients with a high proliferation score or marker were significantly associated with a higher risk of recurrence or death in the external test set (HR = 1.65 (95% CI: 1.05–2.61) and HR = 1.84 (95% CI: 1.17–2.89), respectively). Conclusions The results from this study suggest that gene expression levels of proliferation scores can be predicted directly from breast cancer morphology in WSIs using CNNs and that the predictions provide prognostic information that could be used in research as well as in the clinical setting.https://doi.org/10.1186/s12885-024-13248-9Artificial intelligenceComputational pathologyProliferationGene expressionBreast cancer
spellingShingle Andreas Ekholm
Yinxi Wang
Johan Vallon-Christersson
Constance Boissin
Mattias Rantalainen
Prediction of gene expression-based breast cancer proliferation scores from histopathology whole slide images using deep learning
BMC Cancer
Artificial intelligence
Computational pathology
Proliferation
Gene expression
Breast cancer
title Prediction of gene expression-based breast cancer proliferation scores from histopathology whole slide images using deep learning
title_full Prediction of gene expression-based breast cancer proliferation scores from histopathology whole slide images using deep learning
title_fullStr Prediction of gene expression-based breast cancer proliferation scores from histopathology whole slide images using deep learning
title_full_unstemmed Prediction of gene expression-based breast cancer proliferation scores from histopathology whole slide images using deep learning
title_short Prediction of gene expression-based breast cancer proliferation scores from histopathology whole slide images using deep learning
title_sort prediction of gene expression based breast cancer proliferation scores from histopathology whole slide images using deep learning
topic Artificial intelligence
Computational pathology
Proliferation
Gene expression
Breast cancer
url https://doi.org/10.1186/s12885-024-13248-9
work_keys_str_mv AT andreasekholm predictionofgeneexpressionbasedbreastcancerproliferationscoresfromhistopathologywholeslideimagesusingdeeplearning
AT yinxiwang predictionofgeneexpressionbasedbreastcancerproliferationscoresfromhistopathologywholeslideimagesusingdeeplearning
AT johanvallonchristersson predictionofgeneexpressionbasedbreastcancerproliferationscoresfromhistopathologywholeslideimagesusingdeeplearning
AT constanceboissin predictionofgeneexpressionbasedbreastcancerproliferationscoresfromhistopathologywholeslideimagesusingdeeplearning
AT mattiasrantalainen predictionofgeneexpressionbasedbreastcancerproliferationscoresfromhistopathologywholeslideimagesusingdeeplearning