Text Mining Strategy Identifies Gene Networks Under Control of miR‐21 in Breast Cancer Development

ABSTRACT Background MicroRNAs (miRNAs) are small regulatory molecules that play a critical role in various biological processes by regulating gene expression. They have emerged as crucial players in cancer development, including breast cancer. However, individual research studies may be subject to s...

Full description

Saved in:
Bibliographic Details
Main Authors: Hong Ye, Yuyu Wu, Richard Tran, Jie Wang
Format: Article
Language:English
Published: Wiley 2025-07-01
Series:Cancer Medicine
Subjects:
Online Access:https://doi.org/10.1002/cam4.70986
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849413342353948672
author Hong Ye
Yuyu Wu
Richard Tran
Jie Wang
author_facet Hong Ye
Yuyu Wu
Richard Tran
Jie Wang
author_sort Hong Ye
collection DOAJ
description ABSTRACT Background MicroRNAs (miRNAs) are small regulatory molecules that play a critical role in various biological processes by regulating gene expression. They have emerged as crucial players in cancer development, including breast cancer. However, individual research studies may be subject to specific biases. Methods To gain a more comprehensive understanding of miRNA involvement in breast cancer, we employed a large‐scale analysis of miRNA studies retrieved from PubMed. Our approach involved tokenizing abstracts to identify key biomedical entities (e.g., miRNA, gene, disease) and constructing miRNA‐cancer co‐occurrence networks using bioinformatic analysis. Results This analysis revealed miR‐21 as the most frequently studied miRNA in breast cancer research, with a significant difference compared to other miRNAs. Network analysis identified SMAD3, PIK3R1, STAT3, and TP53 as key regulators potentially affecting pathways like TGF‐β signaling and p53 signaling. Additionally, our analysis suggests that genes associated with miR‐21 are often downregulated in tumors and exhibit a positive correlation with T cell infiltration, particularly CD8+ T cells, potentially indicating a favorable prognosis. Conclusion Our findings highlight miR‐21 as a central regulatory hub and potential biomarker in breast cancer. While informative, the results are derived from literature‐based data and may be influenced by text‐mining limitations, underscoring the need for experimental validation.
format Article
id doaj-art-5a87963b76e84a52a2eeae6869dfa20f
institution Kabale University
issn 2045-7634
language English
publishDate 2025-07-01
publisher Wiley
record_format Article
series Cancer Medicine
spelling doaj-art-5a87963b76e84a52a2eeae6869dfa20f2025-08-20T03:34:09ZengWileyCancer Medicine2045-76342025-07-011413n/an/a10.1002/cam4.70986Text Mining Strategy Identifies Gene Networks Under Control of miR‐21 in Breast Cancer DevelopmentHong Ye0Yuyu Wu1Richard Tran2Jie Wang3Department of Neurology Xiangshan Hospital of TCM Medical and Health Group Ningbo City Zhejiang Province ChinaDepartment of Acupuncture Xiangshan Hospital of TCM Medical and Health Group Ningbo City Zhejiang Province ChinaMasters Program in Computer Science University of Chicago Chicago Illinois USAApplied Data Science Program Syracuse University Syracuse New York USAABSTRACT Background MicroRNAs (miRNAs) are small regulatory molecules that play a critical role in various biological processes by regulating gene expression. They have emerged as crucial players in cancer development, including breast cancer. However, individual research studies may be subject to specific biases. Methods To gain a more comprehensive understanding of miRNA involvement in breast cancer, we employed a large‐scale analysis of miRNA studies retrieved from PubMed. Our approach involved tokenizing abstracts to identify key biomedical entities (e.g., miRNA, gene, disease) and constructing miRNA‐cancer co‐occurrence networks using bioinformatic analysis. Results This analysis revealed miR‐21 as the most frequently studied miRNA in breast cancer research, with a significant difference compared to other miRNAs. Network analysis identified SMAD3, PIK3R1, STAT3, and TP53 as key regulators potentially affecting pathways like TGF‐β signaling and p53 signaling. Additionally, our analysis suggests that genes associated with miR‐21 are often downregulated in tumors and exhibit a positive correlation with T cell infiltration, particularly CD8+ T cells, potentially indicating a favorable prognosis. Conclusion Our findings highlight miR‐21 as a central regulatory hub and potential biomarker in breast cancer. While informative, the results are derived from literature‐based data and may be influenced by text‐mining limitations, underscoring the need for experimental validation.https://doi.org/10.1002/cam4.70986apoptosisbreast cancermiR‐21miRNAnatural language processingtext mining
spellingShingle Hong Ye
Yuyu Wu
Richard Tran
Jie Wang
Text Mining Strategy Identifies Gene Networks Under Control of miR‐21 in Breast Cancer Development
Cancer Medicine
apoptosis
breast cancer
miR‐21
miRNA
natural language processing
text mining
title Text Mining Strategy Identifies Gene Networks Under Control of miR‐21 in Breast Cancer Development
title_full Text Mining Strategy Identifies Gene Networks Under Control of miR‐21 in Breast Cancer Development
title_fullStr Text Mining Strategy Identifies Gene Networks Under Control of miR‐21 in Breast Cancer Development
title_full_unstemmed Text Mining Strategy Identifies Gene Networks Under Control of miR‐21 in Breast Cancer Development
title_short Text Mining Strategy Identifies Gene Networks Under Control of miR‐21 in Breast Cancer Development
title_sort text mining strategy identifies gene networks under control of mir 21 in breast cancer development
topic apoptosis
breast cancer
miR‐21
miRNA
natural language processing
text mining
url https://doi.org/10.1002/cam4.70986
work_keys_str_mv AT hongye textminingstrategyidentifiesgenenetworksundercontrolofmir21inbreastcancerdevelopment
AT yuyuwu textminingstrategyidentifiesgenenetworksundercontrolofmir21inbreastcancerdevelopment
AT richardtran textminingstrategyidentifiesgenenetworksundercontrolofmir21inbreastcancerdevelopment
AT jiewang textminingstrategyidentifiesgenenetworksundercontrolofmir21inbreastcancerdevelopment