Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs.

Real-world data (RWD) in the medical field, such as electronic health records (EHRs) and medication orders, are receiving increasing attention from researchers and practitioners. While structured data have played a vital role thus far, unstructured data represented by text (e.g., discharge summaries...

Full description

Saved in:
Bibliographic Details
Main Authors: Shuntaro Yada, Tomohiro Nishiyama, Shoko Wakamiya, Yoshimasa Kawazoe, Shungo Imai, Satoko Hori, Eiji Aramaki
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2024-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0310432
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850028715407310848
author Shuntaro Yada
Tomohiro Nishiyama
Shoko Wakamiya
Yoshimasa Kawazoe
Shungo Imai
Satoko Hori
Eiji Aramaki
author_facet Shuntaro Yada
Tomohiro Nishiyama
Shoko Wakamiya
Yoshimasa Kawazoe
Shungo Imai
Satoko Hori
Eiji Aramaki
author_sort Shuntaro Yada
collection DOAJ
description Real-world data (RWD) in the medical field, such as electronic health records (EHRs) and medication orders, are receiving increasing attention from researchers and practitioners. While structured data have played a vital role thus far, unstructured data represented by text (e.g., discharge summaries) are not effectively utilized because of the difficulty in extracting medical information. We evaluated the information gained by supplementing structured data with clinical concepts extracted from unstructured text by leveraging natural language processing techniques. Using a machine learning-based pretrained named entity recognition tool, we extracted disease and medication names from real discharge summaries in a Japanese hospital and linked them to medical concepts using medical term dictionaries. By comparing the diseases and medications mentioned in the text with medical codes in tabular diagnosis records, we found that: (1) the text data contained richer information on patient symptoms than tabular diagnosis records, whereas the medication-order table stored more injection data than text. In addition, (2) extractable information regarding specific diseases showed surprisingly small intersections among text, diagnosis records, and medication orders. Text data can thus be a useful supplement for RWD mining, which is further demonstrated by (3) our practical application system for drug safety evaluation, which exhaustively visualizes suspicious adverse drug effects caused by the simultaneous use of anticancer drug pairs. We conclude that proper use of textual information extraction can lead to better outcomes in medical RWD mining.
format Article
id doaj-art-413a4eed82054181b6e14f9eeb05005a
institution DOAJ
issn 1932-6203
language English
publishDate 2024-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-413a4eed82054181b6e14f9eeb05005a2025-08-20T02:59:45ZengPublic Library of Science (PLoS)PLoS ONE1932-62032024-01-01199e031043210.1371/journal.pone.0310432Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs.Shuntaro YadaTomohiro NishiyamaShoko WakamiyaYoshimasa KawazoeShungo ImaiSatoko HoriEiji AramakiReal-world data (RWD) in the medical field, such as electronic health records (EHRs) and medication orders, are receiving increasing attention from researchers and practitioners. While structured data have played a vital role thus far, unstructured data represented by text (e.g., discharge summaries) are not effectively utilized because of the difficulty in extracting medical information. We evaluated the information gained by supplementing structured data with clinical concepts extracted from unstructured text by leveraging natural language processing techniques. Using a machine learning-based pretrained named entity recognition tool, we extracted disease and medication names from real discharge summaries in a Japanese hospital and linked them to medical concepts using medical term dictionaries. By comparing the diseases and medications mentioned in the text with medical codes in tabular diagnosis records, we found that: (1) the text data contained richer information on patient symptoms than tabular diagnosis records, whereas the medication-order table stored more injection data than text. In addition, (2) extractable information regarding specific diseases showed surprisingly small intersections among text, diagnosis records, and medication orders. Text data can thus be a useful supplement for RWD mining, which is further demonstrated by (3) our practical application system for drug safety evaluation, which exhaustively visualizes suspicious adverse drug effects caused by the simultaneous use of anticancer drug pairs. We conclude that proper use of textual information extraction can lead to better outcomes in medical RWD mining.https://doi.org/10.1371/journal.pone.0310432
spellingShingle Shuntaro Yada
Tomohiro Nishiyama
Shoko Wakamiya
Yoshimasa Kawazoe
Shungo Imai
Satoko Hori
Eiji Aramaki
Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs.
PLoS ONE
title Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs.
title_full Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs.
title_fullStr Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs.
title_full_unstemmed Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs.
title_short Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs.
title_sort utility analysis and demonstration of real world clinical texts a case study on japanese cancer related ehrs
url https://doi.org/10.1371/journal.pone.0310432
work_keys_str_mv AT shuntaroyada utilityanalysisanddemonstrationofrealworldclinicaltextsacasestudyonjapanesecancerrelatedehrs
AT tomohironishiyama utilityanalysisanddemonstrationofrealworldclinicaltextsacasestudyonjapanesecancerrelatedehrs
AT shokowakamiya utilityanalysisanddemonstrationofrealworldclinicaltextsacasestudyonjapanesecancerrelatedehrs
AT yoshimasakawazoe utilityanalysisanddemonstrationofrealworldclinicaltextsacasestudyonjapanesecancerrelatedehrs
AT shungoimai utilityanalysisanddemonstrationofrealworldclinicaltextsacasestudyonjapanesecancerrelatedehrs
AT satokohori utilityanalysisanddemonstrationofrealworldclinicaltextsacasestudyonjapanesecancerrelatedehrs
AT eijiaramaki utilityanalysisanddemonstrationofrealworldclinicaltextsacasestudyonjapanesecancerrelatedehrs