Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs.
Real-world data (RWD) in the medical field, such as electronic health records (EHRs) and medication orders, are receiving increasing attention from researchers and practitioners. While structured data have played a vital role thus far, unstructured data represented by text (e.g., discharge summaries...
Saved in:
| Main Authors: | , , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Public Library of Science (PLoS)
2024-01-01
|
| Series: | PLoS ONE |
| Online Access: | https://doi.org/10.1371/journal.pone.0310432 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850028715407310848 |
|---|---|
| author | Shuntaro Yada Tomohiro Nishiyama Shoko Wakamiya Yoshimasa Kawazoe Shungo Imai Satoko Hori Eiji Aramaki |
| author_facet | Shuntaro Yada Tomohiro Nishiyama Shoko Wakamiya Yoshimasa Kawazoe Shungo Imai Satoko Hori Eiji Aramaki |
| author_sort | Shuntaro Yada |
| collection | DOAJ |
| description | Real-world data (RWD) in the medical field, such as electronic health records (EHRs) and medication orders, are receiving increasing attention from researchers and practitioners. While structured data have played a vital role thus far, unstructured data represented by text (e.g., discharge summaries) are not effectively utilized because of the difficulty in extracting medical information. We evaluated the information gained by supplementing structured data with clinical concepts extracted from unstructured text by leveraging natural language processing techniques. Using a machine learning-based pretrained named entity recognition tool, we extracted disease and medication names from real discharge summaries in a Japanese hospital and linked them to medical concepts using medical term dictionaries. By comparing the diseases and medications mentioned in the text with medical codes in tabular diagnosis records, we found that: (1) the text data contained richer information on patient symptoms than tabular diagnosis records, whereas the medication-order table stored more injection data than text. In addition, (2) extractable information regarding specific diseases showed surprisingly small intersections among text, diagnosis records, and medication orders. Text data can thus be a useful supplement for RWD mining, which is further demonstrated by (3) our practical application system for drug safety evaluation, which exhaustively visualizes suspicious adverse drug effects caused by the simultaneous use of anticancer drug pairs. We conclude that proper use of textual information extraction can lead to better outcomes in medical RWD mining. |
| format | Article |
| id | doaj-art-413a4eed82054181b6e14f9eeb05005a |
| institution | DOAJ |
| issn | 1932-6203 |
| language | English |
| publishDate | 2024-01-01 |
| publisher | Public Library of Science (PLoS) |
| record_format | Article |
| series | PLoS ONE |
| spelling | doaj-art-413a4eed82054181b6e14f9eeb05005a2025-08-20T02:59:45ZengPublic Library of Science (PLoS)PLoS ONE1932-62032024-01-01199e031043210.1371/journal.pone.0310432Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs.Shuntaro YadaTomohiro NishiyamaShoko WakamiyaYoshimasa KawazoeShungo ImaiSatoko HoriEiji AramakiReal-world data (RWD) in the medical field, such as electronic health records (EHRs) and medication orders, are receiving increasing attention from researchers and practitioners. While structured data have played a vital role thus far, unstructured data represented by text (e.g., discharge summaries) are not effectively utilized because of the difficulty in extracting medical information. We evaluated the information gained by supplementing structured data with clinical concepts extracted from unstructured text by leveraging natural language processing techniques. Using a machine learning-based pretrained named entity recognition tool, we extracted disease and medication names from real discharge summaries in a Japanese hospital and linked them to medical concepts using medical term dictionaries. By comparing the diseases and medications mentioned in the text with medical codes in tabular diagnosis records, we found that: (1) the text data contained richer information on patient symptoms than tabular diagnosis records, whereas the medication-order table stored more injection data than text. In addition, (2) extractable information regarding specific diseases showed surprisingly small intersections among text, diagnosis records, and medication orders. Text data can thus be a useful supplement for RWD mining, which is further demonstrated by (3) our practical application system for drug safety evaluation, which exhaustively visualizes suspicious adverse drug effects caused by the simultaneous use of anticancer drug pairs. We conclude that proper use of textual information extraction can lead to better outcomes in medical RWD mining.https://doi.org/10.1371/journal.pone.0310432 |
| spellingShingle | Shuntaro Yada Tomohiro Nishiyama Shoko Wakamiya Yoshimasa Kawazoe Shungo Imai Satoko Hori Eiji Aramaki Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs. PLoS ONE |
| title | Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs. |
| title_full | Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs. |
| title_fullStr | Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs. |
| title_full_unstemmed | Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs. |
| title_short | Utility analysis and demonstration of real-world clinical texts: A case study on Japanese cancer-related EHRs. |
| title_sort | utility analysis and demonstration of real world clinical texts a case study on japanese cancer related ehrs |
| url | https://doi.org/10.1371/journal.pone.0310432 |
| work_keys_str_mv | AT shuntaroyada utilityanalysisanddemonstrationofrealworldclinicaltextsacasestudyonjapanesecancerrelatedehrs AT tomohironishiyama utilityanalysisanddemonstrationofrealworldclinicaltextsacasestudyonjapanesecancerrelatedehrs AT shokowakamiya utilityanalysisanddemonstrationofrealworldclinicaltextsacasestudyonjapanesecancerrelatedehrs AT yoshimasakawazoe utilityanalysisanddemonstrationofrealworldclinicaltextsacasestudyonjapanesecancerrelatedehrs AT shungoimai utilityanalysisanddemonstrationofrealworldclinicaltextsacasestudyonjapanesecancerrelatedehrs AT satokohori utilityanalysisanddemonstrationofrealworldclinicaltextsacasestudyonjapanesecancerrelatedehrs AT eijiaramaki utilityanalysisanddemonstrationofrealworldclinicaltextsacasestudyonjapanesecancerrelatedehrs |