Examining HPO by organ and system to facilitate practical use by clinicians

Abstract The Human Phenotype Ontology (HPO) is widely used for annotating clinical text data, and sufficient annotation is crucial for the effective utilization of clinical texts. It was known that the use of LLMs can successfully extract symptoms and findings, but cannot annotate them with the HPO....

Full description

Saved in:
Bibliographic Details
Main Authors: Eisuke Dohi, Terue Takatsuki, Yuka Tateisi, Toyofumi Fujiwara, Yasunori Yamamoto
Format: Article
Language:English
Published: BioMed Central 2024-11-01
Series:Genomics & Informatics
Subjects:
Online Access:https://doi.org/10.1186/s44342-024-00024-1
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832546413694681088
author Eisuke Dohi
Terue Takatsuki
Yuka Tateisi
Toyofumi Fujiwara
Yasunori Yamamoto
author_facet Eisuke Dohi
Terue Takatsuki
Yuka Tateisi
Toyofumi Fujiwara
Yasunori Yamamoto
author_sort Eisuke Dohi
collection DOAJ
description Abstract The Human Phenotype Ontology (HPO) is widely used for annotating clinical text data, and sufficient annotation is crucial for the effective utilization of clinical texts. It was known that the use of LLMs can successfully extract symptoms and findings, but cannot annotate them with the HPO. We hypothesized that one of the potential issue for this is the lack of appropriate terms in the HPO. Therefore, during the Biomedical Linked Annotation Hackathon 8 (BLAH8), we attempted the following two tasks in order to grasp the overall picture of HPO. (1) Extract all HPO terms for each of the 23 HPO subclasses (defined as categories) directly under the HPO "Phenotypic abnormality" and then (2) search for major attributes in each of 23 categories. We employed LLM for these two tasks related to examining HPO and, at the same time, found that LLM didn't work well without ingenuity for tasks that lacked sentences and context. A manual search for terms within each category revealed that the HPO contains a mix of terms with four major attributes: (1) Disease Name, (2) Condition, (3) Test Data, and (4) Symptoms and Findings. Manual curation showed that the ratio of symptoms and findings varied from 0 to 93.1% across categories. For clinicians, who are end-users of medical terminology including HPO, it is difficult to understand ontologies. However, for good quality ontology is also important for good-quality data, and a clinician’s help is essential. It is also important to make the overall picture and limitations of ontologies easy to understand in order to bring out the explanatory power of LLMs and artificial intelligence.
format Article
id doaj-art-9e28ffb457994ed29a36089cf55092b5
institution Kabale University
issn 2234-0742
language English
publishDate 2024-11-01
publisher BioMed Central
record_format Article
series Genomics & Informatics
spelling doaj-art-9e28ffb457994ed29a36089cf55092b52025-02-03T06:59:20ZengBioMed CentralGenomics & Informatics2234-07422024-11-012211510.1186/s44342-024-00024-1Examining HPO by organ and system to facilitate practical use by cliniciansEisuke Dohi0Terue Takatsuki1Yuka Tateisi2Toyofumi Fujiwara3Yasunori Yamamoto4National Center of Neurology and Psychiatry, National Institute of NeuroscienceDatabase Center for Life Science, ROIS-DSOffice of NBDC Program, JSTDatabase Center for Life Science, ROIS-DSDatabase Center for Life Science, ROIS-DSAbstract The Human Phenotype Ontology (HPO) is widely used for annotating clinical text data, and sufficient annotation is crucial for the effective utilization of clinical texts. It was known that the use of LLMs can successfully extract symptoms and findings, but cannot annotate them with the HPO. We hypothesized that one of the potential issue for this is the lack of appropriate terms in the HPO. Therefore, during the Biomedical Linked Annotation Hackathon 8 (BLAH8), we attempted the following two tasks in order to grasp the overall picture of HPO. (1) Extract all HPO terms for each of the 23 HPO subclasses (defined as categories) directly under the HPO "Phenotypic abnormality" and then (2) search for major attributes in each of 23 categories. We employed LLM for these two tasks related to examining HPO and, at the same time, found that LLM didn't work well without ingenuity for tasks that lacked sentences and context. A manual search for terms within each category revealed that the HPO contains a mix of terms with four major attributes: (1) Disease Name, (2) Condition, (3) Test Data, and (4) Symptoms and Findings. Manual curation showed that the ratio of symptoms and findings varied from 0 to 93.1% across categories. For clinicians, who are end-users of medical terminology including HPO, it is difficult to understand ontologies. However, for good quality ontology is also important for good-quality data, and a clinician’s help is essential. It is also important to make the overall picture and limitations of ontologies easy to understand in order to bring out the explanatory power of LLMs and artificial intelligence.https://doi.org/10.1186/s44342-024-00024-1Large Language ModelBiological ontologiesAnnotationPhenotype
spellingShingle Eisuke Dohi
Terue Takatsuki
Yuka Tateisi
Toyofumi Fujiwara
Yasunori Yamamoto
Examining HPO by organ and system to facilitate practical use by clinicians
Genomics & Informatics
Large Language Model
Biological ontologies
Annotation
Phenotype
title Examining HPO by organ and system to facilitate practical use by clinicians
title_full Examining HPO by organ and system to facilitate practical use by clinicians
title_fullStr Examining HPO by organ and system to facilitate practical use by clinicians
title_full_unstemmed Examining HPO by organ and system to facilitate practical use by clinicians
title_short Examining HPO by organ and system to facilitate practical use by clinicians
title_sort examining hpo by organ and system to facilitate practical use by clinicians
topic Large Language Model
Biological ontologies
Annotation
Phenotype
url https://doi.org/10.1186/s44342-024-00024-1
work_keys_str_mv AT eisukedohi examininghpobyorganandsystemtofacilitatepracticalusebyclinicians
AT teruetakatsuki examininghpobyorganandsystemtofacilitatepracticalusebyclinicians
AT yukatateisi examininghpobyorganandsystemtofacilitatepracticalusebyclinicians
AT toyofumifujiwara examininghpobyorganandsystemtofacilitatepracticalusebyclinicians
AT yasunoriyamamoto examininghpobyorganandsystemtofacilitatepracticalusebyclinicians