Plant attribute extraction: An enhancing three-stage deep learning model for relational triple extraction.

Various plant attributes, such as growing environment, growth cycle, and ecological distribution, can provide support to fields like agricultural production and biodiversity. This information is widely dispersed in texts. Manual extraction of this information is highly inefficient due to a fact that...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhihao Zong, Hongtao Shan, Gaoyu Zhang, George Xianzhi Yuan, Shuyi Zhang
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0327186
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849704286099865600
author Zhihao Zong
Hongtao Shan
Gaoyu Zhang
George Xianzhi Yuan
Shuyi Zhang
author_facet Zhihao Zong
Hongtao Shan
Gaoyu Zhang
George Xianzhi Yuan
Shuyi Zhang
author_sort Zhihao Zong
collection DOAJ
description Various plant attributes, such as growing environment, growth cycle, and ecological distribution, can provide support to fields like agricultural production and biodiversity. This information is widely dispersed in texts. Manual extraction of this information is highly inefficient due to a fact that it not only takes considerable time but also increases the likelihood of overlooking relevant details. To convert textual data into structured information, we extract relational triples in the form of (subject, relation, object), where the subject represents the names of plants, the object represents the plant attributes, and the relation represents the classification of plant attributes. To reduce complexity, we employ a joint extraction of entities and relations based on a tagging scheme. The task is broken down into three parts. Firstly, a matrix is used to simultaneously match plant entities and plant attributes. Then, the predefined categories of plant attributes are classified. Finally, the categories of plant attributes are matched with entity-attribute pairs. The tagging-based method typically utilizes parameter sharing to facilitate interaction between different tasks, but it can also lead to issues such as error amplification and instability in parameter updates. Thus, we adopt improved techniques at different stages to enhance the performance of our model. This includes adjustment to the word embedding layer of BERT and optimization in relation prediction. The modification of the word embedding layer is intended to better integrate contextual information during text representation and reduce the interference of erroneous information. The relation prediction part mainly involves multi-level information fusion of textual information, thereby making corrections and highlighting important information. We name the three-stage method as "Bwdgv". Compared to the currently advanced PRGC model, the F1-score of the proposed method has an improvement of 1.4%. With the help of extracted triples, we can construct knowledge graphs and other tasks to better apply various plant attributes.
format Article
id doaj-art-894c48d22d3a4b27bb521180c02ae741
institution DOAJ
issn 1932-6203
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-894c48d22d3a4b27bb521180c02ae7412025-08-20T03:16:47ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01207e032718610.1371/journal.pone.0327186Plant attribute extraction: An enhancing three-stage deep learning model for relational triple extraction.Zhihao ZongHongtao ShanGaoyu ZhangGeorge Xianzhi YuanShuyi ZhangVarious plant attributes, such as growing environment, growth cycle, and ecological distribution, can provide support to fields like agricultural production and biodiversity. This information is widely dispersed in texts. Manual extraction of this information is highly inefficient due to a fact that it not only takes considerable time but also increases the likelihood of overlooking relevant details. To convert textual data into structured information, we extract relational triples in the form of (subject, relation, object), where the subject represents the names of plants, the object represents the plant attributes, and the relation represents the classification of plant attributes. To reduce complexity, we employ a joint extraction of entities and relations based on a tagging scheme. The task is broken down into three parts. Firstly, a matrix is used to simultaneously match plant entities and plant attributes. Then, the predefined categories of plant attributes are classified. Finally, the categories of plant attributes are matched with entity-attribute pairs. The tagging-based method typically utilizes parameter sharing to facilitate interaction between different tasks, but it can also lead to issues such as error amplification and instability in parameter updates. Thus, we adopt improved techniques at different stages to enhance the performance of our model. This includes adjustment to the word embedding layer of BERT and optimization in relation prediction. The modification of the word embedding layer is intended to better integrate contextual information during text representation and reduce the interference of erroneous information. The relation prediction part mainly involves multi-level information fusion of textual information, thereby making corrections and highlighting important information. We name the three-stage method as "Bwdgv". Compared to the currently advanced PRGC model, the F1-score of the proposed method has an improvement of 1.4%. With the help of extracted triples, we can construct knowledge graphs and other tasks to better apply various plant attributes.https://doi.org/10.1371/journal.pone.0327186
spellingShingle Zhihao Zong
Hongtao Shan
Gaoyu Zhang
George Xianzhi Yuan
Shuyi Zhang
Plant attribute extraction: An enhancing three-stage deep learning model for relational triple extraction.
PLoS ONE
title Plant attribute extraction: An enhancing three-stage deep learning model for relational triple extraction.
title_full Plant attribute extraction: An enhancing three-stage deep learning model for relational triple extraction.
title_fullStr Plant attribute extraction: An enhancing three-stage deep learning model for relational triple extraction.
title_full_unstemmed Plant attribute extraction: An enhancing three-stage deep learning model for relational triple extraction.
title_short Plant attribute extraction: An enhancing three-stage deep learning model for relational triple extraction.
title_sort plant attribute extraction an enhancing three stage deep learning model for relational triple extraction
url https://doi.org/10.1371/journal.pone.0327186
work_keys_str_mv AT zhihaozong plantattributeextractionanenhancingthreestagedeeplearningmodelforrelationaltripleextraction
AT hongtaoshan plantattributeextractionanenhancingthreestagedeeplearningmodelforrelationaltripleextraction
AT gaoyuzhang plantattributeextractionanenhancingthreestagedeeplearningmodelforrelationaltripleextraction
AT georgexianzhiyuan plantattributeextractionanenhancingthreestagedeeplearningmodelforrelationaltripleextraction
AT shuyizhang plantattributeextractionanenhancingthreestagedeeplearningmodelforrelationaltripleextraction