Combining the Pre-Trained Model Roberta with a Two-Layer Bidirectional Long- and Short-Term Memory Network and a Multi-Head Attention Mechanism for a Rice Phenomics Entity Classification Study

At a time when global food security is challenged, the importance of phenomics research on rice, as a major food crop, has become more and more prominent. In-depth analysis of rice phenotypic characteristics is of key importance to promote the genetic improvement of rice and sustainable agricultural...

Full description

Saved in:
Bibliographic Details
Main Authors: Dayu Xu, Xinyu Zhu, Xuyao Zhang, Fang Xia
Format: Article
Language:English
Published: MDPI AG 2025-04-01
Series:AgriEngineering
Subjects:
Online Access:https://www.mdpi.com/2624-7402/7/4/94
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849712463444967424
author Dayu Xu
Xinyu Zhu
Xuyao Zhang
Fang Xia
author_facet Dayu Xu
Xinyu Zhu
Xuyao Zhang
Fang Xia
author_sort Dayu Xu
collection DOAJ
description At a time when global food security is challenged, the importance of phenomics research on rice, as a major food crop, has become more and more prominent. In-depth analysis of rice phenotypic characteristics is of key importance to promote the genetic improvement of rice and sustainable agricultural development. However, it is a challenging task to accurately identify and classify entities from the huge amount of rice phenotypic data. In this study, a deep learning model based on Roberta-two-layer BiLSTM-MHA was innovatively constructed for rice phenomics entity classification. Firstly, with the powerful language comprehension capability of the pre-trained Roberta model, deep feature extraction was performed on the rice phenotype text data to capture the underlying semantic information in the text. Next, the contextual information is comprehensively modelled using a two-layer bidirectional long- and short-term memory network (BiLSTM) to fully explore the long-term dependencies in the text sequences. Finally, a multi-head attention mechanism is introduced to enable the model to adaptively focus on key features at different levels, which significantly improves the classification accuracy of complex phenotypic information. The experimental results show that the model performs excellently in several evaluation metrics, with accuracy, recall, and F1-scores of 89.56%, 86.40%, and 87.90%, respectively. This research result not only provides an efficient and precise entity classification tool for rice phenomics research but also provides a comparable method for other crop phenomics analyses, which is expected to promote the technological innovation in the field of crop genetic breeding and agricultural production.
format Article
id doaj-art-8b06396e5e294017ad321b5c4a364ebb
institution DOAJ
issn 2624-7402
language English
publishDate 2025-04-01
publisher MDPI AG
record_format Article
series AgriEngineering
spelling doaj-art-8b06396e5e294017ad321b5c4a364ebb2025-08-20T03:14:16ZengMDPI AGAgriEngineering2624-74022025-04-01749410.3390/agriengineering7040094Combining the Pre-Trained Model Roberta with a Two-Layer Bidirectional Long- and Short-Term Memory Network and a Multi-Head Attention Mechanism for a Rice Phenomics Entity Classification StudyDayu Xu0Xinyu Zhu1Xuyao Zhang2Fang Xia3College of Mathematics and Computer Science, Zhejiang A&F University, Hangzhou 311300, ChinaCollege of Mathematics and Computer Science, Zhejiang A&F University, Hangzhou 311300, ChinaCollege of Economics and Management, Zhejiang A&F University, Hangzhou 311300, ChinaCollege of Economics and Management, Zhejiang A&F University, Hangzhou 311300, ChinaAt a time when global food security is challenged, the importance of phenomics research on rice, as a major food crop, has become more and more prominent. In-depth analysis of rice phenotypic characteristics is of key importance to promote the genetic improvement of rice and sustainable agricultural development. However, it is a challenging task to accurately identify and classify entities from the huge amount of rice phenotypic data. In this study, a deep learning model based on Roberta-two-layer BiLSTM-MHA was innovatively constructed for rice phenomics entity classification. Firstly, with the powerful language comprehension capability of the pre-trained Roberta model, deep feature extraction was performed on the rice phenotype text data to capture the underlying semantic information in the text. Next, the contextual information is comprehensively modelled using a two-layer bidirectional long- and short-term memory network (BiLSTM) to fully explore the long-term dependencies in the text sequences. Finally, a multi-head attention mechanism is introduced to enable the model to adaptively focus on key features at different levels, which significantly improves the classification accuracy of complex phenotypic information. The experimental results show that the model performs excellently in several evaluation metrics, with accuracy, recall, and F1-scores of 89.56%, 86.40%, and 87.90%, respectively. This research result not only provides an efficient and precise entity classification tool for rice phenomics research but also provides a comparable method for other crop phenomics analyses, which is expected to promote the technological innovation in the field of crop genetic breeding and agricultural production.https://www.mdpi.com/2624-7402/7/4/94rice phenomicsdeep learning modelsRobertaBiLSTMentity classification
spellingShingle Dayu Xu
Xinyu Zhu
Xuyao Zhang
Fang Xia
Combining the Pre-Trained Model Roberta with a Two-Layer Bidirectional Long- and Short-Term Memory Network and a Multi-Head Attention Mechanism for a Rice Phenomics Entity Classification Study
AgriEngineering
rice phenomics
deep learning models
Roberta
BiLSTM
entity classification
title Combining the Pre-Trained Model Roberta with a Two-Layer Bidirectional Long- and Short-Term Memory Network and a Multi-Head Attention Mechanism for a Rice Phenomics Entity Classification Study
title_full Combining the Pre-Trained Model Roberta with a Two-Layer Bidirectional Long- and Short-Term Memory Network and a Multi-Head Attention Mechanism for a Rice Phenomics Entity Classification Study
title_fullStr Combining the Pre-Trained Model Roberta with a Two-Layer Bidirectional Long- and Short-Term Memory Network and a Multi-Head Attention Mechanism for a Rice Phenomics Entity Classification Study
title_full_unstemmed Combining the Pre-Trained Model Roberta with a Two-Layer Bidirectional Long- and Short-Term Memory Network and a Multi-Head Attention Mechanism for a Rice Phenomics Entity Classification Study
title_short Combining the Pre-Trained Model Roberta with a Two-Layer Bidirectional Long- and Short-Term Memory Network and a Multi-Head Attention Mechanism for a Rice Phenomics Entity Classification Study
title_sort combining the pre trained model roberta with a two layer bidirectional long and short term memory network and a multi head attention mechanism for a rice phenomics entity classification study
topic rice phenomics
deep learning models
Roberta
BiLSTM
entity classification
url https://www.mdpi.com/2624-7402/7/4/94
work_keys_str_mv AT dayuxu combiningthepretrainedmodelrobertawithatwolayerbidirectionallongandshorttermmemorynetworkandamultiheadattentionmechanismforaricephenomicsentityclassificationstudy
AT xinyuzhu combiningthepretrainedmodelrobertawithatwolayerbidirectionallongandshorttermmemorynetworkandamultiheadattentionmechanismforaricephenomicsentityclassificationstudy
AT xuyaozhang combiningthepretrainedmodelrobertawithatwolayerbidirectionallongandshorttermmemorynetworkandamultiheadattentionmechanismforaricephenomicsentityclassificationstudy
AT fangxia combiningthepretrainedmodelrobertawithatwolayerbidirectionallongandshorttermmemorynetworkandamultiheadattentionmechanismforaricephenomicsentityclassificationstudy