Named Entity Recognition in Aviation Products Domain Based on BERT

The aviation products’ manufacturing industry is undergoing a profound transformation towards intelligence, among which the construction of a knowledge graph specifically for the aviation field has become the core link in achieving cognitive intelligence. In the process of knowledge graph...

Full description

Saved in:
Bibliographic Details
Main Authors: Mingye Yang, Bernadin Namoano, Maryam Farsi, John Ahmet Erkoyuncu
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10795123/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850250127290138624
author Mingye Yang
Bernadin Namoano
Maryam Farsi
John Ahmet Erkoyuncu
author_facet Mingye Yang
Bernadin Namoano
Maryam Farsi
John Ahmet Erkoyuncu
author_sort Mingye Yang
collection DOAJ
description The aviation products’ manufacturing industry is undergoing a profound transformation towards intelligence, among which the construction of a knowledge graph specifically for the aviation field has become the core link in achieving cognitive intelligence. In the process of knowledge graph construction, named entity recognition (NER) is a key step and one of the main tasks of knowledge extraction. Given the high degree of specialisation of aviation product text data and the wide span of contextual information, existing models often perform poorly in entity extraction. This paper proposes a new Named Entity Recognition (NER) method specifically tailored for the aviation product field (BBC-Ap), introducing an innovative approach that leverages domain-specific ontologies and advanced deep learning algorithms to significantly enhance the accuracy and efficiency of entity extraction from complex technical documents. The first step of this method is to establish an ontology model of aviation products and annotate the relevant text data to form a dataset for training the named entity model. Next, it adopts a multi-level model structure based on BERT, in which BERT is used to generate word vector representations, a bidirectional long short-term memory network (BiLSTM) is used as an encoder to extract semantic features, and a conditional random field (CRF) is used as a decoder to achieve optimal label assignment. Through experiments on the constructed aviation product dataset, the model achieved a Precision value of 91.74%, a Recall value of 92.46%, and an F1 score of 92.1%, Compared with other baseline models, the F1-score is improved by 0.9% to 1.5%. At the same time, the model also performs well on standard datasets such as CoNLLpp, with a Precision value of 92.87%, a Recall value of 92.54%, and an F1-Score of 92.70%. Finally, the model was used to successfully construct a knowledge graph reflecting the relationships between aviation products in Neo4j, further demonstrating the effectiveness and practicality of the method.
format Article
id doaj-art-58fbd636874e47b8bc0177f221398e6d
institution OA Journals
issn 2169-3536
language English
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-58fbd636874e47b8bc0177f221398e6d2025-08-20T01:58:19ZengIEEEIEEE Access2169-35362024-01-011218971018972110.1109/ACCESS.2024.351639010795123Named Entity Recognition in Aviation Products Domain Based on BERTMingye Yang0https://orcid.org/0009-0001-0496-8778Bernadin Namoano1https://orcid.org/0000-0002-6159-5250Maryam Farsi2https://orcid.org/0000-0003-4549-0499John Ahmet Erkoyuncu3https://orcid.org/0000-0002-8046-9911Centre for Digital Engineering and Manufacturing, Cranfield University, Cranfield, U.K.Centre for Digital Engineering and Manufacturing, Cranfield University, Cranfield, U.K.Centre for Digital Engineering and Manufacturing, Cranfield University, Cranfield, U.K.Centre for Digital Engineering and Manufacturing, Cranfield University, Cranfield, U.K.The aviation products’ manufacturing industry is undergoing a profound transformation towards intelligence, among which the construction of a knowledge graph specifically for the aviation field has become the core link in achieving cognitive intelligence. In the process of knowledge graph construction, named entity recognition (NER) is a key step and one of the main tasks of knowledge extraction. Given the high degree of specialisation of aviation product text data and the wide span of contextual information, existing models often perform poorly in entity extraction. This paper proposes a new Named Entity Recognition (NER) method specifically tailored for the aviation product field (BBC-Ap), introducing an innovative approach that leverages domain-specific ontologies and advanced deep learning algorithms to significantly enhance the accuracy and efficiency of entity extraction from complex technical documents. The first step of this method is to establish an ontology model of aviation products and annotate the relevant text data to form a dataset for training the named entity model. Next, it adopts a multi-level model structure based on BERT, in which BERT is used to generate word vector representations, a bidirectional long short-term memory network (BiLSTM) is used as an encoder to extract semantic features, and a conditional random field (CRF) is used as a decoder to achieve optimal label assignment. Through experiments on the constructed aviation product dataset, the model achieved a Precision value of 91.74%, a Recall value of 92.46%, and an F1 score of 92.1%, Compared with other baseline models, the F1-score is improved by 0.9% to 1.5%. At the same time, the model also performs well on standard datasets such as CoNLLpp, with a Precision value of 92.87%, a Recall value of 92.54%, and an F1-Score of 92.70%. Finally, the model was used to successfully construct a knowledge graph reflecting the relationships between aviation products in Neo4j, further demonstrating the effectiveness and practicality of the method.https://ieeexplore.ieee.org/document/10795123/Aviationnamed entity recognition (NER)knowledge graphbidirectional encoder representations from transformers (BERT)bidirectional long short-term memory network (Bi-LSTM)
spellingShingle Mingye Yang
Bernadin Namoano
Maryam Farsi
John Ahmet Erkoyuncu
Named Entity Recognition in Aviation Products Domain Based on BERT
IEEE Access
Aviation
named entity recognition (NER)
knowledge graph
bidirectional encoder representations from transformers (BERT)
bidirectional long short-term memory network (Bi-LSTM)
title Named Entity Recognition in Aviation Products Domain Based on BERT
title_full Named Entity Recognition in Aviation Products Domain Based on BERT
title_fullStr Named Entity Recognition in Aviation Products Domain Based on BERT
title_full_unstemmed Named Entity Recognition in Aviation Products Domain Based on BERT
title_short Named Entity Recognition in Aviation Products Domain Based on BERT
title_sort named entity recognition in aviation products domain based on bert
topic Aviation
named entity recognition (NER)
knowledge graph
bidirectional encoder representations from transformers (BERT)
bidirectional long short-term memory network (Bi-LSTM)
url https://ieeexplore.ieee.org/document/10795123/
work_keys_str_mv AT mingyeyang namedentityrecognitioninaviationproductsdomainbasedonbert
AT bernadinnamoano namedentityrecognitioninaviationproductsdomainbasedonbert
AT maryamfarsi namedentityrecognitioninaviationproductsdomainbasedonbert
AT johnahmeterkoyuncu namedentityrecognitioninaviationproductsdomainbasedonbert