Named Entity Recognition in Aviation Products Domain Based on BERT
The aviation products’ manufacturing industry is undergoing a profound transformation towards intelligence, among which the construction of a knowledge graph specifically for the aviation field has become the core link in achieving cognitive intelligence. In the process of knowledge graph...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2024-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10795123/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850250127290138624 |
|---|---|
| author | Mingye Yang Bernadin Namoano Maryam Farsi John Ahmet Erkoyuncu |
| author_facet | Mingye Yang Bernadin Namoano Maryam Farsi John Ahmet Erkoyuncu |
| author_sort | Mingye Yang |
| collection | DOAJ |
| description | The aviation products’ manufacturing industry is undergoing a profound transformation towards intelligence, among which the construction of a knowledge graph specifically for the aviation field has become the core link in achieving cognitive intelligence. In the process of knowledge graph construction, named entity recognition (NER) is a key step and one of the main tasks of knowledge extraction. Given the high degree of specialisation of aviation product text data and the wide span of contextual information, existing models often perform poorly in entity extraction. This paper proposes a new Named Entity Recognition (NER) method specifically tailored for the aviation product field (BBC-Ap), introducing an innovative approach that leverages domain-specific ontologies and advanced deep learning algorithms to significantly enhance the accuracy and efficiency of entity extraction from complex technical documents. The first step of this method is to establish an ontology model of aviation products and annotate the relevant text data to form a dataset for training the named entity model. Next, it adopts a multi-level model structure based on BERT, in which BERT is used to generate word vector representations, a bidirectional long short-term memory network (BiLSTM) is used as an encoder to extract semantic features, and a conditional random field (CRF) is used as a decoder to achieve optimal label assignment. Through experiments on the constructed aviation product dataset, the model achieved a Precision value of 91.74%, a Recall value of 92.46%, and an F1 score of 92.1%, Compared with other baseline models, the F1-score is improved by 0.9% to 1.5%. At the same time, the model also performs well on standard datasets such as CoNLLpp, with a Precision value of 92.87%, a Recall value of 92.54%, and an F1-Score of 92.70%. Finally, the model was used to successfully construct a knowledge graph reflecting the relationships between aviation products in Neo4j, further demonstrating the effectiveness and practicality of the method. |
| format | Article |
| id | doaj-art-58fbd636874e47b8bc0177f221398e6d |
| institution | OA Journals |
| issn | 2169-3536 |
| language | English |
| publishDate | 2024-01-01 |
| publisher | IEEE |
| record_format | Article |
| series | IEEE Access |
| spelling | doaj-art-58fbd636874e47b8bc0177f221398e6d2025-08-20T01:58:19ZengIEEEIEEE Access2169-35362024-01-011218971018972110.1109/ACCESS.2024.351639010795123Named Entity Recognition in Aviation Products Domain Based on BERTMingye Yang0https://orcid.org/0009-0001-0496-8778Bernadin Namoano1https://orcid.org/0000-0002-6159-5250Maryam Farsi2https://orcid.org/0000-0003-4549-0499John Ahmet Erkoyuncu3https://orcid.org/0000-0002-8046-9911Centre for Digital Engineering and Manufacturing, Cranfield University, Cranfield, U.K.Centre for Digital Engineering and Manufacturing, Cranfield University, Cranfield, U.K.Centre for Digital Engineering and Manufacturing, Cranfield University, Cranfield, U.K.Centre for Digital Engineering and Manufacturing, Cranfield University, Cranfield, U.K.The aviation products’ manufacturing industry is undergoing a profound transformation towards intelligence, among which the construction of a knowledge graph specifically for the aviation field has become the core link in achieving cognitive intelligence. In the process of knowledge graph construction, named entity recognition (NER) is a key step and one of the main tasks of knowledge extraction. Given the high degree of specialisation of aviation product text data and the wide span of contextual information, existing models often perform poorly in entity extraction. This paper proposes a new Named Entity Recognition (NER) method specifically tailored for the aviation product field (BBC-Ap), introducing an innovative approach that leverages domain-specific ontologies and advanced deep learning algorithms to significantly enhance the accuracy and efficiency of entity extraction from complex technical documents. The first step of this method is to establish an ontology model of aviation products and annotate the relevant text data to form a dataset for training the named entity model. Next, it adopts a multi-level model structure based on BERT, in which BERT is used to generate word vector representations, a bidirectional long short-term memory network (BiLSTM) is used as an encoder to extract semantic features, and a conditional random field (CRF) is used as a decoder to achieve optimal label assignment. Through experiments on the constructed aviation product dataset, the model achieved a Precision value of 91.74%, a Recall value of 92.46%, and an F1 score of 92.1%, Compared with other baseline models, the F1-score is improved by 0.9% to 1.5%. At the same time, the model also performs well on standard datasets such as CoNLLpp, with a Precision value of 92.87%, a Recall value of 92.54%, and an F1-Score of 92.70%. Finally, the model was used to successfully construct a knowledge graph reflecting the relationships between aviation products in Neo4j, further demonstrating the effectiveness and practicality of the method.https://ieeexplore.ieee.org/document/10795123/Aviationnamed entity recognition (NER)knowledge graphbidirectional encoder representations from transformers (BERT)bidirectional long short-term memory network (Bi-LSTM) |
| spellingShingle | Mingye Yang Bernadin Namoano Maryam Farsi John Ahmet Erkoyuncu Named Entity Recognition in Aviation Products Domain Based on BERT IEEE Access Aviation named entity recognition (NER) knowledge graph bidirectional encoder representations from transformers (BERT) bidirectional long short-term memory network (Bi-LSTM) |
| title | Named Entity Recognition in Aviation Products Domain Based on BERT |
| title_full | Named Entity Recognition in Aviation Products Domain Based on BERT |
| title_fullStr | Named Entity Recognition in Aviation Products Domain Based on BERT |
| title_full_unstemmed | Named Entity Recognition in Aviation Products Domain Based on BERT |
| title_short | Named Entity Recognition in Aviation Products Domain Based on BERT |
| title_sort | named entity recognition in aviation products domain based on bert |
| topic | Aviation named entity recognition (NER) knowledge graph bidirectional encoder representations from transformers (BERT) bidirectional long short-term memory network (Bi-LSTM) |
| url | https://ieeexplore.ieee.org/document/10795123/ |
| work_keys_str_mv | AT mingyeyang namedentityrecognitioninaviationproductsdomainbasedonbert AT bernadinnamoano namedentityrecognitioninaviationproductsdomainbasedonbert AT maryamfarsi namedentityrecognitioninaviationproductsdomainbasedonbert AT johnahmeterkoyuncu namedentityrecognitioninaviationproductsdomainbasedonbert |