A Named Entity Recognition Model for Chinese Electricity Violation Descriptions Based on Word-Character Fusion and Multi-Head Attention Mechanisms

Due to the complexity and technicality of named entity recognition (NER) in the power grid field, existing methods are ineffective at identifying specialized terms in power grid operation record texts. Therefore, this paper proposes a Chinese power violation description entity recognition model base...

Full description

Saved in:
Bibliographic Details
Main Authors: Lingwen Meng, Yulin Wang, Yuanjun Huang, Dingli Ma, Xinshan Zhu, Shumei Zhang
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Energies
Subjects:
Online Access:https://www.mdpi.com/1996-1073/18/2/401
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832588560273768448
author Lingwen Meng
Yulin Wang
Yuanjun Huang
Dingli Ma
Xinshan Zhu
Shumei Zhang
author_facet Lingwen Meng
Yulin Wang
Yuanjun Huang
Dingli Ma
Xinshan Zhu
Shumei Zhang
author_sort Lingwen Meng
collection DOAJ
description Due to the complexity and technicality of named entity recognition (NER) in the power grid field, existing methods are ineffective at identifying specialized terms in power grid operation record texts. Therefore, this paper proposes a Chinese power violation description entity recognition model based on word-character fusion and multi-head attention mechanisms. The model first utilizes a collected power grid domain corpus to train a Word2Vec model, which produces static word vector representations. These static word vectors are then integrated with the dynamic character vector features of the input text generated by the BERT model, thereby mitigating the impact of segmentation errors on the NER model and enhancing the model’s ability to identify entity boundaries. The combined vectors are subsequently input into a BiGRU model for learning contextual features. The output from the BiGRU layer is then passed to an attention mechanism layer to obtain enhanced semantic features, which highlight key semantics and improve the model’s contextual understanding ability. Finally, the CRF layer decodes the output to generate the globally optimal label sequence with the highest probability. Experimental results on the constructed power grid field operation violation description dataset demonstrate that the proposed NER model outperforms the traditional BERT-BiLSTM-CRF model, with an average improvement of 1.58% in precision, recall, and F1-score. This demonstrates the effectiveness of the model design and further enhances the accuracy of entity recognition in the power grid domain.
format Article
id doaj-art-7adb8aa40259411591bd3de887ec8268
institution Kabale University
issn 1996-1073
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Energies
spelling doaj-art-7adb8aa40259411591bd3de887ec82682025-01-24T13:31:21ZengMDPI AGEnergies1996-10732025-01-0118240110.3390/en18020401A Named Entity Recognition Model for Chinese Electricity Violation Descriptions Based on Word-Character Fusion and Multi-Head Attention MechanismsLingwen Meng0Yulin Wang1Yuanjun Huang2Dingli Ma3Xinshan Zhu4Shumei Zhang5Electric Power Research Institute of Guizhou Power Grid Co., Ltd., Guiyang 550002, ChinaSchool of Electrical and Information Engineering, Tianjin University, Tianjin 300072, ChinaSouthern Power Grid Digital Grid Group Co., Ltd., Guizhou Branch, Guiyang 550002, ChinaSouthern Power Grid Digital Grid Group Co., Ltd., Guizhou Branch, Guiyang 550002, ChinaSchool of Electrical and Information Engineering, Tianjin University, Tianjin 300072, ChinaSchool of Electrical and Information Engineering, Tianjin University, Tianjin 300072, ChinaDue to the complexity and technicality of named entity recognition (NER) in the power grid field, existing methods are ineffective at identifying specialized terms in power grid operation record texts. Therefore, this paper proposes a Chinese power violation description entity recognition model based on word-character fusion and multi-head attention mechanisms. The model first utilizes a collected power grid domain corpus to train a Word2Vec model, which produces static word vector representations. These static word vectors are then integrated with the dynamic character vector features of the input text generated by the BERT model, thereby mitigating the impact of segmentation errors on the NER model and enhancing the model’s ability to identify entity boundaries. The combined vectors are subsequently input into a BiGRU model for learning contextual features. The output from the BiGRU layer is then passed to an attention mechanism layer to obtain enhanced semantic features, which highlight key semantics and improve the model’s contextual understanding ability. Finally, the CRF layer decodes the output to generate the globally optimal label sequence with the highest probability. Experimental results on the constructed power grid field operation violation description dataset demonstrate that the proposed NER model outperforms the traditional BERT-BiLSTM-CRF model, with an average improvement of 1.58% in precision, recall, and F1-score. This demonstrates the effectiveness of the model design and further enhances the accuracy of entity recognition in the power grid domain.https://www.mdpi.com/1996-1073/18/2/401named entity recognition (NER)attention mechanismelectric grid on-site operation violation descriptionword-character fusion
spellingShingle Lingwen Meng
Yulin Wang
Yuanjun Huang
Dingli Ma
Xinshan Zhu
Shumei Zhang
A Named Entity Recognition Model for Chinese Electricity Violation Descriptions Based on Word-Character Fusion and Multi-Head Attention Mechanisms
Energies
named entity recognition (NER)
attention mechanism
electric grid on-site operation violation description
word-character fusion
title A Named Entity Recognition Model for Chinese Electricity Violation Descriptions Based on Word-Character Fusion and Multi-Head Attention Mechanisms
title_full A Named Entity Recognition Model for Chinese Electricity Violation Descriptions Based on Word-Character Fusion and Multi-Head Attention Mechanisms
title_fullStr A Named Entity Recognition Model for Chinese Electricity Violation Descriptions Based on Word-Character Fusion and Multi-Head Attention Mechanisms
title_full_unstemmed A Named Entity Recognition Model for Chinese Electricity Violation Descriptions Based on Word-Character Fusion and Multi-Head Attention Mechanisms
title_short A Named Entity Recognition Model for Chinese Electricity Violation Descriptions Based on Word-Character Fusion and Multi-Head Attention Mechanisms
title_sort named entity recognition model for chinese electricity violation descriptions based on word character fusion and multi head attention mechanisms
topic named entity recognition (NER)
attention mechanism
electric grid on-site operation violation description
word-character fusion
url https://www.mdpi.com/1996-1073/18/2/401
work_keys_str_mv AT lingwenmeng anamedentityrecognitionmodelforchineseelectricityviolationdescriptionsbasedonwordcharacterfusionandmultiheadattentionmechanisms
AT yulinwang anamedentityrecognitionmodelforchineseelectricityviolationdescriptionsbasedonwordcharacterfusionandmultiheadattentionmechanisms
AT yuanjunhuang anamedentityrecognitionmodelforchineseelectricityviolationdescriptionsbasedonwordcharacterfusionandmultiheadattentionmechanisms
AT dinglima anamedentityrecognitionmodelforchineseelectricityviolationdescriptionsbasedonwordcharacterfusionandmultiheadattentionmechanisms
AT xinshanzhu anamedentityrecognitionmodelforchineseelectricityviolationdescriptionsbasedonwordcharacterfusionandmultiheadattentionmechanisms
AT shumeizhang anamedentityrecognitionmodelforchineseelectricityviolationdescriptionsbasedonwordcharacterfusionandmultiheadattentionmechanisms
AT lingwenmeng namedentityrecognitionmodelforchineseelectricityviolationdescriptionsbasedonwordcharacterfusionandmultiheadattentionmechanisms
AT yulinwang namedentityrecognitionmodelforchineseelectricityviolationdescriptionsbasedonwordcharacterfusionandmultiheadattentionmechanisms
AT yuanjunhuang namedentityrecognitionmodelforchineseelectricityviolationdescriptionsbasedonwordcharacterfusionandmultiheadattentionmechanisms
AT dinglima namedentityrecognitionmodelforchineseelectricityviolationdescriptionsbasedonwordcharacterfusionandmultiheadattentionmechanisms
AT xinshanzhu namedentityrecognitionmodelforchineseelectricityviolationdescriptionsbasedonwordcharacterfusionandmultiheadattentionmechanisms
AT shumeizhang namedentityrecognitionmodelforchineseelectricityviolationdescriptionsbasedonwordcharacterfusionandmultiheadattentionmechanisms