A Chinese Named Entity Recognition Model of Maintenance Records for Power Primary Equipment Based on Progressive Multitype Feature Fusion

Presently, the State Grid Corporation of China has accumulated a large amount of maintenance records for power primary equipment. Unfortunately, most of these records are unstructured data which lead to difficultly analyze and utilize them. The emergence of natural language processing technology and...

Full description

Saved in:
Bibliographic Details
Main Authors: Lanfei He, Xuefei Zhang, Zhiwei Li, Peng Xiao, Ziming Wei, Xu Cheng, Shaocheng Qu
Format: Article
Language:English
Published: Wiley 2022-01-01
Series:Complexity
Online Access:http://dx.doi.org/10.1155/2022/8114217
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849395324558245888
author Lanfei He
Xuefei Zhang
Zhiwei Li
Peng Xiao
Ziming Wei
Xu Cheng
Shaocheng Qu
author_facet Lanfei He
Xuefei Zhang
Zhiwei Li
Peng Xiao
Ziming Wei
Xu Cheng
Shaocheng Qu
author_sort Lanfei He
collection DOAJ
description Presently, the State Grid Corporation of China has accumulated a large amount of maintenance records for power primary equipment. Unfortunately, most of these records are unstructured data which lead to difficultly analyze and utilize them. The emergence of natural language processing technology and deep learning methods provide a solution for unstructured text data. This paper proposes a progressive multitype feature fusion model to recognize Chinese named entity of unstructured maintenance records for power primary equipment. Firstly, the textual characteristics and word separation difficulties of maintenance records are analyzed, then 7 main entity categories of power technical terms from unstructured maintenance records are chosen, and 3452 maintenance records are labeled by these categories, which is so called EPE-MR training dataset. Secondly, the standard test reports, standard maintenance, and fault analysis reports for three types of power primary equipment (namely, main transformer, circuit breaker, and isolating switch) are employed as corpus to train character embedding in order to obtain certain words representation ability of maintenance records. After that, progressive multilevel radicals feature extraction module is designed to get detailed and fine semantic information in a hierarchical manner. Further, radicals feature representation and character embedding are concatenated and sent to BiLSTM module to extract contextual information in order to improve Chinese entity recognition ability. Moreover, CRF is introduced to handle the dependencies among prediction labels and to output the optimal prediction sequence, which can easily obtain structured data of maintenance records. Finally, comparative experiments on public MSRA dataset, China People’s Daily corpus, and EPE-MR dataset are implemented, respectively, which show the effectiveness of the proposed method.
format Article
id doaj-art-ab168cfc236f47c4820582c8c7aabb8d
institution Kabale University
issn 1099-0526
language English
publishDate 2022-01-01
publisher Wiley
record_format Article
series Complexity
spelling doaj-art-ab168cfc236f47c4820582c8c7aabb8d2025-08-20T03:39:40ZengWileyComplexity1099-05262022-01-01202210.1155/2022/8114217A Chinese Named Entity Recognition Model of Maintenance Records for Power Primary Equipment Based on Progressive Multitype Feature FusionLanfei He0Xuefei Zhang1Zhiwei Li2Peng Xiao3Ziming Wei4Xu Cheng5Shaocheng Qu6Economic and Technical Research Institute of Hubei Electric Power Company of State GridEconomic and Technical Research Institute of Hubei Electric Power Company of State GridEconomic and Technical Research Institute of Hubei Electric Power Company of State GridDepartment of Electronics and Information EngineeringDepartment of Electronics and Information EngineeringWuhan Esmorning S&T Co., Ltd.Department of Electronics and Information EngineeringPresently, the State Grid Corporation of China has accumulated a large amount of maintenance records for power primary equipment. Unfortunately, most of these records are unstructured data which lead to difficultly analyze and utilize them. The emergence of natural language processing technology and deep learning methods provide a solution for unstructured text data. This paper proposes a progressive multitype feature fusion model to recognize Chinese named entity of unstructured maintenance records for power primary equipment. Firstly, the textual characteristics and word separation difficulties of maintenance records are analyzed, then 7 main entity categories of power technical terms from unstructured maintenance records are chosen, and 3452 maintenance records are labeled by these categories, which is so called EPE-MR training dataset. Secondly, the standard test reports, standard maintenance, and fault analysis reports for three types of power primary equipment (namely, main transformer, circuit breaker, and isolating switch) are employed as corpus to train character embedding in order to obtain certain words representation ability of maintenance records. After that, progressive multilevel radicals feature extraction module is designed to get detailed and fine semantic information in a hierarchical manner. Further, radicals feature representation and character embedding are concatenated and sent to BiLSTM module to extract contextual information in order to improve Chinese entity recognition ability. Moreover, CRF is introduced to handle the dependencies among prediction labels and to output the optimal prediction sequence, which can easily obtain structured data of maintenance records. Finally, comparative experiments on public MSRA dataset, China People’s Daily corpus, and EPE-MR dataset are implemented, respectively, which show the effectiveness of the proposed method.http://dx.doi.org/10.1155/2022/8114217
spellingShingle Lanfei He
Xuefei Zhang
Zhiwei Li
Peng Xiao
Ziming Wei
Xu Cheng
Shaocheng Qu
A Chinese Named Entity Recognition Model of Maintenance Records for Power Primary Equipment Based on Progressive Multitype Feature Fusion
Complexity
title A Chinese Named Entity Recognition Model of Maintenance Records for Power Primary Equipment Based on Progressive Multitype Feature Fusion
title_full A Chinese Named Entity Recognition Model of Maintenance Records for Power Primary Equipment Based on Progressive Multitype Feature Fusion
title_fullStr A Chinese Named Entity Recognition Model of Maintenance Records for Power Primary Equipment Based on Progressive Multitype Feature Fusion
title_full_unstemmed A Chinese Named Entity Recognition Model of Maintenance Records for Power Primary Equipment Based on Progressive Multitype Feature Fusion
title_short A Chinese Named Entity Recognition Model of Maintenance Records for Power Primary Equipment Based on Progressive Multitype Feature Fusion
title_sort chinese named entity recognition model of maintenance records for power primary equipment based on progressive multitype feature fusion
url http://dx.doi.org/10.1155/2022/8114217
work_keys_str_mv AT lanfeihe achinesenamedentityrecognitionmodelofmaintenancerecordsforpowerprimaryequipmentbasedonprogressivemultitypefeaturefusion
AT xuefeizhang achinesenamedentityrecognitionmodelofmaintenancerecordsforpowerprimaryequipmentbasedonprogressivemultitypefeaturefusion
AT zhiweili achinesenamedentityrecognitionmodelofmaintenancerecordsforpowerprimaryequipmentbasedonprogressivemultitypefeaturefusion
AT pengxiao achinesenamedentityrecognitionmodelofmaintenancerecordsforpowerprimaryequipmentbasedonprogressivemultitypefeaturefusion
AT zimingwei achinesenamedentityrecognitionmodelofmaintenancerecordsforpowerprimaryequipmentbasedonprogressivemultitypefeaturefusion
AT xucheng achinesenamedentityrecognitionmodelofmaintenancerecordsforpowerprimaryequipmentbasedonprogressivemultitypefeaturefusion
AT shaochengqu achinesenamedentityrecognitionmodelofmaintenancerecordsforpowerprimaryequipmentbasedonprogressivemultitypefeaturefusion
AT lanfeihe chinesenamedentityrecognitionmodelofmaintenancerecordsforpowerprimaryequipmentbasedonprogressivemultitypefeaturefusion
AT xuefeizhang chinesenamedentityrecognitionmodelofmaintenancerecordsforpowerprimaryequipmentbasedonprogressivemultitypefeaturefusion
AT zhiweili chinesenamedentityrecognitionmodelofmaintenancerecordsforpowerprimaryequipmentbasedonprogressivemultitypefeaturefusion
AT pengxiao chinesenamedentityrecognitionmodelofmaintenancerecordsforpowerprimaryequipmentbasedonprogressivemultitypefeaturefusion
AT zimingwei chinesenamedentityrecognitionmodelofmaintenancerecordsforpowerprimaryequipmentbasedonprogressivemultitypefeaturefusion
AT xucheng chinesenamedentityrecognitionmodelofmaintenancerecordsforpowerprimaryequipmentbasedonprogressivemultitypefeaturefusion
AT shaochengqu chinesenamedentityrecognitionmodelofmaintenancerecordsforpowerprimaryequipmentbasedonprogressivemultitypefeaturefusion