Research on a Joint Extraction Method of Track Circuit Entities and Relations Integrating Global Pointer and Tensor Learning

To address the issue of efficiently reusing the massive amount of unstructured knowledge generated during the handling of track circuit equipment faults and to automate the construction of knowledge graphs in the railway maintenance domain, it is crucial to leverage knowledge extraction techniques t...

Full description

Saved in:
Bibliographic Details
Main Authors: Yanrui Chen, Guangwu Chen, Peng Li
Format: Article
Language:English
Published: MDPI AG 2024-11-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/24/22/7128
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850228031256264704
author Yanrui Chen
Guangwu Chen
Peng Li
author_facet Yanrui Chen
Guangwu Chen
Peng Li
author_sort Yanrui Chen
collection DOAJ
description To address the issue of efficiently reusing the massive amount of unstructured knowledge generated during the handling of track circuit equipment faults and to automate the construction of knowledge graphs in the railway maintenance domain, it is crucial to leverage knowledge extraction techniques to efficiently extract relational triplets from fault maintenance text data. Given the current lag in joint extraction technology within the railway domain and the inefficiency in resource utilization, this paper proposes a joint extraction model for track circuit entities and relations, integrating Global Pointer and tensor learning. Taking into account the associative characteristics of semantic relations, the nesting of domain-specific terms in the railway sector, and semantic diversity, this research views the relation extraction task as a tensor learning process and the entity recognition task as a span-based Global Pointer search process. First, a multi-layer dilate gated convolutional neural network with residual connections is used to extract key features and fuse the weighted information from the 12 different semantic layers of the RoBERTa-wwm-ext model, fully exploiting the performance of each encoding layer. Next, the Tucker decomposition method is utilized to capture the semantic correlations between relations, and an Efficient Global Pointer is employed to globally predict the start and end positions of subject and object entities, incorporating relative position information through rotary position embedding (RoPE). Finally, comparative experiments with existing mainstream joint extraction models were conducted, and the proposed model’s excellent performance was validated on the English public datasets NYT and WebNLG, the Chinese public dataset DuIE, and a private track circuit dataset. The <i>F</i>1 scores on the NYT, WebNLG, and DuIE public datasets reached 92.1%, 92.7%, and 78.2%, respectively.
format Article
id doaj-art-b9cd38f44ad74f4490b1932a3ee24cc5
institution OA Journals
issn 1424-8220
language English
publishDate 2024-11-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj-art-b9cd38f44ad74f4490b1932a3ee24cc52025-08-20T02:04:40ZengMDPI AGSensors1424-82202024-11-012422712810.3390/s24227128Research on a Joint Extraction Method of Track Circuit Entities and Relations Integrating Global Pointer and Tensor LearningYanrui Chen0Guangwu Chen1Peng Li2School of Automation and Electrical Engineering, Lanzhou Jiaotong University, Lanzhou 730070, ChinaSchool of Automation and Electrical Engineering, Lanzhou Jiaotong University, Lanzhou 730070, ChinaSchool of Automation and Electrical Engineering, Lanzhou Jiaotong University, Lanzhou 730070, ChinaTo address the issue of efficiently reusing the massive amount of unstructured knowledge generated during the handling of track circuit equipment faults and to automate the construction of knowledge graphs in the railway maintenance domain, it is crucial to leverage knowledge extraction techniques to efficiently extract relational triplets from fault maintenance text data. Given the current lag in joint extraction technology within the railway domain and the inefficiency in resource utilization, this paper proposes a joint extraction model for track circuit entities and relations, integrating Global Pointer and tensor learning. Taking into account the associative characteristics of semantic relations, the nesting of domain-specific terms in the railway sector, and semantic diversity, this research views the relation extraction task as a tensor learning process and the entity recognition task as a span-based Global Pointer search process. First, a multi-layer dilate gated convolutional neural network with residual connections is used to extract key features and fuse the weighted information from the 12 different semantic layers of the RoBERTa-wwm-ext model, fully exploiting the performance of each encoding layer. Next, the Tucker decomposition method is utilized to capture the semantic correlations between relations, and an Efficient Global Pointer is employed to globally predict the start and end positions of subject and object entities, incorporating relative position information through rotary position embedding (RoPE). Finally, comparative experiments with existing mainstream joint extraction models were conducted, and the proposed model’s excellent performance was validated on the English public datasets NYT and WebNLG, the Chinese public dataset DuIE, and a private track circuit dataset. The <i>F</i>1 scores on the NYT, WebNLG, and DuIE public datasets reached 92.1%, 92.7%, and 78.2%, respectively.https://www.mdpi.com/1424-8220/24/22/7128joint entity and relation extractiontrack circuittucker decompositiontensor learningefficient global pointer
spellingShingle Yanrui Chen
Guangwu Chen
Peng Li
Research on a Joint Extraction Method of Track Circuit Entities and Relations Integrating Global Pointer and Tensor Learning
Sensors
joint entity and relation extraction
track circuit
tucker decomposition
tensor learning
efficient global pointer
title Research on a Joint Extraction Method of Track Circuit Entities and Relations Integrating Global Pointer and Tensor Learning
title_full Research on a Joint Extraction Method of Track Circuit Entities and Relations Integrating Global Pointer and Tensor Learning
title_fullStr Research on a Joint Extraction Method of Track Circuit Entities and Relations Integrating Global Pointer and Tensor Learning
title_full_unstemmed Research on a Joint Extraction Method of Track Circuit Entities and Relations Integrating Global Pointer and Tensor Learning
title_short Research on a Joint Extraction Method of Track Circuit Entities and Relations Integrating Global Pointer and Tensor Learning
title_sort research on a joint extraction method of track circuit entities and relations integrating global pointer and tensor learning
topic joint entity and relation extraction
track circuit
tucker decomposition
tensor learning
efficient global pointer
url https://www.mdpi.com/1424-8220/24/22/7128
work_keys_str_mv AT yanruichen researchonajointextractionmethodoftrackcircuitentitiesandrelationsintegratingglobalpointerandtensorlearning
AT guangwuchen researchonajointextractionmethodoftrackcircuitentitiesandrelationsintegratingglobalpointerandtensorlearning
AT pengli researchonajointextractionmethodoftrackcircuitentitiesandrelationsintegratingglobalpointerandtensorlearning