A Semi-Supervised Abbreviation Disambiguation Method Based on ACNN and Bi-LSTM

In order to improve disambiguation accuracy of biomedical abbreviations, a semi-supervised abbreviation disambiguation method based on asymmetric convolutional neural networks and bidirectional long short term memory networks is proposed. Abbreviation is viewed as center. Morphology information, par...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHANG Chun-xiang, PANG Shu-yang, GAO Xue-yao
Format: Article
Language:zho
Published: Harbin University of Science and Technology Publications 2022-10-01
Series:Journal of Harbin University of Science and Technology
Subjects:
Online Access:https://hlgxb.hrbust.edu.cn/#/digest?ArticleID=2135
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849417278912724992
author ZHANG Chun-xiang
PANG Shu-yang
GAO Xue-yao
author_facet ZHANG Chun-xiang
PANG Shu-yang
GAO Xue-yao
author_sort ZHANG Chun-xiang
collection DOAJ
description In order to improve disambiguation accuracy of biomedical abbreviations, a semi-supervised abbreviation disambiguation method based on asymmetric convolutional neural networks and bidirectional long short term memory networks is proposed. Abbreviation is viewed as center. Morphology information, part of speech and semantic information from four adjacent lexical units are extracted as disambiguation features. Training corpus is extended by using Xgboost algorithm and LightGBM algorithm, and then expanded training corpus is input into this model. Asymmetric convolutional neural networks (ACNN) and bidirectional long short-term memory (Bi-LSTM) networks are utilized to extract features. Softmax function is applied to semantic classification. MSH corpus is adopted to optimize this model and test its disambiguation performance. Experimental results show that the proposed model can effectively improve disambiguation accuracy of abbreviations by using only a small amount of annotated corpus.
format Article
id doaj-art-b8f0b6b3e7dd4af4992c8f77ad643391
institution Kabale University
issn 1007-2683
language zho
publishDate 2022-10-01
publisher Harbin University of Science and Technology Publications
record_format Article
series Journal of Harbin University of Science and Technology
spelling doaj-art-b8f0b6b3e7dd4af4992c8f77ad6433912025-08-20T03:32:54ZzhoHarbin University of Science and Technology PublicationsJournal of Harbin University of Science and Technology1007-26832022-10-012705303710.15938/j.jhust.2022.05.005A Semi-Supervised Abbreviation Disambiguation Method Based on ACNN and Bi-LSTM ZHANG Chun-xiang0PANG Shu-yang1GAO Xue-yao2School of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, ChinaSchool of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, ChinaSchool of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, ChinaIn order to improve disambiguation accuracy of biomedical abbreviations, a semi-supervised abbreviation disambiguation method based on asymmetric convolutional neural networks and bidirectional long short term memory networks is proposed. Abbreviation is viewed as center. Morphology information, part of speech and semantic information from four adjacent lexical units are extracted as disambiguation features. Training corpus is extended by using Xgboost algorithm and LightGBM algorithm, and then expanded training corpus is input into this model. Asymmetric convolutional neural networks (ACNN) and bidirectional long short-term memory (Bi-LSTM) networks are utilized to extract features. Softmax function is applied to semantic classification. MSH corpus is adopted to optimize this model and test its disambiguation performance. Experimental results show that the proposed model can effectively improve disambiguation accuracy of abbreviations by using only a small amount of annotated corpus.https://hlgxb.hrbust.edu.cn/#/digest?ArticleID=2135abbreviationxgboostlightgbmdisambiguation featureasymmetric convolution neural networkbidirectional long short-term memory
spellingShingle ZHANG Chun-xiang
PANG Shu-yang
GAO Xue-yao
A Semi-Supervised Abbreviation Disambiguation Method Based on ACNN and Bi-LSTM
Journal of Harbin University of Science and Technology
abbreviation
xgboost
lightgbm
disambiguation feature
asymmetric convolution neural network
bidirectional long short-term memory
title A Semi-Supervised Abbreviation Disambiguation Method Based on ACNN and Bi-LSTM
title_full A Semi-Supervised Abbreviation Disambiguation Method Based on ACNN and Bi-LSTM
title_fullStr A Semi-Supervised Abbreviation Disambiguation Method Based on ACNN and Bi-LSTM
title_full_unstemmed A Semi-Supervised Abbreviation Disambiguation Method Based on ACNN and Bi-LSTM
title_short A Semi-Supervised Abbreviation Disambiguation Method Based on ACNN and Bi-LSTM
title_sort semi supervised abbreviation disambiguation method based on acnn and bi lstm
topic abbreviation
xgboost
lightgbm
disambiguation feature
asymmetric convolution neural network
bidirectional long short-term memory
url https://hlgxb.hrbust.edu.cn/#/digest?ArticleID=2135
work_keys_str_mv AT zhangchunxiang asemisupervisedabbreviationdisambiguationmethodbasedonacnnandbilstm
AT pangshuyang asemisupervisedabbreviationdisambiguationmethodbasedonacnnandbilstm
AT gaoxueyao asemisupervisedabbreviationdisambiguationmethodbasedonacnnandbilstm
AT zhangchunxiang semisupervisedabbreviationdisambiguationmethodbasedonacnnandbilstm
AT pangshuyang semisupervisedabbreviationdisambiguationmethodbasedonacnnandbilstm
AT gaoxueyao semisupervisedabbreviationdisambiguationmethodbasedonacnnandbilstm