Numerical Simulation of Ambiguity Resolution in Multiple Information Streams Based on Network Machine Translation

In natural language, the phenomenon of polysemy is widespread, which makes it very difficult for machines to process natural language. Word sense disambiguation is a key issue in the field of natural language processing. This paper introduces the more common statistical learning methods used in the...

Full description

Saved in:
Bibliographic Details
Main Authors: Lei Wang, Qun Ai
Format: Article
Language:English
Published: Wiley 2020-01-01
Series:Complexity
Online Access:http://dx.doi.org/10.1155/2020/7278085
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849684289436778496
author Lei Wang
Qun Ai
author_facet Lei Wang
Qun Ai
author_sort Lei Wang
collection DOAJ
description In natural language, the phenomenon of polysemy is widespread, which makes it very difficult for machines to process natural language. Word sense disambiguation is a key issue in the field of natural language processing. This paper introduces the more common statistical learning methods used in the field of word sense disambiguation. Using the naive Bayesian machine learning method and the feature vector set extracted and constructed by the Dice coefficient method, a semantic word disambiguation model based on semantics is realized. The results of comparative experiments show that the proposed method is better compared with known systems. This paper proposes a method for disambiguation of word segmentation in professional fields based on unsupervised learning. This method does not rely on professional domain knowledge and training corpus and only uses the frequency, mutual information, and boundary entropy information of the string in the test corpus to solve the problem of word segmentation ambiguity. The experimental results show that these three evaluation standards can solve the problem of word segmentation ambiguity in professional fields and improve the effect of word segmentation. Among them, the segmentation result using mutual information is the best, and the performance is stable.
format Article
id doaj-art-ba332084599c461ebd5b323b0f5a3e9f
institution DOAJ
issn 1076-2787
1099-0526
language English
publishDate 2020-01-01
publisher Wiley
record_format Article
series Complexity
spelling doaj-art-ba332084599c461ebd5b323b0f5a3e9f2025-08-20T03:23:30ZengWileyComplexity1076-27871099-05262020-01-01202010.1155/2020/72780857278085Numerical Simulation of Ambiguity Resolution in Multiple Information Streams Based on Network Machine TranslationLei Wang0Qun Ai1Department of Foreign Language, Jilin Business and Technology College, Jilin, Changchun 130000, ChinaDepartment of Basic Education, Jilin University, Jilin, Changchun 130000, ChinaIn natural language, the phenomenon of polysemy is widespread, which makes it very difficult for machines to process natural language. Word sense disambiguation is a key issue in the field of natural language processing. This paper introduces the more common statistical learning methods used in the field of word sense disambiguation. Using the naive Bayesian machine learning method and the feature vector set extracted and constructed by the Dice coefficient method, a semantic word disambiguation model based on semantics is realized. The results of comparative experiments show that the proposed method is better compared with known systems. This paper proposes a method for disambiguation of word segmentation in professional fields based on unsupervised learning. This method does not rely on professional domain knowledge and training corpus and only uses the frequency, mutual information, and boundary entropy information of the string in the test corpus to solve the problem of word segmentation ambiguity. The experimental results show that these three evaluation standards can solve the problem of word segmentation ambiguity in professional fields and improve the effect of word segmentation. Among them, the segmentation result using mutual information is the best, and the performance is stable.http://dx.doi.org/10.1155/2020/7278085
spellingShingle Lei Wang
Qun Ai
Numerical Simulation of Ambiguity Resolution in Multiple Information Streams Based on Network Machine Translation
Complexity
title Numerical Simulation of Ambiguity Resolution in Multiple Information Streams Based on Network Machine Translation
title_full Numerical Simulation of Ambiguity Resolution in Multiple Information Streams Based on Network Machine Translation
title_fullStr Numerical Simulation of Ambiguity Resolution in Multiple Information Streams Based on Network Machine Translation
title_full_unstemmed Numerical Simulation of Ambiguity Resolution in Multiple Information Streams Based on Network Machine Translation
title_short Numerical Simulation of Ambiguity Resolution in Multiple Information Streams Based on Network Machine Translation
title_sort numerical simulation of ambiguity resolution in multiple information streams based on network machine translation
url http://dx.doi.org/10.1155/2020/7278085
work_keys_str_mv AT leiwang numericalsimulationofambiguityresolutioninmultipleinformationstreamsbasedonnetworkmachinetranslation
AT qunai numericalsimulationofambiguityresolutioninmultipleinformationstreamsbasedonnetworkmachinetranslation