Deep learning Chinese input method with incremental vocabulary selection

The core task of an input method is to convert the keystroke sequences typed by users into Chinese character sequences.Input methods applying deep learning methods have advantages in learning long-range dependencies and solving data sparsity problems.However, the existing methods still have two shor...

Full description

Saved in:
Bibliographic Details
Main Authors: Huajian REN, Xiulan HAO, Wenjing XU
Format: Article
Language:zho
Published: Beijing Xintong Media Co., Ltd 2022-12-01
Series:Dianxin kexue
Subjects:
Online Access:http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2022294/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841530731201822720
author Huajian REN
Xiulan HAO
Wenjing XU
author_facet Huajian REN
Xiulan HAO
Wenjing XU
author_sort Huajian REN
collection DOAJ
description The core task of an input method is to convert the keystroke sequences typed by users into Chinese character sequences.Input methods applying deep learning methods have advantages in learning long-range dependencies and solving data sparsity problems.However, the existing methods still have two shortcomings: the separation structure of pinyin slicing in conversion leads to error propagation, and the model is complicated to meet the demand for real-time performance of the input method.A deep-learning input method model incorporating incremental word selection methods was proposed to address these shortcomings.Various softmax optimization methods were compared.Experiments on People’s Daily data and Chinese Wikipedia data show that the model improves the conversion accuracy by 15% compared with the current state-of-the-art model, and the incremental vocabulary selection method makes the model 130 times faster without losing conversion accuracy.
format Article
id doaj-art-2bfc64033ab242fb828af8bb47c405a6
institution Kabale University
issn 1000-0801
language zho
publishDate 2022-12-01
publisher Beijing Xintong Media Co., Ltd
record_format Article
series Dianxin kexue
spelling doaj-art-2bfc64033ab242fb828af8bb47c405a62025-01-15T02:59:45ZzhoBeijing Xintong Media Co., LtdDianxin kexue1000-08012022-12-0138566459574298Deep learning Chinese input method with incremental vocabulary selectionHuajian RENXiulan HAOWenjing XUThe core task of an input method is to convert the keystroke sequences typed by users into Chinese character sequences.Input methods applying deep learning methods have advantages in learning long-range dependencies and solving data sparsity problems.However, the existing methods still have two shortcomings: the separation structure of pinyin slicing in conversion leads to error propagation, and the model is complicated to meet the demand for real-time performance of the input method.A deep-learning input method model incorporating incremental word selection methods was proposed to address these shortcomings.Various softmax optimization methods were compared.Experiments on People’s Daily data and Chinese Wikipedia data show that the model improves the conversion accuracy by 15% compared with the current state-of-the-art model, and the incremental vocabulary selection method makes the model 130 times faster without losing conversion accuracy.http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2022294/Chinese input methodlong short-term memoryvocabulary selection
spellingShingle Huajian REN
Xiulan HAO
Wenjing XU
Deep learning Chinese input method with incremental vocabulary selection
Dianxin kexue
Chinese input method
long short-term memory
vocabulary selection
title Deep learning Chinese input method with incremental vocabulary selection
title_full Deep learning Chinese input method with incremental vocabulary selection
title_fullStr Deep learning Chinese input method with incremental vocabulary selection
title_full_unstemmed Deep learning Chinese input method with incremental vocabulary selection
title_short Deep learning Chinese input method with incremental vocabulary selection
title_sort deep learning chinese input method with incremental vocabulary selection
topic Chinese input method
long short-term memory
vocabulary selection
url http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2022294/
work_keys_str_mv AT huajianren deeplearningchineseinputmethodwithincrementalvocabularyselection
AT xiulanhao deeplearningchineseinputmethodwithincrementalvocabularyselection
AT wenjingxu deeplearningchineseinputmethodwithincrementalvocabularyselection