LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURAL

The article discusses some current issues of interpreting out-of-vocabulary words by modern machine translation systems (MT systems) in the context of changing forms and ways of maintaining an automatic dictionary. It provides a critical outline of the typology of MT systems and strategies for their...

Full description

Saved in:
Bibliographic Details
Main Authors: Larisa N. Beliaeva, Olga N. Kamshilova
Format: Article
Language:English
Published: Volgograd State University 2024-11-01
Series:Vestnik Volgogradskogo Gosudarstvennogo Universiteta. Seriâ 2. Âzykoznanie
Subjects:
Online Access:https://l.jvolsu.com/index.php/en/archive-en/928-science-journal-of-volsu-linguistics-2024-vol-23-no-5/artificial-intelligence-potential-in-natural-language-processing-and-machine-translation/2836-beliaeva-l-n-kamshilova-o-n-lexicographic-problems-of-machine-translation-systems-on-the-way-from-literal-to-neural
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841545713346936832
author Larisa N. Beliaeva
Olga N. Kamshilova
author_facet Larisa N. Beliaeva
Olga N. Kamshilova
author_sort Larisa N. Beliaeva
collection DOAJ
description The article discusses some current issues of interpreting out-of-vocabulary words by modern machine translation systems (MT systems) in the context of changing forms and ways of maintaining an automatic dictionary. It provides a critical outline of the typology of MT systems and strategies for their development. It describes the impact of fast developing software and technologies on these strategies and analyzes the changes they bring into the forms of dictionary support. The research shows that the linguistic support and the structure of automatic dictionaries, whatever the MT system is, are fundamentally important for ensuring the quality of translation. Despite all the success of neural MT (NMT) systems, their automatically updated vocabulary databases do not record words characterized by terminological specificity and low frequency in the special texts and text corpora on which the system is trained. Analysis of translations performed by two popular NMT systems – Google Translate and Yandex Translate – has proven that they fail to process and unify the translation of words that are not entered in the system dictionaries, a task used to be solved easily by users of all types of MT systems with the help of automatic dictionaries. With statistic-based automatic dictionaries it remains a pressing problem and requires a special approach when editing MP results.
format Article
id doaj-art-05cc4abc1eee4db48ac9abd798840613
institution Kabale University
issn 1998-9911
2409-1979
language English
publishDate 2024-11-01
publisher Volgograd State University
record_format Article
series Vestnik Volgogradskogo Gosudarstvennogo Universiteta. Seriâ 2. Âzykoznanie
spelling doaj-art-05cc4abc1eee4db48ac9abd7988406132025-01-11T16:01:03ZengVolgograd State UniversityVestnik Volgogradskogo Gosudarstvennogo Universiteta. Seriâ 2. Âzykoznanie1998-99112409-19792024-11-0123561910.15688/jvolsu2.2024.5.1LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURALLarisa N. Beliaeva0https://orcid.org/0000-0002-8622-4595Olga N. Kamshilova1https://orcid.org/0000-0002-1488-2206Herzen State Pedagogical University of Russia, Saint Petersburg, RussiaHerzen State Pedagogical University of Russia, Saint Petersburg, Russia; Saint Petersburg University of Management Technologies and Economics, Saint Petersburg, RussiaThe article discusses some current issues of interpreting out-of-vocabulary words by modern machine translation systems (MT systems) in the context of changing forms and ways of maintaining an automatic dictionary. It provides a critical outline of the typology of MT systems and strategies for their development. It describes the impact of fast developing software and technologies on these strategies and analyzes the changes they bring into the forms of dictionary support. The research shows that the linguistic support and the structure of automatic dictionaries, whatever the MT system is, are fundamentally important for ensuring the quality of translation. Despite all the success of neural MT (NMT) systems, their automatically updated vocabulary databases do not record words characterized by terminological specificity and low frequency in the special texts and text corpora on which the system is trained. Analysis of translations performed by two popular NMT systems – Google Translate and Yandex Translate – has proven that they fail to process and unify the translation of words that are not entered in the system dictionaries, a task used to be solved easily by users of all types of MT systems with the help of automatic dictionaries. With statistic-based automatic dictionaries it remains a pressing problem and requires a special approach when editing MP results.https://l.jvolsu.com/index.php/en/archive-en/928-science-journal-of-volsu-linguistics-2024-vol-23-no-5/artificial-intelligence-potential-in-natural-language-processing-and-machine-translation/2836-beliaeva-l-n-kamshilova-o-n-lexicographic-problems-of-machine-translation-systems-on-the-way-from-literal-to-neuralmachine translationmachine translation strategytypology of machine translation systemsautomatic dictionaryout-of-vocabulary wordslinguistic support
spellingShingle Larisa N. Beliaeva
Olga N. Kamshilova
LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURAL
Vestnik Volgogradskogo Gosudarstvennogo Universiteta. Seriâ 2. Âzykoznanie
machine translation
machine translation strategy
typology of machine translation systems
automatic dictionary
out-of-vocabulary words
linguistic support
title LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURAL
title_full LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURAL
title_fullStr LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURAL
title_full_unstemmed LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURAL
title_short LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURAL
title_sort lexicographic problems of machine translation systems on the way from literal to neural
topic machine translation
machine translation strategy
typology of machine translation systems
automatic dictionary
out-of-vocabulary words
linguistic support
url https://l.jvolsu.com/index.php/en/archive-en/928-science-journal-of-volsu-linguistics-2024-vol-23-no-5/artificial-intelligence-potential-in-natural-language-processing-and-machine-translation/2836-beliaeva-l-n-kamshilova-o-n-lexicographic-problems-of-machine-translation-systems-on-the-way-from-literal-to-neural
work_keys_str_mv AT larisanbeliaeva lexicographicproblemsofmachinetranslationsystemsonthewayfromliteraltoneural
AT olgankamshilova lexicographicproblemsofmachinetranslationsystemsonthewayfromliteraltoneural