LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURAL
The article discusses some current issues of interpreting out-of-vocabulary words by modern machine translation systems (MT systems) in the context of changing forms and ways of maintaining an automatic dictionary. It provides a critical outline of the typology of MT systems and strategies for their...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Volgograd State University
2024-11-01
|
Series: | Vestnik Volgogradskogo Gosudarstvennogo Universiteta. Seriâ 2. Âzykoznanie |
Subjects: | |
Online Access: | https://l.jvolsu.com/index.php/en/archive-en/928-science-journal-of-volsu-linguistics-2024-vol-23-no-5/artificial-intelligence-potential-in-natural-language-processing-and-machine-translation/2836-beliaeva-l-n-kamshilova-o-n-lexicographic-problems-of-machine-translation-systems-on-the-way-from-literal-to-neural |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841545713346936832 |
---|---|
author | Larisa N. Beliaeva Olga N. Kamshilova |
author_facet | Larisa N. Beliaeva Olga N. Kamshilova |
author_sort | Larisa N. Beliaeva |
collection | DOAJ |
description | The article discusses some current issues of interpreting out-of-vocabulary words by modern machine translation systems (MT systems) in the context of changing forms and ways of maintaining an automatic dictionary. It provides a critical outline of the typology of MT systems and strategies for their development. It describes the impact of fast developing software and technologies on these strategies and analyzes the changes they bring into the forms of dictionary support. The research shows that the linguistic support and the structure of automatic dictionaries, whatever the MT system is, are fundamentally important for ensuring the quality of
translation. Despite all the success of neural MT (NMT) systems, their automatically updated vocabulary databases do not record words characterized by terminological specificity and low frequency in the special texts and text corpora on which the system is trained. Analysis of translations performed by two popular NMT systems – Google
Translate and Yandex Translate – has proven that they fail to process and unify the translation of words that are not entered in the system dictionaries, a task used to be solved easily by users of all types of MT systems with the help of automatic dictionaries. With statistic-based automatic dictionaries it remains a pressing problem and requires a special approach when editing MP results. |
format | Article |
id | doaj-art-05cc4abc1eee4db48ac9abd798840613 |
institution | Kabale University |
issn | 1998-9911 2409-1979 |
language | English |
publishDate | 2024-11-01 |
publisher | Volgograd State University |
record_format | Article |
series | Vestnik Volgogradskogo Gosudarstvennogo Universiteta. Seriâ 2. Âzykoznanie |
spelling | doaj-art-05cc4abc1eee4db48ac9abd7988406132025-01-11T16:01:03ZengVolgograd State UniversityVestnik Volgogradskogo Gosudarstvennogo Universiteta. Seriâ 2. Âzykoznanie1998-99112409-19792024-11-0123561910.15688/jvolsu2.2024.5.1LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURALLarisa N. Beliaeva0https://orcid.org/0000-0002-8622-4595Olga N. Kamshilova1https://orcid.org/0000-0002-1488-2206Herzen State Pedagogical University of Russia, Saint Petersburg, RussiaHerzen State Pedagogical University of Russia, Saint Petersburg, Russia; Saint Petersburg University of Management Technologies and Economics, Saint Petersburg, RussiaThe article discusses some current issues of interpreting out-of-vocabulary words by modern machine translation systems (MT systems) in the context of changing forms and ways of maintaining an automatic dictionary. It provides a critical outline of the typology of MT systems and strategies for their development. It describes the impact of fast developing software and technologies on these strategies and analyzes the changes they bring into the forms of dictionary support. The research shows that the linguistic support and the structure of automatic dictionaries, whatever the MT system is, are fundamentally important for ensuring the quality of translation. Despite all the success of neural MT (NMT) systems, their automatically updated vocabulary databases do not record words characterized by terminological specificity and low frequency in the special texts and text corpora on which the system is trained. Analysis of translations performed by two popular NMT systems – Google Translate and Yandex Translate – has proven that they fail to process and unify the translation of words that are not entered in the system dictionaries, a task used to be solved easily by users of all types of MT systems with the help of automatic dictionaries. With statistic-based automatic dictionaries it remains a pressing problem and requires a special approach when editing MP results.https://l.jvolsu.com/index.php/en/archive-en/928-science-journal-of-volsu-linguistics-2024-vol-23-no-5/artificial-intelligence-potential-in-natural-language-processing-and-machine-translation/2836-beliaeva-l-n-kamshilova-o-n-lexicographic-problems-of-machine-translation-systems-on-the-way-from-literal-to-neuralmachine translationmachine translation strategytypology of machine translation systemsautomatic dictionaryout-of-vocabulary wordslinguistic support |
spellingShingle | Larisa N. Beliaeva Olga N. Kamshilova LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURAL Vestnik Volgogradskogo Gosudarstvennogo Universiteta. Seriâ 2. Âzykoznanie machine translation machine translation strategy typology of machine translation systems automatic dictionary out-of-vocabulary words linguistic support |
title | LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURAL |
title_full | LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURAL |
title_fullStr | LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURAL |
title_full_unstemmed | LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURAL |
title_short | LEXICOGRAPHIC PROBLEMS OF MACHINE TRANSLATION SYSTEMS ON THE WAY FROM LITERAL TO NEURAL |
title_sort | lexicographic problems of machine translation systems on the way from literal to neural |
topic | machine translation machine translation strategy typology of machine translation systems automatic dictionary out-of-vocabulary words linguistic support |
url | https://l.jvolsu.com/index.php/en/archive-en/928-science-journal-of-volsu-linguistics-2024-vol-23-no-5/artificial-intelligence-potential-in-natural-language-processing-and-machine-translation/2836-beliaeva-l-n-kamshilova-o-n-lexicographic-problems-of-machine-translation-systems-on-the-way-from-literal-to-neural |
work_keys_str_mv | AT larisanbeliaeva lexicographicproblemsofmachinetranslationsystemsonthewayfromliteraltoneural AT olgankamshilova lexicographicproblemsofmachinetranslationsystemsonthewayfromliteraltoneural |