Classification of Russian Texts by Genres Based on Modern Embeddings and Rhythm
The article investigates modern vector text models for solving the problem of genre classification of Russian-language texts. Models include ELMo embeddings, BERT language model with pre-training and a complex of numerical rhythm features based on lexico-grammatical features. The experiments were ca...
Saved in:
| Main Author: | Ksenia Vladimirovna Lagutina |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Yaroslavl State University
2022-12-01
|
| Series: | Моделирование и анализ информационных систем |
| Subjects: | |
| Online Access: | https://www.mais-journal.ru/jour/article/view/1750 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Text Classification by Genre Based on Rhythm Features
by: Ksenia Vladimirovna Lagutina, et al.
Published: (2021-10-01) -
Comparison of Style Features for the Authorship Verification of Literary Texts
by: Ksenia Vladimirovna Lagutina
Published: (2021-10-01) -
Automated Search and Analysis of the Stylometric Features that Describe the Style of the Prose 19th-21st Centuries
by: Ksenia V. Lagutina, et al.
Published: (2020-09-01) -
Automated Search of Rhythm Figures in a Literary Text for Comparative Analysis of Originals and Translations Based on the Material of the English and Russian Languages
by: Nadezhda Stanislavovna Lagutina, et al.
Published: (2019-09-01) -
Text classification by CEFR levels using machine learning methods and BERT language model
by: Nadezhda S. Lagutina, et al.
Published: (2023-09-01)