Using A Neural Network to Generate Images When Teaching Students to Develop an Alternative Text

The purpose of research. The purpose of the study is to develop and test an approach to training digital content compilers in creating alternative text that accurately describes the original image, using a neural network to generate reference images reconstructed from the text. The lack of textual d...

Full description

Saved in:
Bibliographic Details
Main Authors: Yekaterina A. Kosova, Kirill I. Redkokosh, Pavel O. Mikheyev
Format: Article
Language:English
Published: Plekhanov Russian University of Economics 2024-03-01
Series:Открытое образование (Москва)
Subjects:
Online Access:https://openedu.rea.ru/jour/article/view/1008
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849690989471465472
author Yekaterina A. Kosova
Kirill I. Redkokosh
Pavel O. Mikheyev
author_facet Yekaterina A. Kosova
Kirill I. Redkokosh
Pavel O. Mikheyev
author_sort Yekaterina A. Kosova
collection DOAJ
description The purpose of research. The purpose of the study is to develop and test an approach to training digital content compilers in creating alternative text that accurately describes the original image, using a neural network to generate reference images reconstructed from the text. The lack of textual descriptions of visual content in a web resource limits digital accessibility, especially for users with visual disorders. To ensure accessibility, each informative image should be accompanied by the alternative text. Text alternatives generated by means of automated tools are known to be lower in quality to human-generated descriptions. Therefore, a digital content compiler must be able to develop the alternative text for images. It has been suggested that a neural network for generating images from text descriptions can act as a tool for checking the relevance of the developed text alternatives.Materials and methods. The study was carried out in April-May 2023. 17 undergraduate students studied the requirements for developing text alternatives, completed initial text descriptions for three proposed photographs, and then corrected the text using the Kandinsky 2.1 neural network according to the algorithm: generating an image from the description; visual comparison of the resulting image with the original; returning to editing the description or ending the process. Based on the initial and final descriptions, the researchers reconstructed the images using the same neural network. Further work consisted of assessing the quality of all text descriptions and the similarity of all generated images to the original ones. The results of the study (text descriptions; expert evaluations; links to generated images) were published as a data set in the Mendeley Data repository. The t-test, Pearson correlation and multivariate regression were used to analyze the data (at the specified significance level p = 0,05).Results. It was found that the quality scores of the initial and final text descriptions were not significantly different (p > 0,05), and also there were no significant differences for the length of the text (p > 0,05). At the same time, the similarity of the generated images and original photographs after students used the neural network has increased considerably (p < 0,05). Therefore, training in the neural network contributed to improving the quality (similarity to the original) of images generated from modified text descriptions, without losing the descriptions’ quality. It was also shown that the quality of the final text alternatives was higher the larger their size within the allotted limit, the better and shorter the initial descriptions (p < 0,05). Thus, concise and accurate alternative descriptions for images after training students in a neural network can be converted into equally high-quality text alternatives, the relevance of which is increased by adding plot details to the description.Conclusion. Neural networks generating images can be applied as a software tool to encourage potential content authors to create more accurate and complete alternative text while keeping it concise. It seems important to continue the research by extending it to other types of images and using a variety of neural networks.
format Article
id doaj-art-6be058a600f84c74b4c44ae11c4d2b07
institution DOAJ
issn 1818-4243
2079-5939
language English
publishDate 2024-03-01
publisher Plekhanov Russian University of Economics
record_format Article
series Открытое образование (Москва)
spelling doaj-art-6be058a600f84c74b4c44ae11c4d2b072025-08-20T03:21:10ZengPlekhanov Russian University of EconomicsОткрытое образование (Москва)1818-42432079-59392024-03-0128192010.21686/1818-4243-2024-1-9-20684Using A Neural Network to Generate Images When Teaching Students to Develop an Alternative TextYekaterina A. Kosova0Kirill I. Redkokosh1Pavel O. Mikheyev2V.I. Vernadsky Crimean Federal UniversityV.I. Vernadsky Crimean Federal UniversityV.I. Vernadsky Crimean Federal UniversityThe purpose of research. The purpose of the study is to develop and test an approach to training digital content compilers in creating alternative text that accurately describes the original image, using a neural network to generate reference images reconstructed from the text. The lack of textual descriptions of visual content in a web resource limits digital accessibility, especially for users with visual disorders. To ensure accessibility, each informative image should be accompanied by the alternative text. Text alternatives generated by means of automated tools are known to be lower in quality to human-generated descriptions. Therefore, a digital content compiler must be able to develop the alternative text for images. It has been suggested that a neural network for generating images from text descriptions can act as a tool for checking the relevance of the developed text alternatives.Materials and methods. The study was carried out in April-May 2023. 17 undergraduate students studied the requirements for developing text alternatives, completed initial text descriptions for three proposed photographs, and then corrected the text using the Kandinsky 2.1 neural network according to the algorithm: generating an image from the description; visual comparison of the resulting image with the original; returning to editing the description or ending the process. Based on the initial and final descriptions, the researchers reconstructed the images using the same neural network. Further work consisted of assessing the quality of all text descriptions and the similarity of all generated images to the original ones. The results of the study (text descriptions; expert evaluations; links to generated images) were published as a data set in the Mendeley Data repository. The t-test, Pearson correlation and multivariate regression were used to analyze the data (at the specified significance level p = 0,05).Results. It was found that the quality scores of the initial and final text descriptions were not significantly different (p > 0,05), and also there were no significant differences for the length of the text (p > 0,05). At the same time, the similarity of the generated images and original photographs after students used the neural network has increased considerably (p < 0,05). Therefore, training in the neural network contributed to improving the quality (similarity to the original) of images generated from modified text descriptions, without losing the descriptions’ quality. It was also shown that the quality of the final text alternatives was higher the larger their size within the allotted limit, the better and shorter the initial descriptions (p < 0,05). Thus, concise and accurate alternative descriptions for images after training students in a neural network can be converted into equally high-quality text alternatives, the relevance of which is increased by adding plot details to the description.Conclusion. Neural networks generating images can be applied as a software tool to encourage potential content authors to create more accurate and complete alternative text while keeping it concise. It seems important to continue the research by extending it to other types of images and using a variety of neural networks.https://openedu.rea.ru/jour/article/view/1008digital accessibilityalternative textneural networkse-learningdigital competencies
spellingShingle Yekaterina A. Kosova
Kirill I. Redkokosh
Pavel O. Mikheyev
Using A Neural Network to Generate Images When Teaching Students to Develop an Alternative Text
Открытое образование (Москва)
digital accessibility
alternative text
neural networks
e-learning
digital competencies
title Using A Neural Network to Generate Images When Teaching Students to Develop an Alternative Text
title_full Using A Neural Network to Generate Images When Teaching Students to Develop an Alternative Text
title_fullStr Using A Neural Network to Generate Images When Teaching Students to Develop an Alternative Text
title_full_unstemmed Using A Neural Network to Generate Images When Teaching Students to Develop an Alternative Text
title_short Using A Neural Network to Generate Images When Teaching Students to Develop an Alternative Text
title_sort using a neural network to generate images when teaching students to develop an alternative text
topic digital accessibility
alternative text
neural networks
e-learning
digital competencies
url https://openedu.rea.ru/jour/article/view/1008
work_keys_str_mv AT yekaterinaakosova usinganeuralnetworktogenerateimageswhenteachingstudentstodevelopanalternativetext
AT kirilliredkokosh usinganeuralnetworktogenerateimageswhenteachingstudentstodevelopanalternativetext
AT pavelomikheyev usinganeuralnetworktogenerateimageswhenteachingstudentstodevelopanalternativetext