A perspective on gender bias in generated text data

Text generation by artificial intelligence became available to a broader public, latterly. This technology is based on machine learning and language models that need to be trained with input data. Many studies have focused on the distinction of human-written text. vs. generated texts but recent stud...

Full description

Saved in:
Bibliographic Details
Main Author: Thomas Hupperich
Format: Article
Language:English
Published: Frontiers Media S.A. 2024-12-01
Series:Frontiers in Human Dynamics
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fhumd.2024.1495270/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850243124009369600
author Thomas Hupperich
author_facet Thomas Hupperich
author_sort Thomas Hupperich
collection DOAJ
description Text generation by artificial intelligence became available to a broader public, latterly. This technology is based on machine learning and language models that need to be trained with input data. Many studies have focused on the distinction of human-written text. vs. generated texts but recent studies show that the underlying language models might be prone to reproduce gender bias in their output and, consequently, reinforcing gender roles and imbalances. In this paper, we give a perspective on this topic, considering both the generated text data itself and the machine learning models used for language generation. We present a case study of gender bias in generated text data and review recent literature addressing language models. Our results indicate that researching gender bias in the context of text generation faces significant challenges and that future work needs to overcome a lack of definitions as well as a lack of transparency.
format Article
id doaj-art-c334f400f94a487ea8f82702aa3dcd24
institution OA Journals
issn 2673-2726
language English
publishDate 2024-12-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Human Dynamics
spelling doaj-art-c334f400f94a487ea8f82702aa3dcd242025-08-20T02:00:06ZengFrontiers Media S.A.Frontiers in Human Dynamics2673-27262024-12-01610.3389/fhumd.2024.14952701495270A perspective on gender bias in generated text dataThomas HupperichText generation by artificial intelligence became available to a broader public, latterly. This technology is based on machine learning and language models that need to be trained with input data. Many studies have focused on the distinction of human-written text. vs. generated texts but recent studies show that the underlying language models might be prone to reproduce gender bias in their output and, consequently, reinforcing gender roles and imbalances. In this paper, we give a perspective on this topic, considering both the generated text data itself and the machine learning models used for language generation. We present a case study of gender bias in generated text data and review recent literature addressing language models. Our results indicate that researching gender bias in the context of text generation faces significant challenges and that future work needs to overcome a lack of definitions as well as a lack of transparency.https://www.frontiersin.org/articles/10.3389/fhumd.2024.1495270/fullgender biasgenerative artificial intelligencetext generationlanguage modelsmachine learning
spellingShingle Thomas Hupperich
A perspective on gender bias in generated text data
Frontiers in Human Dynamics
gender bias
generative artificial intelligence
text generation
language models
machine learning
title A perspective on gender bias in generated text data
title_full A perspective on gender bias in generated text data
title_fullStr A perspective on gender bias in generated text data
title_full_unstemmed A perspective on gender bias in generated text data
title_short A perspective on gender bias in generated text data
title_sort perspective on gender bias in generated text data
topic gender bias
generative artificial intelligence
text generation
language models
machine learning
url https://www.frontiersin.org/articles/10.3389/fhumd.2024.1495270/full
work_keys_str_mv AT thomashupperich aperspectiveongenderbiasingeneratedtextdata
AT thomashupperich perspectiveongenderbiasingeneratedtextdata