A curated crowdsourced dataset of Luganda and Swahili speech for text-to-speech synthesisMendeley Data

A curated crowdsourced dataset of Luganda and Swahili speech for text-to-speech synthesisMendeley Data

This data article describes a curated, crowdsourced speech dataset in Luganda and Kiswahili, created to support text-to-speech (TTS) development in low-resource settings. The dataset is derived from Mozilla’s Common Voice corpus and includes only validated utterances from female speakers. A multi-st...

Full description

Saved in:

Bibliographic Details
Main Authors:	Andrew Katumba, Sulaiman Kagumire, Joyce Nakatumba-Nabende, John Quinn, Sudi Murindanyi
Format:	Article
Language:	English
Published:	Elsevier 2025-10-01
Series:	Data in Brief
Subjects:	Speech dataset Text-to-speech Low-resource languages Luganda Kiswahili
Online Access:	http://www.sciencedirect.com/science/article/pii/S2352340925006390
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Building Text‐to‐Speech Models for Low‐Resourced Languages From Crowdsourced Data
by: Andrew Katumba, et al.
Published: (2025-04-01)

Final vowel shortening in Luganda
by: Larry M. Hyman, et al.
Published: (1990-04-01)

Using casual speech phonology in synthetic speech
by: Linda SHOCKEY
Published: (2014-04-01)

NADIEM MAKARIM’S FIRST SPEECH AS THE MINISTER OF INDONESIA EDUCATION AND CULTURE: SPEECH ACT ANALYSIS
by: Intan Siti Nugraha, et al.
Published: (2022-04-01)

Lombard Effect in Polish Speech and its Comparison in English Speech
by: Piotr KLECZKOWSKI, et al.
Published: (2017-11-01)

Semantic component of speech development of older preschoolers in the process of speech education
by: Yuri A. Kochetkov, et al.
Published: (2025-02-01)

Perception of vocoded speech in domestic dogs
by: Amritha Mallikarjun, et al.
Published: (2024-04-01)

ZeST: A Zero-Resourced Speech-to-Speech Translation Approach for Unknown, Unpaired, and Untranscribed Languages
by: Luan Thanh Nguyen, et al.
Published: (2025-01-01)

The Influence of the Semantic Material on the Assessment of Speech Reception Threshold
by: Magdalena KRENZ, et al.
Published: (2015-01-01)

THE ONTOGENESIS OF SPEECH DEVELOPMENT
by: T. E. Braudo, et al.
Published: (2017-04-01)

Quality assessment of synthetic speech
by: Stefan Brachmański, et al.
Published: (2025-07-01)

Pedagogy of live speech
by: Magdalena Ostolska
Published: (2024-06-01)

Multi‐stage attention network for monaural speech enhancement
by: Kunpeng Wang, et al.
Published: (2023-03-01)

Assessing costa rican children speech recognition by humans and machines
by: Maribel Morales-Rodríguez, et al.
Published: (2022-11-01)

Speech Emotion Recognition: Humans vs Machines
by: S. Werner, et al.
Published: (2019-12-01)

Automation of subjective measurements of speech intelligibility in analogue telecommunication channels
by: Stefan BRACHMAŃSKI
Published: (2008-01-01)

SPEECH DISORDERS AND DIFFICULTIES WITH READING AND WRITING
by: Beata Wołosiuk
Published: (2019-07-01)

The role and use of speech gestures in discourse
by: Nick CAMPBELL
Published: (2014-04-01)

Development of speech material for an Armenian speech recognition threshold test
by: Sona Sargsyan, et al.
Published: (2021-09-01)

Automatic understanding of acoustic speech signal pathology
by: Wiesław WSZOŁEK
Published: (2014-04-01)

Selected methods of pathological speech signal analysis
by: W. WSZOŁEK
Published: (2014-04-01)

Speech Delay Assistive Device for Speech-to-Text Transcription Based on Machine Learning
by: Maria Kristina C. Rodriguez, et al.
Published: (2025-05-01)

Study of effects of surgical treatment in the larynx area on the speech signal
by: Wiesław WSZOŁEK, et al.
Published: (2008-01-01)

Acoustic analysis of esophageal speech in patients after total laryngectomy
by: Wiesław Wszołek, et al.
Published: (2014-01-01)

Improving Speech Recognition Rate through Analysis Parameters
by: Eringis Deividas, et al.
Published: (2014-05-01)

Ways of Transferring the Internal Speech of Characters: Psycholinguistic Projection
by: Людмила Шитик, et al.
Published: (2020-04-01)

Objective Measure for Assessment of Speech Quality in Rooms
by: Stefan BRACHMAŃSKI
Published: (2008-12-01)

The Study of Spontaneity and Preparedness of Oral Speech in Forensic Linguistics: To the Formulation of the Problem
by: T. V. Berdnikova
Published: (2024-01-01)

Gender and Speech Dısfluency Productıon: a Psycholınguıstıc Analysıs on Turkısh Speakers
by: Ayşe Altıparmak, et al.
Published: (2018-10-01)

Segmentation of speech on phonetic elements for systems of speech information protection
by: Y. N. Seitkulov, et al.
Published: (2019-07-01)

Assessment of the Speech Material Usability for Forensic Speaker Identification by Voice and Sounding Speech
by: T. N. Svirava, et al.
Published: (2025-04-01)

Recent advancements in automatic disordered speech recognition: A survey paper
by: Nada Gohider, et al.
Published: (2024-12-01)

Refined analysis of the Speech-to-Speech Synchronization task reveals subharmonic synchronization
by: Simon Bross, et al.
Published: (2025-07-01)

The impact of interpreting students’ gestures and speech content on speech fluency of consecutive interpreting
by: Qiuya Zhang, et al.
Published: (2025-05-01)

“The Problem of Speech in Merleau-Ponty: My View of ‘Speaking Speech’ and ‘Spoken Speech’ in Light of Ontogenesis”
by: Rajiv Kaushik
Published: (2025-04-01)

Formation of speech culture of primary schoolchildren by means of speech metaphoricity
by: Fatima Kaplanovna Urakova, et al.
Published: (2023-03-01)

Relationship Between Chinese Speech Intelligibility of Elderly and Speech Transmission Index
by: Jianxin PENG, et al.
Published: (2021-06-01)

HATE SPEECH ON MAUDY AYUNDA’S INSTAGRAM POST
by: Cut Novita Srikandi, et al.
Published: (2024-03-01)

CONDITIONS FOR SUCCESS IN APOLOGY SPEECH GENRE
by: Oleksandra M. Shumiatska
Published: (2021-12-01)

Phonological processes in English connected speech: implications for L2 speech learning and communication
by: Sonthaya Rattanasak
Published: (2025-12-01)