A curated crowdsourced dataset of Luganda and Swahili speech for text-to-speech synthesisMendeley Data
This data article describes a curated, crowdsourced speech dataset in Luganda and Kiswahili, created to support text-to-speech (TTS) development in low-resource settings. The dataset is derived from Mozilla’s Common Voice corpus and includes only validated utterances from female speakers. A multi-st...
Saved in:
| Main Authors: | Andrew Katumba, Sulaiman Kagumire, Joyce Nakatumba-Nabende, John Quinn, Sudi Murindanyi |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Elsevier
2025-10-01
|
| Series: | Data in Brief |
| Subjects: | |
| Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340925006390 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Building Text‐to‐Speech Models for Low‐Resourced Languages From Crowdsourced Data
by: Andrew Katumba, et al.
Published: (2025-04-01) -
Final vowel shortening in Luganda
by: Larry M. Hyman, et al.
Published: (1990-04-01) -
Using casual speech phonology in synthetic speech
by: Linda SHOCKEY
Published: (2014-04-01) -
NADIEM MAKARIM’S FIRST SPEECH AS THE MINISTER OF INDONESIA EDUCATION AND CULTURE: SPEECH ACT ANALYSIS
by: Intan Siti Nugraha, et al.
Published: (2022-04-01) -
Lombard Effect in Polish Speech and its Comparison in English Speech
by: Piotr KLECZKOWSKI, et al.
Published: (2017-11-01)