Human voices communicating trustworthy intent: A demographically diverse speech audio dataset

Abstract The multi-disciplinary field of voice perception and trustworthiness lacks accessible and diverse speech audio datasets representing diverse speaker demographics, including age, ethnicity, and sex. Existing datasets primarily feature white, younger adult speakers, limiting generalisability....

Full description

Saved in:

Bibliographic Details
Main Authors:	Constantina Maltezou-Papastylianou, Reinhold Scherer, Silke Paulmann
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-05-01
Series:	Scientific Data
Online Access:	https://doi.org/10.1038/s41597-025-05267-3
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Abstract The multi-disciplinary field of voice perception and trustworthiness lacks accessible and diverse speech audio datasets representing diverse speaker demographics, including age, ethnicity, and sex. Existing datasets primarily feature white, younger adult speakers, limiting generalisability. This paper introduces a novel open-access speech audio dataset with 1,152 utterances from 96 untrained speakers, across white, black and south Asian backgrounds, divided into younger (N = 60, ages 18–45) and older (N = 36, ages 60+) adults. Each speaker recorded both, their natural speech patterns (i.e. “neutral” or no intent), and their attempt to convey their trustworthy intent as they perceive it during speech production. Our dataset is described and evaluated through classification methods between neutral and trustworthy speech. Specifically, extracted acoustic and voice quality features were analysed using linear and non-linear classification models, achieving accuracies of around 70%. This dataset aims to close a crucial gap in the existing literature and provide additional research opportunities that can contribute to the generalisability and applicability of future research results in this field.
ISSN:	2052-4463

Human voices communicating trustworthy intent: A demographically diverse speech audio dataset

Similar Items