A large annotated dataset of vocalizations by common marmosets

Abstract Non-human primates, our closest relatives, use a wide range of complex vocal signals for communication within their species. Previous research on marmoset (Callithrix jacchus) vocalizations has been limited by sampling rates not covering the whole hearing range and insufficient labeling for...

Full description

Saved in:
Bibliographic Details
Main Authors: Charly Lamothe, Manon Obliger-Debouche, Paul Best, Régis Trapeau, Sabrina Ravel, Thierry Artières, Ricard Marxer, Pascal Belin
Format: Article
Language:English
Published: Nature Portfolio 2025-05-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-025-04951-8
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850132994405171200
author Charly Lamothe
Manon Obliger-Debouche
Paul Best
Régis Trapeau
Sabrina Ravel
Thierry Artières
Ricard Marxer
Pascal Belin
author_facet Charly Lamothe
Manon Obliger-Debouche
Paul Best
Régis Trapeau
Sabrina Ravel
Thierry Artières
Ricard Marxer
Pascal Belin
author_sort Charly Lamothe
collection DOAJ
description Abstract Non-human primates, our closest relatives, use a wide range of complex vocal signals for communication within their species. Previous research on marmoset (Callithrix jacchus) vocalizations has been limited by sampling rates not covering the whole hearing range and insufficient labeling for advanced analyses using Deep Neural Networks (DNNs). Here, we provide a database of common marmoset vocalizations, which were continuously recorded with a sampling rate of 96 kHz from an animal holding facility housing simultaneously ~20 marmosets in three cages. The dataset comprises more than 800,000 files, amounting to 253 hours of data collected over 40 months. Each recording lasts a few seconds and captures the marmosets’ social vocalizations, encompassing their entire known vocal repertoire during the experimental period. Around 215,000 calls are annotated with the vocalization type. We offer a trained classifier to assist future investigations. Finally, we validated our dataset by sampling 700 representative recordings and cross-examining them with four experts.
format Article
id doaj-art-4cb2602fd9da44e8b7c5a5c4f62dacc7
institution OA Journals
issn 2052-4463
language English
publishDate 2025-05-01
publisher Nature Portfolio
record_format Article
series Scientific Data
spelling doaj-art-4cb2602fd9da44e8b7c5a5c4f62dacc72025-08-20T02:32:05ZengNature PortfolioScientific Data2052-44632025-05-011211910.1038/s41597-025-04951-8A large annotated dataset of vocalizations by common marmosetsCharly Lamothe0Manon Obliger-Debouche1Paul Best2Régis Trapeau3Sabrina Ravel4Thierry Artières5Ricard Marxer6Pascal Belin7La Timone Neuroscience Institute UMR 7289, CNRS, Aix-Marseille UniversityLa Timone Neuroscience Institute UMR 7289, CNRS, Aix-Marseille UniversityLaboratoire d’Informatique et Systèmes UMR 7020, CNRS, Aix-Marseille UniversityLa Timone Neuroscience Institute UMR 7289, CNRS, Aix-Marseille UniversityLa Timone Neuroscience Institute UMR 7289, CNRS, Aix-Marseille UniversityLaboratoire d’Informatique et Systèmes UMR 7020, CNRS, Aix-Marseille UniversityLaboratoire d’Informatique et Systèmes UMR 7020, CNRS, Aix-Marseille UniversityLa Timone Neuroscience Institute UMR 7289, CNRS, Aix-Marseille UniversityAbstract Non-human primates, our closest relatives, use a wide range of complex vocal signals for communication within their species. Previous research on marmoset (Callithrix jacchus) vocalizations has been limited by sampling rates not covering the whole hearing range and insufficient labeling for advanced analyses using Deep Neural Networks (DNNs). Here, we provide a database of common marmoset vocalizations, which were continuously recorded with a sampling rate of 96 kHz from an animal holding facility housing simultaneously ~20 marmosets in three cages. The dataset comprises more than 800,000 files, amounting to 253 hours of data collected over 40 months. Each recording lasts a few seconds and captures the marmosets’ social vocalizations, encompassing their entire known vocal repertoire during the experimental period. Around 215,000 calls are annotated with the vocalization type. We offer a trained classifier to assist future investigations. Finally, we validated our dataset by sampling 700 representative recordings and cross-examining them with four experts.https://doi.org/10.1038/s41597-025-04951-8
spellingShingle Charly Lamothe
Manon Obliger-Debouche
Paul Best
Régis Trapeau
Sabrina Ravel
Thierry Artières
Ricard Marxer
Pascal Belin
A large annotated dataset of vocalizations by common marmosets
Scientific Data
title A large annotated dataset of vocalizations by common marmosets
title_full A large annotated dataset of vocalizations by common marmosets
title_fullStr A large annotated dataset of vocalizations by common marmosets
title_full_unstemmed A large annotated dataset of vocalizations by common marmosets
title_short A large annotated dataset of vocalizations by common marmosets
title_sort large annotated dataset of vocalizations by common marmosets
url https://doi.org/10.1038/s41597-025-04951-8
work_keys_str_mv AT charlylamothe alargeannotateddatasetofvocalizationsbycommonmarmosets
AT manonobligerdebouche alargeannotateddatasetofvocalizationsbycommonmarmosets
AT paulbest alargeannotateddatasetofvocalizationsbycommonmarmosets
AT registrapeau alargeannotateddatasetofvocalizationsbycommonmarmosets
AT sabrinaravel alargeannotateddatasetofvocalizationsbycommonmarmosets
AT thierryartieres alargeannotateddatasetofvocalizationsbycommonmarmosets
AT ricardmarxer alargeannotateddatasetofvocalizationsbycommonmarmosets
AT pascalbelin alargeannotateddatasetofvocalizationsbycommonmarmosets
AT charlylamothe largeannotateddatasetofvocalizationsbycommonmarmosets
AT manonobligerdebouche largeannotateddatasetofvocalizationsbycommonmarmosets
AT paulbest largeannotateddatasetofvocalizationsbycommonmarmosets
AT registrapeau largeannotateddatasetofvocalizationsbycommonmarmosets
AT sabrinaravel largeannotateddatasetofvocalizationsbycommonmarmosets
AT thierryartieres largeannotateddatasetofvocalizationsbycommonmarmosets
AT ricardmarxer largeannotateddatasetofvocalizationsbycommonmarmosets
AT pascalbelin largeannotateddatasetofvocalizationsbycommonmarmosets