Transformer-based language-independent gender recognition in noisy audio environments
Abstract This study proposes an independent method for identifying the gender of the speaker from an audio clip in a noisy environment. In this paper are performed two different processes on audio clips: one as a Mel-Spectrogram and the other using the Wav2Vec2 acoustic model emission, examining the...
Saved in:
| Main Authors: | Or Haim Anidjar, Roi Yozevitch |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-04-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-99011-x |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Wav2Lip Bridges Communication Gap: Automating Lip Sync and Language Translation for Indian Languages
by: Vaishnavi Venkataraghavan, et al.
Published: (2025-01-01) -
Advancing Spanish Speech Emotion Recognition: A Comprehensive Benchmark of Pre-Trained Models
by: Alex Mares, et al.
Published: (2025-04-01) -
Depression recognition using voice-based pre-training model
by: Xiangsheng Huang, et al.
Published: (2024-06-01) -
w2v-SELD: A Sound Event Localization and Detection Framework for Self-Supervised Spatial Audio Pre-Training
by: Orlem Lima Dos Santos, et al.
Published: (2024-01-01) -
Optimasi Teknologi WAV2Vec 2.0 menggunakan Spectral Masking untuk meningkatkan Kualitas Transkripsi Teks Video bagi Tuna Rungu
by: ACHMAD NOERCHOLIS, et al.
Published: (2024-12-01)