Transformer-based language-independent gender recognition in noisy audio environments

Transformer-based language-independent gender recognition in noisy audio environments

Abstract This study proposes an independent method for identifying the gender of the speaker from an audio clip in a noisy environment. In this paper are performed two different processes on audio clips: one as a Mel-Spectrogram and the other using the Wav2Vec2 acoustic model emission, examining the...

Full description

Saved in:

Bibliographic Details
Main Authors:	Or Haim Anidjar, Roi Yozevitch
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-04-01
Series:	Scientific Reports
Subjects:	Automatic speech recognition Wav2Vec 2.0 Language independent gender recognition
Online Access:	https://doi.org/10.1038/s41598-025-99011-x
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Wav2Lip Bridges Communication Gap: Automating Lip Sync and Language Translation for Indian Languages
by: Vaishnavi Venkataraghavan, et al.
Published: (2025-01-01)

Advancing Spanish Speech Emotion Recognition: A Comprehensive Benchmark of Pre-Trained Models
by: Alex Mares, et al.
Published: (2025-04-01)

Depression recognition using voice-based pre-training model
by: Xiangsheng Huang, et al.
Published: (2024-06-01)

w2v-SELD: A Sound Event Localization and Detection Framework for Self-Supervised Spatial Audio Pre-Training
by: Orlem Lima Dos Santos, et al.
Published: (2024-01-01)

Optimasi Teknologi WAV2Vec 2.0 menggunakan Spectral Masking untuk meningkatkan Kualitas Transkripsi Teks Video bagi Tuna Rungu
by: ACHMAD NOERCHOLIS, et al.
Published: (2024-12-01)

Empathetic Deep Learning: Transferring Adult Speech Emotion Models to Children With Gender-Specific Adaptations Using Neural Embeddings
by: Elina Lesyk, et al.
Published: (2024-12-01)

Comparative performance analysis of end-to-end ASR models on Indo-Aryan and Dravidian languages within India’s linguistic landscape
by: Palash Jain, et al.
Published: (2025-02-01)

Speech-Based Parkinson’s Detection Using Pre-Trained Self-Supervised Automatic Speech Recognition (ASR) Models and Supervised Contrastive Learning
by: Hadi Sedigh Malekroodi, et al.
Published: (2025-07-01)

Advanced Identification of Prosodic Boundaries, Speakers, and Accents Through Multi-Task Audio Pre-Processing and Speech Language Models
by: Francisco Javier Lima Florido, et al.
Published: (2025-03-01)

Development of a Baby Cry Identification System Using a Raspberry Pi-Based Embedded System and Machine Learning
by: Mohcin Mekhfioui, et al.
Published: (2025-03-01)

CLFormer: a cross-lingual transformer framework for temporal forgery localization
by: Haonan Cheng, et al.
Published: (2025-07-01)

Study of Audio Frequency Big Data Processing Architecture and Key Technology
by: Zhen Yang, et al.
Published: (2013-11-01)

Recent advancements in automatic disordered speech recognition: A survey paper
by: Nada Gohider, et al.
Published: (2024-12-01)

Konuşma Tanıma için İnsan-makine Karşılaştırması
by: Ayşe Gürel, et al.
Published: (2008-07-01)

Project Euphonia: advancing inclusive speech recognition through expanded data collection and evaluation
by: Alicia Martin, et al.
Published: (2025-06-01)

A Review on Language-Independent Search on Speech and its Applications
by: Sushil Venkatesh Kulkarni, et al.
Published: (2024-01-01)

Machine Learning and Deep Learning Approaches for Accent Recognition: A Review
by: Muzaffar Ahmad Dar, et al.
Published: (2025-01-01)

Suggested Method for Audio File Steganography
by: Saja Mohammed, et al.
Published: (2013-03-01)

Whispered Speech Recognition Based on Audio Data Augmentation and Inverse Filtering
by: Jovan Galić, et al.
Published: (2024-09-01)

Ghost in the Radio: An Audio Adversarial Attack Using Environmental Noise Through Radio
by: Hyeongjun Choi, et al.
Published: (2024-01-01)

Refining maritime Automatic Speech Recognition by leveraging synthetic speech
by: Christoph Martius, et al.
Published: (2024-12-01)

Small Language Models for Speech Emotion Recognition in Text and Audio Modalities
by: José L. Gómez-Sirvent, et al.
Published: (2025-07-01)

Automatic Test Environment of Speech Recognition Software for Intelligent Display
by: ZHONG Li, et al.
Published: (2020-01-01)

Travel Frequent-Route Identification Based on the Snake Algorithm Using License Plate Recognition Data
by: Feiyang Liu, et al.
Published: (2025-08-01)

AFT-SAM: Adaptive Fusion Transformer with a Sparse Attention Mechanism for Audio–Visual Speech Recognition
by: Na Che, et al.
Published: (2024-12-01)

Research development and forecast of automatic speech recognition technologies
by: Haikun WANG, et al.
Published: (2018-02-01)

An Automatic Tagging System Focused on Digital Resources
by: LEIZhiwen, et al.
Published: (2020-06-01)

On-Device Automatic Speech Recognition for Low-Resource Languages in Mixed Reality Industrial Metaverse Applications: Practical Guidelines and Evaluation of a Shipbuilding Application in Galician
by: Anton Valladares-Poncela, et al.
Published: (2025-01-01)

A comparative study of deep End-to-End Automatic Speech Recognition models for doctor-patient conversations in Polish in a real-life acoustic environment
by: Karolina Pondel-Sycz, et al.
Published: (2025-07-01)

Automatic Speech Recognition: A survey of deep learning techniques and approaches
by: Harsh Ahlawat, et al.
Published: (2025-12-01)

Emotion Recognition from Speech in a Subject-Independent Approach
by: Andrzej Majkowski, et al.
Published: (2025-06-01)

Bridging language gaps: The role of NLP and speech recognition in oral english instruction
by: Parul Dubey, et al.
Published: (2025-06-01)

Building a Gender-Bias-Resistant Super Corpus as a Deep Learning Baseline for Speech Emotion Recognition
by: Babak Abbaschian, et al.
Published: (2025-03-01)

Multichannel speech enhancement for automatic speech recognition: a literature review
by: Zubair Zaland, et al.
Published: (2025-03-01)

An Algorithm for Parameters Estimation of Autoregressive Model of Basic Speech Units
by: I. V. Gubochkin
Published: (2013-04-01)

A Self-Evaluated Bilingual Automatic Speech Recognition System for Mandarin–English Mixed Conversations
by: Xinhe Hai, et al.
Published: (2025-07-01)

Application and Development of Automatic Speech Recognition in Vehicle Field
by: LIU Yue, et al.
Published: (2019-01-01)

Deep Learning Based Automatic Speech Recognition for Turkish
by: Hamit Erdem, et al.
Published: (2020-08-01)

SVIT‐SSR: A sEMG‐based vision transformer approach for silent speech recognition
by: Zhao Li, et al.
Published: (2024-11-01)

Automatic Speech Recognition Errors Detection And Correction: A Review
by: Rahhal Errattahi, et al.
Published: (2016-05-01)