A Unified Approach to Voice Classification: Leveraging Spectrograms, Mel Spectrograms, and Statistical Features

A Unified Approach to Voice Classification: Leveraging Spectrograms, Mel Spectrograms, and Statistical Features

This study presents a multi-input neural network architecture for voice classification that integrates two parallel convolutional neural networks (CNNs) for spectrogram and Mel spectrogram images, along with a fully connected dense network for six handpicked numerical statistical features from time...

Full description

Saved in:

Bibliographic Details
Main Authors:	Muhammad Talha, Huma Ghafoor, Seung Yeob Nam
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Voice classification convolutional neural network (CNN) Mel spectrogram spectrogram statistical features
Online Access:	https://ieeexplore.ieee.org/document/11098792/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Detection of Abnormal Symptoms Using Acoustic-Spectrogram-Based Deep Learning
by: Seong-Yoon Kim, et al.
Published: (2025-04-01)

Research on automatic assessment of the severity of unilateral vocal cord paralysis based on Mel-spectrogram and convolutional neural networks
by: Shuaichi Ma, et al.
Published: (2025-06-01)

Audio copy-move forgery detection with decreasing convolutional kernel neural network and spectrogram fusion
by: Canghong Shi, et al.
Published: (2025-07-01)

Identification of Environmental Noise Traces in Seismic Recordings Using Vision Transformer and Mel-Spectrogram
by: Qianlong Ding, et al.
Published: (2025-08-01)

Fault Diagnosis of Wind Turbine Gearbox Based on Mel Spectrogram and Improved ResNeXt50 Model
by: Xiaojuan Zhang, et al.
Published: (2025-08-01)

Research on Speech Enhancement Translation and Mel-Spectrogram Mapping Method for the Deaf Based on Pix2PixGANs
by: Shaoting Zeng, et al.
Published: (2025-01-01)

Identification of Elephant Rumbles in Seismic Infrasonic Signals Using Spectrogram-Based Machine Learning
by: Janitha Vidunath, et al.
Published: (2024-11-01)

Underwater Acoustic Signal LOFAR Spectrogram Denoising Based on Enhanced Simulation
by: Tianxiang He, et al.
Published: (2024-11-01)

Retrospective Frailty Assessment in Older Adults Using Inertial Measurement Unit-Based Deep Learning on Gait Spectrograms
by: Julius Griškevičius, et al.
Published: (2025-05-01)

Snoring Sound Recognition Using Multi-Channel Spectrograms
by: Ziqiang YE, et al.
Published: (2024-01-01)

Voice-AttentionNet: Voice-Based Multi-Disease Detection with Lightweight Attention-Based Temporal Convolutional Neural Network
by: Jintao Wang, et al.
Published: (2025-03-01)

A Comparative Study of Deep Audio Models for Spectrogram- and Waveform-Based SingFake Detection
by: Minh Nguyen-Duc, et al.
Published: (2025-01-01)

Developing a multi-variate prediction model for COVID-19 from crowd-sourced respiratory voice data
by: Yuyang Yan, et al.
Published: (2024-08-01)

Research on Pantograph Defect Classification Based on Vibration Signals
by: Vytautas Gargasas, et al.
Published: (2024-12-01)

Detection of Epilepsy Disorder Using Spectrogram Images Generated From Brain EEG Signals
by: Venkatesh Bhandage, et al.
Published: (2024-01-01)

Analysis and Research on Spectrogram-Based Emotional Speech Signal Augmentation Algorithm
by: Huawei Tao, et al.
Published: (2025-06-01)

Helium Speech Recognition Method Based on Spectrogram with Deep Learning
by: Yonghong Chen, et al.
Published: (2025-05-01)

Аўтаматызацыя аналізу галасавых сігналаў птушак
by: Y. S. Hetsevich, et al.
Published: (2024-12-01)

A Transfer Learning-Based Framework for Enhanced Classification of Perceived Mental Stress Using EEG Spectrograms
by: Sheharyar Khan, et al.
Published: (2025-01-01)

A Fault Detection Framework for Rotating Machinery with a Spectrogram and Convolutional Autoencoder
by: Hoyeon Lee, et al.
Published: (2025-07-01)

Changes in Voice Quality after a Pure Tone Stimulation (PTS) Program
by: Lady Catherine Cantor-Cutiva, et al.
Published: (2024-11-01)

Differentiability of voice disorders through explainable AI
by: Fatma Özcan
Published: (2025-05-01)

Simultaneous EEG-fNIRS Data Classification Through Selective Channel Representation and Spectrogram Imaging
by: Chayut Bunterngchit, et al.
Published: (2024-01-01)

Speech Emotion Recognition on MELD and RAVDESS Datasets Using CNN
by: Gheed T. Waleed, et al.
Published: (2025-06-01)

Interpreting CNN models for musical instrument recognition using multi-spectrogram heatmap analysis: a preliminary study
by: Rujia Chen, et al.
Published: (2024-12-01)

Recognition of Sheep Feeding Behavior in Sheepfolds Using Fusion Spectrogram Depth Features and Acoustic Features
by: Youxin Yu, et al.
Published: (2024-11-01)

Hearing vocals to recognize schizophrenia: speech discriminant analysis with fusion of emotions and features based on deep learning
by: Jie Huang, et al.
Published: (2025-05-01)

Spectrogram Zeros Method for Rolling Bearing Fault Diagnosis Under Variable Rotating Speeds
by: Trong-Du Nguyen, et al.
Published: (2025-01-01)

CochleaSpecNet: An Attention-Based Dual Branch Hybrid CNN-GRU Network for Speech Emotion Recognition Using Cochleagram and Spectrogram
by: Atkia Anika Namey, et al.
Published: (2024-01-01)

Development and evaluation of machine learning models for premixed flame classification in different hydrogen-natural gas proportions using images and audio
by: Pedro Narvaez, et al.
Published: (2025-09-01)

Voice authentication module using mel-cepstral coefficients
by: D. A. Elizarov, et al.
Published: (2024-07-01)

Research on Super-resolution Methods for Radar Targets Based on Bat-inspired Spectrogram Correlation and Transformation Models
by: Bohong WANG, et al.
Published: (2025-04-01)

SpectroFusionNet a CNN approach utilizing spectrogram fusion for electric guitar play recognition
by: Ganesh Kumar Chellamani, et al.
Published: (2025-05-01)

Enhancing the Accuracy of Image Classification for Degenerative Brain Diseases with CNN Ensemble Models Using Mel-Spectrograms
by: Sang-Ha Sung, et al.
Published: (2025-06-01)

Classification of Dyslexia Among School Students Using Deep Learning
by: Alia Hussein, et al.
Published: (2024-03-01)

Deep Learning-Enhanced Spectrogram Analysis for Anatomical Region Classification in Biomedical Signals
by: Abdul Karim, et al.
Published: (2025-05-01)

Analysis of the influence of selected audio pre-processing stages on accuracy of speaker language recognition
by: Олеся Барковська, et al.
Published: (2023-12-01)

Analysis of the influence of selected audio pre-processing stages on accuracy of speaker language recognition
by: Olesia Barkovska, et al.
Published: (2023-12-01)

Enhancing parkinson disease detection through feature based deep learning with autoencoders and neural networks
by: P. Valarmathi, et al.
Published: (2025-03-01)

High-Quality Text-to-Speech Implementation via Active Shallow Diffusion Mechanism
by: Junlin Deng, et al.
Published: (2025-01-01)