A Unified Approach to Voice Classification: Leveraging Spectrograms, Mel Spectrograms, and Statistical Features
This study presents a multi-input neural network architecture for voice classification that integrates two parallel convolutional neural networks (CNNs) for spectrogram and Mel spectrogram images, along with a fully connected dense network for six handpicked numerical statistical features from time...
Saved in:
| Main Authors: | Muhammad Talha, Huma Ghafoor, Seung Yeob Nam |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/11098792/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Detection of Abnormal Symptoms Using Acoustic-Spectrogram-Based Deep Learning
by: Seong-Yoon Kim, et al.
Published: (2025-04-01) -
Research on automatic assessment of the severity of unilateral vocal cord paralysis based on Mel-spectrogram and convolutional neural networks
by: Shuaichi Ma, et al.
Published: (2025-06-01) -
Audio copy-move forgery detection with decreasing convolutional kernel neural network and spectrogram fusion
by: Canghong Shi, et al.
Published: (2025-07-01) -
Identification of Environmental Noise Traces in Seismic Recordings Using Vision Transformer and Mel-Spectrogram
by: Qianlong Ding, et al.
Published: (2025-08-01) -
Fault Diagnosis of Wind Turbine Gearbox Based on Mel Spectrogram and Improved ResNeXt50 Model
by: Xiaojuan Zhang, et al.
Published: (2025-08-01)