Enhancing Speaker Recognition with CRET Model: a fusion of CONV2D, RESNET and ECAPA-TDNN
Abstract In today’s society, speaker recognition plays an increasingly important role. Currently, neural networks are widely employed for extracting speaker features. Although the Emphasized Channel Attention, Propagation, and Aggregation in Time Delay Neural Network (ECAPA-TDNN) model can obtain te...
Saved in:
| Main Authors: | Pinyan Li, Lap Man Hoi, Yapeng Wang, Xu Yang, Sio Kei Im |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
SpringerOpen
2025-02-01
|
| Series: | EURASIP Journal on Audio, Speech, and Music Processing |
| Subjects: | |
| Online Access: | https://doi.org/10.1186/s13636-025-00396-4 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Spoof speech classification using deep speaker embeddings and machine learning models
by: Mohammed Hamzah Alsalihi, et al.
Published: (2025-09-01) -
Feature Integration Strategies for Neural Speaker Diarization in Conversational Telephone Speech
by: Juan Ignacio Alvarez-Trejos, et al.
Published: (2025-04-01) -
Enhanced Localisation and Handwritten Digit Recognition Using ConvCARU
by: Sio-Kei Im, et al.
Published: (2025-06-01) -
Microseismic moment tensor inversion based on ResNet model
by: Jiaqi Yan, et al.
Published: (2025-06-01) -
Improved ResNet algorithm based intelligent interference identification
by: Jian MA, et al.
Published: (2022-10-01)