Enhancing Speaker Recognition with CRET Model: a fusion of CONV2D, RESNET and ECAPA-TDNN

Abstract In today’s society, speaker recognition plays an increasingly important role. Currently, neural networks are widely employed for extracting speaker features. Although the Emphasized Channel Attention, Propagation, and Aggregation in Time Delay Neural Network (ECAPA-TDNN) model can obtain te...

Full description

Saved in:
Bibliographic Details
Main Authors: Pinyan Li, Lap Man Hoi, Yapeng Wang, Xu Yang, Sio Kei Im
Format: Article
Language:English
Published: SpringerOpen 2025-02-01
Series:EURASIP Journal on Audio, Speech, and Music Processing
Subjects:
Online Access:https://doi.org/10.1186/s13636-025-00396-4
Tags: Add Tag
No Tags, Be the first to tag this record!