TDNN achitecture with efficient channel attention and improved residual blocks for accurate speaker recognition

Abstract In recent years, with the advancement of deep learning, Convolutional Neural Networks (CNNs) have been widely applied in speaker recognition, making CNN-based speaker embedding learning the predominant method for speaker verification. Time Delay Neural Networks (TDNN) have achieved notable...

Full description

Saved in:
Bibliographic Details
Main Authors: Wenzao Li, Sai Yao, Bing Wan, Linsong Xiao, Chengyu Hou, Yanchuan Zhong, Wengang Zhou
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-025-09386-0
Tags: Add Tag
No Tags, Be the first to tag this record!