Feature Integration Strategies for Neural Speaker Diarization in Conversational Telephone Speech
This paper addresses the challenge of optimizing end-to-end neural diarization systems for conversational telephone speech, focusing on diverse acoustic features beyond traditional Mel-filterbanks. We present a methodological framework for integrating and analyzing different feature types as input t...
Saved in:
| Main Authors: | Juan Ignacio Alvarez-Trejos, Alicia Lozano-Diez, Daniel Ramos |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-04-01
|
| Series: | Applied Sciences |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2076-3417/15/9/4842 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Spoof speech classification using deep speaker embeddings and machine learning models
by: Mohammed Hamzah Alsalihi, et al.
Published: (2025-09-01) -
Enhancing Speaker Recognition with CRET Model: a fusion of CONV2D, RESNET and ECAPA-TDNN
by: Pinyan Li, et al.
Published: (2025-02-01) -
Application of Fischer semi discriminant analysis for speaker diarization in costa rican radio broadcasts
by: Roberto Sánchez Cárdenas, et al.
Published: (2022-11-01) -
Assessing the effectiveness of diarization algorithms in costa rican children-adult speech according to age group and gender
by: Alejandro Chacón-Vargas, et al.
Published: (2022-11-01) -
An enhanced deep learning approach for speaker diarization using TitaNet, MarbelNet and time delay network
by: Muzamil Ahmed, et al.
Published: (2025-07-01)