Filamentary Convolution for SLI: A Brain-Inspired Approach with High Efficiency

Spoken language identification (SLI) relies on detecting key frequency characteristics like pitch, tone, and rhythm. While the short-time Fourier transform (STFT) generates time–frequency acoustic features (TFAF) for deep learning networks (DLNs), rectangular convolution kernels cause frequency mixi...

Full description

Saved in:
Bibliographic Details
Main Authors: Boyuan Zhang, Xibang Yang, Tong Xie, Shuyuan Zhu, Bing Zeng
Format: Article
Language:English
Published: MDPI AG 2025-05-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/25/10/3085
Tags: Add Tag
No Tags, Be the first to tag this record!