Beyond Spectrograms: Rethinking Audio Classification from EnCodec’s Latent Space

This paper presents a novel approach to audio classification leveraging the latent representation generated by Meta’s EnCodec neural audio codec. We hypothesize that the compressed latent space representation captures essential audio features more suitable for classification tasks than the tradition...

Full description

Saved in:
Bibliographic Details
Main Authors: Jorge Perianez-Pascual, Juan D. Gutiérrez, Laura Escobar-Encinas, Álvaro Rubio-Largo, Roberto Rodriguez-Echeverria
Format: Article
Language:English
Published: MDPI AG 2025-02-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/18/2/108
Tags: Add Tag
No Tags, Be the first to tag this record!