Exploring the Impact of Image-Based Audio Representations in Classification Tasks Using Vision Transformers and Explainable AI Techniques

An important hurdle in medical diagnostics is the high-quality and interpretable classification of audio signals. In this study, we present an image-based representation of infant crying audio files to predict abnormal infant cries using a vision transformer and also show significant improvements in...

Full description

Saved in:
Bibliographic Details
Main Authors: Sari Masri, Ahmad Hasasneh, Mohammad Tami, Chakib Tadj
Format: Article
Language:English
Published: MDPI AG 2024-11-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/15/12/751
Tags: Add Tag
No Tags, Be the first to tag this record!