Exploring the Impact of Image-Based Audio Representations in Classification Tasks Using Vision Transformers and Explainable AI Techniques
An important hurdle in medical diagnostics is the high-quality and interpretable classification of audio signals. In this study, we present an image-based representation of infant crying audio files to predict abnormal infant cries using a vision transformer and also show significant improvements in...
Saved in:
| Main Authors: | Sari Masri, Ahmad Hasasneh, Mohammad Tami, Chakib Tadj |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2024-11-01
|
| Series: | Information |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2078-2489/15/12/751 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
ECG Biometrics on Mobile Devices: High-Accuracy Authentication Using i-Vectors and Cepstral Coefficients
by: F. Saba Kockan, et al.
Published: (2025-01-01) -
MAS-PD: Transferable Adversarial Attack Against Vision-Transformers-Based SAR Image Classification Task
by: Boshi Zheng, et al.
Published: (2025-01-01) -
Apvit: ViT with adaptive patches for scene text recognition
by: Ning Zhang, et al.
Published: (2025-03-01) -
Analysis of the influence of selected audio pre-processing stages on accuracy of speaker language recognition
by: Олеся Барковська, et al.
Published: (2023-12-01) -
Analysis of the influence of selected audio pre-processing stages on accuracy of speaker language recognition
by: Olesia Barkovska, et al.
Published: (2023-12-01)