Improving Audio Recognition With Randomized Area Ratio Patch Masking: A Data Augmentation Perspective
In audio recognition, improving the accuracy and generalizability of Pretrained Audio Neural Networks (PANNs) remains challenging. This study introduces Randomized Area Ratio Patch Masking (RARPM), a novel data augmentation technique that applies random patches with varying transparency to log mel s...
Saved in:
| Main Authors: | Weichun Wong, Yachun Li, Shihan Li |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2024-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10706845/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Audiogmenter: a MATLAB toolbox for audio data augmentation
by: Gianluca Maguolo, et al.
Published: (2025-01-01) -
Analysis of the influence of selected audio pre-processing stages on accuracy of speaker language recognition
by: Олеся Барковська, et al.
Published: (2023-12-01) -
Analysis of the influence of selected audio pre-processing stages on accuracy of speaker language recognition
by: Olesia Barkovska, et al.
Published: (2023-12-01) -
Automatic recognition and representation of text in the form of audio stream
by: L. V. Serebryanaya, et al.
Published: (2021-10-01) -
Audio copy-move forgery detection with decreasing convolutional kernel neural network and spectrogram fusion
by: Canghong Shi, et al.
Published: (2025-07-01)