Sound recurrence analysis for acoustic scene classification

Abstract In everyday life, people experience different soundscapes in which natural sounds, animal noises, and man-made sounds blend together. Although there have been several studies on the importance of recurring sound patterns in music and language, the relevance of this phenomenon in natural sou...

Full description

Saved in:
Bibliographic Details
Main Authors: Jakob Abeßer, Zhiwei Liang, Bernhard Seeber
Format: Article
Language:English
Published: SpringerOpen 2025-01-01
Series:EURASIP Journal on Audio, Speech, and Music Processing
Subjects:
Online Access:https://doi.org/10.1186/s13636-024-00390-2
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832594471628308480
author Jakob Abeßer
Zhiwei Liang
Bernhard Seeber
author_facet Jakob Abeßer
Zhiwei Liang
Bernhard Seeber
author_sort Jakob Abeßer
collection DOAJ
description Abstract In everyday life, people experience different soundscapes in which natural sounds, animal noises, and man-made sounds blend together. Although there have been several studies on the importance of recurring sound patterns in music and language, the relevance of this phenomenon in natural soundscapes is still largely unexplored. In this article, we study the repetition patterns of harmonic and transient sound events as potential cues for acoustic scene classification (ASC). In the first part of our study, our aim is to identify acoustic scene classes that exhibit characteristic sound repetition patterns concerning harmonic and transient sounds. We propose three metrics to measure the overall prevalence of sound repetitions as well as their repetition periods and temporal stability. In the second part, we evaluate three strategies to incorporate self-similarity matrices as an additional input feature to a convolutional neural network architecture for ASC. We observe the characteristic repetition of transient sounds in recordings of “park” and “street traffic” as well as harmonic sound repetitions in acoustic scene classes related to public transportation. In the ASC experiments, hybrid network architectures, which combine spectrogram features and features from sound recurrence analysis, show increased accuracy for those classes with prominent sound repetition patterns. Our findings provide additional perspective on the distinctions among acoustic scenes previously primarily ascribed in the literature to their spectral features.
format Article
id doaj-art-40e18f7ff43d4b3c89bd67b215ba4116
institution Kabale University
issn 1687-4722
language English
publishDate 2025-01-01
publisher SpringerOpen
record_format Article
series EURASIP Journal on Audio, Speech, and Music Processing
spelling doaj-art-40e18f7ff43d4b3c89bd67b215ba41162025-01-19T12:34:00ZengSpringerOpenEURASIP Journal on Audio, Speech, and Music Processing1687-47222025-01-012025111510.1186/s13636-024-00390-2Sound recurrence analysis for acoustic scene classificationJakob Abeßer0Zhiwei Liang1Bernhard Seeber2Semantic Music Technologies, Fraunhofer IDMTSemantic Music Technologies, Fraunhofer IDMTAudio Information Processing, TU MünchenAbstract In everyday life, people experience different soundscapes in which natural sounds, animal noises, and man-made sounds blend together. Although there have been several studies on the importance of recurring sound patterns in music and language, the relevance of this phenomenon in natural soundscapes is still largely unexplored. In this article, we study the repetition patterns of harmonic and transient sound events as potential cues for acoustic scene classification (ASC). In the first part of our study, our aim is to identify acoustic scene classes that exhibit characteristic sound repetition patterns concerning harmonic and transient sounds. We propose three metrics to measure the overall prevalence of sound repetitions as well as their repetition periods and temporal stability. In the second part, we evaluate three strategies to incorporate self-similarity matrices as an additional input feature to a convolutional neural network architecture for ASC. We observe the characteristic repetition of transient sounds in recordings of “park” and “street traffic” as well as harmonic sound repetitions in acoustic scene classes related to public transportation. In the ASC experiments, hybrid network architectures, which combine spectrogram features and features from sound recurrence analysis, show increased accuracy for those classes with prominent sound repetition patterns. Our findings provide additional perspective on the distinctions among acoustic scenes previously primarily ascribed in the literature to their spectral features.https://doi.org/10.1186/s13636-024-00390-2Acoustic scene classificationSound recurrence analysisSound repetition patternsSelf-similarity matrixHarmonic-percussive source separationResult fusion
spellingShingle Jakob Abeßer
Zhiwei Liang
Bernhard Seeber
Sound recurrence analysis for acoustic scene classification
EURASIP Journal on Audio, Speech, and Music Processing
Acoustic scene classification
Sound recurrence analysis
Sound repetition patterns
Self-similarity matrix
Harmonic-percussive source separation
Result fusion
title Sound recurrence analysis for acoustic scene classification
title_full Sound recurrence analysis for acoustic scene classification
title_fullStr Sound recurrence analysis for acoustic scene classification
title_full_unstemmed Sound recurrence analysis for acoustic scene classification
title_short Sound recurrence analysis for acoustic scene classification
title_sort sound recurrence analysis for acoustic scene classification
topic Acoustic scene classification
Sound recurrence analysis
Sound repetition patterns
Self-similarity matrix
Harmonic-percussive source separation
Result fusion
url https://doi.org/10.1186/s13636-024-00390-2
work_keys_str_mv AT jakobabeßer soundrecurrenceanalysisforacousticsceneclassification
AT zhiweiliang soundrecurrenceanalysisforacousticsceneclassification
AT bernhardseeber soundrecurrenceanalysisforacousticsceneclassification