Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental Monitoring

As a complex ecosystem composed of flora and fauna, the forest has always been vulnerable to threats. Previous researchers utilized environmental audio collections, such as the ESC-50 and UrbanSound8k datasets, as proximate representatives of sounds potentially present in forests. This study focuses...

Full description

Saved in:
Bibliographic Details
Main Authors: Ahmad Qurthobi, Robertas Damasevicius, Vytautas Barzdaitis, Rytis Maskeliunas
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10856116/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832540517724848128
author Ahmad Qurthobi
Robertas Damasevicius
Vytautas Barzdaitis
Rytis Maskeliunas
author_facet Ahmad Qurthobi
Robertas Damasevicius
Vytautas Barzdaitis
Rytis Maskeliunas
author_sort Ahmad Qurthobi
collection DOAJ
description As a complex ecosystem composed of flora and fauna, the forest has always been vulnerable to threats. Previous researchers utilized environmental audio collections, such as the ESC-50 and UrbanSound8k datasets, as proximate representatives of sounds potentially present in forests. This study focuses on the application of deep learning models for forest sound classification as an effort to establish an early threats detection system. The research evaluates the performance of several pre-trained deep learning models, including MobileNet, GoogleNet, and ResNet, on the limited FSC22 dataset, which consists of 2,025 forest sound recordings classified into 27 categories. To improve classification capabilities, the study introduces a hybrid model that combines neural network (CNN) with a Bidirectional Long-Short-Term Memory (BiLSTM) layer, designed to capture both spatial and temporal features of the sound data. The research also employs Pareto-Mordukhovich-optimized Mel Frequency Cepstral Coefficients (MFCC) for feature extraction, improving the representation of audio signals. Data augmentation and dimensionality reduction techniques were also explored to assess their impact on model performance. The results indicate that the proposed hybrid CNN-BiLSTM model significantly improved classification loss and accuracy scores compared to the standalone pre-trained models. GoogleNet, with an added BiLSTM layer and augmented data, achieved an average reduced loss score of 0.7209 and average accuracy of 0.7852, demonstrating its potential to classify forest sounds. Improvements in loss score and classification performance highlight the potential of hybrid models in environmental sound analysis, particularly in scenarios with limited data availability.
format Article
id doaj-art-329c4ca1b37b46faba4e2a1c1308a663
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-329c4ca1b37b46faba4e2a1c1308a6632025-02-05T00:01:02ZengIEEEIEEE Access2169-35362025-01-0113209232094410.1109/ACCESS.2025.353579610856116Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental MonitoringAhmad Qurthobi0https://orcid.org/0000-0003-3882-2897Robertas Damasevicius1https://orcid.org/0000-0001-9990-1084Vytautas Barzdaitis2Rytis Maskeliunas3https://orcid.org/0000-0002-2809-2213Faculty of Informatics, Kaunas University of Technology, Kaunas, LithuaniaDepartment of Applied Informatics, Center of Excellence Forest 4.0, Vytautas Magnus University, Kaunas, LithuaniaDepartment of Applied Informatics, Center of Excellence Forest 4.0, Vytautas Magnus University, Kaunas, LithuaniaFaculty of Informatics, Kaunas University of Technology, Kaunas, LithuaniaAs a complex ecosystem composed of flora and fauna, the forest has always been vulnerable to threats. Previous researchers utilized environmental audio collections, such as the ESC-50 and UrbanSound8k datasets, as proximate representatives of sounds potentially present in forests. This study focuses on the application of deep learning models for forest sound classification as an effort to establish an early threats detection system. The research evaluates the performance of several pre-trained deep learning models, including MobileNet, GoogleNet, and ResNet, on the limited FSC22 dataset, which consists of 2,025 forest sound recordings classified into 27 categories. To improve classification capabilities, the study introduces a hybrid model that combines neural network (CNN) with a Bidirectional Long-Short-Term Memory (BiLSTM) layer, designed to capture both spatial and temporal features of the sound data. The research also employs Pareto-Mordukhovich-optimized Mel Frequency Cepstral Coefficients (MFCC) for feature extraction, improving the representation of audio signals. Data augmentation and dimensionality reduction techniques were also explored to assess their impact on model performance. The results indicate that the proposed hybrid CNN-BiLSTM model significantly improved classification loss and accuracy scores compared to the standalone pre-trained models. GoogleNet, with an added BiLSTM layer and augmented data, achieved an average reduced loss score of 0.7209 and average accuracy of 0.7852, demonstrating its potential to classify forest sounds. Improvements in loss score and classification performance highlight the potential of hybrid models in environmental sound analysis, particularly in scenarios with limited data availability.https://ieeexplore.ieee.org/document/10856116/Forest soundsclassificationMordukhovich subdifferentialPareto optimization
spellingShingle Ahmad Qurthobi
Robertas Damasevicius
Vytautas Barzdaitis
Rytis Maskeliunas
Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental Monitoring
IEEE Access
Forest sounds
classification
Mordukhovich subdifferential
Pareto optimization
title Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental Monitoring
title_full Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental Monitoring
title_fullStr Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental Monitoring
title_full_unstemmed Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental Monitoring
title_short Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental Monitoring
title_sort robust forest sound classification using pareto mordukhovich optimized mfcc in environmental monitoring
topic Forest sounds
classification
Mordukhovich subdifferential
Pareto optimization
url https://ieeexplore.ieee.org/document/10856116/
work_keys_str_mv AT ahmadqurthobi robustforestsoundclassificationusingparetomordukhovichoptimizedmfccinenvironmentalmonitoring
AT robertasdamasevicius robustforestsoundclassificationusingparetomordukhovichoptimizedmfccinenvironmentalmonitoring
AT vytautasbarzdaitis robustforestsoundclassificationusingparetomordukhovichoptimizedmfccinenvironmentalmonitoring
AT rytismaskeliunas robustforestsoundclassificationusingparetomordukhovichoptimizedmfccinenvironmentalmonitoring