Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental Monitoring

As a complex ecosystem composed of flora and fauna, the forest has always been vulnerable to threats. Previous researchers utilized environmental audio collections, such as the ESC-50 and UrbanSound8k datasets, as proximate representatives of sounds potentially present in forests. This study focuses...

Full description

Saved in:

Bibliographic Details
Main Authors:	Ahmad Qurthobi, Robertas Damasevicius, Vytautas Barzdaitis, Rytis Maskeliunas
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Forest sounds classification Mordukhovich subdifferential Pareto optimization
Online Access:	https://ieeexplore.ieee.org/document/10856116/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832540517724848128
author	Ahmad Qurthobi Robertas Damasevicius Vytautas Barzdaitis Rytis Maskeliunas
author_facet	Ahmad Qurthobi Robertas Damasevicius Vytautas Barzdaitis Rytis Maskeliunas
author_sort	Ahmad Qurthobi
collection	DOAJ
description	As a complex ecosystem composed of flora and fauna, the forest has always been vulnerable to threats. Previous researchers utilized environmental audio collections, such as the ESC-50 and UrbanSound8k datasets, as proximate representatives of sounds potentially present in forests. This study focuses on the application of deep learning models for forest sound classification as an effort to establish an early threats detection system. The research evaluates the performance of several pre-trained deep learning models, including MobileNet, GoogleNet, and ResNet, on the limited FSC22 dataset, which consists of 2,025 forest sound recordings classified into 27 categories. To improve classification capabilities, the study introduces a hybrid model that combines neural network (CNN) with a Bidirectional Long-Short-Term Memory (BiLSTM) layer, designed to capture both spatial and temporal features of the sound data. The research also employs Pareto-Mordukhovich-optimized Mel Frequency Cepstral Coefficients (MFCC) for feature extraction, improving the representation of audio signals. Data augmentation and dimensionality reduction techniques were also explored to assess their impact on model performance. The results indicate that the proposed hybrid CNN-BiLSTM model significantly improved classification loss and accuracy scores compared to the standalone pre-trained models. GoogleNet, with an added BiLSTM layer and augmented data, achieved an average reduced loss score of 0.7209 and average accuracy of 0.7852, demonstrating its potential to classify forest sounds. Improvements in loss score and classification performance highlight the potential of hybrid models in environmental sound analysis, particularly in scenarios with limited data availability.
format	Article
id	doaj-art-329c4ca1b37b46faba4e2a1c1308a663
institution	Kabale University
issn	2169-3536
language	English
publishDate	2025-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-329c4ca1b37b46faba4e2a1c1308a6632025-02-05T00:01:02ZengIEEEIEEE Access2169-35362025-01-0113209232094410.1109/ACCESS.2025.353579610856116Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental MonitoringAhmad Qurthobi0https://orcid.org/0000-0003-3882-2897Robertas Damasevicius1https://orcid.org/0000-0001-9990-1084Vytautas Barzdaitis2Rytis Maskeliunas3https://orcid.org/0000-0002-2809-2213Faculty of Informatics, Kaunas University of Technology, Kaunas, LithuaniaDepartment of Applied Informatics, Center of Excellence Forest 4.0, Vytautas Magnus University, Kaunas, LithuaniaDepartment of Applied Informatics, Center of Excellence Forest 4.0, Vytautas Magnus University, Kaunas, LithuaniaFaculty of Informatics, Kaunas University of Technology, Kaunas, LithuaniaAs a complex ecosystem composed of flora and fauna, the forest has always been vulnerable to threats. Previous researchers utilized environmental audio collections, such as the ESC-50 and UrbanSound8k datasets, as proximate representatives of sounds potentially present in forests. This study focuses on the application of deep learning models for forest sound classification as an effort to establish an early threats detection system. The research evaluates the performance of several pre-trained deep learning models, including MobileNet, GoogleNet, and ResNet, on the limited FSC22 dataset, which consists of 2,025 forest sound recordings classified into 27 categories. To improve classification capabilities, the study introduces a hybrid model that combines neural network (CNN) with a Bidirectional Long-Short-Term Memory (BiLSTM) layer, designed to capture both spatial and temporal features of the sound data. The research also employs Pareto-Mordukhovich-optimized Mel Frequency Cepstral Coefficients (MFCC) for feature extraction, improving the representation of audio signals. Data augmentation and dimensionality reduction techniques were also explored to assess their impact on model performance. The results indicate that the proposed hybrid CNN-BiLSTM model significantly improved classification loss and accuracy scores compared to the standalone pre-trained models. GoogleNet, with an added BiLSTM layer and augmented data, achieved an average reduced loss score of 0.7209 and average accuracy of 0.7852, demonstrating its potential to classify forest sounds. Improvements in loss score and classification performance highlight the potential of hybrid models in environmental sound analysis, particularly in scenarios with limited data availability.https://ieeexplore.ieee.org/document/10856116/Forest soundsclassificationMordukhovich subdifferentialPareto optimization
spellingShingle	Ahmad Qurthobi Robertas Damasevicius Vytautas Barzdaitis Rytis Maskeliunas Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental Monitoring IEEE Access Forest sounds classification Mordukhovich subdifferential Pareto optimization
title	Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental Monitoring
title_full	Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental Monitoring
title_fullStr	Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental Monitoring
title_full_unstemmed	Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental Monitoring
title_short	Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental Monitoring
title_sort	robust forest sound classification using pareto mordukhovich optimized mfcc in environmental monitoring
topic	Forest sounds classification Mordukhovich subdifferential Pareto optimization
url	https://ieeexplore.ieee.org/document/10856116/
work_keys_str_mv	AT ahmadqurthobi robustforestsoundclassificationusingparetomordukhovichoptimizedmfccinenvironmentalmonitoring AT robertasdamasevicius robustforestsoundclassificationusingparetomordukhovichoptimizedmfccinenvironmentalmonitoring AT vytautasbarzdaitis robustforestsoundclassificationusingparetomordukhovichoptimizedmfccinenvironmentalmonitoring AT rytismaskeliunas robustforestsoundclassificationusingparetomordukhovichoptimizedmfccinenvironmentalmonitoring

Robust Forest Sound Classification Using Pareto-Mordukhovich Optimized MFCC in Environmental Monitoring

Similar Items