Anomaly Detection in Industrial Machine Sounds Using High-Frequency Features and Gate Recurrent Unit Networks

Detecting anomalies in industrial sound is critical for maintaining operational efficiency, preventing costly equipment failures, and ensuring workplace safety. However, it presents significant challenges due to the complexity and variability of industrial environments, including background noise an...

Full description

Saved in:
Bibliographic Details
Main Authors: Thi-Thu-Huong Le, Andro Aprila Adiputra, Jiwon Yun, Howon Kim
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10980326/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850191994212581376
author Thi-Thu-Huong Le
Andro Aprila Adiputra
Jiwon Yun
Howon Kim
author_facet Thi-Thu-Huong Le
Andro Aprila Adiputra
Jiwon Yun
Howon Kim
author_sort Thi-Thu-Huong Le
collection DOAJ
description Detecting anomalies in industrial sound is critical for maintaining operational efficiency, preventing costly equipment failures, and ensuring workplace safety. However, it presents significant challenges due to the complexity and variability of industrial environments, including background noise and fluctuating operating conditions. This paper proposes a comprehensive approach that leverages machine learning (ML) and deep learning (DL) techniques to address these challenges. Using three datasets, the Malfunctioning Industrial Machine Investigation and Inspection for Domain Generalization 2022 (MIMII DG 2022), the Detection and Classification of Acoustic Scenes and Events (DCASE) 2022, and DCASE 2024, we evaluate the performance of various ML and DL models under different experimental conditions. Our study focuses on feature extraction methods such as Mel-spectrograms and Mel-frequency Cepstral Coefficients (MFCCs) to capture critical acoustic characteristics of industrial machinery. Both supervised ML and DL techniques are employed to explore effective anomaly detection strategies. Extensive experimentation and evaluation using metrics such as confusion matrix, accuracy, precision, recall, and F1 score highlight the effectiveness of our approach in real-world industrial scenarios. Experimental results demonstrate that eXtreme Gradient Boosting (XGBoost) outperforms Support Vector Machine (SVM) and Decision Tree (DT) models in the ML approach across both feature sets. In the DL approach, Gated Recurrent Units (GRU) perform better on MFCC features than Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM) networks. GRU emerges as the best-performing model on testing datasets, achieving a precision of 99.56% and an F1 score of 99.55% on the DCASE 2024 dataset, a precision of 94% and an F1 score of 93.95% on the DCASE 2022 dataset, and a precision of 92.2% and an F1 score of 92.06% on the MIMII DG 2022 dataset. These results underscore the potential of DL for real-time industrial sound analysis in predictive maintenance, offering significant improvements over traditional methods.
format Article
id doaj-art-efd6056fcfcf49ce8514717c7d43ba76
institution OA Journals
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-efd6056fcfcf49ce8514717c7d43ba762025-08-20T02:14:42ZengIEEEIEEE Access2169-35362025-01-0113771657718610.1109/ACCESS.2025.356581210980326Anomaly Detection in Industrial Machine Sounds Using High-Frequency Features and Gate Recurrent Unit NetworksThi-Thu-Huong Le0https://orcid.org/0000-0002-8366-9396Andro Aprila Adiputra1https://orcid.org/0009-0000-9467-2841Jiwon Yun2https://orcid.org/0009-0007-3447-6567Howon Kim3https://orcid.org/0000-0001-8475-7294Blockchain Platform Research Center, Pusan National University, Busan, South KoreaSchool of Computer Science and Engineering, Pusan National University, Busan, South KoreaSchool of Computer Science and Engineering, Pusan National University, Busan, South KoreaSchool of Computer Science and Engineering, Pusan National University, Busan, South KoreaDetecting anomalies in industrial sound is critical for maintaining operational efficiency, preventing costly equipment failures, and ensuring workplace safety. However, it presents significant challenges due to the complexity and variability of industrial environments, including background noise and fluctuating operating conditions. This paper proposes a comprehensive approach that leverages machine learning (ML) and deep learning (DL) techniques to address these challenges. Using three datasets, the Malfunctioning Industrial Machine Investigation and Inspection for Domain Generalization 2022 (MIMII DG 2022), the Detection and Classification of Acoustic Scenes and Events (DCASE) 2022, and DCASE 2024, we evaluate the performance of various ML and DL models under different experimental conditions. Our study focuses on feature extraction methods such as Mel-spectrograms and Mel-frequency Cepstral Coefficients (MFCCs) to capture critical acoustic characteristics of industrial machinery. Both supervised ML and DL techniques are employed to explore effective anomaly detection strategies. Extensive experimentation and evaluation using metrics such as confusion matrix, accuracy, precision, recall, and F1 score highlight the effectiveness of our approach in real-world industrial scenarios. Experimental results demonstrate that eXtreme Gradient Boosting (XGBoost) outperforms Support Vector Machine (SVM) and Decision Tree (DT) models in the ML approach across both feature sets. In the DL approach, Gated Recurrent Units (GRU) perform better on MFCC features than Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM) networks. GRU emerges as the best-performing model on testing datasets, achieving a precision of 99.56% and an F1 score of 99.55% on the DCASE 2024 dataset, a precision of 94% and an F1 score of 93.95% on the DCASE 2022 dataset, and a precision of 92.2% and an F1 score of 92.06% on the MIMII DG 2022 dataset. These results underscore the potential of DL for real-time industrial sound analysis in predictive maintenance, offering significant improvements over traditional methods.https://ieeexplore.ieee.org/document/10980326/Anomalous sound detectionsupervised learninghigh-frequency featureMel-SpectrogramMel-frequency cepstral coefficientsgated recurrent units
spellingShingle Thi-Thu-Huong Le
Andro Aprila Adiputra
Jiwon Yun
Howon Kim
Anomaly Detection in Industrial Machine Sounds Using High-Frequency Features and Gate Recurrent Unit Networks
IEEE Access
Anomalous sound detection
supervised learning
high-frequency feature
Mel-Spectrogram
Mel-frequency cepstral coefficients
gated recurrent units
title Anomaly Detection in Industrial Machine Sounds Using High-Frequency Features and Gate Recurrent Unit Networks
title_full Anomaly Detection in Industrial Machine Sounds Using High-Frequency Features and Gate Recurrent Unit Networks
title_fullStr Anomaly Detection in Industrial Machine Sounds Using High-Frequency Features and Gate Recurrent Unit Networks
title_full_unstemmed Anomaly Detection in Industrial Machine Sounds Using High-Frequency Features and Gate Recurrent Unit Networks
title_short Anomaly Detection in Industrial Machine Sounds Using High-Frequency Features and Gate Recurrent Unit Networks
title_sort anomaly detection in industrial machine sounds using high frequency features and gate recurrent unit networks
topic Anomalous sound detection
supervised learning
high-frequency feature
Mel-Spectrogram
Mel-frequency cepstral coefficients
gated recurrent units
url https://ieeexplore.ieee.org/document/10980326/
work_keys_str_mv AT thithuhuongle anomalydetectioninindustrialmachinesoundsusinghighfrequencyfeaturesandgaterecurrentunitnetworks
AT androaprilaadiputra anomalydetectioninindustrialmachinesoundsusinghighfrequencyfeaturesandgaterecurrentunitnetworks
AT jiwonyun anomalydetectioninindustrialmachinesoundsusinghighfrequencyfeaturesandgaterecurrentunitnetworks
AT howonkim anomalydetectioninindustrialmachinesoundsusinghighfrequencyfeaturesandgaterecurrentunitnetworks