SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and Diagnosis
Fault detection and diagnosis (FDD) is a critical challenge in industrial processes aimed at minimizing risks such as safety hazards, costly downtime, and suboptimal production. Traditional supervised FDD methods offer great performance while heavily relying on large volumes of labeled data, whereas...
Saved in:
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2025-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10869347/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1823857139818430464 |
---|---|
author | Petr Ivanov Maria Shtark Alexander Kozhevnikov Maksim Golyadkin Dmitry Botov Ilya Makarov |
author_facet | Petr Ivanov Maria Shtark Alexander Kozhevnikov Maksim Golyadkin Dmitry Botov Ilya Makarov |
author_sort | Petr Ivanov |
collection | DOAJ |
description | Fault detection and diagnosis (FDD) is a critical challenge in industrial processes aimed at minimizing risks such as safety hazards, costly downtime, and suboptimal production. Traditional supervised FDD methods offer great performance while heavily relying on large volumes of labeled data, whereas unsupervised methods do not depend on labeled data, though are inferior in performance compared to supervised ones. In this paper, we propose SensorDBSCAN, a novel semi-supervised method for anomaly detection and diagnosis. The key innovation lies in achieving good performance with minimal labeled data - less than 1% of the dataset - by leveraging active and contrastive learning techniques. The proposed approach combines a transformer-based encoder trained with a triplet-based contrastive learning objective and the classical density-based clustering algorithm DBSCAN, enabling strong feature extraction, efficient and interpretable feature space organization and simple clustering algorithm. Unlike existing methods, SensorDBSCAN eliminates the need for manual labeling large amounts of data, cluster analysis, and pre-defining cluster numbers, providing greater usability in real-world cases. We validate the effectiveness of our method on the Tennessee Eastman Process (TEP) and its advanced simulations (TEP Rieth and TEP Rieker). SensorDBSCAN demonstrates better performance on well-known and realistic datasets, reducing labeling requirements while maintaining high accuracy of fault detection and diagnostics. The code is available at <uri>https://github.com/K0mp0t/sensordbscan</uri>. |
format | Article |
id | doaj-art-7ea0952dee4a41d885de2f832fe067b0 |
institution | Kabale University |
issn | 2169-3536 |
language | English |
publishDate | 2025-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj-art-7ea0952dee4a41d885de2f832fe067b02025-02-12T00:02:14ZengIEEEIEEE Access2169-35362025-01-0113251862519710.1109/ACCESS.2025.353764910869347SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and DiagnosisPetr Ivanov0https://orcid.org/0000-0003-0527-8354Maria Shtark1Alexander Kozhevnikov2https://orcid.org/0009-0008-7840-8651Maksim Golyadkin3Dmitry Botov4Ilya Makarov5https://orcid.org/0000-0002-3308-8825AI Talent Hub, ITMO University, Saint Petersburg, RussiaAI Talent Hub, ITMO University, Saint Petersburg, RussiaAI Talent Hub, ITMO University, Saint Petersburg, RussiaArtificial Intelligence Research Institute (AIRI), Moscow, RussiaAI Talent Hub, ITMO University, Saint Petersburg, RussiaAI Talent Hub, ITMO University, Saint Petersburg, RussiaFault detection and diagnosis (FDD) is a critical challenge in industrial processes aimed at minimizing risks such as safety hazards, costly downtime, and suboptimal production. Traditional supervised FDD methods offer great performance while heavily relying on large volumes of labeled data, whereas unsupervised methods do not depend on labeled data, though are inferior in performance compared to supervised ones. In this paper, we propose SensorDBSCAN, a novel semi-supervised method for anomaly detection and diagnosis. The key innovation lies in achieving good performance with minimal labeled data - less than 1% of the dataset - by leveraging active and contrastive learning techniques. The proposed approach combines a transformer-based encoder trained with a triplet-based contrastive learning objective and the classical density-based clustering algorithm DBSCAN, enabling strong feature extraction, efficient and interpretable feature space organization and simple clustering algorithm. Unlike existing methods, SensorDBSCAN eliminates the need for manual labeling large amounts of data, cluster analysis, and pre-defining cluster numbers, providing greater usability in real-world cases. We validate the effectiveness of our method on the Tennessee Eastman Process (TEP) and its advanced simulations (TEP Rieth and TEP Rieker). SensorDBSCAN demonstrates better performance on well-known and realistic datasets, reducing labeling requirements while maintaining high accuracy of fault detection and diagnostics. The code is available at <uri>https://github.com/K0mp0t/sensordbscan</uri>.https://ieeexplore.ieee.org/document/10869347/Active learningsemi-supervised learningtime series anomaly detection and diagnosis |
spellingShingle | Petr Ivanov Maria Shtark Alexander Kozhevnikov Maksim Golyadkin Dmitry Botov Ilya Makarov SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and Diagnosis IEEE Access Active learning semi-supervised learning time series anomaly detection and diagnosis |
title | SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and Diagnosis |
title_full | SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and Diagnosis |
title_fullStr | SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and Diagnosis |
title_full_unstemmed | SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and Diagnosis |
title_short | SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and Diagnosis |
title_sort | sensordbscan semi supervised active learning powered method for anomaly detection and diagnosis |
topic | Active learning semi-supervised learning time series anomaly detection and diagnosis |
url | https://ieeexplore.ieee.org/document/10869347/ |
work_keys_str_mv | AT petrivanov sensordbscansemisupervisedactivelearningpoweredmethodforanomalydetectionanddiagnosis AT mariashtark sensordbscansemisupervisedactivelearningpoweredmethodforanomalydetectionanddiagnosis AT alexanderkozhevnikov sensordbscansemisupervisedactivelearningpoweredmethodforanomalydetectionanddiagnosis AT maksimgolyadkin sensordbscansemisupervisedactivelearningpoweredmethodforanomalydetectionanddiagnosis AT dmitrybotov sensordbscansemisupervisedactivelearningpoweredmethodforanomalydetectionanddiagnosis AT ilyamakarov sensordbscansemisupervisedactivelearningpoweredmethodforanomalydetectionanddiagnosis |