SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and Diagnosis

Fault detection and diagnosis (FDD) is a critical challenge in industrial processes aimed at minimizing risks such as safety hazards, costly downtime, and suboptimal production. Traditional supervised FDD methods offer great performance while heavily relying on large volumes of labeled data, whereas...

Full description

Saved in:
Bibliographic Details
Main Authors: Petr Ivanov, Maria Shtark, Alexander Kozhevnikov, Maksim Golyadkin, Dmitry Botov, Ilya Makarov
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10869347/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823857139818430464
author Petr Ivanov
Maria Shtark
Alexander Kozhevnikov
Maksim Golyadkin
Dmitry Botov
Ilya Makarov
author_facet Petr Ivanov
Maria Shtark
Alexander Kozhevnikov
Maksim Golyadkin
Dmitry Botov
Ilya Makarov
author_sort Petr Ivanov
collection DOAJ
description Fault detection and diagnosis (FDD) is a critical challenge in industrial processes aimed at minimizing risks such as safety hazards, costly downtime, and suboptimal production. Traditional supervised FDD methods offer great performance while heavily relying on large volumes of labeled data, whereas unsupervised methods do not depend on labeled data, though are inferior in performance compared to supervised ones. In this paper, we propose SensorDBSCAN, a novel semi-supervised method for anomaly detection and diagnosis. The key innovation lies in achieving good performance with minimal labeled data - less than 1% of the dataset - by leveraging active and contrastive learning techniques. The proposed approach combines a transformer-based encoder trained with a triplet-based contrastive learning objective and the classical density-based clustering algorithm DBSCAN, enabling strong feature extraction, efficient and interpretable feature space organization and simple clustering algorithm. Unlike existing methods, SensorDBSCAN eliminates the need for manual labeling large amounts of data, cluster analysis, and pre-defining cluster numbers, providing greater usability in real-world cases. We validate the effectiveness of our method on the Tennessee Eastman Process (TEP) and its advanced simulations (TEP Rieth and TEP Rieker). SensorDBSCAN demonstrates better performance on well-known and realistic datasets, reducing labeling requirements while maintaining high accuracy of fault detection and diagnostics. The code is available at <uri>https://github.com/K0mp0t/sensordbscan</uri>.
format Article
id doaj-art-7ea0952dee4a41d885de2f832fe067b0
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-7ea0952dee4a41d885de2f832fe067b02025-02-12T00:02:14ZengIEEEIEEE Access2169-35362025-01-0113251862519710.1109/ACCESS.2025.353764910869347SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and DiagnosisPetr Ivanov0https://orcid.org/0000-0003-0527-8354Maria Shtark1Alexander Kozhevnikov2https://orcid.org/0009-0008-7840-8651Maksim Golyadkin3Dmitry Botov4Ilya Makarov5https://orcid.org/0000-0002-3308-8825AI Talent Hub, ITMO University, Saint Petersburg, RussiaAI Talent Hub, ITMO University, Saint Petersburg, RussiaAI Talent Hub, ITMO University, Saint Petersburg, RussiaArtificial Intelligence Research Institute (AIRI), Moscow, RussiaAI Talent Hub, ITMO University, Saint Petersburg, RussiaAI Talent Hub, ITMO University, Saint Petersburg, RussiaFault detection and diagnosis (FDD) is a critical challenge in industrial processes aimed at minimizing risks such as safety hazards, costly downtime, and suboptimal production. Traditional supervised FDD methods offer great performance while heavily relying on large volumes of labeled data, whereas unsupervised methods do not depend on labeled data, though are inferior in performance compared to supervised ones. In this paper, we propose SensorDBSCAN, a novel semi-supervised method for anomaly detection and diagnosis. The key innovation lies in achieving good performance with minimal labeled data - less than 1% of the dataset - by leveraging active and contrastive learning techniques. The proposed approach combines a transformer-based encoder trained with a triplet-based contrastive learning objective and the classical density-based clustering algorithm DBSCAN, enabling strong feature extraction, efficient and interpretable feature space organization and simple clustering algorithm. Unlike existing methods, SensorDBSCAN eliminates the need for manual labeling large amounts of data, cluster analysis, and pre-defining cluster numbers, providing greater usability in real-world cases. We validate the effectiveness of our method on the Tennessee Eastman Process (TEP) and its advanced simulations (TEP Rieth and TEP Rieker). SensorDBSCAN demonstrates better performance on well-known and realistic datasets, reducing labeling requirements while maintaining high accuracy of fault detection and diagnostics. The code is available at <uri>https://github.com/K0mp0t/sensordbscan</uri>.https://ieeexplore.ieee.org/document/10869347/Active learningsemi-supervised learningtime series anomaly detection and diagnosis
spellingShingle Petr Ivanov
Maria Shtark
Alexander Kozhevnikov
Maksim Golyadkin
Dmitry Botov
Ilya Makarov
SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and Diagnosis
IEEE Access
Active learning
semi-supervised learning
time series anomaly detection and diagnosis
title SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and Diagnosis
title_full SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and Diagnosis
title_fullStr SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and Diagnosis
title_full_unstemmed SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and Diagnosis
title_short SensorDBSCAN: Semi-Supervised Active Learning Powered Method for Anomaly Detection and Diagnosis
title_sort sensordbscan semi supervised active learning powered method for anomaly detection and diagnosis
topic Active learning
semi-supervised learning
time series anomaly detection and diagnosis
url https://ieeexplore.ieee.org/document/10869347/
work_keys_str_mv AT petrivanov sensordbscansemisupervisedactivelearningpoweredmethodforanomalydetectionanddiagnosis
AT mariashtark sensordbscansemisupervisedactivelearningpoweredmethodforanomalydetectionanddiagnosis
AT alexanderkozhevnikov sensordbscansemisupervisedactivelearningpoweredmethodforanomalydetectionanddiagnosis
AT maksimgolyadkin sensordbscansemisupervisedactivelearningpoweredmethodforanomalydetectionanddiagnosis
AT dmitrybotov sensordbscansemisupervisedactivelearningpoweredmethodforanomalydetectionanddiagnosis
AT ilyamakarov sensordbscansemisupervisedactivelearningpoweredmethodforanomalydetectionanddiagnosis