An Improved Jaccard Coefficient-Based Clustering Approach with Application to Diagnosis and RUL Estimation

Sample clustering techniques play a crucial role in the data-driven state evaluation of electromechanical equipment, and selecting an appropriate similarity measurement method for sample sets helps improve the clustering performance. The Jaccard coefficient is a commonly employed indicator of simila...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiaoqing Li, Hao Tang, Hai Wang, Gangzhong Miao, Mingang Cheng
Format: Article
Language:English
Published: Wiley 2024-01-01
Series:IET Signal Processing
Online Access:http://dx.doi.org/10.1049/2024/6586622
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841561599289065472
author Xiaoqing Li
Hao Tang
Hai Wang
Gangzhong Miao
Mingang Cheng
author_facet Xiaoqing Li
Hao Tang
Hai Wang
Gangzhong Miao
Mingang Cheng
author_sort Xiaoqing Li
collection DOAJ
description Sample clustering techniques play a crucial role in the data-driven state evaluation of electromechanical equipment, and selecting an appropriate similarity measurement method for sample sets helps improve the clustering performance. The Jaccard coefficient is a commonly employed indicator of similarity for scalar set-type samples. In this paper, we propose an incremental clustering algorithm for matrix-type samples by defining an improved Jaccard coefficient. First, a new binary relation is formulated to derive a relationship matrix between samples. Second, an undirected graph is given by using the relationship matrix, and an improved pruning operation is provided to simplify the graph by eliminating redundant edges. Then, a new relationship matrix is generated according to the modified graph, which enables the calculation of the improved Jaccard coefficient. By using the improved Jaccard coefficient, the improved incremental clustering algorithm updates cluster centers by selecting a particular sample to maximize the sum of similarities between the selected sample and other samples within the same cluster. Finally, the effectiveness of the proposed incremental clustering algorithm is demonstrated in fault diagnosis and remaining useful life estimation application scenarios, respectively. The experimental results indicate that the improved algorithm outperforms traditional clustering methods.
format Article
id doaj-art-f248e80b0e484d4d91a76832a0c5b956
institution Kabale University
issn 1751-9683
language English
publishDate 2024-01-01
publisher Wiley
record_format Article
series IET Signal Processing
spelling doaj-art-f248e80b0e484d4d91a76832a0c5b9562025-01-03T01:31:54ZengWileyIET Signal Processing1751-96832024-01-01202410.1049/2024/6586622An Improved Jaccard Coefficient-Based Clustering Approach with Application to Diagnosis and RUL EstimationXiaoqing Li0Hao Tang1Hai Wang2Gangzhong Miao3Mingang Cheng4School of Electrical Engineering and AutomationSchool of Electrical Engineering and AutomationDiscipline of Engineering and EnergySchool of Electrical Engineering and AutomationSchool of Electrical Engineering and AutomationSample clustering techniques play a crucial role in the data-driven state evaluation of electromechanical equipment, and selecting an appropriate similarity measurement method for sample sets helps improve the clustering performance. The Jaccard coefficient is a commonly employed indicator of similarity for scalar set-type samples. In this paper, we propose an incremental clustering algorithm for matrix-type samples by defining an improved Jaccard coefficient. First, a new binary relation is formulated to derive a relationship matrix between samples. Second, an undirected graph is given by using the relationship matrix, and an improved pruning operation is provided to simplify the graph by eliminating redundant edges. Then, a new relationship matrix is generated according to the modified graph, which enables the calculation of the improved Jaccard coefficient. By using the improved Jaccard coefficient, the improved incremental clustering algorithm updates cluster centers by selecting a particular sample to maximize the sum of similarities between the selected sample and other samples within the same cluster. Finally, the effectiveness of the proposed incremental clustering algorithm is demonstrated in fault diagnosis and remaining useful life estimation application scenarios, respectively. The experimental results indicate that the improved algorithm outperforms traditional clustering methods.http://dx.doi.org/10.1049/2024/6586622
spellingShingle Xiaoqing Li
Hao Tang
Hai Wang
Gangzhong Miao
Mingang Cheng
An Improved Jaccard Coefficient-Based Clustering Approach with Application to Diagnosis and RUL Estimation
IET Signal Processing
title An Improved Jaccard Coefficient-Based Clustering Approach with Application to Diagnosis and RUL Estimation
title_full An Improved Jaccard Coefficient-Based Clustering Approach with Application to Diagnosis and RUL Estimation
title_fullStr An Improved Jaccard Coefficient-Based Clustering Approach with Application to Diagnosis and RUL Estimation
title_full_unstemmed An Improved Jaccard Coefficient-Based Clustering Approach with Application to Diagnosis and RUL Estimation
title_short An Improved Jaccard Coefficient-Based Clustering Approach with Application to Diagnosis and RUL Estimation
title_sort improved jaccard coefficient based clustering approach with application to diagnosis and rul estimation
url http://dx.doi.org/10.1049/2024/6586622
work_keys_str_mv AT xiaoqingli animprovedjaccardcoefficientbasedclusteringapproachwithapplicationtodiagnosisandrulestimation
AT haotang animprovedjaccardcoefficientbasedclusteringapproachwithapplicationtodiagnosisandrulestimation
AT haiwang animprovedjaccardcoefficientbasedclusteringapproachwithapplicationtodiagnosisandrulestimation
AT gangzhongmiao animprovedjaccardcoefficientbasedclusteringapproachwithapplicationtodiagnosisandrulestimation
AT mingangcheng animprovedjaccardcoefficientbasedclusteringapproachwithapplicationtodiagnosisandrulestimation
AT xiaoqingli improvedjaccardcoefficientbasedclusteringapproachwithapplicationtodiagnosisandrulestimation
AT haotang improvedjaccardcoefficientbasedclusteringapproachwithapplicationtodiagnosisandrulestimation
AT haiwang improvedjaccardcoefficientbasedclusteringapproachwithapplicationtodiagnosisandrulestimation
AT gangzhongmiao improvedjaccardcoefficientbasedclusteringapproachwithapplicationtodiagnosisandrulestimation
AT mingangcheng improvedjaccardcoefficientbasedclusteringapproachwithapplicationtodiagnosisandrulestimation