Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks

The clustering of mixed-attribute data is a vital and challenging issue. The density peaks clustering algorithm brings us a simple and efficient solution, but it mainly focuses on numerical attribute data clustering and cannot be adaptive. In this paper, we studied the adaptive improvement method of...

Full description

Saved in:
Bibliographic Details
Main Author: Shihua Liu
Format: Article
Language:English
Published: Wiley 2022-01-01
Series:Complexity
Online Access:http://dx.doi.org/10.1155/2022/6742120
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832552620889210880
author Shihua Liu
author_facet Shihua Liu
author_sort Shihua Liu
collection DOAJ
description The clustering of mixed-attribute data is a vital and challenging issue. The density peaks clustering algorithm brings us a simple and efficient solution, but it mainly focuses on numerical attribute data clustering and cannot be adaptive. In this paper, we studied the adaptive improvement method of such an algorithm and proposed an adaptive mixed-attribute data clustering method based on density peaks called AMDPC. In this algorithm, we used the unified distance metric of mixed-attribute data to construct the distance matrix, calculated the local density based on K-nearest neighbors, and proposed the automatic determination method of cluster centers based on three inflection points. Experimental results on real University of California-Irvine (UCI) datasets showed that the proposed AMDPC algorithm could realize adaptive clustering of mixed-attribute data, can automatically obtain the correct number of clusters, and improved the clustering accuracy of all datasets by more than 22.58%, by 24.25%, by 28.03%, by 22.5%, and by 10.12% for the Heart, Cleveland, Credit, Acute, and Adult datasets compared to that of the traditional K-prototype algorithm, respectively. It also outperformed a modified density peaks clustering algorithm for mixed-attribute data (DPC_M) algorithms.
format Article
id doaj-art-1d3f1e41190a498db607245e1ec00cdd
institution Kabale University
issn 1099-0526
language English
publishDate 2022-01-01
publisher Wiley
record_format Article
series Complexity
spelling doaj-art-1d3f1e41190a498db607245e1ec00cdd2025-02-03T05:58:10ZengWileyComplexity1099-05262022-01-01202210.1155/2022/6742120Adaptive Mixed-Attribute Data Clustering Method Based on Density PeaksShihua Liu0School of Artificial IntelligenceThe clustering of mixed-attribute data is a vital and challenging issue. The density peaks clustering algorithm brings us a simple and efficient solution, but it mainly focuses on numerical attribute data clustering and cannot be adaptive. In this paper, we studied the adaptive improvement method of such an algorithm and proposed an adaptive mixed-attribute data clustering method based on density peaks called AMDPC. In this algorithm, we used the unified distance metric of mixed-attribute data to construct the distance matrix, calculated the local density based on K-nearest neighbors, and proposed the automatic determination method of cluster centers based on three inflection points. Experimental results on real University of California-Irvine (UCI) datasets showed that the proposed AMDPC algorithm could realize adaptive clustering of mixed-attribute data, can automatically obtain the correct number of clusters, and improved the clustering accuracy of all datasets by more than 22.58%, by 24.25%, by 28.03%, by 22.5%, and by 10.12% for the Heart, Cleveland, Credit, Acute, and Adult datasets compared to that of the traditional K-prototype algorithm, respectively. It also outperformed a modified density peaks clustering algorithm for mixed-attribute data (DPC_M) algorithms.http://dx.doi.org/10.1155/2022/6742120
spellingShingle Shihua Liu
Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks
Complexity
title Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks
title_full Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks
title_fullStr Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks
title_full_unstemmed Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks
title_short Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks
title_sort adaptive mixed attribute data clustering method based on density peaks
url http://dx.doi.org/10.1155/2022/6742120
work_keys_str_mv AT shihualiu adaptivemixedattributedataclusteringmethodbasedondensitypeaks