Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks
The clustering of mixed-attribute data is a vital and challenging issue. The density peaks clustering algorithm brings us a simple and efficient solution, but it mainly focuses on numerical attribute data clustering and cannot be adaptive. In this paper, we studied the adaptive improvement method of...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2022-01-01
|
Series: | Complexity |
Online Access: | http://dx.doi.org/10.1155/2022/6742120 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832552620889210880 |
---|---|
author | Shihua Liu |
author_facet | Shihua Liu |
author_sort | Shihua Liu |
collection | DOAJ |
description | The clustering of mixed-attribute data is a vital and challenging issue. The density peaks clustering algorithm brings us a simple and efficient solution, but it mainly focuses on numerical attribute data clustering and cannot be adaptive. In this paper, we studied the adaptive improvement method of such an algorithm and proposed an adaptive mixed-attribute data clustering method based on density peaks called AMDPC. In this algorithm, we used the unified distance metric of mixed-attribute data to construct the distance matrix, calculated the local density based on K-nearest neighbors, and proposed the automatic determination method of cluster centers based on three inflection points. Experimental results on real University of California-Irvine (UCI) datasets showed that the proposed AMDPC algorithm could realize adaptive clustering of mixed-attribute data, can automatically obtain the correct number of clusters, and improved the clustering accuracy of all datasets by more than 22.58%, by 24.25%, by 28.03%, by 22.5%, and by 10.12% for the Heart, Cleveland, Credit, Acute, and Adult datasets compared to that of the traditional K-prototype algorithm, respectively. It also outperformed a modified density peaks clustering algorithm for mixed-attribute data (DPC_M) algorithms. |
format | Article |
id | doaj-art-1d3f1e41190a498db607245e1ec00cdd |
institution | Kabale University |
issn | 1099-0526 |
language | English |
publishDate | 2022-01-01 |
publisher | Wiley |
record_format | Article |
series | Complexity |
spelling | doaj-art-1d3f1e41190a498db607245e1ec00cdd2025-02-03T05:58:10ZengWileyComplexity1099-05262022-01-01202210.1155/2022/6742120Adaptive Mixed-Attribute Data Clustering Method Based on Density PeaksShihua Liu0School of Artificial IntelligenceThe clustering of mixed-attribute data is a vital and challenging issue. The density peaks clustering algorithm brings us a simple and efficient solution, but it mainly focuses on numerical attribute data clustering and cannot be adaptive. In this paper, we studied the adaptive improvement method of such an algorithm and proposed an adaptive mixed-attribute data clustering method based on density peaks called AMDPC. In this algorithm, we used the unified distance metric of mixed-attribute data to construct the distance matrix, calculated the local density based on K-nearest neighbors, and proposed the automatic determination method of cluster centers based on three inflection points. Experimental results on real University of California-Irvine (UCI) datasets showed that the proposed AMDPC algorithm could realize adaptive clustering of mixed-attribute data, can automatically obtain the correct number of clusters, and improved the clustering accuracy of all datasets by more than 22.58%, by 24.25%, by 28.03%, by 22.5%, and by 10.12% for the Heart, Cleveland, Credit, Acute, and Adult datasets compared to that of the traditional K-prototype algorithm, respectively. It also outperformed a modified density peaks clustering algorithm for mixed-attribute data (DPC_M) algorithms.http://dx.doi.org/10.1155/2022/6742120 |
spellingShingle | Shihua Liu Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks Complexity |
title | Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks |
title_full | Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks |
title_fullStr | Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks |
title_full_unstemmed | Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks |
title_short | Adaptive Mixed-Attribute Data Clustering Method Based on Density Peaks |
title_sort | adaptive mixed attribute data clustering method based on density peaks |
url | http://dx.doi.org/10.1155/2022/6742120 |
work_keys_str_mv | AT shihualiu adaptivemixedattributedataclusteringmethodbasedondensitypeaks |