A Systematic Survey of Sparse Clustering
Handling a vast amount of high-dimensional data has always been challenging. The advancement of computer technology has led to an exponential growth of accumulated information where storing and processing are to be carefully handled since not all information gathered is useful. Feature selection and...
Saved in:
| Main Authors: | , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/11015960/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850232358706348032 |
|---|---|
| author | Josephine Bernadette M. Benjamin Miin-Shen Yang |
| author_facet | Josephine Bernadette M. Benjamin Miin-Shen Yang |
| author_sort | Josephine Bernadette M. Benjamin |
| collection | DOAJ |
| description | Handling a vast amount of high-dimensional data has always been challenging. The advancement of computer technology has led to an exponential growth of accumulated information where storing and processing are to be carefully handled since not all information gathered is useful. Feature selection and feature reduction algorithms have been proposed to process the data. In this paper, we review sparse clustering approaches that aim to cluster data sets while selecting and removing redundant and irrelevant features as well as noisy points and outliers. This paper surveys existing sparse clustering algorithms and explores their effectiveness in the analysis of high-dimensional data. The exploration and analysis of these sparse clustering approaches outperform the existing conventional clustering algorithms when faced with large and high-dimensional data. We also investigate the use of regularization terms in the sparse clustering algorithms. In this survey, we consider an extensive search on published papers as well as textbooks and discuss important results reported in the literature. We further explore and compare their strengths, limitations, adaptability, interpretability, complexity, and usability in handling high-dimensional data, and more research directions and topics on sparse clustering are also analyzed. |
| format | Article |
| id | doaj-art-8e1bb16142984e8bbd1c3cdc5bfdcf31 |
| institution | OA Journals |
| issn | 2169-3536 |
| language | English |
| publishDate | 2025-01-01 |
| publisher | IEEE |
| record_format | Article |
| series | IEEE Access |
| spelling | doaj-art-8e1bb16142984e8bbd1c3cdc5bfdcf312025-08-20T02:03:13ZengIEEEIEEE Access2169-35362025-01-0113949829501810.1109/ACCESS.2025.357406611015960A Systematic Survey of Sparse ClusteringJosephine Bernadette M. Benjamin0https://orcid.org/0000-0002-2995-7207Miin-Shen Yang1https://orcid.org/0000-0002-4907-3548Department of Mathematics and Physics, University of Santo Tomas, Manila, PhilippinesDepartment of Applied Mathematics, Chung Yuan Christian University, Taoyuan, TaiwanHandling a vast amount of high-dimensional data has always been challenging. The advancement of computer technology has led to an exponential growth of accumulated information where storing and processing are to be carefully handled since not all information gathered is useful. Feature selection and feature reduction algorithms have been proposed to process the data. In this paper, we review sparse clustering approaches that aim to cluster data sets while selecting and removing redundant and irrelevant features as well as noisy points and outliers. This paper surveys existing sparse clustering algorithms and explores their effectiveness in the analysis of high-dimensional data. The exploration and analysis of these sparse clustering approaches outperform the existing conventional clustering algorithms when faced with large and high-dimensional data. We also investigate the use of regularization terms in the sparse clustering algorithms. In this survey, we consider an extensive search on published papers as well as textbooks and discuss important results reported in the literature. We further explore and compare their strengths, limitations, adaptability, interpretability, complexity, and usability in handling high-dimensional data, and more research directions and topics on sparse clustering are also analyzed.https://ieeexplore.ieee.org/document/11015960/ClusteringK-meansfuzzy c-meanssparsityLassofeature selection |
| spellingShingle | Josephine Bernadette M. Benjamin Miin-Shen Yang A Systematic Survey of Sparse Clustering IEEE Access Clustering K-means fuzzy c-means sparsity Lasso feature selection |
| title | A Systematic Survey of Sparse Clustering |
| title_full | A Systematic Survey of Sparse Clustering |
| title_fullStr | A Systematic Survey of Sparse Clustering |
| title_full_unstemmed | A Systematic Survey of Sparse Clustering |
| title_short | A Systematic Survey of Sparse Clustering |
| title_sort | systematic survey of sparse clustering |
| topic | Clustering K-means fuzzy c-means sparsity Lasso feature selection |
| url | https://ieeexplore.ieee.org/document/11015960/ |
| work_keys_str_mv | AT josephinebernadettembenjamin asystematicsurveyofsparseclustering AT miinshenyang asystematicsurveyofsparseclustering AT josephinebernadettembenjamin systematicsurveyofsparseclustering AT miinshenyang systematicsurveyofsparseclustering |