A Systematic Survey of Sparse Clustering

Handling a vast amount of high-dimensional data has always been challenging. The advancement of computer technology has led to an exponential growth of accumulated information where storing and processing are to be carefully handled since not all information gathered is useful. Feature selection and...

Full description

Saved in:
Bibliographic Details
Main Authors: Josephine Bernadette M. Benjamin, Miin-Shen Yang
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/11015960/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850232358706348032
author Josephine Bernadette M. Benjamin
Miin-Shen Yang
author_facet Josephine Bernadette M. Benjamin
Miin-Shen Yang
author_sort Josephine Bernadette M. Benjamin
collection DOAJ
description Handling a vast amount of high-dimensional data has always been challenging. The advancement of computer technology has led to an exponential growth of accumulated information where storing and processing are to be carefully handled since not all information gathered is useful. Feature selection and feature reduction algorithms have been proposed to process the data. In this paper, we review sparse clustering approaches that aim to cluster data sets while selecting and removing redundant and irrelevant features as well as noisy points and outliers. This paper surveys existing sparse clustering algorithms and explores their effectiveness in the analysis of high-dimensional data. The exploration and analysis of these sparse clustering approaches outperform the existing conventional clustering algorithms when faced with large and high-dimensional data. We also investigate the use of regularization terms in the sparse clustering algorithms. In this survey, we consider an extensive search on published papers as well as textbooks and discuss important results reported in the literature. We further explore and compare their strengths, limitations, adaptability, interpretability, complexity, and usability in handling high-dimensional data, and more research directions and topics on sparse clustering are also analyzed.
format Article
id doaj-art-8e1bb16142984e8bbd1c3cdc5bfdcf31
institution OA Journals
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-8e1bb16142984e8bbd1c3cdc5bfdcf312025-08-20T02:03:13ZengIEEEIEEE Access2169-35362025-01-0113949829501810.1109/ACCESS.2025.357406611015960A Systematic Survey of Sparse ClusteringJosephine Bernadette M. Benjamin0https://orcid.org/0000-0002-2995-7207Miin-Shen Yang1https://orcid.org/0000-0002-4907-3548Department of Mathematics and Physics, University of Santo Tomas, Manila, PhilippinesDepartment of Applied Mathematics, Chung Yuan Christian University, Taoyuan, TaiwanHandling a vast amount of high-dimensional data has always been challenging. The advancement of computer technology has led to an exponential growth of accumulated information where storing and processing are to be carefully handled since not all information gathered is useful. Feature selection and feature reduction algorithms have been proposed to process the data. In this paper, we review sparse clustering approaches that aim to cluster data sets while selecting and removing redundant and irrelevant features as well as noisy points and outliers. This paper surveys existing sparse clustering algorithms and explores their effectiveness in the analysis of high-dimensional data. The exploration and analysis of these sparse clustering approaches outperform the existing conventional clustering algorithms when faced with large and high-dimensional data. We also investigate the use of regularization terms in the sparse clustering algorithms. In this survey, we consider an extensive search on published papers as well as textbooks and discuss important results reported in the literature. We further explore and compare their strengths, limitations, adaptability, interpretability, complexity, and usability in handling high-dimensional data, and more research directions and topics on sparse clustering are also analyzed.https://ieeexplore.ieee.org/document/11015960/ClusteringK-meansfuzzy c-meanssparsityLassofeature selection
spellingShingle Josephine Bernadette M. Benjamin
Miin-Shen Yang
A Systematic Survey of Sparse Clustering
IEEE Access
Clustering
K-means
fuzzy c-means
sparsity
Lasso
feature selection
title A Systematic Survey of Sparse Clustering
title_full A Systematic Survey of Sparse Clustering
title_fullStr A Systematic Survey of Sparse Clustering
title_full_unstemmed A Systematic Survey of Sparse Clustering
title_short A Systematic Survey of Sparse Clustering
title_sort systematic survey of sparse clustering
topic Clustering
K-means
fuzzy c-means
sparsity
Lasso
feature selection
url https://ieeexplore.ieee.org/document/11015960/
work_keys_str_mv AT josephinebernadettembenjamin asystematicsurveyofsparseclustering
AT miinshenyang asystematicsurveyofsparseclustering
AT josephinebernadettembenjamin systematicsurveyofsparseclustering
AT miinshenyang systematicsurveyofsparseclustering