A Systematic Survey of Sparse Clustering

Handling a vast amount of high-dimensional data has always been challenging. The advancement of computer technology has led to an exponential growth of accumulated information where storing and processing are to be carefully handled since not all information gathered is useful. Feature selection and...

Full description

Saved in:

Bibliographic Details
Main Authors:	Josephine Bernadette M. Benjamin, Miin-Shen Yang
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Clustering K-means fuzzy c-means sparsity Lasso feature selection
Online Access:	https://ieeexplore.ieee.org/document/11015960/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850232358706348032
author	Josephine Bernadette M. Benjamin Miin-Shen Yang
author_facet	Josephine Bernadette M. Benjamin Miin-Shen Yang
author_sort	Josephine Bernadette M. Benjamin
collection	DOAJ
description	Handling a vast amount of high-dimensional data has always been challenging. The advancement of computer technology has led to an exponential growth of accumulated information where storing and processing are to be carefully handled since not all information gathered is useful. Feature selection and feature reduction algorithms have been proposed to process the data. In this paper, we review sparse clustering approaches that aim to cluster data sets while selecting and removing redundant and irrelevant features as well as noisy points and outliers. This paper surveys existing sparse clustering algorithms and explores their effectiveness in the analysis of high-dimensional data. The exploration and analysis of these sparse clustering approaches outperform the existing conventional clustering algorithms when faced with large and high-dimensional data. We also investigate the use of regularization terms in the sparse clustering algorithms. In this survey, we consider an extensive search on published papers as well as textbooks and discuss important results reported in the literature. We further explore and compare their strengths, limitations, adaptability, interpretability, complexity, and usability in handling high-dimensional data, and more research directions and topics on sparse clustering are also analyzed.
format	Article
id	doaj-art-8e1bb16142984e8bbd1c3cdc5bfdcf31
institution	OA Journals
issn	2169-3536
language	English
publishDate	2025-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-8e1bb16142984e8bbd1c3cdc5bfdcf312025-08-20T02:03:13ZengIEEEIEEE Access2169-35362025-01-0113949829501810.1109/ACCESS.2025.357406611015960A Systematic Survey of Sparse ClusteringJosephine Bernadette M. Benjamin0https://orcid.org/0000-0002-2995-7207Miin-Shen Yang1https://orcid.org/0000-0002-4907-3548Department of Mathematics and Physics, University of Santo Tomas, Manila, PhilippinesDepartment of Applied Mathematics, Chung Yuan Christian University, Taoyuan, TaiwanHandling a vast amount of high-dimensional data has always been challenging. The advancement of computer technology has led to an exponential growth of accumulated information where storing and processing are to be carefully handled since not all information gathered is useful. Feature selection and feature reduction algorithms have been proposed to process the data. In this paper, we review sparse clustering approaches that aim to cluster data sets while selecting and removing redundant and irrelevant features as well as noisy points and outliers. This paper surveys existing sparse clustering algorithms and explores their effectiveness in the analysis of high-dimensional data. The exploration and analysis of these sparse clustering approaches outperform the existing conventional clustering algorithms when faced with large and high-dimensional data. We also investigate the use of regularization terms in the sparse clustering algorithms. In this survey, we consider an extensive search on published papers as well as textbooks and discuss important results reported in the literature. We further explore and compare their strengths, limitations, adaptability, interpretability, complexity, and usability in handling high-dimensional data, and more research directions and topics on sparse clustering are also analyzed.https://ieeexplore.ieee.org/document/11015960/ClusteringK-meansfuzzy c-meanssparsityLassofeature selection
spellingShingle	Josephine Bernadette M. Benjamin Miin-Shen Yang A Systematic Survey of Sparse Clustering IEEE Access Clustering K-means fuzzy c-means sparsity Lasso feature selection
title	A Systematic Survey of Sparse Clustering
title_full	A Systematic Survey of Sparse Clustering
title_fullStr	A Systematic Survey of Sparse Clustering
title_full_unstemmed	A Systematic Survey of Sparse Clustering
title_short	A Systematic Survey of Sparse Clustering
title_sort	systematic survey of sparse clustering
topic	Clustering K-means fuzzy c-means sparsity Lasso feature selection
url	https://ieeexplore.ieee.org/document/11015960/
work_keys_str_mv	AT josephinebernadettembenjamin asystematicsurveyofsparseclustering AT miinshenyang asystematicsurveyofsparseclustering AT josephinebernadettembenjamin systematicsurveyofsparseclustering AT miinshenyang systematicsurveyofsparseclustering

A Systematic Survey of Sparse Clustering

Similar Items