Feature selection in single-cell RNA sequencing data: a comprehensive evaluation

Single-cell RNA sequencing (scRNA-seq) has revolutionized biological and medical research, providing unique insights into the intricate cell-type compositions within various tissues. Unlike bulk RNA sequencing, scRNA-seq allows for examining gene expression at the individual cell level, r...

Full description

Saved in:
Bibliographic Details
Main Authors: Petros Paplomatas, Konstantinos Lazaros, Georgios N. Dimitrakopoulos, Aristidis Vrahatis
Format: Article
Language:English
Published: Academia.edu Journals 2024-09-01
Series:Academia Biology
Online Access:https://www.academia.edu/123921464/Feature_selection_in_single_cell_RNA_sequencing_data_a_comprehensive_evaluation
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823859562971660288
author Petros Paplomatas
Konstantinos Lazaros
Georgios N. Dimitrakopoulos
Aristidis Vrahatis
author_facet Petros Paplomatas
Konstantinos Lazaros
Georgios N. Dimitrakopoulos
Aristidis Vrahatis
author_sort Petros Paplomatas
collection DOAJ
description Single-cell RNA sequencing (scRNA-seq) has revolutionized biological and medical research, providing unique insights into the intricate cell-type compositions within various tissues. Unlike bulk RNA sequencing, scRNA-seq allows for examining gene expression at the individual cell level, revealing cellular heterogeneity and identifying rare cell types. However, the high dimensionality and inherent noise in scRNA-seq data pose significant analytical challenges. This study focuses on dimensionality reduction and cell-type identification in scRNA-seq data analysis. We developed the GenesRanking package, which offers 20 techniques for dimensionality reduction, including filter-based and embedding machine learning–based methods. By integrating feature selection methods from both statistics and machine learning, we provide a robust framework for improving data interpretation. Our comprehensive evaluation across five diverse scRNA-seq datasets demonstrates that although some methods show consistent performance, the technique should be chosen according to specific datasets for obtaining optimal results. Our findings underscore the enduring necessity for further refinement and continuous innovation in the field of scRNA-seq analysis, aiming to enhance the accuracy of cell-type identification and improve overall data interpretation.
format Article
id doaj-art-a9b9d8b5215e412185ea8356a62c5b27
institution Kabale University
issn 2837-4010
language English
publishDate 2024-09-01
publisher Academia.edu Journals
record_format Article
series Academia Biology
spelling doaj-art-a9b9d8b5215e412185ea8356a62c5b272025-02-11T00:44:08ZengAcademia.edu JournalsAcademia Biology2837-40102024-09-012310.20935/AcadBiol7324Feature selection in single-cell RNA sequencing data: a comprehensive evaluationPetros Paplomatas0Konstantinos Lazaros1Georgios N. Dimitrakopoulos2Aristidis Vrahatis3Bioinformatics and Human Electrophysiology Laboratory, Department of Informatics, Ionian University, 49100 Corfu, Greece.Bioinformatics and Human Electrophysiology Laboratory, Department of Informatics, Ionian University, 49100 Corfu, Greece.Bioinformatics and Human Electrophysiology Laboratory, Department of Informatics, Ionian University, 49100 Corfu, Greece.Bioinformatics and Human Electrophysiology Laboratory, Department of Informatics, Ionian University, 49100 Corfu, Greece. Single-cell RNA sequencing (scRNA-seq) has revolutionized biological and medical research, providing unique insights into the intricate cell-type compositions within various tissues. Unlike bulk RNA sequencing, scRNA-seq allows for examining gene expression at the individual cell level, revealing cellular heterogeneity and identifying rare cell types. However, the high dimensionality and inherent noise in scRNA-seq data pose significant analytical challenges. This study focuses on dimensionality reduction and cell-type identification in scRNA-seq data analysis. We developed the GenesRanking package, which offers 20 techniques for dimensionality reduction, including filter-based and embedding machine learning–based methods. By integrating feature selection methods from both statistics and machine learning, we provide a robust framework for improving data interpretation. Our comprehensive evaluation across five diverse scRNA-seq datasets demonstrates that although some methods show consistent performance, the technique should be chosen according to specific datasets for obtaining optimal results. Our findings underscore the enduring necessity for further refinement and continuous innovation in the field of scRNA-seq analysis, aiming to enhance the accuracy of cell-type identification and improve overall data interpretation.https://www.academia.edu/123921464/Feature_selection_in_single_cell_RNA_sequencing_data_a_comprehensive_evaluation
spellingShingle Petros Paplomatas
Konstantinos Lazaros
Georgios N. Dimitrakopoulos
Aristidis Vrahatis
Feature selection in single-cell RNA sequencing data: a comprehensive evaluation
Academia Biology
title Feature selection in single-cell RNA sequencing data: a comprehensive evaluation
title_full Feature selection in single-cell RNA sequencing data: a comprehensive evaluation
title_fullStr Feature selection in single-cell RNA sequencing data: a comprehensive evaluation
title_full_unstemmed Feature selection in single-cell RNA sequencing data: a comprehensive evaluation
title_short Feature selection in single-cell RNA sequencing data: a comprehensive evaluation
title_sort feature selection in single cell rna sequencing data a comprehensive evaluation
url https://www.academia.edu/123921464/Feature_selection_in_single_cell_RNA_sequencing_data_a_comprehensive_evaluation
work_keys_str_mv AT petrospaplomatas featureselectioninsinglecellrnasequencingdataacomprehensiveevaluation
AT konstantinoslazaros featureselectioninsinglecellrnasequencingdataacomprehensiveevaluation
AT georgiosndimitrakopoulos featureselectioninsinglecellrnasequencingdataacomprehensiveevaluation
AT aristidisvrahatis featureselectioninsinglecellrnasequencingdataacomprehensiveevaluation