Population-Level Cell Trajectory Inference Based on Gaussian Distributions

In the past decade, inferring developmental trajectories from single-cell data has become a significant challenge in bioinformatics. RNA velocity, with its incorporation of directional dynamics, has significantly advanced the study of single-cell trajectories. However, as single-cell RNA sequencing...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiang Chen, Yibing Ma, Yongle Shi, Yuhan Fu, Mengdi Nan, Qing Ren, Jie Gao
Format: Article
Language:English
Published: MDPI AG 2024-11-01
Series:Biomolecules
Subjects:
Online Access:https://www.mdpi.com/2218-273X/14/11/1396
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850145631798034432
author Xiang Chen
Yibing Ma
Yongle Shi
Yuhan Fu
Mengdi Nan
Qing Ren
Jie Gao
author_facet Xiang Chen
Yibing Ma
Yongle Shi
Yuhan Fu
Mengdi Nan
Qing Ren
Jie Gao
author_sort Xiang Chen
collection DOAJ
description In the past decade, inferring developmental trajectories from single-cell data has become a significant challenge in bioinformatics. RNA velocity, with its incorporation of directional dynamics, has significantly advanced the study of single-cell trajectories. However, as single-cell RNA sequencing technology evolves, it generates complex, high-dimensional data with high noise levels. Existing trajectory inference methods, which overlook cell distribution characteristics, may perform inadequately under such conditions. To address this, we introduce CPvGTI, a Gaussian distribution-based trajectory inference method. CPvGTI utilizes a Gaussian mixture model, optimized by the Expectation–Maximization algorithm, to construct new cell populations in the original data space. By integrating RNA velocity, CPvGTI employs Gaussian Process Regression to analyze the differentiation trajectories of these cell populations. To evaluate the performance of CPvGTI, we assess CPvGTI’s performance against several state-of-the-art methods using four structurally diverse simulated datasets and four real datasets. The simulation studies indicate that CPvGTI excels in pseudo-time prediction and structural reconstruction compared to existing methods. Furthermore, the discovery of new branch trajectories in human forebrain and mouse hematopoiesis datasets confirms CPvGTI’s superior performance.
format Article
id doaj-art-2494018b86c548bd8c7629520bf5ff6e
institution OA Journals
issn 2218-273X
language English
publishDate 2024-11-01
publisher MDPI AG
record_format Article
series Biomolecules
spelling doaj-art-2494018b86c548bd8c7629520bf5ff6e2025-08-20T02:28:02ZengMDPI AGBiomolecules2218-273X2024-11-011411139610.3390/biom14111396Population-Level Cell Trajectory Inference Based on Gaussian DistributionsXiang Chen0Yibing Ma1Yongle Shi2Yuhan Fu3Mengdi Nan4Qing Ren5Jie Gao6School of Science, Jiangnan University, Wuxi 214122, ChinaSchool of Science, Jiangnan University, Wuxi 214122, ChinaSchool of Science, Jiangnan University, Wuxi 214122, ChinaSchool of Science, Jiangnan University, Wuxi 214122, ChinaSchool of Science, Jiangnan University, Wuxi 214122, ChinaSchool of Science, Jiangnan University, Wuxi 214122, ChinaSchool of Science, Jiangnan University, Wuxi 214122, ChinaIn the past decade, inferring developmental trajectories from single-cell data has become a significant challenge in bioinformatics. RNA velocity, with its incorporation of directional dynamics, has significantly advanced the study of single-cell trajectories. However, as single-cell RNA sequencing technology evolves, it generates complex, high-dimensional data with high noise levels. Existing trajectory inference methods, which overlook cell distribution characteristics, may perform inadequately under such conditions. To address this, we introduce CPvGTI, a Gaussian distribution-based trajectory inference method. CPvGTI utilizes a Gaussian mixture model, optimized by the Expectation–Maximization algorithm, to construct new cell populations in the original data space. By integrating RNA velocity, CPvGTI employs Gaussian Process Regression to analyze the differentiation trajectories of these cell populations. To evaluate the performance of CPvGTI, we assess CPvGTI’s performance against several state-of-the-art methods using four structurally diverse simulated datasets and four real datasets. The simulation studies indicate that CPvGTI excels in pseudo-time prediction and structural reconstruction compared to existing methods. Furthermore, the discovery of new branch trajectories in human forebrain and mouse hematopoiesis datasets confirms CPvGTI’s superior performance.https://www.mdpi.com/2218-273X/14/11/1396trajectory inferenceRNA velocityGaussian distributionpseudo-timesingle-cell data
spellingShingle Xiang Chen
Yibing Ma
Yongle Shi
Yuhan Fu
Mengdi Nan
Qing Ren
Jie Gao
Population-Level Cell Trajectory Inference Based on Gaussian Distributions
Biomolecules
trajectory inference
RNA velocity
Gaussian distribution
pseudo-time
single-cell data
title Population-Level Cell Trajectory Inference Based on Gaussian Distributions
title_full Population-Level Cell Trajectory Inference Based on Gaussian Distributions
title_fullStr Population-Level Cell Trajectory Inference Based on Gaussian Distributions
title_full_unstemmed Population-Level Cell Trajectory Inference Based on Gaussian Distributions
title_short Population-Level Cell Trajectory Inference Based on Gaussian Distributions
title_sort population level cell trajectory inference based on gaussian distributions
topic trajectory inference
RNA velocity
Gaussian distribution
pseudo-time
single-cell data
url https://www.mdpi.com/2218-273X/14/11/1396
work_keys_str_mv AT xiangchen populationlevelcelltrajectoryinferencebasedongaussiandistributions
AT yibingma populationlevelcelltrajectoryinferencebasedongaussiandistributions
AT yongleshi populationlevelcelltrajectoryinferencebasedongaussiandistributions
AT yuhanfu populationlevelcelltrajectoryinferencebasedongaussiandistributions
AT mengdinan populationlevelcelltrajectoryinferencebasedongaussiandistributions
AT qingren populationlevelcelltrajectoryinferencebasedongaussiandistributions
AT jiegao populationlevelcelltrajectoryinferencebasedongaussiandistributions