Correlation and distinction between stochastic gene transcription models with and without polymerase dynamics

Polymerase dynamics (PD) is an important model for explaining transcriptional regulation in gene perturbation data. In this study, we conducted a detailed analysis of the dynamic behavior of stochastic gene transcription models with PD. We first derived an exact time-dependent formula of mRNA distri...

Full description

Saved in:
Bibliographic Details
Main Authors: Chunjuan Zhu, Liang Chen, Zhishan Qiu, Jiaxin Chen, Feng Jiao, Jianshe Yu
Format: Article
Language:English
Published: American Physical Society 2025-04-01
Series:Physical Review Research
Online Access:http://doi.org/10.1103/PhysRevResearch.7.023050
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Polymerase dynamics (PD) is an important model for explaining transcriptional regulation in gene perturbation data. In this study, we conducted a detailed analysis of the dynamic behavior of stochastic gene transcription models with PD. We first derived an exact time-dependent formula of mRNA distribution for the classical telegraph model with PD, then revealed a different regulation mechanism whereby PD simultaneously suppresses the Fano factor and enhances the bimodal distribution. For deeper insights into PD regulation, we established optimal effective models without PD to approximate the steady-state mRNA distribution in models with PD. Optimized effective parameters reliably captured input gene initiation and mRNA production rates and reflected the parameter variations of complex systems with PD under biological perturbations. The effective models also revealed that PD introduced quantitatively distinct kurtosis values and dynamics of mRNA distribution. By fitting transcriptome-wide mRNA distribution data from mouse fibroblast and embryonic stem cells, we found that over 1000 data sets may be better captured by integrating PD into the telegraph model. By fitting synthetic mRNA distribution data, we showed that the combinations of cell sample size N and number of time points, n, required for the reliable selection between telegraph models with and without PD are N=10^{3} and n≥8, N=10^{4} and n≥2, or N=10^{5} and steady-state data, whereas the parameter combinations required for the reliable estimation of polymerase recruitment and pause release rates are N=10^{4} and n≥8 or N=10^{5} and n≥4. Our proposed method can also be used to determine the regulatory roles of other biological compounds.
ISSN:2643-1564