Short tandem repeat variants are possibly associated with RNA secondary structure and gene expression.

Short tandem repeats (STRs) are abundant in the human genome with approximately 300,000 embedded in gene introns, exons, and untranslated regions. High penetrance STR variants cause human diseases such as Myotonic dystrophy, Baratela-Scott syndrome, and various ataxias. The possibility that STRs con...

Full description

Saved in:
Bibliographic Details
Main Authors: Nick Kinney, Dikshya Pathak, Emma Evans, Paola Arias
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0326355
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849329388938592256
author Nick Kinney
Dikshya Pathak
Emma Evans
Paola Arias
author_facet Nick Kinney
Dikshya Pathak
Emma Evans
Paola Arias
author_sort Nick Kinney
collection DOAJ
description Short tandem repeats (STRs) are abundant in the human genome with approximately 300,000 embedded in gene introns, exons, and untranslated regions. High penetrance STR variants cause human diseases such as Myotonic dystrophy, Baratela-Scott syndrome, and various ataxias. The possibility that STRs contribute to polygenic disease is supported by recent high-powered datasets that link STRs to more subtle effects on gene expression. Indeed, STR variants can induce Z-DNA and H-DNA folding; alter nucleosome positioning; and change the spacing of DNA binding sites. On the other hand, little is known about how STR variants affect RNA secondary structure and accessibility. These factors could affect rates of splicing, nuclear export, and translation. We hypothesize that effects on RNA structure can be predicted using computational tools and associated with gene expression using DNA and RNA sequencing data. We test this hypothesis using data from the 1000 Genomes Project and ViennaRNA. We identify 17,255 transcribed STRs that affect RNA folding (fSTRs); 356 are possibly associated with gene expression. We characterize fSTRs by repeat motif, length, and gene level annotation. Transcribed fSTR variants tend to affect RNA multiloops and external loops. Effects on RNA accessibility depends on the repeat motif: a surprising result that is checked against simulation. These results shed light on how transcribed STRs affect RNA structure and pave the way for experimental validation.
format Article
id doaj-art-6674512e138f49e481a84ebe37c5a689
institution Kabale University
issn 1932-6203
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-6674512e138f49e481a84ebe37c5a6892025-08-20T03:47:16ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01206e032635510.1371/journal.pone.0326355Short tandem repeat variants are possibly associated with RNA secondary structure and gene expression.Nick KinneyDikshya PathakEmma EvansPaola AriasShort tandem repeats (STRs) are abundant in the human genome with approximately 300,000 embedded in gene introns, exons, and untranslated regions. High penetrance STR variants cause human diseases such as Myotonic dystrophy, Baratela-Scott syndrome, and various ataxias. The possibility that STRs contribute to polygenic disease is supported by recent high-powered datasets that link STRs to more subtle effects on gene expression. Indeed, STR variants can induce Z-DNA and H-DNA folding; alter nucleosome positioning; and change the spacing of DNA binding sites. On the other hand, little is known about how STR variants affect RNA secondary structure and accessibility. These factors could affect rates of splicing, nuclear export, and translation. We hypothesize that effects on RNA structure can be predicted using computational tools and associated with gene expression using DNA and RNA sequencing data. We test this hypothesis using data from the 1000 Genomes Project and ViennaRNA. We identify 17,255 transcribed STRs that affect RNA folding (fSTRs); 356 are possibly associated with gene expression. We characterize fSTRs by repeat motif, length, and gene level annotation. Transcribed fSTR variants tend to affect RNA multiloops and external loops. Effects on RNA accessibility depends on the repeat motif: a surprising result that is checked against simulation. These results shed light on how transcribed STRs affect RNA structure and pave the way for experimental validation.https://doi.org/10.1371/journal.pone.0326355
spellingShingle Nick Kinney
Dikshya Pathak
Emma Evans
Paola Arias
Short tandem repeat variants are possibly associated with RNA secondary structure and gene expression.
PLoS ONE
title Short tandem repeat variants are possibly associated with RNA secondary structure and gene expression.
title_full Short tandem repeat variants are possibly associated with RNA secondary structure and gene expression.
title_fullStr Short tandem repeat variants are possibly associated with RNA secondary structure and gene expression.
title_full_unstemmed Short tandem repeat variants are possibly associated with RNA secondary structure and gene expression.
title_short Short tandem repeat variants are possibly associated with RNA secondary structure and gene expression.
title_sort short tandem repeat variants are possibly associated with rna secondary structure and gene expression
url https://doi.org/10.1371/journal.pone.0326355
work_keys_str_mv AT nickkinney shorttandemrepeatvariantsarepossiblyassociatedwithrnasecondarystructureandgeneexpression
AT dikshyapathak shorttandemrepeatvariantsarepossiblyassociatedwithrnasecondarystructureandgeneexpression
AT emmaevans shorttandemrepeatvariantsarepossiblyassociatedwithrnasecondarystructureandgeneexpression
AT paolaarias shorttandemrepeatvariantsarepossiblyassociatedwithrnasecondarystructureandgeneexpression