Short tandem repeats delineate gene bodies across eukaryotes

Abstract Short tandem repeats (STRs) have emerged as important and hypermutable sites where genetic variation correlates with gene expression in plant and animal systems. Recently, it has been shown that a broad range of transcription factors (TFs) are affected by STRs near or in the DNA target bind...

Full description

Saved in:
Bibliographic Details
Main Authors: William B. Reinar, Anders K. Krabberød, Vilde O. Lalun, Melinka A. Butenko, Kjetill S. Jakobsen
Format: Article
Language:English
Published: Nature Portfolio 2024-12-01
Series:Nature Communications
Online Access:https://doi.org/10.1038/s41467-024-55276-w
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823861800301494272
author William B. Reinar
Anders K. Krabberød
Vilde O. Lalun
Melinka A. Butenko
Kjetill S. Jakobsen
author_facet William B. Reinar
Anders K. Krabberød
Vilde O. Lalun
Melinka A. Butenko
Kjetill S. Jakobsen
author_sort William B. Reinar
collection DOAJ
description Abstract Short tandem repeats (STRs) have emerged as important and hypermutable sites where genetic variation correlates with gene expression in plant and animal systems. Recently, it has been shown that a broad range of transcription factors (TFs) are affected by STRs near or in the DNA target binding site. Despite this, the distribution of STR motif repetitiveness in eukaryote genomes is still largely unknown. Here, we identify monomer and dimer STR motif repetitiveness in 5.1 billion 10-bp windows upstream of translation starts and downstream of translation stops in 25 million genes spanning 1270 species across the eukaryotic Tree of Life. We report that all surveyed genomes have gene-proximal shifts in motif repetitiveness. Within genomes, variation in gene-proximal repetitiveness landscapes correlated to the function of genes; genes with housekeeping functions were depleted in upstream and downstream repetitiveness. Furthermore, the repetitiveness landscapes correlated with TF binding sites, indicating that gene function has evolved in conjunction with cis-regulatory STRs and TFs that recognize repetitive sites. These results suggest that the hypermutability inherent to STRs is canalized along the genome sequence and contributes to regulatory and eco-evolutionary dynamics in all eukaryotes.
format Article
id doaj-art-4bc4730d28da466894bf0f271703a44d
institution Kabale University
issn 2041-1723
language English
publishDate 2024-12-01
publisher Nature Portfolio
record_format Article
series Nature Communications
spelling doaj-art-4bc4730d28da466894bf0f271703a44d2025-02-09T12:43:47ZengNature PortfolioNature Communications2041-17232024-12-0115111310.1038/s41467-024-55276-wShort tandem repeats delineate gene bodies across eukaryotesWilliam B. Reinar0Anders K. Krabberød1Vilde O. Lalun2Melinka A. Butenko3Kjetill S. Jakobsen4Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of OsloCentre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of OsloCentre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of OsloCentre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of OsloCentre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of OsloAbstract Short tandem repeats (STRs) have emerged as important and hypermutable sites where genetic variation correlates with gene expression in plant and animal systems. Recently, it has been shown that a broad range of transcription factors (TFs) are affected by STRs near or in the DNA target binding site. Despite this, the distribution of STR motif repetitiveness in eukaryote genomes is still largely unknown. Here, we identify monomer and dimer STR motif repetitiveness in 5.1 billion 10-bp windows upstream of translation starts and downstream of translation stops in 25 million genes spanning 1270 species across the eukaryotic Tree of Life. We report that all surveyed genomes have gene-proximal shifts in motif repetitiveness. Within genomes, variation in gene-proximal repetitiveness landscapes correlated to the function of genes; genes with housekeeping functions were depleted in upstream and downstream repetitiveness. Furthermore, the repetitiveness landscapes correlated with TF binding sites, indicating that gene function has evolved in conjunction with cis-regulatory STRs and TFs that recognize repetitive sites. These results suggest that the hypermutability inherent to STRs is canalized along the genome sequence and contributes to regulatory and eco-evolutionary dynamics in all eukaryotes.https://doi.org/10.1038/s41467-024-55276-w
spellingShingle William B. Reinar
Anders K. Krabberød
Vilde O. Lalun
Melinka A. Butenko
Kjetill S. Jakobsen
Short tandem repeats delineate gene bodies across eukaryotes
Nature Communications
title Short tandem repeats delineate gene bodies across eukaryotes
title_full Short tandem repeats delineate gene bodies across eukaryotes
title_fullStr Short tandem repeats delineate gene bodies across eukaryotes
title_full_unstemmed Short tandem repeats delineate gene bodies across eukaryotes
title_short Short tandem repeats delineate gene bodies across eukaryotes
title_sort short tandem repeats delineate gene bodies across eukaryotes
url https://doi.org/10.1038/s41467-024-55276-w
work_keys_str_mv AT williambreinar shorttandemrepeatsdelineategenebodiesacrosseukaryotes
AT anderskkrabberød shorttandemrepeatsdelineategenebodiesacrosseukaryotes
AT vildeolalun shorttandemrepeatsdelineategenebodiesacrosseukaryotes
AT melinkaabutenko shorttandemrepeatsdelineategenebodiesacrosseukaryotes
AT kjetillsjakobsen shorttandemrepeatsdelineategenebodiesacrosseukaryotes