Genome-wide maps of highly-similar intrachromosomal repeats that can mediate ectopic recombination in three human genome assemblies

Summary: Repeated sequences spread throughout the genome play important roles in shaping the structure of chromosomes and facilitating the generation of new genomic variation through structural rearrangements. Several mechanisms of structural variation formation use shared nucleotide similarity betw...

Full description

Saved in:
Bibliographic Details
Main Authors: Luis Fernandez-Luna, Carlos Aguilar-Perez, Christopher M. Grochowski, Michele G. Mehaffey, Claudia M.B. Carvalho, Claudia Gonzaga-Jauregui
Format: Article
Language:English
Published: Elsevier 2025-04-01
Series:HGG Advances
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666247724001362
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832590895492366336
author Luis Fernandez-Luna
Carlos Aguilar-Perez
Christopher M. Grochowski
Michele G. Mehaffey
Claudia M.B. Carvalho
Claudia Gonzaga-Jauregui
author_facet Luis Fernandez-Luna
Carlos Aguilar-Perez
Christopher M. Grochowski
Michele G. Mehaffey
Claudia M.B. Carvalho
Claudia Gonzaga-Jauregui
author_sort Luis Fernandez-Luna
collection DOAJ
description Summary: Repeated sequences spread throughout the genome play important roles in shaping the structure of chromosomes and facilitating the generation of new genomic variation through structural rearrangements. Several mechanisms of structural variation formation use shared nucleotide similarity between repeated sequences as substrate for ectopic recombination. We performed genome-wide analyses of direct and inverted intrachromosomal repeated sequence pairs with 200 bp or more and 80% or greater sequence identity in three human genome assemblies, GRCh37, GRCh38, and T2T-CHM13. Overall, the composition and distribution of direct and inverted repeated sequences identified was similar among the three assemblies involving 13%–15% of the haploid genome, with an increased, albeit not significant, number of repeated sequences in T2T-CHM13. Interestingly, the majority of repeated sequences are below 1 kb in length with a median of 84.2% identity, highlighting the potential relevance of smaller, less identical repeats, such as Alu-Alu pairs, for ectopic recombination. We cross-referenced the identified repeated sequences with protein-coding genes to identify those at risk for being involved in genomic rearrangements. Olfactory receptors and immune response genes were enriched among those impacted.
format Article
id doaj-art-8c5b66e3f40244ddb578dead08d77052
institution Kabale University
issn 2666-2477
language English
publishDate 2025-04-01
publisher Elsevier
record_format Article
series HGG Advances
spelling doaj-art-8c5b66e3f40244ddb578dead08d770522025-01-23T05:27:47ZengElsevierHGG Advances2666-24772025-04-0162100396Genome-wide maps of highly-similar intrachromosomal repeats that can mediate ectopic recombination in three human genome assembliesLuis Fernandez-Luna0Carlos Aguilar-Perez1Christopher M. Grochowski2Michele G. Mehaffey3Claudia M.B. Carvalho4Claudia Gonzaga-Jauregui5International Laboratory for Human Genome Research, Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Juriquilla, Querétaro, MéxicoInternational Laboratory for Human Genome Research, Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Juriquilla, Querétaro, MéxicoDepartment of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USAPacific Northwest Research Institute, Seattle, WA, USAPacific Northwest Research Institute, Seattle, WA, USA; Corresponding authorInternational Laboratory for Human Genome Research, Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Juriquilla, Querétaro, México; Pacific Northwest Research Institute, Seattle, WA, USA; Corresponding authorSummary: Repeated sequences spread throughout the genome play important roles in shaping the structure of chromosomes and facilitating the generation of new genomic variation through structural rearrangements. Several mechanisms of structural variation formation use shared nucleotide similarity between repeated sequences as substrate for ectopic recombination. We performed genome-wide analyses of direct and inverted intrachromosomal repeated sequence pairs with 200 bp or more and 80% or greater sequence identity in three human genome assemblies, GRCh37, GRCh38, and T2T-CHM13. Overall, the composition and distribution of direct and inverted repeated sequences identified was similar among the three assemblies involving 13%–15% of the haploid genome, with an increased, albeit not significant, number of repeated sequences in T2T-CHM13. Interestingly, the majority of repeated sequences are below 1 kb in length with a median of 84.2% identity, highlighting the potential relevance of smaller, less identical repeats, such as Alu-Alu pairs, for ectopic recombination. We cross-referenced the identified repeated sequences with protein-coding genes to identify those at risk for being involved in genomic rearrangements. Olfactory receptors and immune response genes were enriched among those impacted.http://www.sciencedirect.com/science/article/pii/S2666247724001362repeatsgenomic rearrangementsstructural variationectopic recombinationmicrohomologysegmental duplications
spellingShingle Luis Fernandez-Luna
Carlos Aguilar-Perez
Christopher M. Grochowski
Michele G. Mehaffey
Claudia M.B. Carvalho
Claudia Gonzaga-Jauregui
Genome-wide maps of highly-similar intrachromosomal repeats that can mediate ectopic recombination in three human genome assemblies
HGG Advances
repeats
genomic rearrangements
structural variation
ectopic recombination
microhomology
segmental duplications
title Genome-wide maps of highly-similar intrachromosomal repeats that can mediate ectopic recombination in three human genome assemblies
title_full Genome-wide maps of highly-similar intrachromosomal repeats that can mediate ectopic recombination in three human genome assemblies
title_fullStr Genome-wide maps of highly-similar intrachromosomal repeats that can mediate ectopic recombination in three human genome assemblies
title_full_unstemmed Genome-wide maps of highly-similar intrachromosomal repeats that can mediate ectopic recombination in three human genome assemblies
title_short Genome-wide maps of highly-similar intrachromosomal repeats that can mediate ectopic recombination in three human genome assemblies
title_sort genome wide maps of highly similar intrachromosomal repeats that can mediate ectopic recombination in three human genome assemblies
topic repeats
genomic rearrangements
structural variation
ectopic recombination
microhomology
segmental duplications
url http://www.sciencedirect.com/science/article/pii/S2666247724001362
work_keys_str_mv AT luisfernandezluna genomewidemapsofhighlysimilarintrachromosomalrepeatsthatcanmediateectopicrecombinationinthreehumangenomeassemblies
AT carlosaguilarperez genomewidemapsofhighlysimilarintrachromosomalrepeatsthatcanmediateectopicrecombinationinthreehumangenomeassemblies
AT christophermgrochowski genomewidemapsofhighlysimilarintrachromosomalrepeatsthatcanmediateectopicrecombinationinthreehumangenomeassemblies
AT michelegmehaffey genomewidemapsofhighlysimilarintrachromosomalrepeatsthatcanmediateectopicrecombinationinthreehumangenomeassemblies
AT claudiambcarvalho genomewidemapsofhighlysimilarintrachromosomalrepeatsthatcanmediateectopicrecombinationinthreehumangenomeassemblies
AT claudiagonzagajauregui genomewidemapsofhighlysimilarintrachromosomalrepeatsthatcanmediateectopicrecombinationinthreehumangenomeassemblies