Extensive paralogism in the environmental pangenome: a key factor in the ecological success of natural SAR11 populations

Abstract Background The oceanic microbiome is dominated by members of the SAR11 clade. Despite their abundance, challenges in recovering the full genetic diversity of natural populations have hindered our understanding of the eco-evolutionary mechanisms driving intra-species variation. In this study...

Full description

Saved in:
Bibliographic Details
Main Authors: Carmen Molina-Pardines, Jose M. Haro-Moreno, Francisco Rodriguez-Valera, Mario López-Pérez
Format: Article
Language:English
Published: BMC 2025-02-01
Series:Microbiome
Subjects:
Online Access:https://doi.org/10.1186/s40168-025-02037-6
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823861837107560448
author Carmen Molina-Pardines
Jose M. Haro-Moreno
Francisco Rodriguez-Valera
Mario López-Pérez
author_facet Carmen Molina-Pardines
Jose M. Haro-Moreno
Francisco Rodriguez-Valera
Mario López-Pérez
author_sort Carmen Molina-Pardines
collection DOAJ
description Abstract Background The oceanic microbiome is dominated by members of the SAR11 clade. Despite their abundance, challenges in recovering the full genetic diversity of natural populations have hindered our understanding of the eco-evolutionary mechanisms driving intra-species variation. In this study, we employed a combination of single-amplified genomes and long-read metagenomics to recover the genomic diversity of natural populations within the SAR11 genomospecies Ia.3/VII, the dominant group in the Mediterranean Sea. Results The reconstruction of the first complete genome within this genomospecies revealed that the core genome represents a significant proportion of the genome (~ 81%), with highly divergent areas that allow for greater strain-dependent metabolic flexibility. The flexible genome was concentrated in small regions, typically containing a single gene, and was located in equivalent regions within the genomospecies. Each variable region was associated with a specific set of genes that, despite exhibiting some divergence, maintained equivalent biological functionality within the population. The environmental pangenome is large and enriched in genes involved in nutrient transport, as well as cell wall synthesis and modification, showing an extremely high degree of functional redundancy in the flexible genome (i.e. paralogisms). Conclusions This genomic architecture promotes polyclonality, preserving genetic variation within the population. This, in turn, mitigates intraspecific competition and enables the population to thrive under variable environmental conditions and selective pressures. Furthermore, this study demonstrates the power of long-read metagenomics in capturing the full genetic diversity of environmental SAR11 populations, overcoming the limitations of second-generation sequencing technologies in genome assembly. Video Abstract
format Article
id doaj-art-b3158aeac9b6430d93de13aa9d53f974
institution Kabale University
issn 2049-2618
language English
publishDate 2025-02-01
publisher BMC
record_format Article
series Microbiome
spelling doaj-art-b3158aeac9b6430d93de13aa9d53f9742025-02-09T12:46:50ZengBMCMicrobiome2049-26182025-02-0113111910.1186/s40168-025-02037-6Extensive paralogism in the environmental pangenome: a key factor in the ecological success of natural SAR11 populationsCarmen Molina-Pardines0Jose M. Haro-Moreno1Francisco Rodriguez-Valera2Mario López-Pérez3Evolutionary Genomics Group, División de Microbiología, Universidad Miguel HernándezEvolutionary Genomics Group, División de Microbiología, Universidad Miguel HernándezEvolutionary Genomics Group, División de Microbiología, Universidad Miguel HernándezEvolutionary Genomics Group, División de Microbiología, Universidad Miguel HernándezAbstract Background The oceanic microbiome is dominated by members of the SAR11 clade. Despite their abundance, challenges in recovering the full genetic diversity of natural populations have hindered our understanding of the eco-evolutionary mechanisms driving intra-species variation. In this study, we employed a combination of single-amplified genomes and long-read metagenomics to recover the genomic diversity of natural populations within the SAR11 genomospecies Ia.3/VII, the dominant group in the Mediterranean Sea. Results The reconstruction of the first complete genome within this genomospecies revealed that the core genome represents a significant proportion of the genome (~ 81%), with highly divergent areas that allow for greater strain-dependent metabolic flexibility. The flexible genome was concentrated in small regions, typically containing a single gene, and was located in equivalent regions within the genomospecies. Each variable region was associated with a specific set of genes that, despite exhibiting some divergence, maintained equivalent biological functionality within the population. The environmental pangenome is large and enriched in genes involved in nutrient transport, as well as cell wall synthesis and modification, showing an extremely high degree of functional redundancy in the flexible genome (i.e. paralogisms). Conclusions This genomic architecture promotes polyclonality, preserving genetic variation within the population. This, in turn, mitigates intraspecific competition and enables the population to thrive under variable environmental conditions and selective pressures. Furthermore, this study demonstrates the power of long-read metagenomics in capturing the full genetic diversity of environmental SAR11 populations, overcoming the limitations of second-generation sequencing technologies in genome assembly. Video Abstracthttps://doi.org/10.1186/s40168-025-02037-6SAR11PangenomeMediterranean SeaLong-read metagenomicsFlexible genomeParalogs
spellingShingle Carmen Molina-Pardines
Jose M. Haro-Moreno
Francisco Rodriguez-Valera
Mario López-Pérez
Extensive paralogism in the environmental pangenome: a key factor in the ecological success of natural SAR11 populations
Microbiome
SAR11
Pangenome
Mediterranean Sea
Long-read metagenomics
Flexible genome
Paralogs
title Extensive paralogism in the environmental pangenome: a key factor in the ecological success of natural SAR11 populations
title_full Extensive paralogism in the environmental pangenome: a key factor in the ecological success of natural SAR11 populations
title_fullStr Extensive paralogism in the environmental pangenome: a key factor in the ecological success of natural SAR11 populations
title_full_unstemmed Extensive paralogism in the environmental pangenome: a key factor in the ecological success of natural SAR11 populations
title_short Extensive paralogism in the environmental pangenome: a key factor in the ecological success of natural SAR11 populations
title_sort extensive paralogism in the environmental pangenome a key factor in the ecological success of natural sar11 populations
topic SAR11
Pangenome
Mediterranean Sea
Long-read metagenomics
Flexible genome
Paralogs
url https://doi.org/10.1186/s40168-025-02037-6
work_keys_str_mv AT carmenmolinapardines extensiveparalogismintheenvironmentalpangenomeakeyfactorintheecologicalsuccessofnaturalsar11populations
AT josemharomoreno extensiveparalogismintheenvironmentalpangenomeakeyfactorintheecologicalsuccessofnaturalsar11populations
AT franciscorodriguezvalera extensiveparalogismintheenvironmentalpangenomeakeyfactorintheecologicalsuccessofnaturalsar11populations
AT mariolopezperez extensiveparalogismintheenvironmentalpangenomeakeyfactorintheecologicalsuccessofnaturalsar11populations