Prophage-DB: a comprehensive database to explore diversity, distribution, and ecology of prophages

Abstract Background Viruses that infect prokaryotes (phages) constitute the most abundant group of biological agents, playing pivotal roles in microbial systems. They are known to impact microbial community dynamics, microbial ecology, and evolution. Efforts to document the diversity, host range, in...

Full description

Saved in:
Bibliographic Details
Main Authors: Etan Dieppa-Colón, Cody Martin, James C. Kosmopoulos, Karthik Anantharaman
Format: Article
Language:English
Published: BMC 2025-01-01
Series:Environmental Microbiome
Online Access:https://doi.org/10.1186/s40793-024-00659-1
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832594428070461440
author Etan Dieppa-Colón
Cody Martin
James C. Kosmopoulos
Karthik Anantharaman
author_facet Etan Dieppa-Colón
Cody Martin
James C. Kosmopoulos
Karthik Anantharaman
author_sort Etan Dieppa-Colón
collection DOAJ
description Abstract Background Viruses that infect prokaryotes (phages) constitute the most abundant group of biological agents, playing pivotal roles in microbial systems. They are known to impact microbial community dynamics, microbial ecology, and evolution. Efforts to document the diversity, host range, infection dynamics, and effects of bacteriophage infection on host cell metabolism are extremely underexplored. Phages are classified as virulent or temperate based on their life cycles. Temperate phages adopt the lysogenic mode of infection, where the genome integrates into the host cell genome forming a prophage. Prophages enable viral genome replication without host cell lysis, and often contribute novel and beneficial traits to the host genome. Current phage research predominantly focuses on lytic phages, leaving a significant gap in knowledge regarding prophages, including their biology, diversity, and ecological roles. Results Here we develop and describe Prophage-DB, a database of prophages, their proteins, and associated metadata that will serve as a resource for viral genomics and microbial ecology. To create the database, we identified and characterized prophages from genomes in three of the largest publicly available databases. We applied several state-of-the-art tools in our pipeline to annotate these viruses, cluster them, taxonomically classify them, and detect their respective auxiliary metabolic genes. In total, we identify and characterize over 350,000 prophages and 35,000 auxiliary metabolic genes. Our prophage database is highly representative based on statistical results and contains prophages from a diverse set of archaeal and bacterial hosts which show a wide environmental distribution. Conclusion Given that prophages are particularly overlooked and merit increased attention due to their vital implications for microbiomes and their hosts, we created Prophage-DB to advance our understanding of prophages in microbiomes through a comprehensive characterization of prophages in publicly available genomes. We propose that Prophage-DB will serve as a valuable resource for advancing phage research, offering insights into viral taxonomy, host relationships, auxiliary metabolic genes, and environmental distribution.
format Article
id doaj-art-9f4bda8ae1734075a9b3002ff9817834
institution Kabale University
issn 2524-6372
language English
publishDate 2025-01-01
publisher BMC
record_format Article
series Environmental Microbiome
spelling doaj-art-9f4bda8ae1734075a9b3002ff98178342025-01-19T12:39:19ZengBMCEnvironmental Microbiome2524-63722025-01-0120111410.1186/s40793-024-00659-1Prophage-DB: a comprehensive database to explore diversity, distribution, and ecology of prophagesEtan Dieppa-Colón0Cody Martin1James C. Kosmopoulos2Karthik Anantharaman3Department of Bacteriology, University of Wisconsin–MadisonDepartment of Bacteriology, University of Wisconsin–MadisonDepartment of Bacteriology, University of Wisconsin–MadisonDepartment of Bacteriology, University of Wisconsin–MadisonAbstract Background Viruses that infect prokaryotes (phages) constitute the most abundant group of biological agents, playing pivotal roles in microbial systems. They are known to impact microbial community dynamics, microbial ecology, and evolution. Efforts to document the diversity, host range, infection dynamics, and effects of bacteriophage infection on host cell metabolism are extremely underexplored. Phages are classified as virulent or temperate based on their life cycles. Temperate phages adopt the lysogenic mode of infection, where the genome integrates into the host cell genome forming a prophage. Prophages enable viral genome replication without host cell lysis, and often contribute novel and beneficial traits to the host genome. Current phage research predominantly focuses on lytic phages, leaving a significant gap in knowledge regarding prophages, including their biology, diversity, and ecological roles. Results Here we develop and describe Prophage-DB, a database of prophages, their proteins, and associated metadata that will serve as a resource for viral genomics and microbial ecology. To create the database, we identified and characterized prophages from genomes in three of the largest publicly available databases. We applied several state-of-the-art tools in our pipeline to annotate these viruses, cluster them, taxonomically classify them, and detect their respective auxiliary metabolic genes. In total, we identify and characterize over 350,000 prophages and 35,000 auxiliary metabolic genes. Our prophage database is highly representative based on statistical results and contains prophages from a diverse set of archaeal and bacterial hosts which show a wide environmental distribution. Conclusion Given that prophages are particularly overlooked and merit increased attention due to their vital implications for microbiomes and their hosts, we created Prophage-DB to advance our understanding of prophages in microbiomes through a comprehensive characterization of prophages in publicly available genomes. We propose that Prophage-DB will serve as a valuable resource for advancing phage research, offering insights into viral taxonomy, host relationships, auxiliary metabolic genes, and environmental distribution.https://doi.org/10.1186/s40793-024-00659-1
spellingShingle Etan Dieppa-Colón
Cody Martin
James C. Kosmopoulos
Karthik Anantharaman
Prophage-DB: a comprehensive database to explore diversity, distribution, and ecology of prophages
Environmental Microbiome
title Prophage-DB: a comprehensive database to explore diversity, distribution, and ecology of prophages
title_full Prophage-DB: a comprehensive database to explore diversity, distribution, and ecology of prophages
title_fullStr Prophage-DB: a comprehensive database to explore diversity, distribution, and ecology of prophages
title_full_unstemmed Prophage-DB: a comprehensive database to explore diversity, distribution, and ecology of prophages
title_short Prophage-DB: a comprehensive database to explore diversity, distribution, and ecology of prophages
title_sort prophage db a comprehensive database to explore diversity distribution and ecology of prophages
url https://doi.org/10.1186/s40793-024-00659-1
work_keys_str_mv AT etandieppacolon prophagedbacomprehensivedatabasetoexplorediversitydistributionandecologyofprophages
AT codymartin prophagedbacomprehensivedatabasetoexplorediversitydistributionandecologyofprophages
AT jamesckosmopoulos prophagedbacomprehensivedatabasetoexplorediversitydistributionandecologyofprophages
AT karthikanantharaman prophagedbacomprehensivedatabasetoexplorediversitydistributionandecologyofprophages