Removal of sequencing adapter contamination improves microbial genome databases

Abstract Advances in assembling microbial genomes have led to growth of reference genome databases, which have been transformative for applied and basic microbiome research. Here we show that published microbial genome databases from humans, mice, cows, pigs, fish, honeybees, and marine environments...

Full description

Saved in:
Bibliographic Details
Main Authors: Andrew H. Moeller, Brian A. Dillard, Samantha L. Goldman, Madalena V. F. Real, Daniel D. Sprockett
Format: Article
Language:English
Published: BMC 2024-11-01
Series:BMC Genomics
Subjects:
Online Access:https://doi.org/10.1186/s12864-024-10956-1
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Advances in assembling microbial genomes have led to growth of reference genome databases, which have been transformative for applied and basic microbiome research. Here we show that published microbial genome databases from humans, mice, cows, pigs, fish, honeybees, and marine environments contain significant sequencing-adapter contamination that systematically reduces assembly accuracy and contiguousness. By removing the adapter-contaminated ends of contiguous sequences and reassembling MGnify reference genomes, we improve the quality of assemblies in these databases.
ISSN:1471-2164