When old metagenomic data meet newly sequenced genomes, a case study.

Dozens of computational methods are developed to identify species present in a metagenomic dataset. Many of these computational methods depend on available sequenced microbial species, which are still far from being representative. To see how newly sequenced genomes affect the analysis results, we r...

Full description

Saved in:
Bibliographic Details
Main Authors: Xin Li, Saleh A Naser, Annette Khaled, Haiyan Hu, Xiaoman Li
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:https://storage.googleapis.com/plos-corpus-prod/10.1371/journal.pone.0198773/1/pone.0198773.pdf?X-Goog-Algorithm=GOOG4-RSA-SHA256&X-Goog-Credential=wombat-sa%40plos-prod.iam.gserviceaccount.com%2F20210217%2Fauto%2Fstorage%2Fgoog4_request&X-Goog-Date=20210217T202131Z&X-Goog-Expires=3600&X-Goog-SignedHeaders=host&X-Goog-Signature=d7f869ae7713e9e96a6df5aebb7604873c15c013a6852241b0ff432467ce3691e820e7da902b6f5c0d9db5f279224ae82f321335799cae94c831a2b665c4facff7452cc97facfd8bc9d8a28f6f327be7cc9995a692c284c157035981d55028e1a121bd3abf98648076ff337ff7b92c83243bb16c8d6d8d9dee7ad0cf049835e89aed2a0f8f8acba5fc4496fbb1f062ed7fa1ea02d89ecdd6b7756f189d74b4039162694c16eb38ce9f720d4a37ba361cb93bb5342220ae49c399442bc3c08db0983486468612e18f26a808440d512fa46b1dfe4d64cb456dae30f09f03a3bdf12c22ecba2ce16bf6537198c775ecbfec5c6646d2844f0d4c678c73d96f85633f
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850230734287011840
author Xin Li
Saleh A Naser
Annette Khaled
Haiyan Hu
Xiaoman Li
author_facet Xin Li
Saleh A Naser
Annette Khaled
Haiyan Hu
Xiaoman Li
author_sort Xin Li
collection DOAJ
description Dozens of computational methods are developed to identify species present in a metagenomic dataset. Many of these computational methods depend on available sequenced microbial species, which are still far from being representative. To see how newly sequenced genomes affect the analysis results, we re-analyzed a shotgun metagenomic dataset composed of twelve colitis free metagenomic samples and ten colitis-related metagenomic samples. Unexpectedly, we identified at least two new phyla that may relate to colitis development in patients, together with the phylum identified previously. Compared with the previously identified phylum that differed between the two types of samples, the differences associated with the two new phyla are statistically more significant. Moreover, the abundance of the two new phyla correlates more with the severity of colitis. Surprisingly, even by repeating the analyses implemented in the previous study, we found that at least one main conclusion in the previous study is not supported. Our study indicates the importance of re-analysis of the generated metagenomic datasets and the necessity of considering multiple updated tools in metagenomic studies. It also sheds light on the limitations of the popular tools used currently and the importance to infer the presence of taxa without relying upon available sequenced genomes.
format Article
id doaj-art-ce7ffb93769f4974b3a1e0f8abb740ce
institution OA Journals
issn 1932-6203
language English
publishDate 2018-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-ce7ffb93769f4974b3a1e0f8abb740ce2025-08-20T02:03:46ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-01136e019877310.1371/journal.pone.0198773When old metagenomic data meet newly sequenced genomes, a case study.Xin LiSaleh A NaserAnnette KhaledHaiyan HuXiaoman LiDozens of computational methods are developed to identify species present in a metagenomic dataset. Many of these computational methods depend on available sequenced microbial species, which are still far from being representative. To see how newly sequenced genomes affect the analysis results, we re-analyzed a shotgun metagenomic dataset composed of twelve colitis free metagenomic samples and ten colitis-related metagenomic samples. Unexpectedly, we identified at least two new phyla that may relate to colitis development in patients, together with the phylum identified previously. Compared with the previously identified phylum that differed between the two types of samples, the differences associated with the two new phyla are statistically more significant. Moreover, the abundance of the two new phyla correlates more with the severity of colitis. Surprisingly, even by repeating the analyses implemented in the previous study, we found that at least one main conclusion in the previous study is not supported. Our study indicates the importance of re-analysis of the generated metagenomic datasets and the necessity of considering multiple updated tools in metagenomic studies. It also sheds light on the limitations of the popular tools used currently and the importance to infer the presence of taxa without relying upon available sequenced genomes.https://storage.googleapis.com/plos-corpus-prod/10.1371/journal.pone.0198773/1/pone.0198773.pdf?X-Goog-Algorithm=GOOG4-RSA-SHA256&X-Goog-Credential=wombat-sa%40plos-prod.iam.gserviceaccount.com%2F20210217%2Fauto%2Fstorage%2Fgoog4_request&X-Goog-Date=20210217T202131Z&X-Goog-Expires=3600&X-Goog-SignedHeaders=host&X-Goog-Signature=d7f869ae7713e9e96a6df5aebb7604873c15c013a6852241b0ff432467ce3691e820e7da902b6f5c0d9db5f279224ae82f321335799cae94c831a2b665c4facff7452cc97facfd8bc9d8a28f6f327be7cc9995a692c284c157035981d55028e1a121bd3abf98648076ff337ff7b92c83243bb16c8d6d8d9dee7ad0cf049835e89aed2a0f8f8acba5fc4496fbb1f062ed7fa1ea02d89ecdd6b7756f189d74b4039162694c16eb38ce9f720d4a37ba361cb93bb5342220ae49c399442bc3c08db0983486468612e18f26a808440d512fa46b1dfe4d64cb456dae30f09f03a3bdf12c22ecba2ce16bf6537198c775ecbfec5c6646d2844f0d4c678c73d96f85633f
spellingShingle Xin Li
Saleh A Naser
Annette Khaled
Haiyan Hu
Xiaoman Li
When old metagenomic data meet newly sequenced genomes, a case study.
PLoS ONE
title When old metagenomic data meet newly sequenced genomes, a case study.
title_full When old metagenomic data meet newly sequenced genomes, a case study.
title_fullStr When old metagenomic data meet newly sequenced genomes, a case study.
title_full_unstemmed When old metagenomic data meet newly sequenced genomes, a case study.
title_short When old metagenomic data meet newly sequenced genomes, a case study.
title_sort when old metagenomic data meet newly sequenced genomes a case study
url https://storage.googleapis.com/plos-corpus-prod/10.1371/journal.pone.0198773/1/pone.0198773.pdf?X-Goog-Algorithm=GOOG4-RSA-SHA256&X-Goog-Credential=wombat-sa%40plos-prod.iam.gserviceaccount.com%2F20210217%2Fauto%2Fstorage%2Fgoog4_request&X-Goog-Date=20210217T202131Z&X-Goog-Expires=3600&X-Goog-SignedHeaders=host&X-Goog-Signature=d7f869ae7713e9e96a6df5aebb7604873c15c013a6852241b0ff432467ce3691e820e7da902b6f5c0d9db5f279224ae82f321335799cae94c831a2b665c4facff7452cc97facfd8bc9d8a28f6f327be7cc9995a692c284c157035981d55028e1a121bd3abf98648076ff337ff7b92c83243bb16c8d6d8d9dee7ad0cf049835e89aed2a0f8f8acba5fc4496fbb1f062ed7fa1ea02d89ecdd6b7756f189d74b4039162694c16eb38ce9f720d4a37ba361cb93bb5342220ae49c399442bc3c08db0983486468612e18f26a808440d512fa46b1dfe4d64cb456dae30f09f03a3bdf12c22ecba2ce16bf6537198c775ecbfec5c6646d2844f0d4c678c73d96f85633f
work_keys_str_mv AT xinli whenoldmetagenomicdatameetnewlysequencedgenomesacasestudy
AT salehanaser whenoldmetagenomicdatameetnewlysequencedgenomesacasestudy
AT annettekhaled whenoldmetagenomicdatameetnewlysequencedgenomesacasestudy
AT haiyanhu whenoldmetagenomicdatameetnewlysequencedgenomesacasestudy
AT xiaomanli whenoldmetagenomicdatameetnewlysequencedgenomesacasestudy