Species Identification in Malaise Trap Samples by DNA Barcoding Based on NGS Technologies and a Scoring Matrix.

The German Barcoding initiatives BFB and GBOL have generated a reference library of more than 16,000 metazoan species, which is now ready for applications concerning next generation molecular biodiversity assessments. To streamline the barcoding process, we have developed a meta-barcoding pipeline:...

Full description

Saved in:
Bibliographic Details
Main Authors: Jérôme Morinière, Bruno Cancian de Araujo, Athena Wai Lam, Axel Hausmann, Michael Balke, Stefan Schmidt, Lars Hendrich, Dieter Doczkal, Berthold Fartmann, Samuel Arvidsson, Gerhard Haszprunar
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2016-01-01
Series:PLoS ONE
Online Access:https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0155497&type=printable
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850189000598355968
author Jérôme Morinière
Bruno Cancian de Araujo
Athena Wai Lam
Axel Hausmann
Michael Balke
Stefan Schmidt
Lars Hendrich
Dieter Doczkal
Berthold Fartmann
Samuel Arvidsson
Gerhard Haszprunar
author_facet Jérôme Morinière
Bruno Cancian de Araujo
Athena Wai Lam
Axel Hausmann
Michael Balke
Stefan Schmidt
Lars Hendrich
Dieter Doczkal
Berthold Fartmann
Samuel Arvidsson
Gerhard Haszprunar
author_sort Jérôme Morinière
collection DOAJ
description The German Barcoding initiatives BFB and GBOL have generated a reference library of more than 16,000 metazoan species, which is now ready for applications concerning next generation molecular biodiversity assessments. To streamline the barcoding process, we have developed a meta-barcoding pipeline: We pre-sorted a single malaise trap sample (obtained during one week in August 2014, southern Germany) into 12 arthropod orders and extracted DNA from pooled individuals of each order separately, in order to facilitate DNA extraction and avoid time consuming single specimen selection. Aliquots of each ordinal-level DNA extract were combined to roughly simulate a DNA extract from a non-sorted malaise sample. Each DNA extract was amplified using four primer sets targeting the CO1-5' fragment. The resulting PCR products (150-400bp) were sequenced separately on an Illumina Mi-SEQ platform, resulting in 1.5 million sequences and 5,500 clusters (coverage ≥10; CD-HIT-EST, 98%). Using a total of 120,000 DNA barcodes of identified, Central European Hymenoptera, Coleoptera, Diptera, and Lepidoptera downloaded from BOLD we established a reference sequence database for a local CUSTOM BLAST. This allowed us to identify 529 Barcode Index Numbers (BINs) from our sequence clusters derived from pooled Malaise trap samples. We introduce a scoring matrix based on the sequence match percentages of each amplicon in order to gain plausibility for each detected BIN, leading to 390 high score BINs in the sorted samples; whereas 268 of these high score BINs (69%) could be identified in the combined sample. The results indicate that a time consuming presorting process will yield approximately 30% more high score BINs compared to the non-sorted sample in our case. These promising results indicate that a fast, efficient and reliable analysis of next generation data from malaise trap samples can be achieved using this pipeline.
format Article
id doaj-art-c8e1e8bef400460e9515460e89d9d352
institution OA Journals
issn 1932-6203
language English
publishDate 2016-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-c8e1e8bef400460e9515460e89d9d3522025-08-20T02:15:42ZengPublic Library of Science (PLoS)PLoS ONE1932-62032016-01-01115e015549710.1371/journal.pone.0155497Species Identification in Malaise Trap Samples by DNA Barcoding Based on NGS Technologies and a Scoring Matrix.Jérôme MorinièreBruno Cancian de AraujoAthena Wai LamAxel HausmannMichael BalkeStefan SchmidtLars HendrichDieter DoczkalBerthold FartmannSamuel ArvidssonGerhard HaszprunarThe German Barcoding initiatives BFB and GBOL have generated a reference library of more than 16,000 metazoan species, which is now ready for applications concerning next generation molecular biodiversity assessments. To streamline the barcoding process, we have developed a meta-barcoding pipeline: We pre-sorted a single malaise trap sample (obtained during one week in August 2014, southern Germany) into 12 arthropod orders and extracted DNA from pooled individuals of each order separately, in order to facilitate DNA extraction and avoid time consuming single specimen selection. Aliquots of each ordinal-level DNA extract were combined to roughly simulate a DNA extract from a non-sorted malaise sample. Each DNA extract was amplified using four primer sets targeting the CO1-5' fragment. The resulting PCR products (150-400bp) were sequenced separately on an Illumina Mi-SEQ platform, resulting in 1.5 million sequences and 5,500 clusters (coverage ≥10; CD-HIT-EST, 98%). Using a total of 120,000 DNA barcodes of identified, Central European Hymenoptera, Coleoptera, Diptera, and Lepidoptera downloaded from BOLD we established a reference sequence database for a local CUSTOM BLAST. This allowed us to identify 529 Barcode Index Numbers (BINs) from our sequence clusters derived from pooled Malaise trap samples. We introduce a scoring matrix based on the sequence match percentages of each amplicon in order to gain plausibility for each detected BIN, leading to 390 high score BINs in the sorted samples; whereas 268 of these high score BINs (69%) could be identified in the combined sample. The results indicate that a time consuming presorting process will yield approximately 30% more high score BINs compared to the non-sorted sample in our case. These promising results indicate that a fast, efficient and reliable analysis of next generation data from malaise trap samples can be achieved using this pipeline.https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0155497&type=printable
spellingShingle Jérôme Morinière
Bruno Cancian de Araujo
Athena Wai Lam
Axel Hausmann
Michael Balke
Stefan Schmidt
Lars Hendrich
Dieter Doczkal
Berthold Fartmann
Samuel Arvidsson
Gerhard Haszprunar
Species Identification in Malaise Trap Samples by DNA Barcoding Based on NGS Technologies and a Scoring Matrix.
PLoS ONE
title Species Identification in Malaise Trap Samples by DNA Barcoding Based on NGS Technologies and a Scoring Matrix.
title_full Species Identification in Malaise Trap Samples by DNA Barcoding Based on NGS Technologies and a Scoring Matrix.
title_fullStr Species Identification in Malaise Trap Samples by DNA Barcoding Based on NGS Technologies and a Scoring Matrix.
title_full_unstemmed Species Identification in Malaise Trap Samples by DNA Barcoding Based on NGS Technologies and a Scoring Matrix.
title_short Species Identification in Malaise Trap Samples by DNA Barcoding Based on NGS Technologies and a Scoring Matrix.
title_sort species identification in malaise trap samples by dna barcoding based on ngs technologies and a scoring matrix
url https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0155497&type=printable
work_keys_str_mv AT jeromemoriniere speciesidentificationinmalaisetrapsamplesbydnabarcodingbasedonngstechnologiesandascoringmatrix
AT brunocanciandearaujo speciesidentificationinmalaisetrapsamplesbydnabarcodingbasedonngstechnologiesandascoringmatrix
AT athenawailam speciesidentificationinmalaisetrapsamplesbydnabarcodingbasedonngstechnologiesandascoringmatrix
AT axelhausmann speciesidentificationinmalaisetrapsamplesbydnabarcodingbasedonngstechnologiesandascoringmatrix
AT michaelbalke speciesidentificationinmalaisetrapsamplesbydnabarcodingbasedonngstechnologiesandascoringmatrix
AT stefanschmidt speciesidentificationinmalaisetrapsamplesbydnabarcodingbasedonngstechnologiesandascoringmatrix
AT larshendrich speciesidentificationinmalaisetrapsamplesbydnabarcodingbasedonngstechnologiesandascoringmatrix
AT dieterdoczkal speciesidentificationinmalaisetrapsamplesbydnabarcodingbasedonngstechnologiesandascoringmatrix
AT bertholdfartmann speciesidentificationinmalaisetrapsamplesbydnabarcodingbasedonngstechnologiesandascoringmatrix
AT samuelarvidsson speciesidentificationinmalaisetrapsamplesbydnabarcodingbasedonngstechnologiesandascoringmatrix
AT gerhardhaszprunar speciesidentificationinmalaisetrapsamplesbydnabarcodingbasedonngstechnologiesandascoringmatrix