De novo assembly of transcriptome sequencing in Caragana korshinskii Kom. and characterization of EST-SSR markers.

Caragana korshinskii Kom. is widely distributed in various habitats, including gravel desert, clay desert, fixed and semi-fixed sand, and saline land in the Asian and African deserts. To date, no previous genomic information or EST-SSR marker has been reported in Caragana Fabr. genus. In this study,...

Full description

Saved in:
Bibliographic Details
Main Authors: Yan Long, Yanyan Wang, Shanshan Wu, Jiao Wang, Xinjie Tian, Xinwu Pei
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-01-01
Series:PLoS ONE
Online Access:https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0115805&type=printable
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850023110080724992
author Yan Long
Yanyan Wang
Shanshan Wu
Jiao Wang
Xinjie Tian
Xinwu Pei
author_facet Yan Long
Yanyan Wang
Shanshan Wu
Jiao Wang
Xinjie Tian
Xinwu Pei
author_sort Yan Long
collection DOAJ
description Caragana korshinskii Kom. is widely distributed in various habitats, including gravel desert, clay desert, fixed and semi-fixed sand, and saline land in the Asian and African deserts. To date, no previous genomic information or EST-SSR marker has been reported in Caragana Fabr. genus. In this study, more than two billion bases of high-quality sequence of C. korshinskii were generated by using illumina sequencing technology and demonstrated the de novo assembly and annotation of genes without prior genome information. These reads were assembled into 86,265 unigenes (mean length = 709 bp). The similarity search indicated that 33,955 and 21,978 unigenes showed significant similarities to known proteins from NCBI non-redundant and Swissprot protein databases, respectively. Among these annotated unigenes, 26,232 a unigenes were separately assigned to Gene Ontology (GO) database. When 22,756 unigenes searched against the Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database, 5,598 unigenes were assigned to 5 main categories including 32 KEGG pathways. Among the main KEGG categories, metabolism was the biggest category (2,862, 43.7%), suggesting the active metabolic processes in the desert tree. In addition, a total of 19,150 EST-SSRs were identified from 15,484 unigenes, and the characterizations of EST-SSRs were further compared with other four species in Fabraceae. 126 potential marker sites were randomly selected to validate the assembly quality and develop EST-SSR markers. Among the 9 germplasms in Caranaga Fabr. genus, PCR success rate were 93.7% and the phylogenic tree was constructed based on the genotypic data. This research generated a substantial fraction of transcriptome sequences, which were very useful resources for gene annotation and discovery, molecular markers development, genome assembly and annotation. The EST-SSR markers identified and developed in this study will facilitate marker-assisted selection breeding.
format Article
id doaj-art-25438254d7b14d9aa3360eec9422e34f
institution DOAJ
issn 1932-6203
language English
publishDate 2015-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-25438254d7b14d9aa3360eec9422e34f2025-08-20T03:01:28ZengPublic Library of Science (PLoS)PLoS ONE1932-62032015-01-01101e011580510.1371/journal.pone.0115805De novo assembly of transcriptome sequencing in Caragana korshinskii Kom. and characterization of EST-SSR markers.Yan LongYanyan WangShanshan WuJiao WangXinjie TianXinwu PeiCaragana korshinskii Kom. is widely distributed in various habitats, including gravel desert, clay desert, fixed and semi-fixed sand, and saline land in the Asian and African deserts. To date, no previous genomic information or EST-SSR marker has been reported in Caragana Fabr. genus. In this study, more than two billion bases of high-quality sequence of C. korshinskii were generated by using illumina sequencing technology and demonstrated the de novo assembly and annotation of genes without prior genome information. These reads were assembled into 86,265 unigenes (mean length = 709 bp). The similarity search indicated that 33,955 and 21,978 unigenes showed significant similarities to known proteins from NCBI non-redundant and Swissprot protein databases, respectively. Among these annotated unigenes, 26,232 a unigenes were separately assigned to Gene Ontology (GO) database. When 22,756 unigenes searched against the Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database, 5,598 unigenes were assigned to 5 main categories including 32 KEGG pathways. Among the main KEGG categories, metabolism was the biggest category (2,862, 43.7%), suggesting the active metabolic processes in the desert tree. In addition, a total of 19,150 EST-SSRs were identified from 15,484 unigenes, and the characterizations of EST-SSRs were further compared with other four species in Fabraceae. 126 potential marker sites were randomly selected to validate the assembly quality and develop EST-SSR markers. Among the 9 germplasms in Caranaga Fabr. genus, PCR success rate were 93.7% and the phylogenic tree was constructed based on the genotypic data. This research generated a substantial fraction of transcriptome sequences, which were very useful resources for gene annotation and discovery, molecular markers development, genome assembly and annotation. The EST-SSR markers identified and developed in this study will facilitate marker-assisted selection breeding.https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0115805&type=printable
spellingShingle Yan Long
Yanyan Wang
Shanshan Wu
Jiao Wang
Xinjie Tian
Xinwu Pei
De novo assembly of transcriptome sequencing in Caragana korshinskii Kom. and characterization of EST-SSR markers.
PLoS ONE
title De novo assembly of transcriptome sequencing in Caragana korshinskii Kom. and characterization of EST-SSR markers.
title_full De novo assembly of transcriptome sequencing in Caragana korshinskii Kom. and characterization of EST-SSR markers.
title_fullStr De novo assembly of transcriptome sequencing in Caragana korshinskii Kom. and characterization of EST-SSR markers.
title_full_unstemmed De novo assembly of transcriptome sequencing in Caragana korshinskii Kom. and characterization of EST-SSR markers.
title_short De novo assembly of transcriptome sequencing in Caragana korshinskii Kom. and characterization of EST-SSR markers.
title_sort de novo assembly of transcriptome sequencing in caragana korshinskii kom and characterization of est ssr markers
url https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0115805&type=printable
work_keys_str_mv AT yanlong denovoassemblyoftranscriptomesequencingincaraganakorshinskiikomandcharacterizationofestssrmarkers
AT yanyanwang denovoassemblyoftranscriptomesequencingincaraganakorshinskiikomandcharacterizationofestssrmarkers
AT shanshanwu denovoassemblyoftranscriptomesequencingincaraganakorshinskiikomandcharacterizationofestssrmarkers
AT jiaowang denovoassemblyoftranscriptomesequencingincaraganakorshinskiikomandcharacterizationofestssrmarkers
AT xinjietian denovoassemblyoftranscriptomesequencingincaraganakorshinskiikomandcharacterizationofestssrmarkers
AT xinwupei denovoassemblyoftranscriptomesequencingincaraganakorshinskiikomandcharacterizationofestssrmarkers