Bioinformatics services for analyzing massive genomic datasets

The explosive growth of next-generation sequencing data has resulted in ultra-large-scale datasets and ensuing computational problems. In Korea, the amount of genomic data has been increasing rapidly in the recent years. Leveraging these big data requires researchers to use large-scale computational...

Full description

Saved in:
Bibliographic Details
Main Authors: Gunhwan Ko, Pan-Gyu Kim, Youngbum Cho, Seongmun Jeong, Jae-Yoon Kim, Kyoung Hyoun Kim, Ho-Yeon Lee, Jiyeon Han, Namhee Yu, Seokjin Ham, Insoon Jang, Byunghee Kang, Sunguk Shin, Lian Kim, Seung-Won Lee, Dougu Nam, Jihyun F. Kim, Namshin Kim, Seon-Young Kim, Sanghyuk Lee, Tae-Young Roh, Byungwook Lee
Format: Article
Language:English
Published: BioMed Central 2020-03-01
Series:Genomics & Informatics
Subjects:
Online Access:http://genominfo.org/upload/pdf/gi-2020-18-1-e8.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832573722172588032
author Gunhwan Ko
Pan-Gyu Kim
Youngbum Cho
Seongmun Jeong
Jae-Yoon Kim
Kyoung Hyoun Kim
Ho-Yeon Lee
Jiyeon Han
Namhee Yu
Seokjin Ham
Insoon Jang
Byunghee Kang
Sunguk Shin
Lian Kim
Seung-Won Lee
Dougu Nam
Jihyun F. Kim
Namshin Kim
Seon-Young Kim
Sanghyuk Lee
Tae-Young Roh
Byungwook Lee
author_facet Gunhwan Ko
Pan-Gyu Kim
Youngbum Cho
Seongmun Jeong
Jae-Yoon Kim
Kyoung Hyoun Kim
Ho-Yeon Lee
Jiyeon Han
Namhee Yu
Seokjin Ham
Insoon Jang
Byunghee Kang
Sunguk Shin
Lian Kim
Seung-Won Lee
Dougu Nam
Jihyun F. Kim
Namshin Kim
Seon-Young Kim
Sanghyuk Lee
Tae-Young Roh
Byungwook Lee
author_sort Gunhwan Ko
collection DOAJ
description The explosive growth of next-generation sequencing data has resulted in ultra-large-scale datasets and ensuing computational problems. In Korea, the amount of genomic data has been increasing rapidly in the recent years. Leveraging these big data requires researchers to use large-scale computational resources and analysis pipelines. A promising solution for addressing this computational challenge is cloud computing, where CPUs, memory, storage, and programs are accessible in the form of virtual machines. Here, we present a cloud computing-based system, Bio-Express, that provides user-friendly, cost-effective analysis of massive genomic datasets. Bio-Express is loaded with predefined multi-omics data analysis pipelines, which are divided into genome, transcriptome, epigenome, and metagenome pipelines. Users can employ predefined pipelines or create a new pipeline for analyzing their own omics data. We also developed several web-based services for facilitating downstream analysis of genome data. Bio-Express web service is freely available at https://www.bioexpress.re.kr/.
format Article
id doaj-art-cedf9d54a4f84b7d80e78b149c2bcf1a
institution Kabale University
issn 2234-0742
language English
publishDate 2020-03-01
publisher BioMed Central
record_format Article
series Genomics & Informatics
spelling doaj-art-cedf9d54a4f84b7d80e78b149c2bcf1a2025-02-02T03:20:36ZengBioMed CentralGenomics & Informatics2234-07422020-03-0118110.5808/GI.2020.18.1.e8599Bioinformatics services for analyzing massive genomic datasetsGunhwan Ko0Pan-Gyu Kim1Youngbum Cho2Seongmun Jeong3Jae-Yoon Kim4Kyoung Hyoun Kim5Ho-Yeon Lee6Jiyeon Han7Namhee Yu8Seokjin Ham9Insoon Jang10Byunghee Kang11Sunguk Shin12Lian Kim13Seung-Won Lee14Dougu Nam15Jihyun F. Kim16Namshin Kim17Seon-Young Kim18Sanghyuk Lee19Tae-Young Roh20Byungwook Lee21 Korea Bioinformation Center (KOBIC), KRIBB, Daejeon 34141, Korea Korea Bioinformation Center (KOBIC), KRIBB, Daejeon 34141, Korea Genome Editing Research Center, KRIBB, Daejeon 34141, Korea Genome Editing Research Center, KRIBB, Daejeon 34141, Korea Genome Editing Research Center, KRIBB, Daejeon 34141, Korea Genome Editing Research Center, KRIBB, Daejeon 34141, Korea Genome Editing Research Center, KRIBB, Daejeon 34141, Korea Department of BioInformation Science, Ewha Womans University, Seoul 03760, Korea Department of BioInformation Science, Ewha Womans University, Seoul 03760, Korea Department of Life Sciences and Division of Integrative Biosciences & Biotechnology, Pohang University of Science & Technology (POSTECH), Pohang 37673, Korea Department of Life Sciences and Division of Integrative Biosciences & Biotechnology, Pohang University of Science & Technology (POSTECH), Pohang 37673, Korea Department of Life Sciences and Division of Integrative Biosciences & Biotechnology, Pohang University of Science & Technology (POSTECH), Pohang 37673, Korea Department of Systems, Biology Division of Life Sciences, and Institute for Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea Bioposh Inc., Daejeon 34016, Korea SeqGenesis, Daejeon 34016, Korea School of Life Sciences, Ulsan National Institute of Science and Technology, Ulsan 44919, Korea Department of Systems, Biology Division of Life Sciences, and Institute for Life Science and Biotechnology, Yonsei University, Seoul 03722, Korea Genome Editing Research Center, KRIBB, Daejeon 34141, Korea Genome Structure Research Center, KRIBB, Daejeon 34141, Korea Department of BioInformation Science, Ewha Womans University, Seoul 03760, Korea Department of Life Sciences and Division of Integrative Biosciences & Biotechnology, Pohang University of Science & Technology (POSTECH), Pohang 37673, Korea Korea Bioinformation Center (KOBIC), KRIBB, Daejeon 34141, KoreaThe explosive growth of next-generation sequencing data has resulted in ultra-large-scale datasets and ensuing computational problems. In Korea, the amount of genomic data has been increasing rapidly in the recent years. Leveraging these big data requires researchers to use large-scale computational resources and analysis pipelines. A promising solution for addressing this computational challenge is cloud computing, where CPUs, memory, storage, and programs are accessible in the form of virtual machines. Here, we present a cloud computing-based system, Bio-Express, that provides user-friendly, cost-effective analysis of massive genomic datasets. Bio-Express is loaded with predefined multi-omics data analysis pipelines, which are divided into genome, transcriptome, epigenome, and metagenome pipelines. Users can employ predefined pipelines or create a new pipeline for analyzing their own omics data. We also developed several web-based services for facilitating downstream analysis of genome data. Bio-Express web service is freely available at https://www.bioexpress.re.kr/.http://genominfo.org/upload/pdf/gi-2020-18-1-e8.pdfanalysis pipelinecloud computinggenomic dataweb serverworkflow system
spellingShingle Gunhwan Ko
Pan-Gyu Kim
Youngbum Cho
Seongmun Jeong
Jae-Yoon Kim
Kyoung Hyoun Kim
Ho-Yeon Lee
Jiyeon Han
Namhee Yu
Seokjin Ham
Insoon Jang
Byunghee Kang
Sunguk Shin
Lian Kim
Seung-Won Lee
Dougu Nam
Jihyun F. Kim
Namshin Kim
Seon-Young Kim
Sanghyuk Lee
Tae-Young Roh
Byungwook Lee
Bioinformatics services for analyzing massive genomic datasets
Genomics & Informatics
analysis pipeline
cloud computing
genomic data
web server
workflow system
title Bioinformatics services for analyzing massive genomic datasets
title_full Bioinformatics services for analyzing massive genomic datasets
title_fullStr Bioinformatics services for analyzing massive genomic datasets
title_full_unstemmed Bioinformatics services for analyzing massive genomic datasets
title_short Bioinformatics services for analyzing massive genomic datasets
title_sort bioinformatics services for analyzing massive genomic datasets
topic analysis pipeline
cloud computing
genomic data
web server
workflow system
url http://genominfo.org/upload/pdf/gi-2020-18-1-e8.pdf
work_keys_str_mv AT gunhwanko bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT pangyukim bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT youngbumcho bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT seongmunjeong bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT jaeyoonkim bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT kyounghyounkim bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT hoyeonlee bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT jiyeonhan bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT namheeyu bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT seokjinham bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT insoonjang bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT byungheekang bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT sungukshin bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT liankim bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT seungwonlee bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT dougunam bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT jihyunfkim bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT namshinkim bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT seonyoungkim bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT sanghyuklee bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT taeyoungroh bioinformaticsservicesforanalyzingmassivegenomicdatasets
AT byungwooklee bioinformaticsservicesforanalyzingmassivegenomicdatasets