Fast Processing RNA-Seq on Multicore Processor

RNA Sequencing (RNA-Seq) is the sequencing and analysis of transcriptomes. The main purpose of RNA-Seq analysis is to find out the presence and quantity of RNA in an experimental sample under a specific condition. Essentially, RNA raw sequence data was massive. It can be as big as hundreds of Gigaby...

Full description

Saved in:
Bibliographic Details
Main Authors: Lee Jia Bin, Nor Asilah Wati Abdul Hamid, Zurita Ismail, Mohamed Faris Laham
Format: Article
Language:English
Published: University of Baghdad, College of Science for Women 2021-12-01
Series:مجلة بغداد للعلوم
Subjects:
Online Access:https://bsj.uobaghdad.edu.iq/index.php/BSJ/article/view/6642
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850057333219000320
author Lee Jia Bin
Nor Asilah Wati Abdul Hamid
Zurita Ismail
Mohamed Faris Laham
author_facet Lee Jia Bin
Nor Asilah Wati Abdul Hamid
Zurita Ismail
Mohamed Faris Laham
author_sort Lee Jia Bin
collection DOAJ
description RNA Sequencing (RNA-Seq) is the sequencing and analysis of transcriptomes. The main purpose of RNA-Seq analysis is to find out the presence and quantity of RNA in an experimental sample under a specific condition. Essentially, RNA raw sequence data was massive. It can be as big as hundreds of Gigabytes (GB). This massive data always makes the processing time become longer and take several days. A multicore processor can speed up a program by separating the tasks and running the tasks’ errands concurrently. Hence, a multicore processor will be a suitable choice to overcome this problem. Therefore, this study aims to use an Intel multicore processor to improve the RNA-Seq speed and analyze RNA-Seq analysis's performance with a multiprocessor. This study only processed RNA-Seq from quality control analysis until sorted the BAM (Binary Alignment/Map) file content. Three different sizes of RNA paired end has been used to make the comparison. The final experiment results showed that the implementation of RNA-Seq on an Intel multicore processor could achieve a higher speedup. The total processing time of RNA-Seq with the largest size of RNA raw sequence data (66.3 Megabytes) decreased from 317.638 seconds to 211.916 seconds. The reduced processing time was 105 seconds and near to 2 minutes. Furthermore, for the smallest RNA raw sequence data size, the total processing time decreased from 212.380 seconds to 163.961 seconds which reduced 48 seconds.
format Article
id doaj-art-dc7ce8302bc14f24885182a64800b485
institution DOAJ
issn 2078-8665
2411-7986
language English
publishDate 2021-12-01
publisher University of Baghdad, College of Science for Women
record_format Article
series مجلة بغداد للعلوم
spelling doaj-art-dc7ce8302bc14f24885182a64800b4852025-08-20T02:51:27ZengUniversity of Baghdad, College of Science for Womenمجلة بغداد للعلوم2078-86652411-79862021-12-01184(Suppl.)10.21123/bsj.2021.18.4(Suppl.).1413Fast Processing RNA-Seq on Multicore ProcessorLee Jia Bin0Nor Asilah Wati Abdul Hamid1Zurita Ismail2Mohamed Faris Laham3Department of Communication Technology and Network, Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, MalaysiaDepartment of Communication Technology and Network, Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, Malaysia & Institute for Mathematical Research, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, MalaysiaInstitute for Mathematical Research, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, MalaysiaInstitute for Mathematical Research, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, MalaysiaRNA Sequencing (RNA-Seq) is the sequencing and analysis of transcriptomes. The main purpose of RNA-Seq analysis is to find out the presence and quantity of RNA in an experimental sample under a specific condition. Essentially, RNA raw sequence data was massive. It can be as big as hundreds of Gigabytes (GB). This massive data always makes the processing time become longer and take several days. A multicore processor can speed up a program by separating the tasks and running the tasks’ errands concurrently. Hence, a multicore processor will be a suitable choice to overcome this problem. Therefore, this study aims to use an Intel multicore processor to improve the RNA-Seq speed and analyze RNA-Seq analysis's performance with a multiprocessor. This study only processed RNA-Seq from quality control analysis until sorted the BAM (Binary Alignment/Map) file content. Three different sizes of RNA paired end has been used to make the comparison. The final experiment results showed that the implementation of RNA-Seq on an Intel multicore processor could achieve a higher speedup. The total processing time of RNA-Seq with the largest size of RNA raw sequence data (66.3 Megabytes) decreased from 317.638 seconds to 211.916 seconds. The reduced processing time was 105 seconds and near to 2 minutes. Furthermore, for the smallest RNA raw sequence data size, the total processing time decreased from 212.380 seconds to 163.961 seconds which reduced 48 seconds.https://bsj.uobaghdad.edu.iq/index.php/BSJ/article/view/6642Bioinformatics, High-performance computing, Multicore processors, RNA Sequencing
spellingShingle Lee Jia Bin
Nor Asilah Wati Abdul Hamid
Zurita Ismail
Mohamed Faris Laham
Fast Processing RNA-Seq on Multicore Processor
مجلة بغداد للعلوم
Bioinformatics, High-performance computing, Multicore processors, RNA Sequencing
title Fast Processing RNA-Seq on Multicore Processor
title_full Fast Processing RNA-Seq on Multicore Processor
title_fullStr Fast Processing RNA-Seq on Multicore Processor
title_full_unstemmed Fast Processing RNA-Seq on Multicore Processor
title_short Fast Processing RNA-Seq on Multicore Processor
title_sort fast processing rna seq on multicore processor
topic Bioinformatics, High-performance computing, Multicore processors, RNA Sequencing
url https://bsj.uobaghdad.edu.iq/index.php/BSJ/article/view/6642
work_keys_str_mv AT leejiabin fastprocessingrnaseqonmulticoreprocessor
AT norasilahwatiabdulhamid fastprocessingrnaseqonmulticoreprocessor
AT zuritaismail fastprocessingrnaseqonmulticoreprocessor
AT mohamedfarislaham fastprocessingrnaseqonmulticoreprocessor