Discovering biological progression underlying microarray samples.

In biological systems that undergo processes such as differentiation, a clear concept of progression exists. We present a novel computational approach, called Sample Progression Discovery (SPD), to discover patterns of biological progression underlying microarray gene expression data. SPD assumes th...

Full description

Saved in:
Bibliographic Details
Main Authors: Peng Qiu, Andrew J Gentles, Sylvia K Plevritis
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2011-04-01
Series:PLoS Computational Biology
Online Access:https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1001123&type=printable
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850192240768450560
author Peng Qiu
Andrew J Gentles
Sylvia K Plevritis
author_facet Peng Qiu
Andrew J Gentles
Sylvia K Plevritis
author_sort Peng Qiu
collection DOAJ
description In biological systems that undergo processes such as differentiation, a clear concept of progression exists. We present a novel computational approach, called Sample Progression Discovery (SPD), to discover patterns of biological progression underlying microarray gene expression data. SPD assumes that individual samples of a microarray dataset are related by an unknown biological process (i.e., differentiation, development, cell cycle, disease progression), and that each sample represents one unknown point along the progression of that process. SPD aims to organize the samples in a manner that reveals the underlying progression and to simultaneously identify subsets of genes that are responsible for that progression. We demonstrate the performance of SPD on a variety of microarray datasets that were generated by sampling a biological process at different points along its progression, without providing SPD any information of the underlying process. When applied to a cell cycle time series microarray dataset, SPD was not provided any prior knowledge of samples' time order or of which genes are cell-cycle regulated, yet SPD recovered the correct time order and identified many genes that have been associated with the cell cycle. When applied to B-cell differentiation data, SPD recovered the correct order of stages of normal B-cell differentiation and the linkage between preB-ALL tumor cells with their cell origin preB. When applied to mouse embryonic stem cell differentiation data, SPD uncovered a landscape of ESC differentiation into various lineages and genes that represent both generic and lineage specific processes. When applied to a prostate cancer microarray dataset, SPD identified gene modules that reflect a progression consistent with disease stages. SPD may be best viewed as a novel tool for synthesizing biological hypotheses because it provides a likely biological progression underlying a microarray dataset and, perhaps more importantly, the candidate genes that regulate that progression.
format Article
id doaj-art-032c5863cac7400eb86d6845440c0ef3
institution OA Journals
issn 1553-734X
1553-7358
language English
publishDate 2011-04-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj-art-032c5863cac7400eb86d6845440c0ef32025-08-20T02:14:37ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582011-04-0174e100112310.1371/journal.pcbi.1001123Discovering biological progression underlying microarray samples.Peng QiuAndrew J GentlesSylvia K PlevritisIn biological systems that undergo processes such as differentiation, a clear concept of progression exists. We present a novel computational approach, called Sample Progression Discovery (SPD), to discover patterns of biological progression underlying microarray gene expression data. SPD assumes that individual samples of a microarray dataset are related by an unknown biological process (i.e., differentiation, development, cell cycle, disease progression), and that each sample represents one unknown point along the progression of that process. SPD aims to organize the samples in a manner that reveals the underlying progression and to simultaneously identify subsets of genes that are responsible for that progression. We demonstrate the performance of SPD on a variety of microarray datasets that were generated by sampling a biological process at different points along its progression, without providing SPD any information of the underlying process. When applied to a cell cycle time series microarray dataset, SPD was not provided any prior knowledge of samples' time order or of which genes are cell-cycle regulated, yet SPD recovered the correct time order and identified many genes that have been associated with the cell cycle. When applied to B-cell differentiation data, SPD recovered the correct order of stages of normal B-cell differentiation and the linkage between preB-ALL tumor cells with their cell origin preB. When applied to mouse embryonic stem cell differentiation data, SPD uncovered a landscape of ESC differentiation into various lineages and genes that represent both generic and lineage specific processes. When applied to a prostate cancer microarray dataset, SPD identified gene modules that reflect a progression consistent with disease stages. SPD may be best viewed as a novel tool for synthesizing biological hypotheses because it provides a likely biological progression underlying a microarray dataset and, perhaps more importantly, the candidate genes that regulate that progression.https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1001123&type=printable
spellingShingle Peng Qiu
Andrew J Gentles
Sylvia K Plevritis
Discovering biological progression underlying microarray samples.
PLoS Computational Biology
title Discovering biological progression underlying microarray samples.
title_full Discovering biological progression underlying microarray samples.
title_fullStr Discovering biological progression underlying microarray samples.
title_full_unstemmed Discovering biological progression underlying microarray samples.
title_short Discovering biological progression underlying microarray samples.
title_sort discovering biological progression underlying microarray samples
url https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1001123&type=printable
work_keys_str_mv AT pengqiu discoveringbiologicalprogressionunderlyingmicroarraysamples
AT andrewjgentles discoveringbiologicalprogressionunderlyingmicroarraysamples
AT sylviakplevritis discoveringbiologicalprogressionunderlyingmicroarraysamples