Robust Microarray Meta-Analysis Identifies Differentially Expressed Genes for Clinical Prediction

Combining multiple microarray datasets increases sample size and leads to improved reproducibility in identification of informative genes and subsequent clinical prediction. Although microarrays have increased the rate of genomic data collection, sample size is still a major issue when identifying i...

Full description

Saved in:
Bibliographic Details
Main Authors: John H. Phan, Andrew N. Young, May D. Wang
Format: Article
Language:English
Published: Wiley 2012-01-01
Series:The Scientific World Journal
Online Access:http://dx.doi.org/10.1100/2012/989637
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Combining multiple microarray datasets increases sample size and leads to improved reproducibility in identification of informative genes and subsequent clinical prediction. Although microarrays have increased the rate of genomic data collection, sample size is still a major issue when identifying informative genetic biomarkers. Because of this, feature selection methods often suffer from false discoveries, resulting in poorly performing predictive models. We develop a simple meta-analysis-based feature selection method that captures the knowledge in each individual dataset and combines the results using a simple rank average. In a comprehensive study that measures robustness in terms of clinical application (i.e., breast, renal, and pancreatic cancer), microarray platform heterogeneity, and classifier (i.e., logistic regression, diagonal LDA, and linear SVM), we compare the rank average meta-analysis method to five other meta-analysis methods. Results indicate that rank average meta-analysis consistently performs well compared to five other meta-analysis methods.
ISSN:1537-744X