Multi-platform data integration in microarray analysis

IEEE Trans Inf Technol Biomed. 2011 Nov;15(6):806-12. doi: 10.1109/TITB.2011.2158232.

Abstract

An increasing number of studies have profiled gene expressions in tumor specimens using distinct microarray platforms and analysis techniques. One challenging task is to develop robust statistical models in order to integrate multi-platform findings. We compare some methodologies on the field with respect to estrogen receptor (ER) status, and focus on a unified-among-platforms scale implemented by Shen et al. in 2004, which is based on a Bayesian mixture model. Under this scale, we study the ER intensity similarities between four breast cancer datasets derived from various platforms. We evaluate our results with an independent dataset in terms of ER sample classification, given the derived gene ER signatures of the integrated data. We found that integrated multi-platform gene signatures and fold-change variability similarities between different platform measurements can assist the statistical analysis of independent microarray datasets in terms of ER classification.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Artificial Intelligence
  • Bayes Theorem
  • Breast Neoplasms / genetics
  • Computer Simulation
  • Data Mining / methods*
  • Databases, Genetic*
  • Female
  • Gene Expression Profiling
  • Gene Expression Regulation, Neoplastic
  • Humans
  • Microarray Analysis / methods*
  • Models, Molecular*
  • Models, Statistical*
  • Receptors, Estrogen / analysis*
  • Reproducibility of Results
  • Systems Integration*

Substances

  • Receptors, Estrogen