A novel strategy for microarray quality control using Bayesian networks

Bioinformatics. 2003 Nov 1;19(16):2031-8. doi: 10.1093/bioinformatics/btg275.

Abstract

Motivation: High-throughput microarray technologies enable measurements of the expression levels of thousands of genes in parallel. However, microarray printing, hybridization and washing may create substantial variability in the quality of the data. As erroneous measurements may have a drastic impact on the results by disturbing the normalization schemes and by introducing expression patterns that lead to incorrect conclusions, it is crucial to discard low quality observations in the early phases of a microarray experiment. A typical microarray experiment consists of tens of thousands of spots on a microarray, making manual extraction of poor quality spots impossible. Thus, there is a need for a reliable and general microarray spot quality control strategy.

Results: We suggest a novel strategy for spot quality control by using Bayesian networks, which contain many appealing properties in the spot quality control context. We illustrate how a non-linear least squares based Gaussian fitting procedure can be used in order to extract features for a spot on a microarray. The features we used in this study are: spot intensity, size of the spot, roundness of the spot, alignment error, background intensity, background noise, and bleeding. We conclude that Bayesian networks are a reliable and useful model for microarray spot quality assessment.

Supplementary information: http://sigwww.cs.tut.fi/TICSP/SpotQuality/.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms*
  • Base Sequence
  • Bayes Theorem
  • Computer Simulation
  • Gene Expression Profiling / methods*
  • Models, Genetic*
  • Models, Statistical*
  • Molecular Sequence Data
  • Normal Distribution
  • Oligonucleotide Array Sequence Analysis / methods*
  • Quality Control
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Sequence Analysis, DNA / methods*