Improved analysis of bacterial CGH data beyond the log-ratio paradigm

BMC Bioinformatics. 2009 Mar 19:10:91. doi: 10.1186/1471-2105-10-91.

Abstract

Background: Existing methods for analyzing bacterial CGH data from two-color arrays are based on log-ratios only, a paradigm inherited from expression studies. We propose an alternative approach, where microarray signals are used in a different way and sequence identity is predicted using a supervised learning approach.

Results: A data set containing 32 hybridizations of sequenced versus sequenced genomes have been used to test and compare methods. A ROC-analysis has been performed to illustrate the ability to rank probes with respect to Present/Absent calls. Classification into Present and Absent is compared with that of a gaussian mixture model.

Conclusion: The results indicate our proposed method is an improvement of existing methods with respect to ranking and classification of probes, especially for multi-genome arrays.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Comparative Genomic Hybridization*
  • Computational Biology / methods*
  • Gene Expression Profiling / methods
  • Genome, Bacterial*
  • Oligonucleotide Array Sequence Analysis / methods
  • ROC Curve