Visualization and statistical comparisons of microbial communities using R packages on Phylochip data

Pac Symp Biocomput. 2011:142-53. doi: 10.1142/9789814335058_0016.

Abstract

This article explains the statistical and computational methodology used to analyze species abundances collected using the LNBL Phylochip in a study of Irritable Bowel Syndrome (IBS) in rats. Some tools already available for the analysis of ordinary microarray data are useful in this type of statistical analysis. For instance in correcting for multiple testing we use Family Wise Error rate control and step-down tests (available in the multtest package). Once the most significant species are chosen we use the hypergeometric tests familiar for testing GO categories to test specific phyla and families. We provide examples of normalization, multivariate projections, batch effect detection and integration of phylogenetic covariation, as well as tree equalization and robustification methods.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Computational Biology
  • Data Interpretation, Statistical
  • Humans
  • Irritable Bowel Syndrome / microbiology
  • Metagenome* / genetics
  • Metagenomics / statistics & numerical data*
  • Multivariate Analysis
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data*
  • Phylogeny
  • RNA, Ribosomal, 16S / genetics
  • Rats
  • Software

Substances

  • RNA, Ribosomal, 16S