A model-based analysis to infer the functional content of a gene list

Stat Appl Genet Mol Biol. 2012 Jan 6;11(2):10.2202/1544-6115.1716 /j/sagmb.2012.11.issue-2/1544-6115.1716/1544-6115.1716.xml. doi: 10.2202/1544-6115.1716.

Abstract

An important challenge in statistical genomics concerns integrating experimental data with exogenous information about gene function. A number of statistical methods are available to address this challenge, but most do not accommodate complexities in the functional record. To infer activity of a functional category (e.g., a gene ontology term), most methods use gene-level data on that category, but do not use other functional properties of the same genes. Not doing so creates undue errors in inference. Recent developments in model-based category analysis aim to overcome this difficulty, but in attempting to do so they are faced with serious computational problems. This paper investigates statistical properties and the structure of posterior computation in one such model for the analysis of functional category data. We examine the graphical structures underlying posterior computation in the original parameterization and in a new parameterization aimed at leveraging elements of the model. We characterize identifiability of the underlying activation states, describe a new prior distribution, and introduce approximations that aim to support numerical methods for posterior inference.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Gene Expression Profiling
  • Gene Regulatory Networks
  • Genomics / methods*
  • Models, Genetic*
  • Signal Transduction
  • Software