MRHMMs: multivariate regression hidden Markov models and the variantS

Bioinformatics. 2014 Jun 15;30(12):1755-6. doi: 10.1093/bioinformatics/btu070. Epub 2014 Feb 19.

Abstract

Summary: Hidden Markov models (HMMs) are flexible and widely used in scientific studies. Particularly in genomics and genetics, there are multiple distinct regimes in the genome within each of which the relationships among multivariate features are distinct. Examples include differential gene regulation depending on gene functions and experimental conditions, and varying combinatorial patterns of multiple transcription factors. We developed a software package called MRHMMs (Multivariate Regression Hidden Markov Models and the variantS) that accommodates a variety of HMMs that can be flexibly applied to many biological studies and beyond. MRHMMs supplements existing HMM software packages in two aspects. First, MRHMMs provides a diverse set of emission probability structures, including mixture of multivariate normal distributions and (logistic) regression models. Second, MRHMMs is computationally efficient for analyzing large data-sets generated in current genome-wide studies. Especially, the software is written in C for the speed advantage and further amenable to implement alternative models to meet users' own purposes.

Availability and implementation: http://sourceforge.net/projects/mrhmms/

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Animals
  • GATA1 Transcription Factor / metabolism
  • Genome
  • Genomics / methods*
  • Logistic Models
  • Markov Chains
  • Mice
  • Multivariate Analysis
  • Normal Distribution
  • Software*

Substances

  • GATA1 Transcription Factor
  • Gata1 protein, mouse