A Poisson mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology

Weixing Feng; Yunlong Liu; Jiejun Wu; Kenneth P Nephew; Tim H M Huang; Lang Li

doi:10.1186/1471-2164-9-S2-S23

A Poisson mixture model to identify changes in RNA polymerase II binding quantity using high-throughput sequencing technology

BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S23. doi: 10.1186/1471-2164-9-S2-S23.

Authors

Weixing Feng¹, Yunlong Liu, Jiejun Wu, Kenneth P Nephew, Tim H M Huang, Lang Li

Affiliation

¹ Division of Biostatistics, Indiana University School of Medicine, Indianapolis, IN 46202, USA. [email protected]

Abstract

We present a mixture model-based analysis for identifying differences in the distribution of RNA polymerase II (Pol II) in transcribed regions, measured using ChIP-seq (chromatin immunoprecipitation following massively parallel sequencing technology). The statistical model assumes that the number of Pol II-targeted sequences contained within each genomic region follows a Poisson distribution. A Poisson mixture model was then developed to distinguish Pol II binding changes in transcribed region using an empirical approach and an expectation-maximization (EM) algorithm developed for estimation and inference. In order to achieve a global maximum in the M-step, a particle swarm optimization (PSO) was implemented. We applied this model to Pol II binding data generated from hormone-dependent MCF7 breast cancer cells and antiestrogen-resistant MCF7 breast cancer cells before and after treatment with 17beta-estradiol (E2). We determined that in the hormone-dependent cells, approximately 9.9% (2527) genes showed significant changes in Pol II binding after E2 treatment. However, only approximately 0.7% (172) genes displayed significant Pol II binding changes in E2-treated antiestrogen-resistant cells. These results show that a Poisson mixture model can be used to analyze ChIP-seq data.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Bayes Theorem
Cell Line, Tumor
Chromatin Immunoprecipitation
Estradiol / pharmacology
Genome, Human
Humans
Models, Statistical*
Neoplasms, Hormone-Dependent / genetics
Neoplasms, Hormone-Dependent / metabolism
Oligonucleotide Array Sequence Analysis / methods*
Poisson Distribution
Protein Binding
RNA Polymerase II / genetics
RNA Polymerase II / metabolism*

Substances

Estradiol
RNA Polymerase II

Abstract

Publication types

MeSH terms

Substances

Grants and funding