Low-rank regularization for learning gene expression programs

Guibo Ye; Mengfan Tang; Jian-Feng Cai; Qing Nie; Xiaohui Xie

doi:10.1371/journal.pone.0082146

Low-rank regularization for learning gene expression programs

PLoS One. 2013 Dec 17;8(12):e82146. doi: 10.1371/journal.pone.0082146. eCollection 2013.

Authors

Guibo Ye¹, Mengfan Tang², Jian-Feng Cai³, Qing Nie⁴, Xiaohui Xie⁵

Affiliations

¹ Department of Computer Science, University of California Irvine, Irvine, California, United States of America ; Department of Mathematics, University of California Irvine, Irvine, California, United States of America.
² Department of Computer Science, University of California Irvine, Irvine, California, United States of America.
³ Department of Mathematics, University of Iowa, Iowa City, Iowa, United States of America.
⁴ Department of Mathematics, University of California Irvine, Irvine, California, United States of America ; Center for Complex Biological Systems, University of California Irvine, Irvine, California, United States of America.
⁵ Department of Computer Science, University of California Irvine, Irvine, California, United States of America ; Center for Complex Biological Systems, University of California Irvine, Irvine, California, United States of America.

Abstract

Learning gene expression programs directly from a set of observations is challenging due to the complexity of gene regulation, high noise of experimental measurements, and insufficient number of experimental measurements. Imposing additional constraints with strong and biologically motivated regularizations is critical in developing reliable and effective algorithms for inferring gene expression programs. Here we propose a new form of regulation that constrains the number of independent connectivity patterns between regulators and targets, motivated by the modular design of gene regulatory programs and the belief that the total number of independent regulatory modules should be small. We formulate a multi-target linear regression framework to incorporate this type of regulation, in which the number of independent connectivity patterns is expressed as the rank of the connectivity matrix between regulators and targets. We then generalize the linear framework to nonlinear cases, and prove that the generalized low-rank regularization model is still convex. Efficient algorithms are derived to solve both the linear and nonlinear low-rank regularized problems. Finally, we test the algorithms on three gene expression datasets, and show that the low-rank regularization improves the accuracy of gene expression prediction in these three datasets.

Publication types

Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Algorithms
Gene Expression Regulation*
Gene Expression*
Humans
Models, Genetic*
Software*

Abstract

Publication types

MeSH terms

Grants and funding