The correlation of gene expression and co-regulated gene patterns in characteristic KEGG pathways

J Theor Biol. 2010 Sep 21;266(2):242-9. doi: 10.1016/j.jtbi.2010.06.029. Epub 2010 Jun 25.

Abstract

There is great interest in chromosome- and pathway-based techniques for genomics data analysis in the current work in order to understand the mechanism of disease. However, there are few studies addressing the abilities of machine learning methods in incorporating pathway information for analyzing microarray data. In this paper, we identified the characteristic pathways by combining the classification error rates of out-of-bag (OOB) in random forests with pathways information. At each characteristic pathway, the correlation of gene expression was studied and the co-regulated gene patterns in different biological conditions were mined by Mining Attribute Profile (MAP) algorithm. The discovered co-regulated gene patterns were clustered by the average-linkage hierarchical clustering technique. The results showed that the expression of genes at the same characteristic pathway were approximate. Furthermore, two characteristic pathways were discovered to present co-regulated gene patterns in which one contained 108 patterns and the other contained one pattern. The results of cluster analysis showed that the smallest similarity coefficient of clusters was more than 0.623, which indicated that the co-regulated patterns in different biological conditions were more approximate at the same characteristic pathway. The methods discussed in this paper can provide additional insight into the study of microarray data.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Artificial Intelligence
  • Cluster Analysis
  • Computational Biology / methods*
  • Data Mining / methods*
  • Databases, Genetic
  • Gene Expression Regulation / genetics*
  • Genes / genetics*
  • Genomics / methods*
  • Metabolic Networks and Pathways / genetics*
  • Microarray Analysis / methods*