Discovery of sequence motifs related to coexpression of genes using evolutionary computation

Nucleic Acids Res. 2004 Jul 20;32(13):3826-35. doi: 10.1093/nar/gkh713. Print 2004.

Abstract

Transcription factors are key regulatory elements that control gene expression. Recognition of transcription factor binding site (TFBS) motifs in the upstream region of coexpressed genes is therefore critical towards a true understanding of the regulations of gene expression. The task of discovering eukaryotic TFBSs remains a challenging problem. Here, we demonstrate that evolutionary computation can be used to search for TFBSs in upstream regions of genes known to be coexpressed. Evolutionary computation was used to search for TFBSs of genes regulated by octamer-binding factor and nuclear factor kappa B. The discovered binding sites included experimentally determined known binding motifs as well as lists of putative, previously unknown TFBSs. We believe that this method to search nucleotide sequence information efficiently for similar motifs will be useful for discovering TFBSs that affect gene regulation.

Publication types

  • Evaluation Study

MeSH terms

  • Algorithms
  • Base Sequence
  • Binding Sites
  • Computational Biology / methods*
  • Databases, Nucleic Acid
  • Evolution, Molecular
  • Gene Expression Regulation
  • Molecular Sequence Data
  • NF-kappa B / metabolism
  • Regulatory Sequences, Nucleic Acid*
  • Sequence Analysis, DNA / methods*
  • Transcription Factors / metabolism*

Substances

  • NF-kappa B
  • Transcription Factors

Associated data

  • GENBANK/C00158