Condition specific transcription factor binding site characterization in Saccharomyces cerevisiae

Bioinformatics. 2002 Oct;18(10):1289-96. doi: 10.1093/bioinformatics/18.10.1289.

Abstract

Motivation: We demonstrate a computational process by which transcription factor binding sites can be elucidated using genome-wide expression and binding profiles. The profiles direct us to the intergenic locations likely to contain the promoter regions for a given factor. These sequences are multiply and locally aligned to give an anchor motif from which further characterization can take place.

Results: We present bases for and assumptions about the variability within these motifs which give rise to potentially more accurate motifs, capture complex binding sites built upon the basis motif, and eliminate the constraints of the currently employed promoter searching protocols. We also present a measure of motif quality based on the occurrence of the putative motifs in regions observed to contain the binding sites. The assumptions, motif generation, quality assessment and comparison allow the user as much control as their a priori knowledge allows.

Availability: IGRDB and the datasets mentioned herein are available at http://chipdb.wi.mit.edu/

Publication types

  • Evaluation Study
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Motifs
  • Base Sequence
  • Binding Sites / genetics
  • Consensus Sequence / genetics
  • DNA-Binding Proteins
  • Database Management Systems*
  • Databases, Genetic
  • Databases, Protein*
  • Fungal Proteins / genetics
  • Fungal Proteins / metabolism
  • Gene Expression Regulation, Fungal
  • Genome, Fungal
  • Information Storage and Retrieval / methods*
  • Internet
  • Molecular Sequence Data
  • Open Reading Frames
  • Protein Binding / genetics
  • Quality Control
  • Reproducibility of Results
  • Saccharomyces cerevisiae / genetics*
  • Saccharomyces cerevisiae / metabolism*
  • Saccharomyces cerevisiae Proteins / genetics
  • Saccharomyces cerevisiae Proteins / metabolism
  • Sensitivity and Specificity
  • Sequence Alignment / methods
  • Sequence Analysis, Protein / methods
  • Transcription Factors / genetics*
  • Transcription Factors / metabolism*

Substances

  • DNA-Binding Proteins
  • Fungal Proteins
  • GAL4 protein, S cerevisiae
  • STE12 protein, S cerevisiae
  • Saccharomyces cerevisiae Proteins
  • Transcription Factors