Identifying operons and untranslated regions of transcripts using Escherichia coli RNA expression analysis

Bioinformatics. 2002:18 Suppl 1:S337-44. doi: 10.1093/bioinformatics/18.suppl_1.s337.

Abstract

Microarrays traditionally have been used to assay the transcript expression of coding regions of genes. Here, we use Escherichia coli oligonucleotide microarrays to assay transcript expression of both open reading frames (ORFs) and intergenic regions. We then use hidden Markov models to analyse this expression data and estimate transcription boundaries of genes. This approach allows us to identify 5' untranslated regions (5' UTRs) of transcripts as well as genes that are likely to be operon members. The operon elements we identify correspond to documented operons with 99% specificity and 63% sensitivity. Similarly we find that our 5' UTR results accurately coincide with experimentally verified promoter regions for most genes.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Validation Study

MeSH terms

  • 5' Untranslated Regions / genetics*
  • Algorithms
  • Base Sequence
  • Chromosome Mapping / methods
  • Escherichia coli / genetics*
  • Gene Expression Profiling / methods*
  • Gene Expression Regulation, Bacterial / genetics
  • Genome, Bacterial
  • Models, Genetic
  • Models, Statistical
  • Molecular Sequence Data
  • Oligonucleotide Array Sequence Analysis / methods*
  • Operon / genetics*
  • Prokaryotic Cells
  • RNA, Bacterial / genetics*
  • Sequence Analysis, DNA / methods
  • Sequence Homology, Amino Acid
  • Transcription Factors / genetics*

Substances

  • 5' Untranslated Regions
  • RNA, Bacterial
  • Transcription Factors