Patterns in interspecies similarity correlate with nucleotide composition in mammalian 3'UTRs

Nucleic Acids Res. 2003 Sep 15;31(18):5433-9. doi: 10.1093/nar/gkg751.

Abstract

Post-transcriptional regulation and the formation of mRNA 3' ends are crucial for gene expression in eukaryotes. Interspecies conservation of many sequences within 3'UTRs reveals selective constraint due to similar function. To study the pattern of conservation within 3'UTRs, we compiled and aligned 50 sets of complete orthologous 3'UTRs from four orders of mammals. We observed a mosaic pattern of conservation, with alternating regions of high (phylogenetic footprints) and low similarity. Conservation in 3'UTRs correlates with their base composition and also with the synonymous substitution rate in corresponding coding regions. The non-uniform distribution of conservation is more pronounced for 3'UTRs with a moderate or low level of overall conservation, where invariant nucleotides are more numerous, and their runs of lengths 4-7 occur more frequently than if conservation were random. Many runs of invariant nucleotides are AU-rich or pyrimidine-rich. Some of these runs coincide with known functional cis- elements of eukaryotic mRNAs, such as the U-rich upstream element, polyadenylation signal and DICE regulatory signal. More divergent regions of multiple alignments of 3'UTRs are often more G- and/or C-rich. Our results provide evidence on the importance of moderately conserved regions in 3'UTRs and suggest that regulatory functions of 3'UTRs might utilize gene-specific information in these regions.

Publication types

  • Comparative Study

MeSH terms

  • 3' Untranslated Regions / genetics*
  • 3',5'-Cyclic-GMP Phosphodiesterases / genetics
  • Animals
  • Base Composition
  • Base Sequence
  • Conserved Sequence / genetics
  • Cyclic Nucleotide Phosphodiesterases, Type 6
  • Eye Proteins / genetics
  • Humans
  • Mammals / genetics*
  • Molecular Sequence Data
  • RNA, Messenger / genetics
  • Regression Analysis
  • Sequence Alignment
  • Sequence Homology, Nucleic Acid
  • Species Specificity

Substances

  • 3' Untranslated Regions
  • Eye Proteins
  • RNA, Messenger
  • 3',5'-Cyclic-GMP Phosphodiesterases
  • Cyclic Nucleotide Phosphodiesterases, Type 6
  • PDE6B protein, human