pen repeat sequences are GGN clusters and encode a glycine-rich domain in a Drosophila cDNA homologous to the rat helix destabilizing protein

Proc Natl Acad Sci U S A. 1987 Apr;84(7):1819-23. doi: 10.1073/pnas.84.7.1819.

Abstract

Several cDNA clones that contain the pen repeat have been isolated and sequenced; pen consists of clusters of GGN triplets, where N can be any nucleotide. Some of the pen repeat sequences are found within long open reading frames in which they encode oligoglycine stretches. For one of the clones, the deduced amino acid sequence of the entire open reading frame, especially in the region preceding the glycine-rich domain, shows strong homology to the rat helix destabilizing protein [Cobianchi, F., SenGupta, D. N., Zmudzka, B. Z. & Wilson, S. H. (1986) J. Biol. Chem. 261, 3536-3543]. The rat protein and homologs in other organisms are single-stranded nucleic acid binding proteins, some of which are major components of heterogeneous nuclear ribonucleoprotein particles. We suggest that we have cloned a cDNA encoding a Drosophila single-stranded nucleic acid binding protein.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • DNA Helicases / genetics*
  • DNA*
  • DNA-Binding Proteins / genetics*
  • Drosophila / genetics*
  • Drosophila / growth & development
  • Genes*
  • Glycine
  • Repetitive Sequences, Nucleic Acid
  • Transcription, Genetic
  • Viral Proteins*

Substances

  • DNA-Binding Proteins
  • Viral Proteins
  • gp32 protein, Enterobacteria phage T4
  • helix-destabilizing proteins
  • DNA
  • DNA Helicases
  • Glycine

Associated data

  • GENBANK/M15762
  • GENBANK/M15763
  • GENBANK/M15764
  • GENBANK/M15765
  • GENBANK/M15766
  • GENBANK/M15767