A motif within SET-domain proteins binds single-stranded nucleic acids and transcribed and supercoiled DNAs and can interfere with assembly of nucleosomes

Mol Cell Biol. 2005 Mar;25(5):1891-9. doi: 10.1128/MCB.25.5.1891-1899.2005.

Abstract

The evolutionary conserved SET domain is present in many eukaryotic chromatin-associated proteins, including some members of the trithorax (TrxG) group and the polycomb (PcG) group of epigenetic transcriptional regulators and modifiers of position effect variegation. All SET domains examined exhibited histone lysine methyltransferase activity, implicating these proteins in the generation of epigenetic marks. However, the mode of the initial recruitment of SET proteins to target genes and the way that their association with the genes is maintained after replication are not known. We found that SET-containing proteins of the SET1 and SET2 families contain motifs in the pre-SET region or at the pre-SET-SET and SET-post-SET boundaries which very tightly bind single-stranded DNA (ssDNA) and RNA. These motifs also bind stretches of ssDNA generated by superhelical tension or during the in vitro transcription of duplex DNA. Importantly, such binding withstands nucleosome assembly, interfering with the formation of regular nucleosomal arrays. Two representatives of the SUV39 SET family, SU(VAR)3-9 and G9a, did not bind ssDNA. The trxZ11 homeotic point mutation, which is located within TRX SET and disrupts embryonic development, impairs the ssDNA binding capacity of the protein. We suggest that the motifs described here may be directly involved in the biological function(s) of SET-containing proteins. The binding of single-stranded nucleic acids might play a role in the initial recruitment of the proteins to target genes, in the maintenance of their association after DNA replication, or in sustaining DNA stretches in a single-stranded configuration to allow for continuous transcription.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Motifs / genetics
  • Amino Acid Motifs / physiology
  • Animals
  • Chromosomal Proteins, Non-Histone / chemistry*
  • Chromosomal Proteins, Non-Histone / genetics
  • Chromosomal Proteins, Non-Histone / metabolism*
  • DNA, Single-Stranded / metabolism*
  • DNA, Superhelical / metabolism*
  • DNA-Binding Proteins / chemistry
  • DNA-Binding Proteins / genetics
  • DNA-Binding Proteins / metabolism
  • DNA-Binding Proteins / physiology
  • Escherichia coli Proteins / chemistry
  • Escherichia coli Proteins / genetics
  • Escherichia coli Proteins / physiology
  • Humans
  • Mutation / genetics
  • Nucleosomes / metabolism*
  • Peptides / chemistry
  • Peptides / genetics
  • Peptides / physiology
  • Protein Structure, Tertiary / genetics
  • Protein Structure, Tertiary / physiology
  • RNA / metabolism
  • Transcription Factors / chemistry
  • Transcription Factors / genetics
  • Transcription Factors / physiology
  • Transcription, Genetic / physiology*

Substances

  • Chromosomal Proteins, Non-Histone
  • DNA, Single-Stranded
  • DNA, Superhelical
  • DNA-Binding Proteins
  • Escherichia coli Proteins
  • Nucleosomes
  • Peptides
  • SSB protein, E coli
  • Transcription Factors
  • RNA