Recognition rules for binding of homeodomains to operator DNA

J Biomol Struct Dyn. 2012;29(4):715-31. doi: 10.1080/073911012010525019.

Abstract

The spatial arrangement of interfaces between homeodomain transcription factors and operator DNA has been considered. We analyzed the binding contacts for a representative set of 22 complexes of homeodomain transcription factors with a double-stranded operator DNA in the region of the major groove. It was shown that the recognition of DNA by the recognizing _-helix of protein is governed by two contact groups. Invariant protein-DNA group of contacts includes six contacts, formed by atomic groups of coding and non-coding DNA chains with the groups of amino acids. The recognizing _-helix forms contacts by polar groups of residues Trp2 (NE1), Asn5, and Lys9 with the canonical sequence T(1)A(2)A(3)T(4) of the coding DNA chain, and contacts by residues Lys0, Arg7 and Lys11 with the sequence A(4)X(5)X(6)X(7) of a non-coding DNA chain, where X is any nucleotide. Variable protein-DNA group of contacts comprises two groups bound with the sequence T(3)A(4)X(5)X(6) of the non- coding DNA-chain. These contacts are mainly with the bases and specify the binding pattern of individual homeodomains. The invariant contact group represents a recognition pattern for transcription factors of the homeodomain family: multiple adenine-asparagine contact and six position-specific phosphate contacts mainly with lysine or arginine. Within this group, we have found three most significant invariant contacts which allow deducing the recognition rules for homeodomains. These rules are inherent for different taxonomic groups of the homeodomain family and can distinguishing members of this family from any other family of transcription factors.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA / chemistry
  • DNA-Binding Proteins* / metabolism
  • Lysine
  • Molecular Sequence Data
  • Transcription Factors* / chemistry

Substances

  • DNA-Binding Proteins
  • Transcription Factors
  • DNA
  • Lysine