Human centromeric alpha-satellite DNA is composed of tandem arrays of two types of 171 bp monomers; type A and type B. The differences between these types are concentrated in a 17 bp region of the monomer called the A/B box. Here, we have determined the solution structure of the C-rich strand of the two main variants of the human alpha-satellite A box. We show that, under acidic conditions, the C-rich strands of two A boxes self-recognize and form a head-to-tail dimeric i-motif stabilized by four intercalated hemi-protonated C:C(+) base pairs. Interestingly, the stack of C:C(+) base pairs is capped by T:T and Hoogsteen A:T base pairs. The two main variants of the A box adopt a similar three-dimensional structure, although the residues involved in the formation of the i-motif core are different in each case. Together with previous studies showing that the B box (known as the CENP-B box) also forms dimeric i-motif structures, our finding of this non-canonical structure in the A box shows that centromeric alpha satellites in all human chromosomes are able to form i-motifs, which consequently raises the possibility that these structures may play a role in the structural organization of the centromere.
Keywords: DNA recognition; DNA structures; helical structures; nucleic acids; structural biology.
© 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.