Transcription activator-like effector proteins (TALEs) contain large numbers of repeats that bind double-stranded DNA, wrapping around DNA to form a continuous superhelix. Since unbound TALEs retain superhelical structure, it seems likely that DNA binding requires a significant structural distortion or partial unfolding. In this study, we use nearest-neighbor "Ising" analysis of consensus TALE (cTALE) repeat unfolding to quantify intrinsic folding free energies, coupling energies between repeats, and the free energy distribution of partly unfolded states, and to determine how those energies depend on the sequence that determines DNA-specificity (called the "RVD"). We find a moderate level of cooperativity for both the HD and NS RVD sequences (stabilizing interfaces combined with unstable repeats), as has been seen in other linear repeat proteins. Surprisingly, RVD sequence identity influences both the overall stability and the balance of intrinsic repeat stability and interfacial coupling energy. Using parameters from the Ising analysis, we have analyzed the distribution of partly folded states as a function of cTALE length and RVD sequence. We find partly unfolded states where one or more repeats are unfolded to be energetically accessible. Mixing repeats with different RVD sequences increases the population of partially folded states. Local folding free energies plateau for central repeats, suggesting that TALEs access partially folded states where a single internal repeat is unfolded while adjacent repeats remain folded. This breakage should allow TALEs to access superhelically-broken states, and may facilitate DNA binding.
Copyright © 2016 Biophysical Society. Published by Elsevier Inc. All rights reserved.