Combining Evolutionary Covariance and NMR Data for Protein Structure Determination

Methods Enzymol. 2019:614:363-392. doi: 10.1016/bs.mie.2018.11.004. Epub 2018 Dec 23.

Abstract

Accurate protein structure determination by solution-state NMR is challenging for proteins greater than about 20kDa, for which extensive perdeuteration is generally required, providing experimental data that are incomplete (sparse) and ambiguous. However, the massive increase in evolutionary sequence information coupled with advances in methods for sequence covariance analysis can provide reliable residue-residue contact information for a protein from sequence data alone. These "evolutionary couplings (ECs)" can be combined with sparse NMR data to determine accurate 3D protein structures. This hybrid "EC-NMR" method has been developed using NMR data for several soluble proteins and validated by comparison with corresponding reference structures determined by X-ray crystallography and/or conventional NMR methods. For small proteins, only backbone resonance assignments are utilized, while for larger proteins both backbone and some sidechain methyl resonance assignments are generally required. ECs can be combined with sparse NMR data obtained on deuterated, selectively protonated protein samples to provide structures that are more accurate and complete than those obtained using such sparse NMR data alone. EC-NMR also has significant potential for analysis of protein structures from solid-state NMR data and for studies of integral membrane proteins. The requirement that ECs are consistent with NMR data recorded on a specific member of a protein family, under specific conditions, also allows identification of ECs that reflect alternative allosteric or excited states of the protein structure.

Keywords: AutoStructure/ASDP; Automated NMR data analysis; Evolutionary couplings; Hybrid methods; Maximum entropy; Multiple sequence alignment; Protein NMR spectroscopy; Protein families.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Analysis of Variance
  • Binding Sites
  • Crystallography, X-Ray
  • Databases, Protein
  • Deuterium / chemistry
  • Escherichia coli / chemistry*
  • Escherichia coli / metabolism
  • Escherichia coli Proteins / chemistry*
  • Escherichia coli Proteins / metabolism
  • Evolution, Molecular*
  • Humans
  • Isotope Labeling
  • Models, Molecular
  • Nuclear Magnetic Resonance, Biomolecular / methods*
  • Periplasmic Binding Proteins / chemistry*
  • Periplasmic Binding Proteins / metabolism
  • Protein Binding
  • Protein Conformation, alpha-Helical
  • Protein Conformation, beta-Strand
  • Protein Interaction Domains and Motifs
  • Software*
  • Structural Homology, Protein
  • Thermodynamics

Substances

  • Escherichia coli Proteins
  • MalE protein, E coli
  • Periplasmic Binding Proteins
  • Deuterium