Protein three-dimensional structure determination and sequence-specific assignment of 13C and 15N-separated NOE data. A novel real-space ab initio approach

J Mol Biol. 1994 Nov 4;243(4):696-718. doi: 10.1016/0022-2836(94)90042-6.

Abstract

The sequence-specific assignment of resonances is considered to be a requirement for the determination of the three-dimensional (3D) structure of a protein in solution by nuclear magnetic resonance methods. The main source of structural information is the nuclear Overhauser effect spectroscopy (NOESY) spectrum, which contains information about spatially close pairs of protons. Currently, various J-correlated spectra must be recorded in order to obtain the sequence-specific assignments necessary to interpret the NOESY spectra. In this work, a novel procedure to determine the 3D structure and the sequence-specific assignments of a protein using only data from 13C and 15N-separated multidimensional NOESY spectra is described. No information from J-correlated spectra is required. The algorithm is called ANSRS (Assignment of NOESY Spectra in Real Space) and is based on an inversion of the traditional strategy. A 3D real-space structure of detected, but unassigned, 1H spins is calculated from the nuclear Overhauser effect (NOE) distance restraints using a dynamical simulated annealing procedure. The sequence-specific assignments are then determined by searching among the 1H spins in the 3D real-space structure for plausible residue assignments. The search uses a Monte Carlo simulated annealing algorithm based on assignment probabilities derived from the 1H, 15N and 13C chemical shifts, various spatial constraints, and the known sequence of the protein. The procedure has been tested on semi-synthetic data sets comprising published experimental chemical shifts and NOE distance restraints derived from the known 3D structures of the two proteins GAL4 (residues 9 to 41) and bovine pancreatic trypsin inhibitor. The ANSRS procedure was able to determine the sequence-specific assignments for more than 95% of the spins, and was fairly robust with respect to missing NOE data. The potential of the ANSRS approach with respect to automated assignment, reduction of the number of NMR spectra required for a structure determination, assignment of homologous and mutant proteins, and the possibility of analysing spectra recorded at high pH is discussed.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amino Acids / chemistry
  • Animals
  • Aprotinin / chemistry*
  • Carbon Radioisotopes
  • Computer Simulation
  • DNA-Binding Proteins / chemistry
  • Data Interpretation, Statistical
  • Databases, Factual
  • Fungal Proteins / chemistry*
  • Hydrogen-Ion Concentration
  • Magnetic Resonance Spectroscopy
  • Metalloproteins / chemistry
  • Molecular Weight
  • Monte Carlo Method
  • Nitrogen Radioisotopes
  • Protein Conformation
  • Protein Structure, Secondary*
  • Reference Standards
  • Saccharomyces cerevisiae Proteins*
  • Temperature
  • Transcription Factors / chemistry

Substances

  • Amino Acids
  • Carbon Radioisotopes
  • DNA-Binding Proteins
  • Fungal Proteins
  • GAL4 protein, S cerevisiae
  • Metalloproteins
  • Nitrogen Radioisotopes
  • Saccharomyces cerevisiae Proteins
  • Transcription Factors
  • Aprotinin