Statistical and visual morph movie analysis of crystallographic mutant selection bias in protein mutation resource data

J Bioinform Comput Biol. 2004 Mar;2(1):61-75. doi: 10.1142/s0219720004000478.

Abstract

Structural studies of the effects of non-silent mutations on protein conformational change are an important key in deciphering the language that relates protein amino acid primary structure to tertiary structure. Elsewhere, we presented the Protein Mutant Resource (PMR) database, a set of online tools that systematically identified groups of related mutant structures in the Protein DataBank (PDB), accurately inferred mutant classifications in the Gene Ontology using an innovative, statistically rigorous data-mining algorithm with more general applicability, and illustrated the relationship of these mutant structures via an intuitive user interface. Here, we perform a comprehensive statistical analysis of the effect of PMR mutations on protein tertiary structure. We find that, although the PMR does contain spectacular examples of conformational change, in general there is a counter-intuitive inverse relationship between conformational change (measured as C-alpha displacement or RMS of the core structure) and the number of mutations in a structure. That is, point mutations by structural biologists present in the PDB contrast naturally evolved mutations. We compare the frequency of mutations in the PMR/PDB datasets against the accepted PAM250 natural amino acid mutation frequency to confirm these observations. We generated morph movies from PMR structure pairs using technology previously developed for the Macromolecular Motions Database (http://molmovdb.org), allowing bioinformaticians, geneticists, protein engineers, and rational drug designers to analyze visually the mechanisms of protein conformational change and distinguish between conformational change due to motions (e.g., ligand binding) and mutations. The PMR morph movies and statistics can be freely viewed from the PMR website, http://pmr.sdsc.edu.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Amino Acid Substitution
  • Bias
  • Computer Simulation
  • Crystallography / methods*
  • Databases, Protein*
  • Models, Chemical
  • Models, Molecular*
  • Models, Statistical
  • Molecular Sequence Data
  • Mutation
  • Protein Conformation
  • Proteins / chemistry*
  • Proteins / genetics
  • Sequence Alignment / methods*
  • Sequence Analysis, Protein / methods*
  • User-Computer Interface*

Substances

  • Proteins