Matthews coefficient probabilities: Improved estimates for unit cell contents of proteins, DNA, and protein-nucleic acid complex crystals

Protein Sci. 2003 Sep;12(9):1865-71. doi: 10.1110/ps.0350503.

Abstract

Estimating the number of molecules in the crystallographic asymmetric unit is one of the first steps in a macromolecular structure determination. Based on a survey of 15641 crystallographic Protein Data Bank (PDB) entries the distribution of V(M), the crystal volume per unit of protein molecular weight, known as Matthews coefficient, has been reanalyzed. The range of values and frequencies has changed in the 30 years since Matthews first analysis of protein crystal solvent content. In the statistical analysis, complexes of proteins and nucleic acids have been treated as a separate group. In addition, the V(M) distribution for nucleic acid crystals has been examined for the first time. Observing that resolution is a significant discriminator of V(M), an improved estimator for the probabilities of the number of molecules in the crystallographic asymmetric unit has been implemented, using resolution as additional information.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Crystallization
  • Crystallography, X-Ray
  • DNA / chemistry*
  • DNA-Binding Proteins / chemistry*
  • Databases as Topic
  • Models, Molecular
  • Models, Statistical
  • Models, Theoretical
  • Nucleic Acids / chemistry
  • Probability
  • Protein Binding
  • Proteins / chemistry*

Substances

  • DNA-Binding Proteins
  • Nucleic Acids
  • Proteins
  • DNA