Massive integration of diverse protein quality assessment methods to improve template based modeling in CASP11

Proteins. 2016 Sep;84 Suppl 1(Suppl 1):247-59. doi: 10.1002/prot.24924. Epub 2015 Sep 29.

Abstract

Model evaluation and selection is an important step and a big challenge in template-based protein structure prediction. Individual model quality assessment methods designed for recognizing some specific properties of protein structures often fail to consistently select good models from a model pool because of their limitations. Therefore, combining multiple complimentary quality assessment methods is useful for improving model ranking and consequently tertiary structure prediction. Here, we report the performance and analysis of our human tertiary structure predictor (MULTICOM) based on the massive integration of 14 diverse complementary quality assessment methods that was successfully benchmarked in the 11th Critical Assessment of Techniques of Protein Structure prediction (CASP11). The predictions of MULTICOM for 39 template-based domains were rigorously assessed by six scoring metrics covering global topology of Cα trace, local all-atom fitness, side chain quality, and physical reasonableness of the model. The results show that the massive integration of complementary, diverse single-model and multi-model quality assessment methods can effectively leverage the strength of single-model methods in distinguishing quality variation among similar good models and the advantage of multi-model quality assessment methods of identifying reasonable average-quality models. The overall excellent performance of the MULTICOM predictor demonstrates that integrating a large number of model quality assessment methods in conjunction with model clustering is a useful approach to improve the accuracy, diversity, and consequently robustness of template-based protein structure prediction. Proteins 2016; 84(Suppl 1):247-259. © 2015 Wiley Periodicals, Inc.

Keywords: CASP; integration; model quality assessment; protein structure prediction; template-based modeling.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Benchmarking*
  • Computational Biology / methods
  • Computational Biology / statistics & numerical data*
  • Computer Simulation
  • Databases, Protein
  • Humans
  • Internet
  • Models, Molecular*
  • Models, Statistical*
  • Protein Folding
  • Protein Interaction Domains and Motifs
  • Protein Structure, Secondary
  • Protein Structure, Tertiary
  • Proteins / chemistry*
  • Quality Control
  • Software*
  • Structural Homology, Protein
  • Thermodynamics

Substances

  • Proteins