DeepCDpred: Inter-residue distance and contact prediction for improved prediction of protein structure

PLoS One. 2019 Jan 8;14(1):e0205214. doi: 10.1371/journal.pone.0205214. eCollection 2019.

Abstract

Rapid, accurate prediction of protein structure from amino acid sequence would accelerate fields as diverse as drug discovery, synthetic biology and disease diagnosis. Massively improved prediction of protein structures has been driven by improving the prediction of the amino acid residues that contact in their 3D structure. For an average globular protein, around 92% of all residue pairs are non-contacting, therefore accurate prediction of only a small percentage of inter-amino acid distances could increase the number of constraints to guide structure determination. We have trained deep neural networks to predict inter-residue contacts and distances. Distances are predicted with an accuracy better than most contact prediction techniques. Addition of distance constraints improved de novo structure predictions for test sets of 158 protein structures, as compared to using the best contact prediction methods alone. Importantly, usage of distance predictions allows the selection of better models from the structure pool without a need for an external model assessment tool. The results also indicate how the accuracy of distance prediction methods might be improved further.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amino Acid Sequence*
  • Computational Biology / methods*
  • Databases, Protein
  • Deep Learning*
  • Models, Molecular
  • Protein Structure, Tertiary*
  • Proteins / chemistry*
  • Sequence Analysis, Protein / methods
  • Support Vector Machine

Substances

  • Proteins

Grants and funding

Tuğçe Oruç was funded by the Darwin Trust of Edinburgh and Shuangxi Ji by an Elite Scholarship from the University of Birmingham.