Structured States of Disordered Proteins from Genomic Sequences

Cell. 2016 Sep 22;167(1):158-170.e12. doi: 10.1016/j.cell.2016.09.010.

Abstract

Protein flexibility ranges from simple hinge movements to functional disorder. Around half of all human proteins contain apparently disordered regions with little 3D or functional information, and many of these proteins are associated with disease. Building on the evolutionary couplings approach previously successful in predicting 3D states of ordered proteins and RNA, we developed a method to predict the potential for ordered states for all apparently disordered proteins with sufficiently rich evolutionary information. The approach is highly accurate (79%) for residue interactions as tested in more than 60 known disordered regions captured in a bound or specific condition. Assessing the potential for structure of more than 1,000 apparently disordered regions of human proteins reveals a continuum of structural order with at least 50% with clear propensity for three- or two-dimensional states. Co-evolutionary constraints reveal hitherto unseen structures of functional importance in apparently disordered proteins.

Keywords: EVfold; Evolutionary couplings; bioinformatics; computational biology; conformational flexibility; disorder; maximum entropy; statistical physics; structure prediction.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Directed Molecular Evolution / methods
  • Genomics
  • Humans
  • Intrinsically Disordered Proteins / chemistry*
  • Intrinsically Disordered Proteins / genetics
  • Protein Structure, Secondary
  • Protein Structure, Tertiary
  • Proteome / chemistry
  • Proteome / genetics

Substances

  • Intrinsically Disordered Proteins
  • Proteome