SODA: prediction of protein solubility from disorder and aggregation propensity

Nucleic Acids Res. 2017 Jul 3;45(W1):W236-W240. doi: 10.1093/nar/gkx412.

Abstract

Solubility is an important, albeit not well understood, feature determining protein behavior. It is of paramount importance in protein engineering, where similar folded proteins may behave in very different ways in solution. Here we present SODA, a novel method to predict the changes of protein solubility based on several physico-chemical properties of the protein. SODA uses the propensity of the protein sequence to aggregate as well as intrinsic disorder, plus hydrophobicity and secondary structure preferences to estimate changes in solubility. It has been trained and benchmarked on two different datasets. The comparison to other recently published methods shows that SODA has state-of-the-art performance and is particularly well suited to predict mutations decreasing solubility. The method is fast, returning results for single mutations in seconds. A usage example estimating the full repertoire of mutations for a human germline antibody highlights several solubility hotspots on the surface. The web server, complete with RESTful interface and extensive help, can be accessed from URL: http://protein.bio.unipd.it/soda.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Antibodies / chemistry
  • Antibodies / genetics
  • Hydrophobic and Hydrophilic Interactions
  • Internet
  • Intrinsically Disordered Proteins / chemistry
  • Mutation*
  • Protein Aggregates
  • Protein Structure, Secondary
  • Proteins / chemistry*
  • Sequence Analysis, Protein / methods*
  • Software*
  • Solubility

Substances

  • Antibodies
  • Intrinsically Disordered Proteins
  • Protein Aggregates
  • Proteins