AggreRATE-Pred: a mathematical model for the prediction of change in aggregation rate upon point mutation

Puneet Rawat; R Prabakaran; Sandeep Kumar; M Michael Gromiha

doi:10.1093/bioinformatics/btz764

AggreRATE-Pred: a mathematical model for the prediction of change in aggregation rate upon point mutation

Bioinformatics. 2020 Mar 1;36(5):1439-1444. doi: 10.1093/bioinformatics/btz764.

Authors

Puneet Rawat¹, R Prabakaran¹, Sandeep Kumar², M Michael Gromiha^{1

3}

Affiliations

¹ Protein Bioinformatics Lab, Department of Biotechnology, Indian Institute of Technology Madras, Chennai, Tamil Nadu 600036, India.
² Biotherapeutics Discovery, Boehringer-Ingelheim Pharmaceutical Inc. Ridgefield, CT, USA.
³ Advanced Computational Drug Discovery Unit (ACDD), Tokyo Tech World Research Hub Initiative (WRHI), Institute of Innovative Research, Tokyo Institute of Technology, 4259 Nagatsuta-cho, Midori-ku, Yokohama, Japan.

PMID: 31599925
DOI: 10.1093/bioinformatics/btz764

Abstract

Motivation: Protein aggregation is a major unsolved problem in biochemistry with implications for several human diseases, biotechnology and biomaterial sciences. A majority of sequence-structural properties known for their mechanistic roles in protein aggregation do not correlate well with the aggregation kinetics. This limits the practical utility of predictive algorithms.

Results: We analyzed experimental data on 183 unique single point mutations that lead to change in aggregation rates for 23 polypeptides and proteins. Our initial mathematical model obtained a correlation coefficient of 0.43 between predicted and experimental change in aggregation rate upon mutation (P-value <0.0001). However, when the dataset was classified based on protein length and conformation at the mutation sites, the average correlation coefficient almost doubled to 0.82 (range: 0.74-0.87; P-value <0.0001). We observed that distinct sequence and structure-based properties determine protein aggregation kinetics in each class. In conclusion, the protein aggregation kinetics are impacted by local factors and not by global ones, such as overall three-dimensional protein fold, or mechanistic factors such as the presence of aggregation-prone regions.

Availability and implementation: The web server is available at http://www.iitm.ac.in/bioinfo/aggrerate-pred/.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Humans
Models, Theoretical
Mutation
Point Mutation*
Prednisolone / analogs & derivatives
Proteins / genetics*
Software

Substances

Proteins
Prednisolone
prednylidene