Progress on the development of prediction tools for detecting disease causing mutations in proteins

Comput Biol Med. 2024 Dec 4:185:109510. doi: 10.1016/j.compbiomed.2024.109510. Online ahead of print.

Abstract

Proteins are involved in a variety of functions in living organisms. The mutation of amino acid residues in a protein alters its structure, stability, binding, and function, with some mutations leading to diseases. Understanding the influence of mutations on protein structure and function help to gain deep insights on the molecular mechanism of diseases and devising therapeutic strategies. Hence, several generic and disease-specific methods have been proposed to reveal pathogenic effects on mutations. In this review, we focus on the development of prediction methods for identifying disease causing mutations in proteins. We briefly outline the existing databases for disease-causing mutations, followed by a discussion on sequence- and structure-based features used for prediction. Further, we discuss computational tools based on machine learning, deep learning and large language models for detecting disease-causing mutations. Specifically, we emphasize the advances in predicting hotspots and mutations for targets involved in cancer, neurodegenerative and infectious diseases as well as in membrane proteins. The computational resources including databases and algorithms understanding/predicting the effect of mutations will be listed. Moreover, limitations of existing methods and possible improvements will be discussed.

Keywords: Algorithms; Binding affinity; Cancer hotspots; Databases; Disease-causing mutations; Infectious diseases; Machine-learning; Membrane proteins; Network; Neurodegenerative; Sequence-based features; Stability; Structure.

Publication types

  • Review