Pair-EGRET: enhancing the prediction of protein-protein interaction sites through graph attention networks and protein language models

Ramisa Alam; Sazan Mahbub; Md Shamsuzzoha Bayzid

doi:10.1093/bioinformatics/btae588

Pair-EGRET: enhancing the prediction of protein-protein interaction sites through graph attention networks and protein language models

Bioinformatics. 2024 Oct 1;40(10):btae588. doi: 10.1093/bioinformatics/btae588.

Authors

Ramisa Alam¹, Sazan Mahbub^{1

2}, Md Shamsuzzoha Bayzid¹

Affiliations

¹ Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh.
² Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, United States.

Abstract

Motivation: Proteins are responsible for most biological functions, many of which require the interaction of more than one protein molecule. However, accurately predicting protein-protein interaction (PPI) sites (the interfacial residues of a protein that interact with other protein molecules) remains a challenge. The growing demand and cost associated with the reliable identification of PPI sites using conventional experimental methods call for computational tools for automated prediction and understanding of PPIs.

Results: We present Pair-EGRET, an edge-aggregated graph attention network that leverages the features extracted from pretrained transformer-like models to accurately predict PPI sites. Pair-EGRET works on a k-nearest neighbor graph, representing the 3D structure of a protein, and utilizes the cross-attention mechanism for accurate identification of interfacial residues of a pair of proteins. Through an extensive evaluation study using a diverse array of experimental data, evaluation metrics, and case studies on representative protein sequences, we demonstrate that Pair-EGRET can achieve remarkable performance in predicting PPI sites. Moreover, Pair-EGRET can provide interpretable insights from the learned cross-attention matrix.

Availability and implementation: Pair-EGRET is freely available in open source form at the GitHub Repository https://github.com/1705004/Pair-EGRET.

MeSH terms

Algorithms
Computational Biology / methods
Databases, Protein
Protein Interaction Mapping* / methods
Proteins* / chemistry
Proteins* / metabolism
Software

Substances

Proteins

Grants and funding

BUET