Structure-based prediction of nucleic acid binding residues by merging deep learning- and template-based approaches

PLoS Comput Biol. 2023 Sep 6;19(9):e1011428. doi: 10.1371/journal.pcbi.1011428. eCollection 2023 Sep.

Abstract

Accurate prediction of nucleic binding residues is essential for the understanding of transcription and translation processes. Integration of feature- and template-based strategies could improve the prediction of these key residues in proteins. Nevertheless, traditional hybrid algorithms have been surpassed by recently developed deep learning-based methods, and the possibility of integrating deep learning- and template-based approaches to improve performance remains to be explored. To address these issues, we developed a novel structure-based integrative algorithm called NABind that can accurately predict DNA- and RNA-binding residues. A deep learning module was built based on the diversified sequence and structural descriptors and edge aggregated graph attention networks, while a template module was constructed by transforming the alignments between the query and its multiple templates into features for supervised learning. Furthermore, the stacking strategy was adopted to integrate the above two modules for improving prediction performance. Finally, a post-processing module dependent on the random walk algorithm was proposed to further correct the integrative predictions. Extensive evaluations indicated that our approach could not only achieve excellent performance on both native and predicted structures but also outperformed existing hybrid algorithms and recent deep learning methods. The NABind server is available at http://liulab.hzau.edu.cn/NABind/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Cell Nucleus
  • Deep Learning*
  • Nucleic Acids*
  • Walking

Substances

  • Nucleic Acids

Grants and funding

This work was supported by the National Natural Science Foundation of China (32071249 to RL). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.