Deep neural learning based protein function prediction

Math Biosci Eng. 2022 Jan 7;19(3):2471-2488. doi: 10.3934/mbe.2022114.

Abstract

It is vital for the annotation of uncharacterized proteins by protein function prediction. At present, Deep Neural Network based protein function prediction is mainly carried out for dataset of small scale proteins or Gene Ontology, and usually explore the relationships between single protein feature and function tags. The practical methods for large-scale multi-features protein prediction still need to be studied in depth. This paper proposes a DNN based protein function prediction approach IGP-DNN. This method uses Grasshopper Optimization Algorithm (GOA) and Intuitionistic Fuzzy c-Means clustering (IFCM) based protein function modules extracting algorithm to extract the features of protein modules, utilizing Kernel Principal Component Analysis (KPCA) method to reduce the dimensionality of the protein attribute information, and integrating module features and attribute features. Inputting integrated data into DNN through multiple hidden layers to classify proteins and predict protein functions. In the experiments, the F-measure value of IGP-DNN on the DIP dataset reaches 0.4436, which shows better performance.

Keywords: DEEP Neural Network(DNN); Grasshopper Optimization Algorithm(GOA); Kernel Principal Component Analysis(KPCA); protein function prediction; protein-protein Interaction(PPI).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Gene Ontology
  • Neural Networks, Computer*
  • Proteins*

Substances

  • Proteins