Fragments quantum descriptors in classification of bio-accumulative compounds

J Mol Graph Model. 2023 Dec:125:108584. doi: 10.1016/j.jmgm.2023.108584. Epub 2023 Aug 2.

Abstract

The aim of the following research is to assess the applicability of calculated quantum properties of molecular fragments as molecular descriptors in machine learning classification task. The research is based on bio-concentration and QM9-extended databases. A number of compounds with results from quantum-chemical calculations conducted with Psi4 quantum chemistry package was also added to the quantum properties database. Classification results are compared with a baseline of random guesses and predictions obtained with the traditional RDKit generated molecular descriptors. Chosen classification metrics show that results obtained with fragments quantum descriptors fall between results from baseline and those provided by molecular descriptors widely applied in cheminformatics. According to the results, the implementation of principal component analysis, causes a drop in categorization metrics.

Keywords: Cheminformatics; Fragments quantum descriptors; Machine learning; Molecular descriptors; Quantum computing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bioaccumulation
  • Cheminformatics*
  • Databases, Factual
  • Machine Learning*
  • Principal Component Analysis