QSAR modeling of datasets with enantioselective compounds using chirality sensitive molecular descriptors

SAR QSAR Environ Res. 2005 Feb-Apr;16(1-2):93-102. doi: 10.1080/10629360412331319844.

Abstract

Shape descriptors used in 3D QSAR studies naturally take into account chirality; however, for flexible and structurally diverse molecules such studies require extensive conformational searching and alignment. QSAR modeling studies of two datasets of fragrance compounds with complex stereochemistry using simple alignment-free chirality sensitive descriptors developed in our laboratories are presented. In the first investigation, 44 alpha-campholenic derivatives with sandalwood odor were represented as derivatives of several common structural templates with substituents numbered according to their relative spatial positions in the molecules. Both molecular and substituent descriptors were used as independent variables in MLR calculations, and the best model was characterized by the training set q2 of 0.79 and external test set r2 of 0.95. In the second study, several types of chirality descriptors were employed in combinatorial QSAR modeling of 98 ambergris fragrance compounds. Among 28 possible combinations of seven types of descriptors and four statistical modeling techniques, k nearest neighbor classification with CoMFA descriptors was initially found to generate the best models with the internal and external accuracies of 76 and 89%, respectively. The same dataset was then studied using novel atom pair chirality descriptors (cAP). The cAP are based on a modified definition of the atomic chirality, in which the seniority of the substituents is defined by their relative partial charge values: higher values correspond to higher seniorities. The resulting models were found to have higher predictive power than those developed with CoMFA descriptors; the best model was characterized by the internal and external accuracies of 82 and 94%, respectively. The success of modeling studies using simple alignment free chirality descriptors discussed in this paper suggests that they should be applied broadly to QSAR studies of many datasets when compound stereochemistry plays an important role in defining their activity.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Ambergris / chemistry
  • Combinatorial Chemistry Techniques
  • Cyclopentanes / chemistry
  • Ecdysteroids / chemistry
  • Models, Biological
  • Molecular Conformation
  • Odorants
  • Predictive Value of Tests
  • Quantitative Structure-Activity Relationship*
  • Santalum
  • Stereoisomerism*

Substances

  • Ambergris
  • Cyclopentanes
  • Ecdysteroids