TyPol - a new methodology for organic compounds clustering based on their molecular characteristics and environmental behavior

Chemosphere. 2014 Sep:111:613-22. doi: 10.1016/j.chemosphere.2014.05.020. Epub 2014 Jun 5.

Abstract

Following legislation, the assessment of the environmental risks of 30000-100000 chemical substances is required for their registration dossiers. However, their behavior in the environment and their transfer to environmental components such as water or atmosphere are studied for only a very small proportion of the chemical in laboratory tests or monitoring studies because it is time-consuming and/or cost prohibitive. Therefore, the objective of this work was to develop a new methodology, TyPol, to classify organic compounds, and their degradation products, according to both their behavior in the environment and their molecular properties. The strategy relies on partial least squares analysis and hierarchical clustering. The calculation of molecular descriptors is based on an in silico approach, and the environmental endpoints (i.e. environmental parameters) are extracted from several available databases and literature. The classification of 215 organic compounds inputted in TyPol for this proof-of-concept study showed that the combination of some specific molecular descriptors could be related to a particular behavior in the environment. TyPol also provided an analysis of similarities (or dissimilarities) between organic compounds and their degradation products. Among the 24 degradation products that were inputted, 58% were found in the same cluster as their parents. The robustness of the method was tested and shown to be good. TyPol could help to predict the environmental behavior of a "new" compound (parent compound or degradation product) from its affiliation to one cluster, but also to select representative substances from a large data set in order to answer some specific questions regarding their behavior in the environment.

Keywords: Clustering; Degradation products; Environmental fate; Molecular modeling; Partial least squares; Pesticides.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cluster Analysis
  • Environment
  • Environmental Pollutants / chemistry*
  • Environmental Pollutants / classification
  • Least-Squares Analysis
  • Organic Chemicals / chemistry*
  • Organic Chemicals / classification

Substances

  • Environmental Pollutants
  • Organic Chemicals