In the present study, ninety-five halogenated dioxins and related chemicals (dibenzo-p-dioxins, dibenzofurans, biphenyls, and naphthalene) with endpoint pEC50 were used to develop twelve quantitative structure toxicity relationship (QSTR) models using inbuilt Monte Carlo algorithm of CORAL software. The hybrid optimal descriptor of correlation weights (DCW) using a combination of SMILES and HSG (hydrogen suppressed graph) was employed to generate QSTR models. Three target functions i.e. TF1 (WIIC=WCII=0), TF2 (WIIC= 0.3 & WCII=0) and TF3 (WIIC= 0.0 &WCII=0.3) were employed to develop robust QSTR models and the statistical outcomes of each target function were compared with each other. The correlation intensity index (CII) was found a reliable benchmark of the predictive potential for QSTR models. The numerical value of the determination coefficient of the validation set of split 1 computed by TF3 was found highest (RValid2=0.8438). The fragments responsible for the toxicity of dioxins and related chemicals were also identified in terms of the promoter of increase/decrease for pEC50. Three random splits (Split 1, Split 2 and Split 4) were selected for the extraction of the promoter of increase/decrease for pEC50. In the last, consensus modelling was performed using the intelligent consensus tool of DTC lab (https://dtclab.webs.com/software-tools). The original consensus model, which was created by combining four distinct models employing the split 4 arrangement, was more predictive for the validation set and the numerical value of the determination coefficient of the test set (validation set) was increased from 0.8133 to 0.9725. For the validation set of split 4, the mean absolute error (MAE 100%) was also lowered from 0.513 to 0.2739.
Keywords: CII; CORAL; Consensus modelling; Dioxins; IIC; QSTR.
Copyright © 2022 Elsevier B.V. All rights reserved.