Bayesian analysis and inference from QSAR predictive model results

R M McDowell; J S Jaworska

doi:10.1080/10629360290002280

Bayesian analysis and inference from QSAR predictive model results

SAR QSAR Environ Res. 2002 Mar;13(1):111-25. doi: 10.1080/10629360290002280.

Authors

R M McDowell¹, J S Jaworska

Affiliation

¹ Animal and Plant Health Inspection Service, US Department of Agriculture, Riverdale, MD 20737, USA. [email protected]

PMID: 12074380
DOI: 10.1080/10629360290002280

Abstract

QSAR models have been under development for decades but acceptance and utilization of model results have been slow, in part, because there is no widely accepted metric for assessing their reliability. We reapply a method commonly used in quantitative epidemiology and medical decision-making for evaluating the results of screening tests to assess reliability of a QSAR model. It quantifies the accuracy (expressed as sensitivity and specificity) of QSAR models as conditional probabilities of correct and incorrect classification of chemical characteristic, given a true characteristic. Using Bayes formula, these conditional probabilities are combined with prior information to generate a posterior distribution to determine the probability a specific chemical has a particular characteristic, given a model prediction. As an example, we apply this approach to evaluate the predictive reliability of a CATABOL model and base on it a "ready" and "not ready" biodegradability classification. Finally, we show how predictive capability of the model can be improved by sequential use of two models, the first one with high sensitivity and the second with high specificity.

MeSH terms

Bayes Theorem
Decision Making
Environmental Pollutants / adverse effects
Environmental Pollutants / pharmacology*
Forecasting
Humans
Models, Chemical*
Public Health
Sensitivity and Specificity
Structure-Activity Relationship

Substances

Environmental Pollutants