Motivation: Data mining tools are proposed to establish mechanistic connections between chemotypes and specific cellular functions. Drawing on a previous study that classified the cellular response patterns of growth inhibition measurements log( GI(50)) from the National Cancer Institute's (NCI's) anticancer screen, we have examined additional data for mRNA expression, sets of known molecular targets and mutational status against these same tumor cell lines to relate chemosensitivity more precisely to biochemical pathways.
Results: Our analysis finds that gene expression levels do not, in general, correlate with log(GI(50)) measurements, instead they reflect a generic toxic condition. Within the remaining set of non-generic conditions, examples were found where a correlation suggesting a biochemical basis for cellular cytotoxicity could be supported. These included reconfirmation of previously observed associations between mutant and wild-type status of p53, and chemosensitivity to alkylating agents, while extending these results to reveal associations with gamma-induced expressions of MDM2, WAF1 and GADD45, signals that were not apparent in measurements of basal mRNA expression levels for any of these genes. Additional examinations revealed that mRNA expression levels directly correlated with paclitaxel chemosensitivity to mitosis, while also identifying additional chemotypes as P-glycoprotein substrates. Our analysis revealed well-known direct associations between p16 mutant status and chemotypes implicated in cell cycle control, and extended these results to include expression levels for three additional tyrosine kinase proteins (TEK, transgelin and hCdc4). Links were also found that suggested associations between chemosensitivity and the endocrine, paracrine ligand-receptor loops, via expression of the adrenergic receptor, calcium second messenger pathways via expression levels of carbonic anhydrase and cellular communication pathways via fibrillin.