Knowledge Generation with Rule Induction in Cancer Omics

Int J Mol Sci. 2019 Dec 18;21(1):18. doi: 10.3390/ijms21010018.

Abstract

The explosion of omics data availability in cancer research has boosted the knowledge of the molecular basis of cancer, although the strategies for its definitive resolution are still not well established. The complexity of cancer biology, given by the high heterogeneity of cancer cells, leads to the development of pharmacoresistance for many patients, hampering the efficacy of therapeutic approaches. Machine learning techniques have been implemented to extract knowledge from cancer omics data in order to address fundamental issues in cancer research, as well as the classification of clinically relevant sub-groups of patients and for the identification of biomarkers for disease risk and prognosis. Rule induction algorithms are a group of pattern discovery approaches that represents discovered relationships in the form of human readable associative rules. The application of such techniques to the modern plethora of collected cancer omics data can effectively boost our understanding of cancer-related mechanisms. In fact, the capability of these methods to extract a huge amount of human readable knowledge will eventually help to uncover unknown relationships between molecular attributes and the malignant phenotype. In this review, we describe applications and strategies for the usage of rule induction approaches in cancer omics data analysis. In particular, we explore the canonical applications and the future challenges and opportunities posed by multi-omics integration problems.

Keywords: TCGA (The Cancer Genome Atlas); cancer; machine learning; omics data; patients classification; rule induction.

Publication types

  • Review

MeSH terms

  • Computational Biology / methods
  • Databases, Genetic
  • Genomics* / methods
  • Humans
  • Machine Learning
  • Metabolomics* / methods
  • Neoplasms / etiology*
  • Neoplasms / metabolism*
  • Proteomics* / methods

Grants and funding