Better diagnostic signatures from RNAseq data through use of auxiliary co-data

Bioinformatics. 2017 May 15;33(10):1572-1574. doi: 10.1093/bioinformatics/btw837.

Abstract

Summary: Our aim is to improve omics based prediction and feature selection using multiple sources of auxiliary information: co-data. Adaptive group regularized ridge regression (GRridge) was proposed to achieve this by estimating additional group-based penalty parameters through an empirical Bayes method at a low computational cost. We illustrate the GRridge method and software on RNA sequencing datasets. The method boosts the performance of an ordinary ridge regression and outperforms other classifiers. Post-hoc feature selection maintains the predictive ability of the classifier with far fewer markers.

Availability and implementation: GRridge is an R package that includes a vignette. It is freely available at ( https://bioconductor.org/packages/GRridge/ ). All information and R scripts used in this study, including those on retrieval and processing of the co-data, are available from http://github.com/markvdwiel/GRridgeCodata .

Contact: [email protected].

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Bayes Theorem
  • Early Detection of Cancer / methods
  • Female
  • Genomics / methods*
  • Humans
  • Models, Genetic*
  • Molecular Sequence Annotation
  • Sequence Analysis, RNA / methods*
  • Software*
  • Uterine Cervical Neoplasms / diagnosis
  • Uterine Cervical Neoplasms / genetics