TIVAN: tissue-specific cis-eQTL single nucleotide variant annotation and prediction

Bioinformatics. 2019 May 1;35(9):1573-1575. doi: 10.1093/bioinformatics/bty872.

Abstract

Summary: Predicting genetic regulatory variants, most of which locate in non-coding genomic regions, still remain a challenge in genetic research. Among all non-coding regulatory variants, cis-eQTL single nucleotide variants (SNVs) are of particular interest for their crucial role in regulating gene expression. Since different gene expression patterns are believed to contribute to the etiologies of different phenotypes, it is desirable to characterize the impact of cis-eQTL SNVs in a context-specific manner. Though computational methods for predicting the potential of variants being pathogenic or deleterious are well-established, methods for annotating and predicting cis-eQTL SNVs are under-developed. Here, we present TIVAN (TIssue-specific Variant ANnotation and prediction), an ensemble method of decision trees, to predict tissue-specific cis-eQTL SNVs. TIVAN is trained based on a comprehensive collection of features, including genome-wide genomic and epigenomic profiling data. As a result, TIVAN has been shown to accurately discriminate cis-eQTL SNVs from non-eQTL SNVs and perform favorably to other methods by obtaining higher five-fold cross-validation AUC values (CV-AUC) and Leave-One-Chromosome-Out predicted AUC values (LOCO-AUC) across 44 different tissues belonging to 27 different tissue classes. Finally, TIVAN consistently maintains top performance on an independent testing dataset, which includes 7 tissues in 11 studies.

Availability and implementation: TIVAN software is available at https://github.com/lichen-lab/TIVAN.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Genome-Wide Association Study
  • Genomics
  • Nucleotides
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci*
  • Software

Substances

  • Nucleotides