Expression quantitative trait loci (eQTLs) have been touted as the missing piece that can bridge the gap between genetic variants and phenotypes. Over the past decade, we have witnessed a sharp rise of effort in the identification and application of eQTLs. The successful application of eQTLs relies heavily on their reproducibility. The current eQTL databases such as Genotype-Tissue Expression (GTEx) were populated primarily with eQTLs deriving from germline single nucleotide polymorphisms and normal tissue gene expression. The novel scenarios that employ eQTL models for prediction purposes often involve disease phenotypes characterized by altered gene expressions. To evaluate eQTL reproducibility across diverse data sources and the effect of disease-specific gene expression alteration on eQTL identification, we conducted an eQTL study using 5178 samples from The Cancer Genome Atlas (TCGA). We found that the reproducibility of eQTLs between normal and tumor tissues was low in terms of the number of shared eQTLs. However, among the shared eQTLs, the effect directions were generally concordant. This suggests that the source of the gene expression (normal or tumor tissue) has a strong effect on the detectable eQTLs and the effect direction of the eQTLs. Additional analyses demonstrated good directional concordance of eQTLs between GTEx and TCGA. Furthermore, we found that multi-tissue eQTLs may exert opposite effects across multiple tissue types. In summary, our results suggest that eQTL prediction models need to carefully address tissue and disease dependency of eQTLs. Tissue-disease-specific eQTL databases can afford more accurate prediction models for future studies.
Keywords: SNP; disease specificity; eQTL; tissue specificity.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For Permissions, please email: [email protected].