Population-scale proteome variation in human induced pluripotent stem cells

Elife. 2020 Aug 10:9:e57390. doi: 10.7554/eLife.57390.

Abstract

Human disease phenotypes are driven primarily by alterations in protein expression and/or function. To date, relatively little is known about the variability of the human proteome in populations and how this relates to variability in mRNA expression and to disease loci. Here, we present the first comprehensive proteomic analysis of human induced pluripotent stem cells (iPSC), a key cell type for disease modelling, analysing 202 iPSC lines derived from 151 donors, with integrated transcriptome and genomic sequence data from the same lines. We characterised the major genetic and non-genetic determinants of proteome variation across iPSC lines and assessed key regulatory mechanisms affecting variation in protein abundance. We identified 654 protein quantitative trait loci (pQTLs) in iPSCs, including disease-linked variants in protein-coding sequences and variants with trans regulatory effects. These include pQTL linked to GWAS variants that cannot be detected at the mRNA level, highlighting the utility of dissecting pQTL at peptide level resolution.

Keywords: deleterious variants; genetics; genomics; human; induced pluripotent stem cells; proteomics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Adult
  • Aged
  • Child
  • Child, Preschool
  • Disease / genetics*
  • Female
  • Genetic Variation*
  • Genetics, Population
  • Genotype
  • Humans
  • Induced Pluripotent Stem Cells / metabolism*
  • Infant
  • Infant, Newborn
  • Male
  • Middle Aged
  • Phenotype
  • Proteome*
  • Proteomics
  • Quantitative Trait Loci
  • RNA, Messenger / genetics
  • Transcriptome*
  • Young Adult

Substances

  • Proteome
  • RNA, Messenger