Establishing the yeast Saccharomyces cerevisiae as a system for expression of human proteins on a proteome-scale

J Struct Funct Genomics. 2003;4(2-3):97-108. doi: 10.1023/a:1026226429429.

Abstract

Structural genomics requires the application of a standardised process for overexpression of soluble proteins that allows high-throughput purification and analysis of protein products. We have developed a highly parallel approach to protein expression, including the simultaneous expression screening of a large number of cDNA clones in an appropriate vector system and the use of a protease-deficient host strain. A set of 221 human genes coding for proteins of various sizes with unknown structures was selected to evaluate the system. We transferred the cDNAs from an E. coli vector to the yeast expression vector by recombinational cloning, avoiding time-consuming recloning steps and the use of restriction enzymes in the cloning process. The subcloning yield was 95%, provided that a PCR fragment of the correct size could be obtained. Sixty percent of these proteins were expressed as soluble products at detectable levels and 48% were successfully purified under native conditions using the His6 tag fusion. The advantages of the developed yeast-based expression system are the ease of manipulation and cultivation of S. cerevisiae in the same way as with prokaryotic hosts and the ability to introduce post-translational modifications of proteins if required, thus being an attractive system for heterologous expression of mammalian proteins. The expression clones selected in this screening process are passed on to the fermentation process in order to provide milligram amounts of proteins for structure analysis within the 'Berlin Protein Structure Factory'. All data generated is stored in a relational database and is available on our website (http://www.proteinstrukturfabrik.de).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aspartic Acid Endopeptidases / genetics
  • Base Sequence
  • Cloning, Molecular / methods
  • Escherichia coli
  • Gene Expression
  • Genetic Vectors
  • Histidine / genetics
  • Humans
  • Molecular Sequence Data
  • Protein Engineering / methods*
  • Proteins / genetics*
  • Proteins / metabolism
  • Proteome
  • Recombinant Fusion Proteins / genetics
  • Recombinant Fusion Proteins / metabolism
  • Saccharomyces cerevisiae / genetics*
  • Saccharomyces cerevisiae / metabolism

Substances

  • Proteins
  • Proteome
  • Recombinant Fusion Proteins
  • Histidine
  • aspartic proteinase A
  • Aspartic Acid Endopeptidases