The high-throughput protein sample production platform of the Northeast Structural Genomics Consortium

J Struct Biol. 2010 Oct;172(1):21-33. doi: 10.1016/j.jsb.2010.07.011. Epub 2010 Aug 3.

Abstract

We describe the core Protein Production Platform of the Northeast Structural Genomics Consortium (NESG) and outline the strategies used for producing high-quality protein samples. The platform is centered on the cloning, expression and purification of 6X-His-tagged proteins using T7-based Escherichia coli systems. The 6X-His tag allows for similar purification procedures for most targets and implementation of high-throughput (HTP) parallel methods. In most cases, the 6X-His-tagged proteins are sufficiently purified (>97% homogeneity) using a HTP two-step purification protocol for most structural studies. Using this platform, the open reading frames of over 16,000 different targeted proteins (or domains) have been cloned as>26,000 constructs. Over the past 10 years, more than 16,000 of these expressed protein, and more than 4400 proteins (or domains) have been purified to homogeneity in tens of milligram quantities (see Summary Statistics, http://nesg.org/statistics.html). Using these samples, the NESG has deposited more than 900 new protein structures to the Protein Data Bank (PDB). The methods described here are effective in producing eukaryotic and prokaryotic protein samples in E. coli. This paper summarizes some of the updates made to the protein production pipeline in the last 5 years, corresponding to phase 2 of the NIGMS Protein Structure Initiative (PSI-2) project. The NESG Protein Production Platform is suitable for implementation in a large individual laboratory or by a small group of collaborating investigators. These advanced automated and/or parallel cloning, expression, purification, and biophysical screening technologies are of broad value to the structural biology, functional proteomics, and structural genomics communities.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Cloning, Molecular
  • Databases, Protein
  • Electrophoresis, Polyacrylamide Gel
  • Escherichia coli / genetics
  • Genomics / methods*
  • Magnetic Resonance Spectroscopy
  • Proteins / chemistry
  • Proteins / genetics
  • Proteins / metabolism*
  • Proteomics / methods*
  • Recombinant Proteins / chemistry
  • Recombinant Proteins / isolation & purification
  • Recombinant Proteins / metabolism
  • Reproducibility of Results
  • Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization

Substances

  • Proteins
  • Recombinant Proteins