Strategy to design improved proteomic experiments based on statistical analyses of the chemical properties of identified peptides

J Proteome Res. 2005 Nov-Dec;4(6):2201-6. doi: 10.1021/pr050290o.

Abstract

Proteomics is an emerging field that uses many types of proteomic platforms however has few standardized procedures. Deciding which platform to use to perform large-scale proteomic studies is either based on personal preference or on so-called "figures of merit" such as dynamic range, resolution, and the limit of detection; these factors are often insufficient to predict the outcome of the experiment as the detection of peptides correlates to the chemical properties of each peptide. There is a need for a novel figure of merit that describes the overall performance of a platform based on measured output, which in proteomics is often a list of identified peptides. We report the development of such a figure of merit based on a predictive genetic algorithm. This algorithm takes into account the properties of the observed peptides such as length, hydrophobicity, and pI. Several large-scale studies that differed in sample type or platform were used to demonstrate the usefulness of the algorithm for improved experimental design. The figures that were obtained were clustered to find platforms that were biased in similar ways. Even though some platforms are different, they lead to the identification of similar peptide types and are thus redundant. The algorithm can thus be used as an exploratory tool to suggest a minimal number of complementary experiments in order to maximize experimental efficiency.

MeSH terms

  • Algorithms
  • Blood Proteins / chemistry*
  • Cluster Analysis
  • Computational Biology
  • Genomics
  • Humans
  • Mass Spectrometry
  • Models, Genetic
  • Molecular Weight
  • Peptides / chemistry*
  • Proteome
  • Proteomics / methods*
  • Trypsin / pharmacology

Substances

  • Blood Proteins
  • Peptides
  • Proteome
  • Trypsin