The application of new software tools to quantitative protein profiling via isotope-coded affinity tag (ICAT) and tandem mass spectrometry: II. Evaluation of tandem mass spectrometry methodologies for large-scale protein analysis, and the application of statistical tools for data analysis and interpretation

Priska D von Haller; Eugene Yi; Samuel Donohoe; Kelly Vaughn; Andrew Keller; Alexey I Nesvizhskii; Jimmy Eng; Xiao-jun Li; David R Goodlett; Ruedi Aebersold; Julian D Watts

doi:10.1074/mcp.M300041-MCP200

The application of new software tools to quantitative protein profiling via isotope-coded affinity tag (ICAT) and tandem mass spectrometry: II. Evaluation of tandem mass spectrometry methodologies for large-scale protein analysis, and the application of statistical tools for data analysis and interpretation

Mol Cell Proteomics. 2003 Jul;2(7):428-42. doi: 10.1074/mcp.M300041-MCP200. Epub 2003 Jun 25.

Authors

Priska D von Haller¹, Eugene Yi, Samuel Donohoe, Kelly Vaughn, Andrew Keller, Alexey I Nesvizhskii, Jimmy Eng, Xiao-jun Li, David R Goodlett, Ruedi Aebersold, Julian D Watts

Affiliation

¹ Institute for Systems Biology, 1441 North 34th Street, Seattle, WA 98103, USA.

PMID: 12832459
DOI: 10.1074/mcp.M300041-MCP200

Abstract

Proteomic approaches to biological research that will prove the most useful and productive require robust, sensitive, and reproducible technologies for both the qualitative and quantitative analysis of complex protein mixtures. Here we applied the isotope-coded affinity tag (ICAT) approach to quantitative protein profiling, in this case proteins that copurified with lipid raft plasma membrane domains isolated from control and stimulated Jurkat human T cells. With the ICAT approach, cysteine residues of the two related protein isolates were covalently labeled with isotopically normal and heavy versions of the same reagent, respectively. Following proteolytic cleavage of combined labeled proteins, peptides were fractionated by multidimensional chromatography and subsequently analyzed via automated tandem mass spectrometry. Individual tandem mass spectrometry spectra were searched against a human sequence database, and a variety of recently developed, publicly available software applications were used to sort, filter, analyze, and compare the results of two repetitions of the same experiment. In particular, robust statistical modeling algorithms were used to assign measures of confidence to both peptide sequences and the proteins from which they were likely derived, identified via the database searches. We show that by applying such statistical tools to the identification of T cell lipid raft-associated proteins, we were able to estimate the accuracy of peptide and protein identifications made. These tools also allow for determination of the false positive rate as a function of user-defined data filtering parameters, thus giving the user significant control over and information about the final output of large-scale proteomic experiments. With the ability to assign probabilities to all identifications, the need for manual verification of results is substantially reduced, thus making the rapid evaluation of large proteomic datasets possible. Finally, by repeating the experiment, information relating to the general reproducibility and validity of this approach to large-scale proteomic analyses was also obtained.

Publication types

Evaluation Study
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Amino Acid Sequence
Cysteine / chemistry
Data Interpretation, Statistical*
Databases, Protein
Evaluation Studies as Topic
Humans
Isotope Labeling*
Isotopes / chemistry
Jurkat Cells
Mass Spectrometry*
Membrane Microdomains / chemistry
Proteins / analysis*
Proteins / chemistry
Proteome / analysis
Proteomics
Software*
T-Lymphocytes / chemistry

Substances

Isotopes
Proteins
Proteome
Cysteine

Abstract

Publication types

MeSH terms

Substances

Grants and funding