Data management and preliminary data analysis in the pilot phase of the HUPO Plasma Proteome Project

Marcin Adamski; Thomas Blackwell; Rajasree Menon; Lennart Martens; Henning Hermjakob; Chris Taylor; Gilbert S Omenn; David J States

doi:10.1002/pmic.200500186

Data management and preliminary data analysis in the pilot phase of the HUPO Plasma Proteome Project

Proteomics. 2005 Aug;5(13):3246-61. doi: 10.1002/pmic.200500186.

Authors

Marcin Adamski¹, Thomas Blackwell, Rajasree Menon, Lennart Martens, Henning Hermjakob, Chris Taylor, Gilbert S Omenn, David J States

Affiliation

¹ University of Michigan, Ann Arbor, MI 48109-2218, USA.

PMID: 16104057
DOI: 10.1002/pmic.200500186

Abstract

The pilot phase of the HUPO Plasma Proteome Project (PPP) is an international collaboration to catalog the protein composition of human blood plasma and serum by analyzing standardized aliquots of reference serum and plasma specimens using a variety of experimental techniques. Data management for this project included collection, integration, analysis, and dissemination of findings from participating organizations world-wide. Accomplishing this task required a communication and coordination infrastructure specific enough to support meaningful integration of results from all participants, but flexible enough to react to changing requirements and new insights gained during the course of the project and to allow participants with varying informatics capabilities to contribute. Challenges included integrating heterogeneous data, reducing redundant information to minimal identification sets, and data annotation. Our data integration workflow assembles a minimal and representative set of protein identifications, which account for the contributed data. It accommodates incomplete concordance of results from different laboratories, ambiguity and redundancy in contributed identifications, and redundancy in the protein sequence databases. Recommendations of the PPP for future large-scale proteomics endeavors are described.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, P.H.S.

MeSH terms

Algorithms
Blood Proteins / chemistry*
Databases, Protein
False Positive Reactions
Humans
International Cooperation
Models, Statistical
Peptides / chemistry
Pilot Projects
Poisson Distribution
Proteome / chemistry*
Proteomics / methods*
Reference Values
Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization

Substances

Blood Proteins
Peptides
Proteome