The iProClass integrated database for protein functional analysis

Comput Biol Chem. 2004 Feb;28(1):87-96. doi: 10.1016/j.compbiolchem.2003.10.003.

Abstract

Increasingly, scientists have begun to tackle gene functions and other complex regulatory processes by studying organisms at the global scales for various levels of biological organization, ranging from genomes to metabolomes and physiomes. Meanwhile, new bioinformatics methods have been developed for inferring protein function using associative analysis of functional properties to complement the traditional sequence homology-based methods. To fully exploit the value of the high-throughput system biology data and to facilitate protein functional studies requires bioinformatics infrastructures that support both data integration and associative analysis. The iProClass database, designed to serve as a framework for data integration in a distributed networking environment, provides comprehensive descriptions of all proteins, with rich links to over 50 databases of protein family, function, pathway, interaction, modification, structure, genome, ontology, literature, and taxonomy. In particular, the database is organized with PIRSF family classification and maps to other family, function, and structure classification schemes. Coupled with the underlying taxonomic information for complete genomes, the iProClass system (http://pir.georgetown.edu/iproclass/) supports associative studies of protein family, domain, function, and structure. A case study of the phosphoglycerate mutases illustrates a systematic approach for protein family and phylogenetic analysis. Such studies may serve as a basis for further analysis of protein functional evolution, and its relationship to the co-evolution of metabolic pathways, cellular networks, and organisms.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Computational Biology
  • Databases, Factual*
  • Genome, Human*
  • Humans
  • Molecular Biology / methods
  • Molecular Sequence Data
  • Phosphoglycerate Mutase / chemistry
  • Phosphoglycerate Mutase / genetics
  • Phosphoglycerate Mutase / metabolism
  • Phylogeny
  • Proteins / chemistry
  • Proteins / genetics
  • Proteins / metabolism*

Substances

  • Proteins
  • Phosphoglycerate Mutase