The InterPro Database, 2003 brings increased coverage and new features

Nucleic Acids Res. 2003 Jan 1;31(1):315-8. doi: 10.1093/nar/gkg046.

Abstract

InterPro, an integrated documentation resource of protein families, domains and functional sites, was created in 1999 as a means of amalgamating the major protein signature databases into one comprehensive resource. PROSITE, Pfam, PRINTS, ProDom, SMART and TIGRFAMs have been manually integrated and curated and are available in InterPro for text- and sequence-based searching. The results are provided in a single format that rationalises the results that would be obtained by searching the member databases individually. The latest release of InterPro contains 5629 entries describing 4280 families, 1239 domains, 95 repeats and 15 post-translational modifications. Currently, the combined signatures in InterPro cover more than 74% of all proteins in SWISS-PROT and TrEMBL, an increase of nearly 15% since the inception of InterPro. New features of the database include improved searching capabilities and enhanced graphical user interfaces for visualisation of the data. The database is available via a webserver (http://www.ebi.ac.uk/interpro) and anonymous FTP (ftp://ftp.ebi.ac.uk/pub/databases/interpro).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Computer Graphics
  • Databases, Protein*
  • Protein Processing, Post-Translational
  • Protein Structure, Tertiary
  • Proteins / chemistry*
  • Proteins / genetics
  • Proteins / metabolism
  • Repetitive Sequences, Amino Acid
  • User-Computer Interface

Substances

  • Proteins