The CATH protein family database: a resource for structural and functional annotation of genomes

Christine A Orengo; James E Bray; Daniel W A Buchan; Andrew Harrison; David Lee; Frances M G Pearl; Ian Sillitoe; Annabel E Todd; Janet M Thornton

The CATH protein family database: a resource for structural and functional annotation of genomes

Proteomics. 2002 Jan;2(1):11-21.

Authors

Christine A Orengo¹, James E Bray, Daniel W A Buchan, Andrew Harrison, David Lee, Frances M G Pearl, Ian Sillitoe, Annabel E Todd, Janet M Thornton

Affiliation

¹ Department of Biochemistry and Molecular Biology, University College, London, UK. [email protected]

PMID: 11788987

Abstract

Over the last decade, there have been huge increases in the numbers of protein sequences and structures determined. In parallel, many methods have been developed for recognising similarities between these proteins, arising from their common evolutionary background, and for clustering such relatives into protein families. Here we review some of the protein family resources available to the biologist and describe how these can be used to provide structural and functional annotations for newly determined sequences. In particular we describe recent developments to the CATH domain database of protein structural families which have facilitated genome annotation and which have also revealed important caveats that must be considered when transferring functional data between homologous proteins.

Publication types

Research Support, Non-U.S. Gov't
Review

MeSH terms

Databases, Protein*
Genome*
Protein Conformation