Is protein classification necessary? Toward alternative approaches to function annotation

Curr Opin Struct Biol. 2009 Jun;19(3):363-8. doi: 10.1016/j.sbi.2009.02.001. Epub 2009 Mar 5.

Abstract

The current nonredundant protein sequence database contains over seven million entries and the number of individual functional domains is significantly larger than this value. The vast quantity of data associated with these proteins poses enormous challenges to any attempt at function annotation. Classification of proteins into sequence and structural groups has been widely used as an approach to simplifying the problem. In this article we question such strategies. We describe how the multifunctionality and structural diversity of even closely related proteins confounds efforts to assign function on the basis of overall sequence or structural similarity. Rather, we suggest that strategies that avoid classification may offer a more robust approach to protein function annotation.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Amino Acid Sequence
  • Computational Biology / methods*
  • Humans
  • Molecular Sequence Data
  • Proteins / chemistry
  • Proteins / classification*
  • Proteins / metabolism*

Substances

  • Proteins