Protein structural alignments and functional genomics

Proteins. 2001 Feb 15;42(3):378-82. doi: 10.1002/1097-0134(20010215)42:3<378::aid-prot70>3.0.co;2-3.

Abstract

Structural genomics-the systematic solution of structures of the proteins of an organism-will increasingly often produce molecules of unknown function with no close relative of known function. Prediction of protein function from structure has thereby become a challenging problem of computational molecular biology. The strong conservation of active site conformations in homologous proteins suggests a method for identifying them. This depends on the relationship between size and goodness-of-fit of aligned substructures in homologous proteins. For all pairs of proteins studied, the root-mean-square deviation (RMSD) as a function of the number of residues aligned varies exponentially for large common substructures and linearly for small common substructures. The exponent of the dependence at large common substructures is well correlated with the RMSD of the core as originally calculated by Chothia and Lesk (EMBO J 1986;5:823-826), affording the possibility of reconciling different structural alignment procedures. In the region of small common substructures, reduced aligned subsets define active sites and can be used to suggest the locations of active sites in homologous proteins.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacillus subtilis
  • Bacterial Proteins / chemistry*
  • Bacterial Proteins / genetics
  • Binding Sites
  • Computational Biology*
  • Escherichia coli
  • Escherichia coli Proteins*
  • Genomics
  • Papain / chemistry*
  • Protein Conformation

Substances

  • Bacterial Proteins
  • Escherichia coli Proteins
  • YabJ protein, Bacillus subtilis
  • YjgF protein, Bacteria
  • YjgF protein, E coli
  • Papain