Amino acid substitution matrices from protein blocks

S Henikoff; J G Henikoff

doi:10.1073/pnas.89.22.10915

Amino acid substitution matrices from protein blocks

Proc Natl Acad Sci U S A. 1992 Nov 15;89(22):10915-9. doi: 10.1073/pnas.89.22.10915.

Authors

S Henikoff¹, J G Henikoff

Affiliation

¹ Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, Seattle, WA 98104.

Abstract

Methods for alignment of protein sequences typically measure similarity by using a substitution matrix with scores for all possible exchanges of one amino acid with another. The most widely used matrices are based on the Dayhoff model of evolutionary rates. Using a different approach, we have derived substitution matrices from about 2000 blocks of aligned sequence segments characterizing more than 500 groups of related proteins. This led to marked improvements in alignments and in searches using queries from each of the groups.

Publication types

Comparative Study
Research Support, U.S. Gov't, P.H.S.

MeSH terms

Algorithms
Amino Acid Sequence*
Animals
Caenorhabditis elegans / genetics
Drosophila / genetics
Lod Score
Mathematics
Molecular Sequence Data
Probability
Proteins / chemistry
Proteins / genetics*
Sequence Homology, Amino Acid*
Software*

Substances

Proteins