Application of constraint programming techniques for structure prediction of lattice proteins with extended alphabets

Bioinformatics. 1999 Mar;15(3):234-42. doi: 10.1093/bioinformatics/15.3.234.

Abstract

Motivation: Predicting the ground state of biopolymers is a notoriously hard problem in biocomputing. Model systems, such as lattice proteins, are simple tools and valuable to test and improve new methods. Best known are models with sequences composed from a binary (hydrophobic and polar) alphabet. The major drawback is the degeneracy, i.e. the number of different ground state conformations.

Results: We show how recently developed constraint programming techniques can be used to solve the structure prediction problem efficiently for a higher order alphabet. To our knowledge it is the first report of an exact and computationally feasible solution to model proteins of length up to 36 and without resorting to maximally compact states. We further show that degeneracy is reduced by more than one order of magnitude and that ground state conformations are not necessarily compact. Therefore, more realistic protein simulations become feasible with our model.

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Computer Simulation*
  • Models, Molecular*
  • Molecular Sequence Data
  • Protein Conformation*
  • Proteins / genetics
  • Software
  • Thermodynamics

Substances

  • Proteins