Protein secondary structure modelling with probabilistic networks

A L Delcher; S Kasif; H R Goldberg; W H Hsu

Protein secondary structure modelling with probabilistic networks

Proc Int Conf Intell Syst Mol Biol. 1993:1:109-17.

Authors

A L Delcher¹, S Kasif, H R Goldberg, W H Hsu

Affiliation

¹ Computer Science Dept., Loyola College, Baltimore, MD 21210, USA.

PMID: 7584325

Abstract

In this paper we study the performance of probabilistic networks in the context of protein sequence analysis in molecular biology. Specifically, we report the results of our initial experiments applying this framework to the problem of protein secondary structure prediction. One of the main advantages of the probabilistic approach we describe here is our ability to perform detailed experiments where we can experiment with different models. We can easily perform local substitutions (mutations) and measure (probabilistically) their effect on the global structure. Window-based methods do not support such experimentation as readily. Our method is efficient both during training and during prediction, which is important in order to be able to perform many experiments with different networks. We believe that probabilistic methods are comparable to other methods in prediction quality. In addition, the predictions generated by our methods have precise quantitative semantics which is not shared by other classification methods. Specifically, all the causal and statistical independence assumptions are made explicit in our networks thereby allowing biologists to study and experiment with different causal models in a convenient manner.

Publication types

Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Algorithms
Bayes Theorem
Decision Trees
Markov Chains
Models, Genetic
Models, Molecular*
Mutation
Neural Networks, Computer
Protein Structure, Secondary*
Reproducibility of Results