Approximate p-values for local sequence alignments: numerical studies

J D Storey; D Siegmund

doi:10.1089/106652701753216530

Approximate p-values for local sequence alignments: numerical studies

J Comput Biol. 2001;8(5):549-56. doi: 10.1089/106652701753216530.

Authors

J D Storey¹, D Siegmund

Affiliation

¹ Department of Statistics, Stanford University, Stanford, CA 94305, USA.

PMID: 11694182
DOI: 10.1089/106652701753216530

Abstract

Siegmund and Yakir (2000) have given an approximate p-value when two independent, identically distributed sequences from a finite alphabet are optimally aligned based on a scoring system that rewards similarities according to a general scoring matrix and penalizes gaps (insertions and deletions). The approximation involves an infinite sequence of difficult-to-compute parameters. In this paper, it is shown by numerical studies that these reduce to essentially two numerically distinct parameters, which can be computed as one-dimensional numerical integrals. For an arbitrary scoring matrix and affine gap penalty, this modified approximation is easily evaluated. Comparison with published numerical results show that it is reasonably accurate.

Publication types

Comparative Study
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Models, Molecular*
Sequence Alignment / methods*