Efficient and accurate P-value computation for Position Weight Matrices |
| |
Authors: | Hélène Touzet Jean-Stéphane Varré |
| |
Affiliation: | 1. LIFL, UMR CNRS 8022, Université des Sciences et Technologies de Lille, 59655, Villeneuve d'Ascq, France 2. INRIA, 40 avenue Halley, 59650, Villeneuve d'Ascq, France
|
| |
Abstract: | Background Position Weight Matrices (PWMs) are probabilistic representations of signals in sequences. They are widely used to model approximate patterns in DNA or in protein sequences. The usage of PWMs needs as a prerequisite to knowing the statistical significance of a word according to its score. This is done by defining the P-value of a score, which is the probability that the background model can achieve a score larger than or equal to the observed value. This gives rise to the following problem: Given a P-value, find the corresponding score threshold. Existing methods rely on dynamic programming or probability generating functions. For many examples of PWMs, they fail to give accurate results in a reasonable amount of time. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|