Model identification for DNA sequence-structure relationships |
| |
Authors: | Hawley Stephen Dwyer Chiu Anita Chizeck Howard Jay |
| |
Affiliation: | Cogent Systems Research Laboratory, Department of Electrical Engineering, University of Washington, Seattle, WA 98195, USA. |
| |
Abstract: | We investigate the use of algebraic state-space models for the sequence dependent properties of DNA. By considering the DNA sequence as an input signal, rather than using an all atom physical model, computational efficiency is achieved. A challenge in deriving this type of model is obtaining its structure and estimating its parameters. Here we present two candidate model structures for the sequence dependent structural property Slide and a method of encoding the models so that a recursive least squares algorithm can be applied for parameter estimation. These models are based on the assumption that the value of Slide at a base-step is determined by the surrounding tetranucleotide sequence. The first model takes the four bases individually as inputs and has a median root mean square deviation of 0.90 A. The second model takes the four bases pairwise and has a median root mean square deviation of 0.88 A. These values indicate that the accuracy of these models is within the useful range for structure prediction. Performance is comparable to published predictions of a more physically derived model, at significantly less computational cost. |
| |
Keywords: | DNA Sequence dependent structure Algebraic systems |
本文献已被 ScienceDirect PubMed 等数据库收录! |