Remote homolog detection using local sequence-structure correlations |
| |
Authors: | Hou Yuna Hsu Wynne Lee Mong Li Bystroff Christopher |
| |
Affiliation: | School of Computing, National University of Singapore, Singapore. houyuna@comp.nus.edu.sg |
| |
Abstract: | Remote homology detection refers to the detection of structural homology in proteins when there is little or no sequence similarity. In this article, we present a remote homolog detection method called SVM-HMMSTR that overcomes the reliance on detectable sequence similarity by transforming the sequences into strings of hidden Markov states that represent local folding motif patterns. These state strings are transformed into fixed-dimension feature vectors for input to a support vector machine. Two sets of features are defined: an order-independent feature set that captures the amino acid and local structure composition; and an order-dependent feature set that captures the sequential ordering of the local structures. Tests using the Structural Classification of Proteins (SCOP) 1.53 data set show that the SVM-HMMSTR gives a significant improvement over several current methods. |
| |
Keywords: | remote homology local structure support vector machines hidden Markov model protein folding I-sites HMMSTR |
本文献已被 PubMed 等数据库收录! |