Hidden Markov models-based system (HMMSPECTR) for detecting structural homologies on the basis of sequential information |
| |
Authors: | Tsigelny Igor Sharikov Yuriy Ten Eyck Lynn F |
| |
Affiliation: | Department of Chemistry and Biochemistry 0654, University of California-San Diego, La Jolla, CA 92093, USA. itsigeln@ucsd.edi |
| |
Abstract: | HMMSPECTR is a tool for finding putative structural homologs for proteins with known primary sequences. HMMSPECTR contains four major components: a data warehouse with the hidden Markov models (HMM) and alignment libraries; a search program which compares the initial protein sequences with the libraries of HMMs; a secondary structure prediction and comparison program; and a dominant protein selection program that prepares the set of 10-15 "best" proteins from the chosen HMMs. The data warehouse contains four libraries of HMMs. The first two libraries were constructed using different HHM preparation options of the HAMMER program. The third library contains parts ("partial HMM") of initial alignments. The fourth library contains trained HMMs. We tested our program against all of the protein targets proposed in the CASP4 competition. The data warehouse included libraries of structural alignments and HMMs constructed on the basis of proteins publicly available in the Protein Data Bank before the CASP4 meeting. The newest fully automated versions of HMMSPECTR 1.02 and 1.02ss produced better results than the best result reported at CASP4 either by r.m.s.d. or by length (or both) in 64% (HMMSPECTR 1.02) and 79% (HMMSPECTR 1.02ss) of the cases. The improvement is most notable for the targets with complexity 4 (difficult fold recognition cases). |
| |
Keywords: | |
本文献已被 PubMed 等数据库收录! |
|