Evaluation performance of substitution matrices,based on contacts between residue terminal groups |
| |
Authors: | Boris Vishnepolsky Grigol Managadze Maya Grigolava |
| |
Institution: | 1. Life Science Research Centre, Laboratory of Bioinformatics , 14 Gotua St, Tbilisi , 0160 , Georgia b.vishnepolsky@lifescience.org.ge;3. Life Science Research Centre, Laboratory of Bioinformatics , 14 Gotua St, Tbilisi , 0160 , Georgia |
| |
Abstract: | Sequence alignment is a standard method for the estimation of the evolutionary, structural, and functional relationships among amino acid sequences. The quality of alignments depends on the used similarity matrix. Statistical contact potentials (CPs) contain information on contact propensities among residues in native protein structures. Substitution matrices (SMs) based on CPs are applicable for the comparison of distantly related sequences. Here, contact between amino acids was estimated on the basis of the evaluation of the distances between side-chain terminal groups (SCTGs), which are defined as the group of the side-chain heavy atoms with fixed distances between them. In this paper, two new types of CPs and similarity matrices have been constructed: one based on fixed cutoff distance obtained from geometric characteristics of the SCTGs (TGC1), while the other is distance-dependent potential (TGC2). These matrices are compared with other popular SMs. The performance of the matrices was evaluated by comparing sequence with structural alignments. The obtained results show that TGC2 has the best performance among contact-based matrices, but on the whole, contact-based matrices have slightly lower performance than other SMs except fold-level similarity. |
| |
Keywords: | contact potentials alignment protein structure prediction twilight zone fold recognition |
|
|