A Study of Residue Correlation within Protein Sequences and Its Application to Sequence Classification |
| |
Authors: | Chris Hemmerich Sun Kim |
| |
Affiliation: | 1Center For Genomics and Bioinformatics, Indiana University, 1001 E. 3rd Street, Bloomington, Indiana, IN 47405-3700, USA;2School of Informatics, Center for Genomics and Bioinformatics, Indiana University, 901 E. 10th Street, Bloomington, Indiana, IN 47408-3912, USA |
| |
Abstract: | We investigate methods of estimating residue correlation within protein sequences. We begin by using mutual information (MI) of adjacent residues, and improve our methodology by defining the mutual information vector (MIV) to estimate long range correlations between nonadjacent residues. We also consider correlation based on residue hydropathy rather than protein-specific interactions. Finally, in experiments of family classification tests, the modeling power of MIV was shown to be significantly better than the classic MI method, reaching the level where proteins can be classified without alignment information. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|