首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Construction of non-symmetric substitution matrices derived from proteomes with biased amino acid distributions
Authors:Bastien Olivier  Roy Sylvaine  Maréchal Eric
Institution:Laboratoire de Physiologie Cellulaire Végétale, Département Réponse et Dynamique Cellulaire, UMR 5019, CNRS-CEA-INRA-Université Joseph-Fourier, CEA Grenoble, 17, rue des Martyrs, 38054 Grenoble, France.
Abstract:Automatic comparison of compositionally biased genomes, such as that of the malarial causative agent Plasmodium falciparum (82% adenosine + thymidine), with genomes of average composition, is currently limited. Indeed, popular tools such as BLAST require that amino acid distributions be similar in aligned sequences. However, the P. falciparum genome is so biased that six amino acids account for more than 50% of the protein composition. One reason for the comparison methods failure lies in the compositional difference between the query and the subject proteomes, which is not taken into account in the amino acid substitution matrices. This paper introduces a method to derive substitution matrices, in particular BLOSUM 62, in the frame of the information theory. It allows the construction of non-symmetrical matrices, taking into account the non-symmetric amino acid distributions. The dirAtPf family of matrices allowing the comparison of P. falciparum and A. thaliana is given as an example. This paper further provides an analysis of the obtained matrices in the frame of the information theory, supporting the discrimination advantage they bring.
Keywords:Substitution matrix  BLOSUM  Biased genome  Information theory  Mutual information  Matrice de substitution  BLOSUM  Génome biaisé  Théorie de l'information  Information mutuelle
本文献已被 ScienceDirect PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号