首页 | 本学科首页   官方微博 | 高级检索  
   检索      


P-value based visualization of codon usage data
Authors:Peter Meinicke  Thomas Brodag  Wolfgang Florian Fricke  Stephan Waack
Institution:1. Abteilung Bioinformatik, Institut für Mikrobiologie und Genetik, Georg-August-Universit?t G?ttingen, Goldschmidtstr. 1, 37077, G?ttingen, Germany
2. Institut für Numerische und Angewandte Mathematik, Universit?t G?ttingen, Lotzestr. 16, 37083, G?ttingen, Germany
3. G?ttingen Genomics Laboratory, Universit?t G?ttingen, Grisebachstr. 8, 37077, G?ttingen, Germany
Abstract:Two important and not yet solved problems in bacterial genome research are the identification of horizontally transferred genes and the prediction of gene expression levels. Both problems can be addressed by multivariate analysis of codon usage data. In particular dimensionality reduction methods for visualization of multivariate data have shown to be effective tools for codon usage analysis. We here propose a multidimensional scaling approach using a novel similarity measure for codon usage tables. Our probabilistic similarity measure is based on P-values derived from the well-known chi-square test for comparison of two distributions. Experimental results on four microbial genomes indicate that the new method is well-suited for the analysis of horizontal gene transfer and translational selection. As compared with the widely-used correspondence analysis, our method did not suffer from outlier sensitivity and showed a better clustering of putative alien genes in most cases.
Keywords:
本文献已被 PubMed SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号