A model-based approach to selection of tag SNPs |
| |
Authors: | Pierre Nicolas Fengzhu Sun Lei M Li |
| |
Affiliation: | 1. Molecular and Computational Biology Program, Department of Biological Sciences, University of Southern California, Los Angeles, USA 2. Mathématique, Informatique et Génome, INRA, Jouy-en-Josas, France 3. Department of Mathematics, University of Southern California, Los Angeles, USA
|
| |
Abstract: | Background Single Nucleotide Polymorphisms (SNPs) are the most common type of polymorphisms found in the human genome. Effective genetic association studies require the identification of sets of tag SNPs that capture as much haplotype information as possible. Tag SNP selection is analogous to the problem of data compression in information theory. According to Shannon's framework, the optimal tag set maximizes the entropy of the tag SNPs subject to constraints on the number of SNPs. This approach requires an appropriate probabilistic model. Compared to simple measures of Linkage Disequilibrium (LD), a good model of haplotype sequences can more accurately account for LD structure. It also provides a machinery for the prediction of tagged SNPs and thereby to assess the performances of tag sets through their ability to predict larger SNP sets. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|