首页 | 本学科首页   官方微博 | 高级检索  
     

等位基因多态性群体遗传结构的多元非线性分析方法
引用本文:薛付忠,王洁贞,郭亦寿,胡平,吴学森. 等位基因多态性群体遗传结构的多元非线性分析方法[J]. 遗传学报, 2004, 31(2): 202-211
作者姓名:薛付忠  王洁贞  郭亦寿  胡平  吴学森
作者单位:1. 山东大学流行病学与医学统计学研究所,济南,250012
2. 山东大学遗传学研究所,济南,250012
基金项目:国家自然科学基金资助项目 (No .30 1 70 52 7)~~
摘    要:长期以来,对于多维基因多态性数据的多元统计分析,如计算遗传距离时昕用的聚类分析、分析群体遗传结构时所用的主成分分析、因子分析和典型相关分析等,一直应用为无约束条件数据而设计的经典多元线性分析方法,并没有注意基因多态性数据的“闭合效应”所带来的问题。从分析基因多态性数据的分布和结构特征入手,文中指出了基因多态性分布具有“闭合数据”的特点,分析了由于“闭合效应”的影响,经典多元线性方法用于群体遗传结构分析昕面临的困难。根据成分数据统计分析的理论和方法,提出了基因多态性群体遗传结构的多元非线性分析基本方法。并以主成分分析为例,通过实例比较和分析了经典线性主成分分析和“对数比”非线性主成分分析的结果,证明“对数比”非线性主成分分析方法是研究基因多态性群体遗传结构的良好方法,具有特异、灵敏等优点,其结果符合群体遗传学规律。

关 键 词:基因多态性 群体遗传结构 多元非线性分析
文章编号:0379-4172(2004)02-0202-10

Multiple Nonlinear Statistical Method of Population Genetic Structure Based on the Allelic Polymorphism Data
XUE Fu-Zhong. Multiple Nonlinear Statistical Method of Population Genetic Structure Based on the Allelic Polymorphism Data[J]. Journal of Genetics and Genomics, 2004, 31(2): 202-211
Authors:XUE Fu-Zhong
Affiliation:XUE Fu-Zhong~
Abstract:The distribution and structure of the allelic polymorphism data are analyzed and it is pointed out that the distribution of allelic polymorphism data reveals the characteristic of closed data (also named as compositional data or data of constant sum).It is interpreted that the correlation structure of the allelic polymorphism data contains null correlations introduced by "closure" and the statistical distribution of the data is not normal because of its constant row sum,which resulted in great difficulties in analyzing the data with traditional multiple linear statistical methods such as principal component analysis,factor analysis,cluster analysis and canonical correlation analysis.Based on the theory of compositional data analysis proposed by Aitchison in 1982,a multiple nonlinear statistical method originating from the "logratios" approach to the statistical analysis of compositional data is put forward in this paper.As an example,the "logratios" method was used to analyze the genetic structure of TH01 polymorphic loci in Chinese population and the results were compared with those of multiple linear methods such as component principal.It is concluded that the "logratios" multiple nonlinear principle component analysis is a better method with the virtue of sensitivity and specificity for analyzing the genetic structure of population from the data of allelic polymorphism.
Keywords:allelic polymorphism  genetic structure of populations  multiple nonlinear statistical method
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号