首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
X Chen  D Min  TA Yasir  YG Hu 《PloS one》2012,7(9):e44510
To ascertain genetic diversity, population structure and linkage disequilibrium (LD) among a representative collection of Chinese winter wheat cultivars and lines, 90 winter wheat accessions were analyzed with 269 SSR markers distributed throughout the wheat genome. A total of 1,358 alleles were detected, with 2 to 10 alleles per locus and a mean genetic richness of 5.05. The average genetic diversity index was 0.60, with values ranging from 0.05 to 0.86. Of the three genomes of wheat, ANOVA revealed that the B genome had the highest genetic diversity (0.63) and the D genome the lowest (0.56); significant differences were observed between these two genomes (P<0.01). The 90 Chinese winter wheat accessions could be divided into three subgroups based on STRUCTURE, UPGMA cluster and principal coordinate analyses. The population structure derived from STRUCTURE clustering was positively correlated to some extent with geographic eco-type. LD analysis revealed that there was a shorter LD decay distance in Chinese winter wheat compared with other wheat germplasm collections. The maximum LD decay distance, estimated by curvilinear regression, was 17.4 cM (r(2)>0.1), with a whole genome LD decay distance of approximately 2.2 cM (r(2)>0.1, P<0.001). Evidence from genetic diversity analyses suggest that wheat germplasm from other countries should be introduced into Chinese winter wheat and distant hybridization should be adopted to create new wheat germplasm with increased genetic diversity. The results of this study should provide valuable information for future association mapping using this Chinese winter wheat collection.  相似文献   

2.

Background  

Population structure analysis is important to genetic association studies and evolutionary investigations. Parametric approaches, e.g. STRUCTURE and L-POP, usually assume Hardy-Weinberg equilibrium (HWE) and linkage equilibrium among loci in sample population individuals. However, the assumptions may not hold and allele frequency estimation may not be accurate in some data sets. The improved version of STRUCTURE (version 2.1) can incorporate linkage information among loci but is still sensitive to high background linkage disequilibrium. Nowadays, large-scale single nucleotide polymorphisms (SNPs) are becoming popular in genetic studies. Therefore, it is imperative to have software that makes full use of these genetic data to generate inference even when model assumptions do not hold or allele frequency estimation suffers from high variation.  相似文献   

3.
Gordon D  Simonic I  Ott J 《Genomics》2000,66(1):87-92
We explore the extent of deviations from Hardy-Weinberg equilibrium (HWE) at a marker locus and linkage disequilibrium (LD) between pairs of marker loci in the Afrikaner population of South Africa. DNA samples were used for genotyping of 23 loci on six chromosomes. The samples were collected from 91 healthy unrelated Afrikaner adults. Exact tests were used to determine evidence for deviations from HWE at a single marker locus or LD between pairs of marker loci. At the 0.05 level of significance, evidence was found for deviation from HWE at only one of the 23 loci. At the same level of significance, LD was found among 8 of the 34 intrachromosomal pairs of loci. On chromosome 21, there was evidence for LD (P = 0.02) between a pair of loci with a genetic distance of 5.51 cM. On chromosome 2, there was evidence for LD between a pair of loci with a genetic distance of 5.28 cM (P = 0.002) and a pair of loci with a genetic distance of 3.68 cM (P = 0.0004). Detailed analysis of LD for one locus pair indicated that only a few of all alleles participated in the LD and that strong LD was most often positive. Our findings indicate that Afrikaans-speaking Afrikaners represent one of those special populations deemed particularly suitable for disequilibrium mapping.  相似文献   

4.
Domesticated materials with well-known wild relatives provide an experimental system to reveal how human selection during cultivation affects genetic composition and adaptation to novel environments. In this paper, our goal was to elucidate how two geographically distinct domestication events modified the structure and level of genetic diversity in common bean. Specifically, we analyzed the genome-wide genetic composition at 26, mostly unlinked microsatellite loci in 349 accessions of wild and domesticated common bean from the Andean and Mesoamerican gene pools. Using a model-based approach, implemented in the software STRUCTURE, we identified nine wild or domesticated populations in common bean, including four of Andean and four of Mesoamerican origins. The ninth population was the putative wild ancestor of the species, which was classified as a Mesoamerican population. A neighbor-joining analysis and a principal coordinate analysis confirmed genetic relationships among accessions and populations observed with the STRUCTURE analysis. Geographic and genetic distances in wild populations were congruent with the exception of a few putative hybrids identified in this study, suggesting a predominant effect of isolation by distance. Domesticated common bean populations possessed lower genetic diversity, higher F ST, and generally higher linkage disequilibrium (LD) than wild populations in both gene pools; their geographic distributions were less correlated with genetic distance, probably reflecting seed-based gene flow after domestication. The LD was reduced when analyzed in separate Andean and Mesoamerican germplasm samples. The Andean domesticated race Nueva Granada had the highest F ST value and widest geographic distribution compared to other domesticated races, suggesting a very recent origin or a selection event, presumably associated with a determinate growth habit, which predominates in this race. Electronic supplementary material  The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

5.
Zhang P  Li J  Li X  Liu X  Zhao X  Lu Y 《PloS one》2011,6(12):e27565
The assessment of genetic diversity and population structure of a core collection would benefit to make use of these germplasm as well as applying them in association mapping. The objective of this study were to (1) examine the population structure of a rice core collection; (2) investigate the genetic diversity within and among subgroups of the rice core collection; (3) identify the extent of linkage disequilibrium (LD) of the rice core collection. A rice core collection consisting of 150 varieties which was established from 2260 varieties of Ting's collection of rice germplasm were genotyped with 274 SSR markers and used in this study. Two distinct subgroups (i.e. SG 1 and SG 2) were detected within the entire population by different statistical methods, which is in accordance with the differentiation of indica and japonica rice. MCLUST analysis might be an alternative method to STRUCTURE for population structure analysis. A percentage of 26% of the total markers could detect the population structure as the whole SSR marker set did with similar precision. Gene diversity and MRD between the two subspecies varied considerably across the genome, which might be used to identify candidate genes for the traits under domestication and artificial selection of indica and japonica rice. The percentage of SSR loci pairs in significant (P<0.05) LD is 46.8% in the entire population and the ratio of linked to unlinked loci pairs in LD is 1.06. Across the entire population as well as the subgroups and sub-subgroups, LD decays with genetic distance, indicating that linkage is one main cause of LD. The results of this study would provide valuable information for association mapping using the rice core collection in future.  相似文献   

6.
Tomato (Solanum lycopersicum L.) has undergone intensive selection during and following domestication. We investigated population structure and genetic differentiation within a collection of 70 tomato lines representing contemporary (processing and fresh-market) varieties, vintage varieties and landraces. The model-based Bayesian clustering software, STRUCTURE, was used to detect subpopulations. Six independent analyses were conducted using all marker data (173 markers) and five subsets of markers based on marker type (single-nucleotide polymorphisms, simple sequence repeats and insertion/deletions) and location (exon and intron sequences) within genes. All of these analyses consistently separated four groups predefined by market niche and age into distinct subpopulations. Furthermore, we detected at least two subpopulations within the processing varieties. These subpopulations correspond to historical patterns of breeding conducted for specific production environments. We found no subpopulation within fresh-market varieties, vintage varieties and landraces when using all marker data. High levels of admixture were shown in several varieties representing a transition in the demarcation between processing and fresh-market breeding. The genetic clustering detected by using the STRUCTURE software was confirmed by two statistics, pairwise F(st) (θ) and Nei's standard genetic distance. We also identified a total of 19 loci under positive selection between processing, fresh-market and vintage germplasm by using an F(st)-outlier method based on the deviation from the expected distribution of F(st) and heterozygosity. The markers and genome locations we identified are consistent with known patterns of selection and linkage to traits that differentiate the market classes. These results demonstrate how human selection through breeding has shaped genetic variation within cultivated tomato.  相似文献   

7.
Population structure, extent of linkage disequilibrium (LD) as well as signatures of selection were investigated in sorghum using a core sample representative of worldwide diversity. A total of 177 accessions were genotyped with 1122 informative physically anchored DArT markers. The properties of DArTs to describe sorghum genetic structure were compared to those of SSRs and of previously published RFLP markers. Model-based (STRUCTURE software) and Neighbor-Joining diversity analyses led to the identification of 6 groups and confirmed previous evolutionary hypotheses. Results were globally consistent between the different marker systems. However, DArTs appeared more robust in terms of data resolution and bayesian group assignment. Whole genome linkage disequilibrium as measured by mean r(2) decreased from 0.18 (between 0 to 10 kb) to 0.03 (between 100 kb to 1 Mb), stabilizing at 0.03 after 1 Mb. Effects on LD estimations of sample size and genetic structure were tested using i. random sampling, ii. the Maximum Length SubTree algorithm (MLST), and iii. structure groups. Optimizing population composition by the MLST reduced the biases in small samples and seemed to be an efficient way of selecting samples to make the best use of LD as a genome mapping approach in structured populations. These results also suggested that more than 100,000 markers may be required to perform genome-wide association studies in collections covering worldwide sorghum diversity. Analysis of DArT markers differentiation between the identified genetic groups pointed out outlier loci potentially linked to genes controlling traits of interest, including disease resistance genes for which evidence of selection had already been reported. In addition, evidence of selection near a homologous locus of FAR1 concurred with sorghum phenotypic diversity for sensitivity to photoperiod.  相似文献   

8.
Populations with two sexes are vulnerable to a pair of genetic conflicts: sexual antagonism that can arise when alleles have opposing fitness effects on females and males; and parental antagonism that arises when alleles have opposing fitness effects when maternally and paternally inherited. This paper extends previous theoretical work that found stable linkage disequilibrium (LD) between sexually antagonistic loci. We find that LD is also generated between parentally antagonistic loci, and between sexually and parentally antagonistic loci, without any requirement of epistasis. We contend that the LD in these models arises from the admixture of gene pools subject to different selective histories. We also find that polymorphism maintained by parental antagonism at one locus expands the opportunity for polymorphism at a linked locus experiencing parental or sexual antagonism. Taken together, our results predict the chromosomal clustering of loci that segregate for sexually and parentally antagonistic alleles. Thus, genetic conflict may play a role in the evolution of genomic architecture.  相似文献   

9.
Rapeseed (Brassica napus L.) is the leading European oilseed crop serving as source for edible oil and renewable energy. The objectives of our study were to (i) examine the population structure of a large and diverse set of B. napus inbred lines, (ii) investigate patterns of genetic diversity within and among different germplasm types, (iii) compare the two genomes of B. napus with regard to genetic diversity, and (iv) assess the extent of linkage disequilibrium (LD) between simple sequence repeat (SSR) markers. Our study was based on 509 B. napus inbred lines genotyped with 89 genome-specific SSR primer combinations. Both a principal coordinate analysis and software STRUCTURE revealed that winter types, spring types, and swedes were assigned to three major clusters. The genetic diversity of winter oilseed rape was lower than the diversity found in other germplasm types. Within winter oilseed rape types, a decay of genetic diversity with more recent release dates and reduced levels of erucic acid and glucosinolates was observed. The percentage of linked SSR loci pairs in significant (r 2 > Q 95 unlinked loci pairs) LD was 6.29% for the entire germplasm set. Furthermore, LD decayed rapidly with distance, which will allow a relatively high mapping resolution in genome-wide association studies using our germplasm set, but, on the other hand, will require a high number of markers.  相似文献   

10.
豌豆种质表型性状SSR标记关联分析   总被引:2,自引:0,他引:2  
关联分析是以连锁不平衡原理为基础,鉴定某一群体内表型性状与遗传标记或候选基因间关系的遗传分析方法。本研究利用59个多态性SSR标记,对192份豌豆种质进行全基因组扫描,以分析SSR位点遗传多样性,寻找其连锁不平衡位点;采用TASSEL软件的一般线性模型,利用59个SSR标记对19个形态性状进行关联分析。结果显示SSR位点间有较高的多态性和一定程度的连锁不平衡,共检测出32个SSR标记位点与14个表形性状相关联,一些SSR标记与2个或多个形态性状相关联。  相似文献   

11.
Detecting QTLs (quantitative trait loci) that enhance cotton yield and fiber quality traits and accelerate breeding has been the focus of many cotton breeders. In the present study, 359 SSR (simple sequence repeat) markers were used for the association mapping of 241 Upland cotton collections. A total of 333 markers, representing 733 polymorphic loci, were detected. The average linkage disequilibrium (LD) decay distances were 8.58 cM (r2 > 0.1) and 5.76 cM (r2 > 0.2). 241 collections were arranged into two subgroups using STRUCTURE software. Mixed linear modeling (MLM) methods (with population structure (Q) and relative kinship matrix (K)) were applied to analyze four phenotypic datasets obtained from four environments (two different locations and two years). Forty-six markers associated with the number of bolls per plant (NB), boll weight (BW), lint percentage (LP), fiber length (FL), fiber strength (FS) and fiber micornaire value (FM) were repeatedly detected in at least two environments. Of 46 associated markers, 32 were identified as new association markers, and 14 had been previously reported in the literature. Nine association markers were near QTLs (at a distance of less than 1–2 LD decay on the reference map) that had been previously described. These results provide new useful markers for marker-assisted selection in breeding programs and new insights for understanding the genetic basis of Upland cotton yields and fiber quality traits at the whole-genome level.  相似文献   

12.
The population of Costa Rica (CR) represents an admixture of major continental populations. An investigation of the CR population structure would provide an important foundation for mapping genetic variants underlying common diseases and traits. We conducted an analysis of 1,301 women from the Guanacaste region of CR using 27,904 single nucleotide polymorphisms (SNPs) genotyped on a custom Illumina InfiniumII iSelect chip. The program STRUCTURE was used to compare the CR Guanacaste sample with four continental reference samples, including HapMap Europeans (CEU), East Asians (JPT+CHB), West African Yoruba (YRI), as well as Native Americans (NA) from the Illumina iControl database. Our results show that the CR Guanacaste sample comprises a three-way admixture estimated to be 43% European, 38% Native American and 15% West African. An estimated 4% residual Asian ancestry may be within the error range. Results from principal components analysis reveal a correlation between genetic and geographic distance. The magnitude of linkage disequilibrium (LD) measured by the number of tagging SNPs required to cover the same region in the genome in the CR Guanacaste sample appeared to be weaker than that observed in CEU, JPT+CHB and NA reference samples but stronger than that of the HapMap YRI sample. Based on the clustering pattern observed in both STRUCTURE and principal components analysis, two subpopulations were identified that differ by approximately 20% in LD block size averaged over all LD blocks identified by Haploview. We also show in a simulated association study conducted within the two subpopulations, that the failure to account for population stratification (PS) could lead to a noticeable inflation in the false positive rate. However, we further demonstrate that existing PS adjustment approaches can reduce the inflation to an acceptable level for gene discovery.  相似文献   

13.
Gao H  Williamson S  Bustamante CD 《Genetics》2007,176(3):1635-1651
Nonrandom mating induces correlations in allelic states within and among loci that can be exploited to understand the genetic structure of natural populations (Wright 1965). For many species, it is of considerable interest to quantify the contribution of two forms of nonrandom mating to patterns of standing genetic variation: inbreeding (mating among relatives) and population substructure (limited dispersal of gametes). Here, we extend the popular Bayesian clustering approach STRUCTURE (Pritchard et al. 2000) for simultaneous inference of inbreeding or selfing rates and population-of-origin classification using multilocus genetic markers. This is accomplished by eliminating the assumption of Hardy-Weinberg equilibrium within clusters and, instead, calculating expected genotype frequencies on the basis of inbreeding or selfing rates. We demonstrate the need for such an extension by showing that selfing leads to spurious signals of population substructure using the standard STRUCTURE algorithm with a bias toward spurious signals of admixture. We gauge the performance of our method using extensive coalescent simulations and demonstrate that our approach can correct for this bias. We also apply our approach to understanding the population structure of the wild relative of domesticated rice, Oryza rufipogon, an important partially selfing grass species. Using a sample of n = 16 individuals sequenced at 111 random loci, we find strong evidence for existence of two subpopulations, which correlates well with geographic location of sampling, and estimate selfing rates for both groups that are consistent with estimates from experimental data (s approximately 0.48-0.70).  相似文献   

14.
Fan R  Jung J 《Human heredity》2003,56(4):166-187
This paper proposes variance component models for high resolution joint linkage disequilibrium (LD) and linkage mapping of quantitative trait loci (QTL) based on sibship data; this can include population data if independent individuals are treated as single sibships. One application of these models is late onset complex disease gene mapping, when parental data are not available. The models simultaneously incorporate both LD and linkage information. The LD information is contained in mean coefficients of sibship data. The linkage information is contained in the variance-covariance matrices of trait values for sibships with at least two siblings. We derive formulas for calculating the probability of sharing two trait alleles identical by descent (IBD) for sibpairs in interval mapping of QTL; this is the coefficient of dominant variance of the trait covariance of sibpairs on major QTL. To investigate the performance of the formulas, we calculate the numerical values via the formulas and get satisfactory approximations. We compare the power and sample sizes for both LD and linkage mapping. By simulation and theoretical analysis, we compare the results with those of Fulker and Abecasis "AbAw" approach. It is well known that the resolution of linkage analysis can be low for complex disease gene mapping. LD mapping, on the other hand, can increase mapping precision and is useful in high resolution mapping. Linkage analysis is less sensitive to population subdivisions and admixtures. The level of LD is sensitive to population stratification which may easily lead to spurious association. Performing a joint analysis of LD and linkage mapping can help to overcome the limits of both approaches. Moreover, the advantages of the two complementary strategies can be utilized maximally. In practice, linkage analysis may be performed using pedigree data to identify suggestive linkage between markers and trait loci based on a sparse marker map. In the presence of linkage, joint LD and linkage mapping can be carried out to do fine gene mapping based on a dense genetic map using both pedigree and population data. Population and pedigree data of any type can be combined to perform a joint analysis of high resolution LD and linkage mapping of QTL by generalizing the method.  相似文献   

15.
This study analyzes population structure and linkage disequilibrium (LD) among 187 commonly used Chinese maize inbred lines, representing the genetic diversity among public, commercial and historically important lines for corn breeding. Seventy SSR loci, evenly distributed over 10 chromosomes, were assayed for polymorphism. The identified 290 alleles served to estimate population structure and analyze the genome-wide LD. The population of lines was highly structured, showing 6 subpopulations: BSSS (American BSSS including Reid), PA (group A germplasm derived from modern U.S. hybrids in China), PB (group B germplasm derived from modern U.S. hybrid in China), Lan (Lancaster Surecrop), LRC (derivative lines from Lvda Reb Cob, a Chinese landrace) and SPT (derivative lines from Si-ping-tou, a Chinese landrace). Forty lines, which formerly had an unknown and/or miscellaneous origin and pedigree record, were assigned to the appropriate group. Relationship estimates based on SSR marker data were quantified in a Q matrix, and this information will inform breeder’s decisions regarding crosses. Extensive inter- and intra-chromosomal LD was detected between 70 microsatellite loci for the investigated maize lines (2109 loci pairs in LD with D′ > 0.1 and 93 out of them at P < 0.01).This suggests that rapidly evolving microsatellites may track recent population structure. Interlocus LD decay among the diverse maize germplasm indicated that association studies in QTLs and/or candidate genes might avoid nonfunctional and spurious associations since most of the LD blocks were broken between diverse germplasm. The defined population structure and the LD analysis present the basis for future association mapping. Electronic supplementary material The online version of this article (doi:) contains supplementary material, which is available to authorized users.  相似文献   

16.
Bayesian clustering methods have been widely used for studying species delimitation and genetic introgression. In order to test the effect of phylogenetic relationships and sampling scheme on the inferred clustering solution and on the performance of Bayesian clustering analysis, I simulated genotypes of the interfertile oak species Quercus robur, Quercus petraea, and Quercus pubescens and I run analyses using two popular software programs, STRUCTURE and BAPS. First, based on purebred simulations, I compared clustering solutions resulting from different sample size configurations. While clustering solution generally reflected the taxonomic relationships when equal samples of each species were included, spurious partition was inferred by STRUCTURE when some species were represented by larger and others by smaller samples. In very unbalanced configurations, STRUCTURE failed to identify the three species, even if three subpopulations were assumed. By contrast, BAPS could properly identify the three species under any sampling scheme. Second, based on simulations of purebreds and hybrids, I tested the performance of individual assignments with variable number of loci. This analysis showed that STRUCTURE can detect introgressed individuals more efficiently than BAPS. However, BAPS could assign purebreds more efficiently with a lower number of loci. Method performance also depended on phylogenetic relationships. In the case of Q. petraea, Q. pubescens, and their hybrids, method performance was lower due to their phylogenetic affinity. Inclusion of three instead of two species into the analysis led to reduction of performance, and to misclassification of hybrids, which often reflected the phylogenetic affinity between Q. petraea and Q. pubescens.  相似文献   

17.
Linkage disequilibrium in related breeding lines of chickens   总被引:2,自引:1,他引:1       下载免费PDF全文
High-density genotyping of single-nucleotide polymorphisms (SNPs) enables detection of quantitative trait loci (QTL) by linkage disequilibrium (LD) mapping using LD between markers and QTL and the subsequent use of this information for marker-assisted selection (MAS). The success of LD mapping and MAS depends on the extent of LD in the populations of interest and the use of associations across populations requires LD between loci to be consistent across populations. To assess the extent and consistency of LD in commercial broiler breeding populations, we used genotype data for 959 and 398 SNPs on chromosomes 1 and 4 on 179-244 individuals from each of nine commercial broiler chicken breeding lines. Results show that LD measured by r(2) extends over shorter distances than reported previously in other livestock breeding populations. The LD at short distance (within 1 cM) tended to be consistent across related populations; correlations of LD measured by r for pairs of lines ranged from 0.17 to 0.94 and closely matched the line relationships based on marker allele frequencies. In conclusion, LD-based correlations are good estimates of line relationships and the relationship between a pair of lines a good predictor of LD consistency between the lines.  相似文献   

18.
Semon M  Nielsen R  Jones MP  McCouch SR 《Genetics》2005,169(3):1639-1647
Genome-wide linkage disequilibrium (LD) was investigated for 198 accessions of Oryza glaberrima using 93 nuclear microsatellite markers. Significantly elevated levels of LD were detected, even among distantly located markers. Free recombination among loci at the population genetic level was shown (1) by a lack of decay in LD among markers on the same chromosome and (2) by a strictly increasing composite likelihood function for the recombination parameter. This suggested that the elevation in LD was due not to physical linkage but to other factors, such as population structure. A Bayesian clustering analysis confirmed this hypothesis, indicating that the sample of O. glaberrima in this study was subdivided into at least five cryptic subpopulations. Two of these subpopulations clustered with control samples of O. sativa, subspecies indica and japonica, indicating that some O. glaberrima accessions represent admixtures. The remaining three O. glaberrima subpopulations were significantly associated with specific combinations of phenotypic traits-possibly reflecting ecological adaptation to different growing environments.  相似文献   

19.
推测187份玉米自交系基因组血统与分子亲缘关系   总被引:13,自引:0,他引:13  
为提高育种效率以及开展重要 QTL 和关键基因关联性分析研究, 以 187 份生产上重要玉米自交系为材料, 以 70 个均匀分布于全基因组简单重复序列(SSR)基因座鉴定出的290个等位基因多态性为分析数据, 采用联合连锁位点与混合模型分析, 推测这些自交系的基因组血缘构成以及分子亲缘关系, 并分析了全基因连锁不平衡。当亚群数目 K=5 时, 导致似然值 P 明显下降, 亚群数据 K > 6 时, 似然值 P 没有明显上升, 表明群体结构的亚群数 K 最佳推测为 6。六个亚群分别为 PA、BSSS (含 Reid)、PB、兰卡斯特 (Lancaster)、旅大红骨 (旅大红骨及其衍生系)、四平头 (唐四平头及其衍生系)。亚群间的Kullback-Leibler 距离自 0.13 至 1.06 不等, 平均为 0.599, 各亚群间区分度较好。全基因组连锁不平衡(LD)分析表明: 与四平头种质类群内相比, 遗传基础宽泛种质的基因组内存在 LD"区块"(LD block)少且小, 对重要 QTLs 与基因的关联性分析可以避免假阳性。本研究群体结构与基因组构成分析数据为这些材料育种应用与改良提供了重要信息, 也为基于这些材料的关联性分析奠定了分析基础。  相似文献   

20.
Jiang N  Wang M  Jia T  Wang L  Leach L  Hackett C  Marshall D  Luo Z 《PloS one》2011,6(8):e23192

Background

It has been well established that theoretical kernel for recently surging genome-wide association study (GWAS) is statistical inference of linkage disequilibrium (LD) between a tested genetic marker and a putative locus affecting a disease trait. However, LD analysis is vulnerable to several confounding factors of which population stratification is the most prominent. Whilst many methods have been proposed to correct for the influence either through predicting the structure parameters or correcting inflation in the test statistic due to the stratification, these may not be feasible or may impose further statistical problems in practical implementation.

Methodology

We propose here a novel statistical method to control spurious LD in GWAS from population structure by incorporating a control marker into testing for significance of genetic association of a polymorphic marker with phenotypic variation of a complex trait. The method avoids the need of structure prediction which may be infeasible or inadequate in practice and accounts properly for a varying effect of population stratification on different regions of the genome under study. Utility and statistical properties of the new method were tested through an intensive computer simulation study and an association-based genome-wide mapping of expression quantitative trait loci in genetically divergent human populations.

Results/Conclusions

The analyses show that the new method confers an improved statistical power for detecting genuine genetic association in subpopulations and an effective control of spurious associations stemmed from population structure when compared with other two popularly implemented methods in the literature of GWAS.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号