首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 338 毫秒
1.
Association mapping enables the detection of marker-trait associations in unstructured populations by taking advantage of historical linkage disequilibrium (LD) that exists between a marker and the true causative polymorphism of the trait phenotype. Our first objective was to understand the pattern of LD decay in the diploid alfalfa genome. We used 89 highly polymorphic SSR loci in 374 unimproved diploid alfalfa (Medicago sativa L.) genotypes from 120 accessions to infer chromosome-wide patterns of LD. We also sequenced four lignin biosynthesis candidate genes (caffeoyl-CoA 3-O-methyltransferase (CCoAoMT), ferulate-5-hydroxylase (F5H), caffeic acid-O-methyltransferase (COMT), and phenylalanine amonialyase (PAL 1)) to identify single nucleotide polymorphisms (SNPs) and infer within gene estimates of LD. As the second objective of this study, we conducted association mapping for cell wall components and agronomic traits using the SSR markers and SNPs from the four candidate genes. We found very little LD among SSR markers implying limited value for genomewide association studies. In contrast, within gene LD decayed within 300 bp below an r (2) of 0.2 in three of four candidate genes. We identified one SSR and two highly significant SNPs associated with biomass yield. Based on our results, focusing association mapping on candidate gene sequences will be necessary until a dense set of genome-wide markers is available for alfalfa.  相似文献   

2.
Hao C  Wang L  Ge H  Dong Y  Zhang X 《PloS one》2011,6(2):e17279
Two hundred and fifty bread wheat lines, mainly Chinese mini core accessions, were assayed for polymorphism and linkage disequilibrium (LD) based on 512 whole-genome microsatellite loci representing a mean marker density of 5.1 cM. A total of 6,724 alleles ranging from 1 to 49 per locus were identified in all collections. The mean PIC value was 0.650, ranging from 0 to 0.965. Population structure and principal coordinate analysis revealed that landraces and modern varieties were two relatively independent genetic sub-groups. Landraces had a higher allelic diversity than modern varieties with respect to both genomes and chromosomes in terms of total number of alleles and allelic richness. 3,833 (57.0%) and 2,788 (41.5%) rare alleles with frequencies of <5% were found in the landrace and modern variety gene pools, respectively, indicating greater numbers of rare variants, or likely new alleles, in landraces. Analysis of molecular variance (AMOVA) showed that A genome had the largest genetic differentiation and D genome the lowest. In contrast to genetic diversity, modern varieties displayed a wider average LD decay across the whole genome for locus pairs with r(2)>0.05 (P<0.001) than the landraces. Mean LD decay distance for the landraces at the whole genome level was <5 cM, while a higher LD decay distance of 5-10 cM in modern varieties. LD decay distances were also somewhat different for each of the 21 chromosomes, being higher for most of the chromosomes in modern varieties (<5 ~ 25 cM) compared to landraces (<5 ~ 15 cM), presumably indicating the influences of domestication and breeding. This study facilitates predicting the marker density required to effectively associate genotypes with traits in Chinese wheat genetic resources.  相似文献   

3.
High-density genetic markers are the prerequisite for understanding linkage disequilibrium (LD) and genome-wide association studies (GWASs) of complex traits in crops. To evaluate the LD pattern in oilseed rape, we sequenced a previous association panel containing 189 B. napus inbred lines using double-digested restriction-site associated DNA (ddRAD) and genotyped 19,327 RAD tags. A total of 15,921 RAD tags were assigned to a published genetic linkage map and the majority (71.1%) of these tags was uniquely mapped to the draft reference genome “Darmor-bzh.” The distance of LD decay was 1,214 kb across the genome at the background level (r2 = 0.26), with the distances of LD decay being 405 kb and 2,111 kb in the A and C subgenomes, respectively. A total of 361 haplotype blocks with length > 100 kb were identified in the entire genome. The association panel could be classified into two groups, P1 and P2, which are essentially consistent with the geographical origins of varieties. A large number of group-specific haplotypes were identified, reflecting that varieties in the P1 and P2 groups experienced distinct selection in breeding programs to adapt their different growth habitats. GWAS repeatedly detected two loci significantly associated with oil content of seeds based on the developed SNPs, suggesting that the high-density SNPs were useful for understanding the genetic determinants of complex traits in GWAS.  相似文献   

4.
Zhang P  Li J  Li X  Liu X  Zhao X  Lu Y 《PloS one》2011,6(12):e27565
The assessment of genetic diversity and population structure of a core collection would benefit to make use of these germplasm as well as applying them in association mapping. The objective of this study were to (1) examine the population structure of a rice core collection; (2) investigate the genetic diversity within and among subgroups of the rice core collection; (3) identify the extent of linkage disequilibrium (LD) of the rice core collection. A rice core collection consisting of 150 varieties which was established from 2260 varieties of Ting's collection of rice germplasm were genotyped with 274 SSR markers and used in this study. Two distinct subgroups (i.e. SG 1 and SG 2) were detected within the entire population by different statistical methods, which is in accordance with the differentiation of indica and japonica rice. MCLUST analysis might be an alternative method to STRUCTURE for population structure analysis. A percentage of 26% of the total markers could detect the population structure as the whole SSR marker set did with similar precision. Gene diversity and MRD between the two subspecies varied considerably across the genome, which might be used to identify candidate genes for the traits under domestication and artificial selection of indica and japonica rice. The percentage of SSR loci pairs in significant (P<0.05) LD is 46.8% in the entire population and the ratio of linked to unlinked loci pairs in LD is 1.06. Across the entire population as well as the subgroups and sub-subgroups, LD decays with genetic distance, indicating that linkage is one main cause of LD. The results of this study would provide valuable information for association mapping using the rice core collection in future.  相似文献   

5.
6.
Xu P  Wu X  Wang B  Luo J  Liu Y  Ehlers JD  Close TJ  Roberts PA  Lu Z  Wang S  Li G 《Heredity》2012,109(1):34-40
Association mapping of important traits of crop plants relies on first understanding the extent and patterns of linkage disequilibrium (LD) in the particular germplasm being investigated. We characterize here the genetic diversity, population structure and genome wide LD patterns in a set of asparagus bean (Vigna. unguiculata ssp. sesquipedialis) germplasm from China. A diverse collection of 99 asparagus bean and normal cowpea accessions were genotyped with 1127 expressed sequence tag-derived single nucleotide polymorphism markers (SNPs). The proportion of polymorphic SNPs across the collection was relatively low (39%), with an average number of SNPs per locus of 1.33. Bayesian population structure analysis indicated two subdivisions within the collection sampled that generally represented the 'standard vegetable' type (subgroup SV) and the 'non-standard vegetable' type (subgroup NSV), respectively. Level of LD (r(2)) was higher and extent of LD persisted longer in subgroup SV than in subgroup NSV, whereas LD decayed rapidly (0-2 cM) in both subgroups. LD decay distance varied among chromosomes, with the longest (≈ 5 cM) five times longer than the shortest (≈ 1 cM). Partitioning of LD variance into within- and between-subgroup components coupled with comparative LD decay analysis suggested that linkage group 5, 7 and 10 may have undergone the most intensive epistatic selection toward traits favorable for vegetable use. This work provides a first population genetic insight into domestication history of asparagus bean and demonstrates the feasibility of mapping complex traits by genome wide association study in asparagus bean using a currently available cowpea SNPs marker platform.  相似文献   

7.
The fluctuation of population size has not been well studied in the previous studies of theoretical linkage disequilibrium (LD) expectation. In this study, an improved theoretical prediction of LD decay was derived to account for the effects of changes in effective population sizes. The equation was used to estimate effective population size (Ne) assuming a constant Ne and LD at equilibrium, and these Ne estimates implied the past changes of Ne for a certain number of generations until equilibrium, which differed based on recombination rate. As the influence of recent population history on the Ne estimates is larger than old population history, recent changes in population size can be inferred more accurately than old changes. The theoretical predictions based on this improved expression showed accurate agreement with the simulated values. When applied to human genome data, the detailed recent history of human populations was obtained. The inferred past population history of each population showed good correspondence with historical studies. Specifically, four populations (three African ancestries and one Mexican ancestry) showed population growth that was significantly less than that of other populations, and two populations originated from China showed prominent exponential growth. During the examination of overall LD decay in the human genome, a selection pressure on chromosome 14, the gephyrin gene, was observed in all populations.  相似文献   

8.

Key message

The number of SNPs required for QTL discovery is justified by the distance at which linkage disequilibrium has decayed. Simulations and real potato SNP data showed how to estimate and interpret LD decay.

Abstract

The magnitude of linkage disequilibrium (LD) and its decay with genetic distance determine the resolution of association mapping, and are useful for assessing the desired numbers of SNPs on arrays. To study LD and LD decay in tetraploid potato, we simulated autotetraploid genotypes and used it to explore the dependence on: (1) the number of haplotypes in the population (the amount of genetic variation) and (2) the percentage of haplotype specific SNPs (hs-SNPs). Several estimators for short-range LD were explored, such as the average r 2, median r 2, and other percentiles of r 2 (80, 90, and 95 %). For LD decay, we looked at LD½,90, the distance at which the short-range LD is halved when using the 90 % percentile of r 2 at short range, as estimator for LD. Simulations showed that the performance of various estimators for LD decay strongly depended on the number of haplotypes, although the real value of LD decay was not influenced very much by this number. The estimator LD½,90 was chosen to evaluate LD decay in 537 tetraploid varieties. LD½,90 values were 1.5 Mb for varieties released before 1945 and 0.6 Mb in varieties released after 2005. LD½,90 values within three different subpopulations ranged from 0.7 to 0.9 Mb. LD½,90 was 2.5 Mb for introgressed regions, indicating large haplotype blocks. In pericentromeric heterochromatin, LD decay was negligible. This study demonstrates that several related factors influencing LD decay could be disentangled, that no universal approach can be suggested, and that the estimation of LD decay has to be performed with great care and knowledge of the sampled material.
  相似文献   

9.
We have previously shown that linkage disequilibrium (LD) in the elite cultivated barley (Hordeum vulgare) gene pool extends, on average, for <1-5 cM. Based on this information, we have developed a platform for whole genome association studies that comprises a collection of elite lines that we have characterized at 3060 genome-wide single nucleotide polymorphism (SNP) marker loci. Interrogating this data set shows that significant population substructure is present within the elite gene pool and that diversity and LD vary considerably across each of the seven barley chromosomes. However, we also show that a subpopulation comprised of only the two-rowed spring germplasm is less structured and well suited to whole genome association studies without the need for extensive statistical intervention to account for structure. At the current marker density, the two-rowed spring population is suited for fine mapping simple traits that are located outside of the genetic centromeres with a resolution that is sufficient for candidate gene identification by exploiting conservation of synteny with fully sequenced model genomes and the emerging barley physical map.  相似文献   

10.
The Ethiopian plateau hosts thousands of durum wheat (Triticum turgidum subsp. durum) farmer varieties (FV) with high adaptability and breeding potential. To harness their unique allelic diversity, we produced a large nested association mapping (NAM) population intercrossing fifty Ethiopian FVs with an international elite durum wheat variety (Asassa). The Ethiopian NAM population (EtNAM) is composed of fifty interconnected bi‐parental families, totalling 6280 recombinant inbred lines (RILs) that represent both a powerful quantitative trait loci (QTL) mapping tool, and a large pre‐breeding panel. Here, we discuss the molecular and phenotypic diversity of the EtNAM founder lines, then we use an array featuring 13 000 single nucleotide polymorphisms (SNPs) to characterize a subset of 1200 EtNAM RILs from 12 families. Finally, we test the usefulness of the population by mapping phenology traits and plant height using a genome wide association (GWA) approach. EtNAM RILs showed high allelic variation and a genetic makeup combining genetic diversity from Ethiopian FVs with the international durum wheat allele pool. EtNAM SNP data were projected on the fully sequenced AB genome of wild emmer wheat, and were used to estimate pairwise linkage disequilibrium (LD) measures that reported an LD decay distance of 7.4 Mb on average, and balanced founder contributions across EtNAM families. GWA analyses identified 11 genomic loci individually affecting up to 3 days in flowering time and more than 1.6 cm in height. We argue that the EtNAM is a powerful tool to support the production of new durum wheat varieties targeting local and global agriculture.  相似文献   

11.
The linkage disequilibrium (LD) structure of the human genome is now well understood and characterised for a number of human populations. The LD structure underpins the design and execution of candidate gene and genome-wide association mapping studies. Successful association mapping studies completed to date provide vital new insights into the genetic influences on common diseases, such as diabetes, some cancers and heart disease. The LD structure also presents new avenues of research into the genetic history of human populations, the effects of natural selection and the impact of recombination on the genomic landscape. This review introduces this exciting and complex field by encompassing this range of topics.  相似文献   

12.
X Chen  D Min  TA Yasir  YG Hu 《PloS one》2012,7(9):e44510
To ascertain genetic diversity, population structure and linkage disequilibrium (LD) among a representative collection of Chinese winter wheat cultivars and lines, 90 winter wheat accessions were analyzed with 269 SSR markers distributed throughout the wheat genome. A total of 1,358 alleles were detected, with 2 to 10 alleles per locus and a mean genetic richness of 5.05. The average genetic diversity index was 0.60, with values ranging from 0.05 to 0.86. Of the three genomes of wheat, ANOVA revealed that the B genome had the highest genetic diversity (0.63) and the D genome the lowest (0.56); significant differences were observed between these two genomes (P<0.01). The 90 Chinese winter wheat accessions could be divided into three subgroups based on STRUCTURE, UPGMA cluster and principal coordinate analyses. The population structure derived from STRUCTURE clustering was positively correlated to some extent with geographic eco-type. LD analysis revealed that there was a shorter LD decay distance in Chinese winter wheat compared with other wheat germplasm collections. The maximum LD decay distance, estimated by curvilinear regression, was 17.4 cM (r(2)>0.1), with a whole genome LD decay distance of approximately 2.2 cM (r(2)>0.1, P<0.001). Evidence from genetic diversity analyses suggest that wheat germplasm from other countries should be introduced into Chinese winter wheat and distant hybridization should be adopted to create new wheat germplasm with increased genetic diversity. The results of this study should provide valuable information for future association mapping using this Chinese winter wheat collection.  相似文献   

13.
European beech (Fagus sylvatica L.) is one of the most economically and ecologically important deciduous trees in Europe, yet little is known about its genomic diversity and its adaptive potential. Here, we detail the discovery and analysis of 573 single nucleotide polymorphisms (SNPs) from 58 candidate gene fragments that are potentially involved in abiotic stress response and budburst phenology using a panel of 96 individuals from southeastern France. The mean nucleotide diversity was low (θ π?=?2.2?×?10?3) but extremely variable among gene fragments (range from 0.02 to 10), with genes carrying insertion/deletion mutations exhibiting significantly higher diversity. The decay of linkage disequilibrium (LD) measured at gene fragments >800 base pairs was moderate (the half distance of r 2 was 154 bp), consistent with the low average population-scaled recombination rate (ρ?=?5.4?×?10?3). Overall, the population-scaled recombination rate estimated in F. sylvatica was lower than for other angiosperm tree genera (such as Quercus or Populus) and similar to conifers. As a methodological perspective, we explored the effect of minimum allele frequency (MAF) on LD and showed that higher MAF resulted in slower decay of LD. It is thus essential that the same MAF is used when comparing the decay of LD among different studies and species. Our results suggest that genome-wide association mapping can be a potentially efficient approach in F. sylvatica, which has a relatively small genome size.  相似文献   

14.
There is presently much interest in utilizing patterns of linkage disequilibrium (LD) to further genetic association studies. This is particularly pertinent in the class III region of the human major histocompatibility complex (MHC), which has been extensively studied as a disease susceptibility locus in a number of ethnic groups. To date, however, few studies of LD in the MHC have considered non-Caucasian populations. With the advent of large-scale haplotyping of the human genome, the question of utilizing LD patterns across populations has come to the fore. We have previously used LD mapping to direct an MHC class III association study in a UK Caucasian population. As an extension of this, we sought to determine to what extent the pattern of LD observed in that study could be used to conduct a similar study in a West African Gambian population. We found that broad patterns of LD were similar in the two populations, resulting in similar candidate region delineations, but at a higher resolution, marker-specific patterns of LD and population-dependent allele frequencies confounded the choice of regional tagging SNPs. Our results have implications for the applicability of large-scale haplotype maps such as the HapMap to complex regions like the MHC.Electronic Supplementary Material Supplementary material is available for this article at .  相似文献   

15.
To understand the genetic basis of tolerance to drought and heat stresses in chickpea, a comprehensive association mapping approach has been undertaken. Phenotypic data were generated on the reference set (300 accessions, including 211 mini-core collection accessions) for drought tolerance related root traits, heat tolerance, yield and yield component traits from 1–7 seasons and 1–3 locations in India (Patancheru, Kanpur, Bangalore) and three locations in Africa (Nairobi, Egerton in Kenya and Debre Zeit in Ethiopia). Diversity Array Technology (DArT) markers equally distributed across chickpea genome were used to determine population structure and three sub-populations were identified using admixture model in STRUCTURE. The pairwise linkage disequilibrium (LD) estimated using the squared-allele frequency correlations (r2; when r2<0.20) was found to decay rapidly with the genetic distance of 5 cM. For establishing marker-trait associations (MTAs), both genome-wide and candidate gene-sequencing based association mapping approaches were conducted using 1,872 markers (1,072 DArTs, 651 single nucleotide polymorphisms [SNPs], 113 gene-based SNPs and 36 simple sequence repeats [SSRs]) and phenotyping data mentioned above employing mixed linear model (MLM) analysis with optimum compression with P3D method and kinship matrix. As a result, 312 significant MTAs were identified and a maximum number of MTAs (70) was identified for 100-seed weight. A total of 18 SNPs from 5 genes (ERECTA, 11 SNPs; ASR, 4 SNPs; DREB, 1 SNP; CAP2 promoter, 1 SNP and AMDH, 1SNP) were significantly associated with different traits. This study provides significant MTAs for drought and heat tolerance in chickpea that can be used, after validation, in molecular breeding for developing superior varieties with enhanced drought and heat tolerance.  相似文献   

16.
This present review gives an overview on Linkage disequilibrium (LD), its measures and its different utilizations in human genetics studies. In the first part, we provide a detailed and a simplified presentation focusing on the definition of LD, its measures and the major software for its evaluation. Thereafter, we describe and discuss the biological and evolutionary mechanisms which create, remodel, maintain or destroy LD in human population. Consensus has now emerged on the pattern of LD in the genome which has a block-like organization with block of high disequilibrium interrupted by recombination hotspots. However, no standard method exists for the determination of such blocks and, more importantly, for the identification of TagSNP. This would yield inconsistencies between different studies of the same genes, compromising the practical use of TagSNP in association studies. The ACE gene is used to illustrate this. Will it be possible to identify consensus TagSNP that could be used consistently in all populations for testing association of candidate genes in common diseases? What is the part of myth and reality in what is called "individualized medicine"? We conclude that further LD studies are needed to get clear insights into this matter.  相似文献   

17.
Linkage disequilibrium (LD) mapping is commonly used as a fine mapping tool in human genome mapping and has been used with some success for initial disease gene isolation in certain isolated inbred human populations. An understanding of the population history of domestic dog breeds suggests that LD mapping could be routinely utilized in this species for initial genome-wide scans. Such an approach offers significant advantages over traditional linkage analysis. Here, we demonstrate, using canine copper toxicosis in the Bedlington terrier as the model, that LD mapping could be reasonably expected to be a useful strategy in low-resolution, genome-wide scans in pure-bred dogs. Significant LD was demonstrated over distances up to 33.3 cM. It is very unlikely, for a number of reasons discussed, that this result could be extrapolated to the rest of the genome. It is, however, consistent with the expectation given the population structure of canine breeds and, in this breed at least, with the hypothesis that it may be possible to utilize LD in a genome-wide scan. In this study, LD mapping confirmed the location of the copper toxicosis in Bedlington terrier gene (CT-BT) and was able to do so in a population that was refractory to traditional linkage analysis.  相似文献   

18.
The effects of selection on genome variation were investigated and visualized in tomato using a high-density single nucleotide polymorphism (SNP) array. 7,720 SNPs were genotyped on a collection of 426 tomato accessions (410 inbreds and 16 hybrids) and over 97% of the markers were polymorphic in the entire collection. Principal component analysis (PCA) and pairwise estimates of F st supported that the inbred accessions represented seven sub-populations including processing, large-fruited fresh market, large-fruited vintage, cultivated cherry, landrace, wild cherry, and S. pimpinellifolium. Further divisions were found within both the contemporary processing and fresh market sub-populations. These sub-populations showed higher levels of genetic diversity relative to the vintage sub-population. The array provided a large number of polymorphic SNP markers across each sub-population, ranging from 3,159 in the vintage accessions to 6,234 in the cultivated cherry accessions. Visualization of minor allele frequency revealed regions of the genome that distinguished three representative sub-populations of cultivated tomato (processing, fresh market, and vintage), particularly on chromosomes 2, 4, 5, 6, and 11. The PCA loadings and F st outlier analysis between these three sub-populations identified a large number of candidate loci under positive selection on chromosomes 4, 5, and 11. The extent of linkage disequilibrium (LD) was examined within each chromosome for these sub-populations. LD decay varied between chromosomes and sub-populations, with large differences reflective of breeding history. For example, on chromosome 11, decay occurred over 0.8 cM for processing accessions and over 19.7 cM for fresh market accessions. The observed SNP variation and LD decay suggest that different patterns of genetic variation in cultivated tomato are due to introgression from wild species and selection for market specialization.  相似文献   

19.
Perennial ryegrass (Lolium perenne L.) is a highly valued temperate climate grass species grown as forage crop and for amenity uses. Due to its outbreeding nature and recent domestication, a high degree of genetic diversity is expected among cultivars. The aim of this study was to assess the extent of linkage disequilibrium (LD) within European elite germplasm and to evaluate the appropriate methodology for genetic association mapping in perennial ryegrass. A high level of genetic diversity was observed in a set of 380 perennial ryegrass elite genotypes when genotyped with 40 SSRs and 2 STS markers. A Bayesian structure analysis identified two subpopulations, which were confirmed by principal coordinate analysis (PCoA). One subpopulation consisted mainly of genotypes originating from the UK, while germplasm mostly from Continental Europe was grouped into the second subpopulation. LD (r2) decay was rapid and occurred within 0.4 cM across European varieties, when population structure was taken into consideration. However, an extended LD of up to 6.6 cM was detected within the variety Aberdart. High genetic diversity and rapid LD decay provide means for high resolution association mapping in elite materials of perennial ryegrass. However, different strategies need to be applied depending on the material used. Genome-wide association study (GWAS) with several hundred markers can be applied within synthetic varieties to identify large (up to 10 cM) genomic regions affecting trait variation. A combination of available and novel DNA markers is needed to achieve resolution required for GWAS in elite breeding materials. An even higher marker density of several million SNPs might be needed for GWAS in diverse ecotype collections, potentially resulting in quantitative trait polymorphism (QTP) identification.  相似文献   

20.
The identification of molecular markers associated with economic and quality traits will help improve breeding for new apple (Malus × domestica Borkh.) cultivars. Tools such as the 8K apple SNP array developed by the RosBREED consortium allow for high-throughput genotyping of SNP polymorphisms within collections. However, genetic characterization and the identification of population stratification and kinship within germplasm collections is a fundamental prerequisite for identifying robust marker–trait associations. In this study, a collection of apple germplasm originally developed for plant architectural studies and consisting of both non-commercial/local and elite accessions was genotyped using the 8K apple SNP array to identify cryptic relationships between accessions, to analyze population structure and to calculate the linkage disequilibrium (LD). A total of nine pairs of synonyms and several triploids accessions were identified within the 130 accessions genotyped. In addition, most of the known parent-child relations were confirmed, and several putative, previously unknown parent-child relations were identified among the local accessions. No clear subgroups could be identified although some separation between local and elite accessions was evident. The study of LD showed a rapid decay in our collection, indicating that a larger number of SNPs is necessary to perform whole genome association mapping. Finally, an association mapping effort for architectural traits was carried out on a small number of accessions to estimate the feasibility of this approach.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号