首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Blocks of linkage disequilibrium (LD) in the human genome represent segments of ancestral chromosomes. To investigate the relationship between LD and genealogy, we analysed diversity associated with restriction fragment length polymorphism (RFLP) haplotypes of the 5' beta-globin gene complex. Genealogical analyses were based on sequence alleles that spanned a 12.2-kb interval, covering 3.1 kb around the psibeta gene and 6.2 kb of the delta-globin gene and its 5' flanking sequence known as the R/T region. Diversity was sampled from a Kenyan Luo population where recent malarial selection has contributed to substantial LD. A single common sequence allele spanning the 12.2-kb interval exclusively identified the ancestral chromosome bearing the "Bantu" beta(s) (sickle-cell) RFLP haplotype. Other common 5' RFLP haplotypes comprised interspersed segments from multiple ancestral chromosomes. Nucleotide diversity was similar between psibeta and R/T-delta-globin but was non-uniformly distributed within the R/T-delta-globin region. High diversity associated with the 5' R/T identified two ancestral lineages that probably date back more than 2 million years. Within this genealogy, variation has been introduced into the 3' R/T by gene conversion from other ancestral chromosomes. Diversity in delta-globin was found to lead through parts of the main genealogy but to coalesce in a more recent ancestor. The well-known recombination hotspot is clearly restricted to the region 3' of delta-globin. Our analyses show that, whereas one common haplotype in a block of high LD represents a long segment from a single ancestral chromosome, others are mosaics of short segments from multiple ancestors related in genealogies of unsuspected complexity.  相似文献   

2.
Stephan W  Song YS  Langley CH 《Genetics》2006,172(4):2647-2663
We analyzed a three-locus model of genetic hitchhiking with one locus experiencing positive directional selection and two partially linked neutral loci. Following the original hitchhiking approach by Maynard Smith and Haigh, our analysis is purely deterministic. In the first half of the selected phase after a favored mutation has entered the population, hitchhiking may lead to a strong increase of linkage disequilibrium (LD) between the two neutral sites if both are <0.1 s away from the selected site (where s is the selection coefficient). In the second half of the selected phase, the main effect of hitchhiking is to destroy LD. This occurs very quickly (before the end of the selected phase) when the selected site is between both neutral loci. This pattern cannot be attributed to the well-known variation-reducing effect of hitchhiking but is a consequence of secondary hitchhiking effects on the recombinants created in the selected phase. When the selected site is outside the neutral loci (which are, say, <0.1s apart), however, a fast decay of LD is observed only if the selected site is in the immediate neighborhood of one of the neutral sites (i.e., if the recombination rate r between the selected site and one of the neutral sites satisfies r<0.1 s). If the selected site is far away from the neutral sites (say, r > 0.3 s), the decay rate of LD approaches that of neutrality. Averaging over a uniform distribution of initial gamete frequencies shows that the expected LD at the end of the hitchhiking phase is driven toward zero, while the variance is increased when the selected site is well outside the two neutral sites. When the direction of LD is polarized with respect to the more common allele at each neutral site, hitchhiking creates more positive than negative linkage disequilibrium. Thus, hitchhiking may have a distinctively patterned LD-reducing effect, in particular near the target of selection.  相似文献   

3.
We have identified, in four diverse human populations, five common single-nucleotide polymorphisms (SNPs) in the coding region of the gene for the blood coagulation protease factor XI. Each SNP has an allele frequency >5% in at least one population. Three of the SNPs (C472T, A844G, and T1234C), spread out over approximately 10 kb of genomic DNA, are in marked linkage disequilibrium (LD) with one another (P < 10(-4)). Interestingly, haplotypes associated with the linked SNPs are conserved across all populations studied, despite significantly different allele frequencies between populations. The presence of such common, widely dispersed haplotypes could complicate the interpretation of LD studies and emphasizes the need for a better understanding of general patterns of LD to facilitate identification of genes for common disorders.  相似文献   

4.
Strong selection within a given population locally reduces genetic variability not only in the selected gene itself but also in neighbouring loci. This so-called hitch-hiking effect is related to the initial linkage disequilibrium between markers and the selected gene, and depends mainly on the number of copies of the beneficial allele at the start of the selection phase. Contrary to the classical case, in which selection acts on a single, newly arisen beneficial mutation, we considered selection from standing variation (soft selective sweeps) on a gene ( Rht-B1 ) with a major effect on plant height, a selected trait in an experimental wheat population grown for 17 generations, and we documented the evolution of gene diversity and linkage disequilibrium near this gene. As expected, Rht-B1 was found to be under strong selection ( s  = 0.15) and its variation in frequency accounted for 15% of the total trait evolution. This led to a smaller genetic effective population size at Rht-B1 ( Neg  = 18) compared to the whole genome estimation ( Neg  = 167). When compared with expectations under genetic drift only, no significant decrease in gene diversity was found at the closest loci. We computed expected di-locus frequencies for any linked marker– Rht-B1 pair due to hitch-hiking effects. We found that hitch-hiking was expected to affect the two most closely linked loci, but expected reduction in gene diversity was not greater than that due to genetic drift, which was consistent with the observations. Such limited effect was attributed to the low level of linkage disequilibrium (0.16) estimated after parental intercrosses, together with a relatively high initial frequency of the gene. This situation is favourable to candidate gene approaches where small linkage disequilibrium around selected genes is expected.  相似文献   

5.

Background

Haemoglobin S (HbS) and C (HbC) are variants of the HBB gene which both protect against malaria. It is not clear, however, how these two alleles have evolved in the West African countries where they co-exist at high frequencies. Here we use haplotypic signatures of selection to investigate the evolutionary history of the malaria-protective alleles HbS and HbC in the Kassena-Nankana District (KND) of Ghana.

Methodology/Principal Findings

The haplotypic structure of HbS and HbC alleles was investigated, by genotyping 56 SNPs around the HBB locus. We found that, in the KND population, both alleles reside on extended haplotypes (approximately 1.5 Mb for HbS and 650 Kb for HbC) that are significantly less diverse than those of the ancestral HbA allele. The extended haplotypes span a recombination hotspot that is known to exist in this region of the genome

Significance

Our findings show strong support for recent positive selection of both the HbS and HbC alleles and provide insights into how these two alleles have both evolved in the population of northern Ghana.  相似文献   

6.
The extent and pattern of linkage disequilibrium (LD) between closely spaced markers contain information about population history, including past population size and selection history. Selection signatures can be identified by comparing the LD surrounding a putative selected allele at a locus to the putative non-selected allele. In livestock populations, locations of selection signatures identified in this way should be correlated with QTL affecting production traits, as the populations have been under strong artificial selection for these traits. We used a dense SNP map of bovine chromosome 6 to characterize the pattern of LD on this chromosome in Norwegian Red cattle, a breed which has been strongly selected for milk production. The pattern of LD was generally consistent with strong selection in regions containing QTL affecting milk production traits, including a strong selection signature in a region containing a mutation known to affect milk production. The results demonstrate that in livestock populations, the origin of selection signatures will often be QTL for livestock production traits, and illustrate the value of selection signatures in uncovering new mutations with potential effects on quantitative traits.  相似文献   

7.
Variability in cystic fibrosis (CF) lung disease is partially due to non-CFTR genetic modifiers. Mucin genes are very polymorphic, and mucins play a key role in the pathogenesis of CF lung disease; therefore, mucin genes are strong candidates as genetic modifiers. DNA from CF patients recruited for extremes of lung phenotype was analyzed by Southern blot or PCR to define variable number tandem repeat (VNTR) length polymorphisms for MUC1, MUC2, MUC5AC, and MUC7. VNTR length polymorphisms were tested for association with lung disease severity and for linkage disequilibrium (LD) with flanking single nucleotide polymorphisms (SNPs). No strong associations were found for MUC1, MUC2, or MUC7. A significant association was found between the overall distribution of MUC5AC VNTR length and CF lung disease severity (p = 0.025; n = 468 patients); plus, there was robust association of the specific 6.4 kb HinfI VNTR fragment with severity of lung disease (p = 6.2×10(-4) after Bonferroni correction). There was strong LD between MUC5AC VNTR length modes and flanking SNPs. The severity-associated 6.4 kb VNTR allele of MUC5AC was confirmed to be genetically distinct from the 6.3 kb allele, as it showed significantly stronger association with nearby SNPs. These data provide detailed respiratory mucin gene VNTR allele distributions in CF patients. Our data also show a novel link between the MUC5AC 6.4 kb VNTR allele and severity of CF lung disease. The LD pattern with surrounding SNPs suggests that the 6.4 kb allele contains, or is linked to, important functional genetic variation.  相似文献   

8.
The hemoglobin E variant (HbE; ( beta )26Glu-->Lys) is concentrated in parts of Southeast Asia where malaria is endemic, and HbE carrier status has been shown to confer some protection against Plasmodium falciparum malaria. To examine the effect of natural selection on the pattern of linkage disequilibrium (LD) and to infer the evolutionary history of the HbE variant, we analyzed biallelic markers surrounding the HbE variant in a Thai population. Pairwise LD analysis of HbE and 43 surrounding biallelic markers revealed LD of HbE extending beyond 100 kb, whereas no LD was observed between non-HbE variants and the same markers. The inferred haplotype network suggests a single origin of the HbE variant in the Thai population. Forward-in-time computer simulations under a variety of selection models indicate that the HbE variant arose 1,240-4,440 years ago. These results support the conjecture that the HbE mutation occurred recently, and the allele frequency has increased rapidly. Our study provides another clear demonstration that a high-resolution LD map across the human genome can detect recent variants that have been subjected to positive selection.  相似文献   

9.
The gene coding for glucose-6-phosphate dehydrogenase (G6PD) is subject to positive selection by malaria in some human populations. The G6PD A- allele, which is common in sub-Saharan Africa, is associated with deficient enzyme activity and protection from severe malaria. To delimit the impact of selection on patterns of linkage disequilibrium (LD) and nucleotide diversity, we resequenced 5.1 kb at G6PD and approximately 2-3 kb at each of eight loci in a 2.5-Mb region roughly centered on G6PD in a diverse sub-Saharan African panel of 51 unrelated men (including 20 G6PD A-, 11 G6PD A+, and 20 G6PD B chromosomes). The signature of selection is evident in the absence of genetic variation at G6PD and at three neighboring loci within 0.9 Mb from G6PD among all individuals bearing G6PD A- alleles. A genomic region of approximately 1.6 Mb around G6PD was characterized by long-range LD associated with the A- alleles. These patterns of nucleotide variability and LD suggest that G6PD A- is younger than previous age estimates and has increased in frequency in sub-Saharan Africa due to strong selection (0.1 < s < 0.2). These results also show that selection can lead to nonrandom associations among SNPs over great physical and genetic distances, even in African populations.  相似文献   

10.
Storz JF  Kelly JK 《Genetics》2008,180(1):367-379
An important goal of population genetics is to elucidate the effects of natural selection on patterns of DNA sequence variation. Here we report results of a study to assess the joint effects of selection, recombination, and gene flow in shaping patterns of nucleotide variation at genes involved in local adaptation. We first describe a new summary statistic, Z(g), that measures the between-sample component of linkage disequilibrium (LD). We then report results of a multilocus survey of nucleotide diversity and LD between high- and low-altitude populations of deer mice, Peromyscus maniculatus. The multilocus survey included two closely linked alpha-globin genes, HBA-T1 and HBA-T2, that underlie adaptation to different elevational zones. The primary goals were to assess whether the alpha-globin genes exhibit the hallmarks of spatially varying selection that are predicted by theory (i.e., sharply defined peaks in the between-population components of nucleotide diversity and LD) and to assess whether peaks in diversity and LD may be useful for identifying specific sites that distinguish selectively maintained alleles. Consistent with theoretical expectations, HBA-T1 and HBA-T2 were characterized by highly elevated levels of diversity between populations and between allele classes. Simulation and empirical results indicate that sliding-window analyses of Z(g) between allele classes may provide an effective means of pinpointing causal substitutions.  相似文献   

11.
Xiong M  Fan R  Jin L 《Human heredity》2002,53(3):158-172
As a dense map of single nucleotide polymorphism (SNP) markers are available, population-based linkage disequilibrium (LD) mapping or association study is becoming one of the major tools for identifying quantitative trait loci (QTL) and for fine gene mapping. However, in many cases, LD between the marker and trait locus is not very strong. Approaches that maximize the potential of detecting LD will be essential for the success of LD mapping of QTL. In this paper, we propose two strategies for increasing the probability of detecting LD: (1) phenotypic selection and (2) haplotype LD mapping. To provide the foundations for LD mapping of QTL under selection, we develop analytic tools for assessing the impact of phenotypic selection on allele and haplotype frequencies, and LD under three trait models: single trait locus, two unlinked trait loci, and two linked trait loci with or without epistasis. In addition to a traditional chi(2) test, which compares the difference in allele or haplotype frequencies in the selected sample and population sample, we present multiple regression methods for LD mapping of QTL, and investigate which methods are effective in employing phenotypic selection for QTL mapping. We also develop a statistical framework for investigating and comparing the power of the single marker and multilocus haplotype test for LD mapping of QTL. Finally, the proposed methods are applied to mapping QTL influencing variation in systolic blood pressure in an isolated Chinese population.  相似文献   

12.
Population subdivision and migration are generally considered to be important causes of linkage disequilibrium (LD). We explore the combined effects of recombination and gene flow on the amount of LD, the maintenance of polymorphism, and the degree of local adaptation in a subdivided population by analyzing a diploid, deterministic continent–island model with genic selection on two linked loci (i.e., no dominance or epistasis). For this simple model, we characterize explicitly all possible equilibrium configurations. Simple and intuitive approximations for many quantities of interest are obtained in limiting cases, such as weak migration, weak selection, weak or strong recombination. For instance, we derive explicit expressions for the measures and r2 (the squared correlation in allelic state) of LD. They depend in qualitatively different ways on the migration rate. Remarkably high values of r2 are maintained between weakly linked loci, especially if gene flow is low. We determine how the maximum amount of gene flow that admits preservation of the locally adapted haplotype, hence of polymorphism at both loci, depends on recombination rate and selection coefficients. We also investigate the evolution of differentiation by examining the invasion of beneficial mutants of small effect that are linked to an already present, locally adapted allele. Mutants of much smaller effect can invade successfully than predicted by naive single-locus theory provided they are at least weakly linked. Finally, the influence of linkage on the degree of local adaptation, the migration load, and the effective migration rate at a neutral locus is explored. We discuss possible consequences for the evolution of genetic architecture, in particular, for the emergence of clusters of tightly linked, slightly beneficial mutations and the evolution of recombination and chromosome inversions.  相似文献   

13.
To fine map genes, investigators often test for disease-marker association in chromosomal regions with evidence for linkage. Given a marker allele tentatively associated with disease, one would ask if this allele, or one in linkage disequilibrium (LD) with it, could account in part for the observed linkage signal. This question can be addressed by determining if families selected on the basis of the presence of the tentatively associated allele show stronger evidence of linkage as measured by increased allele sharing identical by descent (IBD) by affected family members. However, common selection strategies can be biased for or against linkage in the marker region, even given no disease-marker association. We define unbiased selection schemes and extend the definition to allow weighted selection on the basis of all genotyped family members. For affected-sibship data, we describe three genotype-based weight variables, corresponding to dominant, recessive, and additive models. We then introduce a test for association of a family weight variable with excess IBD sharing. This test allows us to determine if the linkage signal in a region can be attributed in part to the presence of a marker allele, either because of direct involvement in disease etiology or because of LD with a predisposing genetic variant. For samples of 500 affected sib pairs, the tests are powerful in detection of genotype-IBD sharing association, even for disease models with sib relative risk as low as lambda S=1.1, or when evidence for linkage is absent because of sampling variation. This makes our method a new tool for detecting linkage as well as association, especially in regions harboring a candidate gene. We have implemented these methods in the software package GIST (Genotype-IBD Sharing Test).  相似文献   

14.
M. J. Mackinnon  MAJ. Georges 《Genetics》1992,132(4):1177-1185
The effects of within-sample selection on the outcome of analyses detecting linkage between genetic markers and quantitative traits were studied. It was found that selection by truncation for the trait of interest significantly reduces the differences between marker genotype means thus reducing the power to detect linked quantitative trait loci (QTL). The size of this reduction is a function of proportion selected, the magnitude of the QTL effect, recombination rate between the marker locus and the QTL, and the allele frequency of the QTL. Proportion selected was the most influential of these factors on bias, e.g., for an allele substitution effect of one standard deviation unit, selecting the top 80%, 50% or 20% of the population required 2, 6 or 24 times the number of progeny, respectively, to offset the loss of power caused by this selection. The effect on power was approximately linear with respect to the size of gene effect, almost invariant to recombination rate, and a complex function of QTL allele frequency. It was concluded that experimental samples from animal populations which have been subjected to even minor amounts of selection will be inefficient in yielding information on linkage between markers and loci influencing the quantitative trait under selection.  相似文献   

15.
Hasselmann M  Beye M 《Genetics》2006,174(3):1469-1480
Recombination decreases the association of linked nucleotide sites and can influence levels of polymorphism in natural populations. When coupled with selection, recombination may relax potential conflict among linked genes, a concept that has played a central role in research on the evolution of recombination. The sex determination locus (SDL) of the honeybee is an informative example for exploring the combined forces of recombination, selection, and linkage on sequence evolution. Balancing selection at SDL is very strong and homozygous individuals at SDL are eliminated by worker bees. The recombination rate is increased up to four times that of the genomewide average in the region surrounding SDL. Analysis of nucleotide diversity (pi) reveals a sevenfold increase of polymorphism within the sex determination gene complementary sex determiner (csd) that rapidly declines within 45 kb to levels of genomewide estimates. Although no recombination was observed within SDL, which contains csd, analyses of heterogeneity, shared polymorphic sites, and linkage disequilibrium (LD) show that recombination has contributed to the evolution of the 5' part of some csd sequences. Gene conversion, however, has not obviously contributed to the evolution of csd sequences. The local control of recombination appears to be related to SDL function and mode of selection. The homogenizing force of recombination is reduced within SDL, which preserves allelic differences and specificity, while the increase of recombination activity around SDL relaxes conflict between SDL and linked genes.  相似文献   

16.
Genomic selection (GS) is a promising strategy for enhancing genetic gain. We investigated the accuracy of genomic estimated breeding values (GEBV) in four inter-related synthetic populations that underwent several cycles of recurrent selection in an upland rice-breeding program. A total of 343 S2:4 lines extracted from those populations were phenotyped for flowering time, plant height, grain yield and panicle weight, and genotyped with an average density of one marker per 44.8 kb. The relative effect of the linkage disequilibrium (LD) and minor allele frequency (MAF) thresholds for selecting markers, the relative size of the training population (TP) and of the validation population (VP), the selected trait and the genomic prediction models (frequentist and Bayesian) on the accuracy of GEBVs was investigated in 540 cross validation experiments with 100 replicates. The effect of kinship between the training and validation populations was tested in an additional set of 840 cross validation experiments with a single genomic prediction model. LD was high (average r2 = 0.59 at 25 kb) and decreased slowly, distribution of allele frequencies at individual loci was markedly skewed toward unbalanced frequencies (MAF average value 15.2% and median 9.6%), and differentiation between the four synthetic populations was low (FST ≤0.06). The accuracy of GEBV across all cross validation experiments ranged from 0.12 to 0.54 with an average of 0.30. Significant differences in accuracy were observed among the different levels of each factor investigated. Phenotypic traits had the biggest effect, and the size of the incidence matrix had the smallest. Significant first degree interaction was observed for GEBV accuracy between traits and all the other factors studied, and between prediction models and LD, MAF and composition of the TP. The potential of GS to accelerate genetic gain and breeding options to increase the accuracy of predictions are discussed.  相似文献   

17.
18.
Saunders MA  Hammer MF  Nachman MW 《Genetics》2002,162(4):1849-1861
Glucose-6-phosphate dehydrogenase (G6PD) deficiency is the most common enzymopathy in humans. Deficiency alleles for this X-linked disorder are geographically correlated with historical patterns of malaria, and the most common deficiency allele in Africa (G6PD A-) has been shown to confer some resistance to malaria in both hemizygous males and heterozygous females. We studied DNA sequence variation in 5.1 kb of G6pd from 47 individuals representing a worldwide sample to examine the impact of selection on patterns of human nucleotide diversity and to infer the evolutionary history of the G6PD A- allele. We also sequenced 3.7 kb of a neighboring locus, L1cam, from the same set of individuals to study the effect of selection on patterns of linkage disequilibrium. Despite strong clinical evidence for malarial selection maintaining G6PD deficiency alleles in human populations, the overall level of nucleotide heterozygosity at G6pd is typical of other genes on the X chromosome. However, the signature of selection is evident in the absence of genetic variation among A- alleles from different parts of Africa and in the unusually high levels of linkage disequilibrium over a considerable distance of the X chromosome. In spite of a long-term association between Plasmodium falciparum and the ancestors of modern humans, patterns of nucleotide variability and linkage disequilibrium suggest that the A- allele arose in Africa only within the last 10,000 years and spread due to selection.  相似文献   

19.
Jones DA  Wakeley J 《Genetics》2008,180(2):1251-1259
In a 2007 article, McVean studied the effect of recombination on linkage disequilibrium (LD) between two neutral loci located near a third locus that has undergone a selective sweep. The results demonstrated that two loci on the same side of a selected locus might show substantial LD, whereas the expected LD for two loci on opposite sides of a selected locus is zero. In this article, we extend McVean's model to include gene conversion. We show that one of the conclusions is strongly affected by gene conversion: when gene conversion is present, there may be substantial LD between two loci on opposite sides of a selective sweep.  相似文献   

20.
Ayala FJ  Balakirev ES  Sáez AG 《Gene》2002,300(1-2):19-29
We have examined the patterns of polymorphism at two linked loci, Sod and Est-6, separated by nearly 1000 kb on the left arm of chromosome 3 of Drosophila melanogaster. The evidence suggests that natural selection has been involved in shaping the polymorphisms. At the Sod locus, a fairly strong (s>0.01) selective sweep, started ≥2600 years ago, increased the frequency of a rare haplotype, F(A), to about 50% frequency in populations of Europe, Asia, and the Americas. More recently, an F(A) allele mutated to an S allele, which has increased to frequencies 5–15% in populations of Europe, Asia and North America. All S alleles are identical (or very nearly) in sequence and differ by one nucleotide substitution (which accounts for the F→S electrophoretic difference) from F(A) alleles. At the Est-6 locus, the evidence indicates both directional and balancing selection impacting separately the promoter and the coding regions of the gene, with linkage disequilibrium occurring within each region. Some linkage disequilibrium also exists between the two genes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号