首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 570 毫秒
1.
Genome-wide association studies (GWAS) have in recent years discovered thousands of associated markers for hundreds of phenotypes. However, associated loci often only explain a relatively small fraction of heritability and the link between association and causality has yet to be uncovered for most loci. Rare causal variants have been suggested as one scenario that may partially explain these shortcomings. Specifically, Dickson et al. recently reported simulations of rare causal variants that lead to association signals of common, tag single nucleotide polymorphisms, dubbed "synthetic associations". However, an open question is what practical implications synthetic associations have for GWAS. Here, we explore the signatures exhibited by such "synthetic associations" and their implications based on patterns of genetic variation observed in human populations, thus accounting for human evolutionary history -a force disregarded in previous simulation studies. This is made possible by human population genetic data from HapMap 3 consisting of both resequencing and array-based genotyping data for the same set of individuals from multiple populations. We report that synthetic associations tend to be further away from the underlying risk alleles compared to "natural associations" (i.e. associations due to underlying common causal variants), but to a much lesser extent than previously predicted, with both the age and the effect size of the risk allele playing a part in this phenomenon. We find that while a synthetic association has a lower probability of capturing causal variants within its linkage disequilibrium block, sequencing around the associated variant need not extend substantially to have a high probability of capturing at least one causal variant. We also show that the minor allele frequency of synthetic associations is lower than of natural associations for most, but not all, loci that we explored. Finally, we find the variance in associated allele frequency to be a potential indicator of synthetic associations.  相似文献   

2.
The characterization and analysis of genetic variation at the HLA loci provides important insight for population geneticists trying to understand the evolutionary forces that have shaped human populations. This study describes the HLA-A and HLA-B loci serotyping and statistical analysis on an isolated Native American population, the Havasupai of Arizona. Four alleles at the HLA-A locus were identified, while eight alleles were found at the HLA-B locus. These variants were present as 20 of 32 potential two-locus haplotypes, with five of the six most common haplotypes exhibiting high positive linkage disequilibrium. Significant homozygote deficiency (heterozygosity excess) was detected both at HLA-A and at HLA-B. This deviation from Hardy-Weinberg proportions was not attributable to nonselective causes such as different allele frequencies in males and females or avoidance of consanguineous matings. In addition, the distribution of alleles at both HLA-A and HLA-B was more even than expected from neutrality theory; that is, the observed Hardy-Weinberg homozygosity was only 62.4% of that expected under neutrality. These observations suggest that balancing selection is of major importance in maintaining genetic variation at HLA-A and HLA-B.  相似文献   

3.
This study addresses the question of how purifying selection operates during recent rapid population growth such as has been experienced by human populations. This is not a straightforward problem because the human population is not at equilibrium: population genetics predicts that, on the one hand, the efficacy of natural selection increases as population size increases, eliminating ever more weakly deleterious variants; on the other hand, a larger number of deleterious mutations will be introduced into the population and will be more likely to increase in their number of copies as the population grows. To understand how patterns of human genetic variation have been shaped by the interaction of natural selection and population growth, we examined the trajectories of mutations with varying selection coefficients, using computer simulations. We observed that while population growth dramatically increases the number of deleterious segregating sites in the population, it only mildly increases the number carried by each individual. Our simulations also show an increased efficacy of natural selection, reflected in a higher fraction of deleterious mutations eliminated at each generation and a more efficient elimination of the most deleterious ones. As a consequence, while each individual carries a larger number of deleterious alleles than expected in the absence of growth, the average selection coefficient of each segregating allele is less deleterious. Combined, our results suggest that the genetic risk of complex diseases in growing populations might be distributed across a larger number of more weakly deleterious rare variants.  相似文献   

4.
The interpretation of data on genetic variation with regard to the relative roles of different evolutionary factors that produce and maintain genetic variation depends critically on our assumptions concerning effective population size and the level of migration between neighboring populations. In humans, recent population growth and movements of specific ethnic groups across wide geographic areas mean that any theory based on assumptions of constant population size and absence of substructure is generally untenable. We examine the effects of population subdivision on the pattern of protein genetic variation in a total sample drawn from an artificial agglomerate of 12 tribal populations of Central and South America, analyzing the pooled sample as though it were a single population. Several striking findings emerge. (1) Mean heterozygosity is not sensitive to agglomeration, but the number of different alleles (allele count) is inflated, relative to neutral mutation/drift/equilibrium expectation. (2) The inflation is most serious for rare alleles, especially those which originally occurred as tribally restricted "private" polymorphisms. (3) The degree of inflation is an increasing function of both the number of populations encompassed by the sample and of the genetic divergence among them. (4) Treating an agglomerated population as though it were a panmictic unit of long standing can lead to serious biases in estimates of mutation rates, selection pressures, and effective population sizes. Current DNA studies indicate the presence of numerous genetic variants in human populations. The findings and conclusions of this paper are all fully applicable to the study of genetic variation at the DNA level as well.  相似文献   

5.
Innan H  Stephan W 《Genetics》2001,159(1):389-399
A two-locus model of reversible mutations with compensatory fitness interactions is presented; single mutations are assumed to be deleterious but neutral in appropriate combinations. The expectation of the time of compensatory nucleotide substitutions is calculated analytically for the case of tight linkage between sites. It is shown that selection increases the substitution time dramatically when selection intensity Ns > 1, where N is the diploid population size and s the selection coefficient. Computer simulations demonstrate that recombination increases the substitution time, but the effect of recombination is small when selection is weak. The amount of linkage disequilibrium generated in the process of compensatory substitution is also investigated. It is shown that significant linkage disequilibrium is expected to be rare in natural populations. The model is applied to the mRNA secondary structure of the bicoid 3' untranslated region of Drosophila. It is concluded that average selection intensity Ns against single deleterious mutations is not likely to be much larger than 1.  相似文献   

6.
A 3.5-kb segment of the alcohol dehydrogenase (Adh) region that includes the Adh and Adh-related genes was sequenced in 139 Drosophila pseudoobscura strains collected from 13 populations. The Adh gene encodes four protein alleles and rejects a neutral model of protein evolution with the McDonald-Kreitman test, although the number of segregating synonymous sites is too high to conclude that adaptive selection has operated. The Adh-related gene encodes 18 protein haplotypes and fails to reject an equilibrium neutral model. The populations fail to show significant geographic differentiation of the Adh-related haplotypes. Eight of 404 single nucleotide polymorphisms (SNPs) in the Adh region were in significant linkage disequilibrium with three ADHR protein alleles. Coalescent simulations with and without recombination were used to derive the expected levels of significant linkage disequilibrium between SNPs and 18 protein haplotypes. Maximum levels of linkage disequilibrium are expected for protein alleles at moderate frequencies. In coalescent models without recombination, linkage disequilibrium decays between SNPs and high frequency haplotypes because common alleles mutate to haplotypes that are rare or that reach moderate frequency. The implication of this study is that linkage disequilibrium mapping has the highest probability of success with disease-causing alleles at frequencies of 10%.  相似文献   

7.
Current human sequencing projects observe an abundance of extremely rare genetic variation, suggesting recent acceleration of population growth. To better understand the impact of such accelerating growth on the quantity and nature of genetic variation, we present a new class of models capable of incorporating faster than exponential growth in a coalescent framework. Our work shows that such accelerated growth affects only the population size in the recent past and thus large samples are required to detect the models’ effects on patterns of variation. When we compare models with fixed initial growth rate, models with accelerating growth achieve very large current population sizes and large samples from these populations contain more variation than samples from populations with constant growth. This increase is driven almost entirely by an increase in singleton variation. Moreover, linkage disequilibrium decays faster in populations with accelerating growth. When we instead condition on current population size, models with accelerating growth result in less overall variation and slower linkage disequilibrium decay compared to models with exponential growth. We also find that pairwise linkage disequilibrium of very rare variants contains information about growth rates in the recent past. Finally, we demonstrate that models of accelerating growth may substantially change estimates of present-day effective population sizes and growth times.  相似文献   

8.

Background

Both common and rare genetic variants have been shown to contribute to the etiology of complex diseases. Recent genome-wide association studies (GWAS) have successfully investigated how common variants contribute to the genetic factors associated with common human diseases. However, understanding the impact of rare variants, which are abundant in the human population (one in every 17 bases), remains challenging. A number of statistical tests have been developed to analyze collapsed rare variants identified by association tests. Here, we propose a haplotype-based approach. This work inspired by an existing statistical framework of the pedigree disequilibrium test (PDT), which uses genetic data to assess the effects of variants in general pedigrees. We aim to compare the performance between the haplotype-based approach and the rare variant-based approach for detecting rare causal variants in pedigrees.

Results

Extensive simulations in the sequencing setting were carried out to evaluate and compare the haplotype-based approach with the rare variant methods that drew on a more conventional collapsing strategy. As assessed through a variety of scenarios, the haplotype-based pedigree tests had enhanced statistical power compared with the rare variants based pedigree tests when the disease of interest was mainly caused by rare haplotypes (with multiple rare alleles), and vice versa when disease was caused by rare variants acting independently. For most of other situations when disease was caused both by haplotypes with multiple rare alleles and by rare variants with similar effects, these two approaches provided similar power in testing for association.

Conclusions

The haplotype-based approach was designed to assess the role of rare and potentially causal haplotypes. The proposed rare variants-based pedigree tests were designed to assess the role of rare and potentially causal variants. This study clearly documented the situations under which either method performs better than the other. All tests have been implemented in a software, which was submitted to the Comprehensive R Archive Network (CRAN) for general use as a computer program named rvHPDT.  相似文献   

9.
Martin G  Otto SP  Lenormand T 《Genetics》2006,172(1):593-609
In finite populations, linkage disequilibria generated by the interaction of drift and directional selection (Hill-Robertson effect) can select for sex and recombination, even in the absence of epistasis. Previous models of this process predict very little advantage to recombination in large panmictic populations. In this article we demonstrate that substantial levels of linkage disequilibria can accumulate by drift in the presence of selection in populations of any size, provided that the population is subdivided. We quantify (i) the linkage disequilibrium produced by the interaction of drift and selection during the selective sweep of beneficial alleles at two loci in a subdivided population and (ii) the selection for recombination generated by these disequilibria. We show that, in a population subdivided into n demes of large size N, both the disequilibrium and the selection for recombination are equivalent to that expected in a single population of a size intermediate between the size of each deme (N) and the total size (nN), depending on the rate of migration among demes, m. We also show by simulations that, with small demes, the selection for recombination is stronger than both that expected in an unstructured population (m = 1 - 1/n) and that expected in a set of isolated demes (m = 0). Indeed, migration maintains polymorphisms that would otherwise be lost rapidly from small demes, while population structure maintains enough local stochasticity to generate linkage disequilibria. These effects are also strong enough to overcome the twofold cost of sex under strong selection when sex is initially rare. Overall, our results show that the stochastic theories of the evolution of sex apply to a much broader range of conditions than previously expected.  相似文献   

10.
The Major Histocompatibility Complex (MHC) is a large multigene coding for glycoproteins that play a key role in the initiation of immune responses in vertebrates. The exon 2 region of the MHC DQB locus was analyzed using 160 finless porpoises from 5 populations in Japanese waters. The 5 populations were based on a previous mitochondrial DNA control region analysis, which showed distinct geographical separation. Eight DQB alleles were detected, and the geographical distribution of the alleles indicated that most of them are shared among the populations. Heterozygosity of the DQB alleles in each population ranged from 0.55 to 0.78, and for all 5 populations was 0.78. Low MHC variability is not a common feature in marine mammals, but the finless porpoise populations inhabiting coastal waters had a relatively high MHC heterozygosity. Balancing selection in the MHC DQB alleles of the finless porpoise was indicated by the higher rate of nonsynonymous than synonymous substitutions for PBR; however, an excess of hetrozygotes compared to expectation was not observed. This suggests that the MHC DQB locus in the finless porpoise may have been under balancing selection for a long evolutionary time period, and is influenced by genetic drift beyond the effect of balancing selection for short time periods in small local populations.  相似文献   

11.
Large whole-genome sequencing projects have provided access to much rare variation in human populations, which is highly informative about population structure and recent demography. Here, we show how the age of rare variants can be estimated from patterns of haplotype sharing and how these ages can be related to historical relationships between populations. We investigate the distribution of the age of variants occurring exactly twice ( variants) in a worldwide sample sequenced by the 1000 Genomes Project, revealing enormous variation across populations. The median age of haplotypes carrying variants is 50 to 160 generations across populations within Europe or Asia, and 170 to 320 generations within Africa. Haplotypes shared between continents are much older with median ages for haplotypes shared between Europe and Asia ranging from 320 to 670 generations. The distribution of the ages of haplotypes is informative about their demography, revealing recent bottlenecks, ancient splits, and more modern connections between populations. We see the effect of selection in the observation that functional variants are significantly younger than nonfunctional variants of the same frequency. This approach is relatively insensitive to mutation rate and complements other nonparametric methods for demographic inference.  相似文献   

12.

Background

It is possible to infer the past of populations by comparing genomes between individuals. In general, older populations have more genomic diversity than younger populations. The force of selection can also be inferred from population diversity. If selection is strong and frequently eliminates less fit variants, diversity will be limited because new, initially homogeneous populations constantly emerge.

Methodology and Results

Here we translate a population genetics approach to human somatic cancer cell populations by measuring genomic diversity within and between small colorectal cancer (CRC) glands. Control tissue culture and xenograft experiments demonstrate that the population diversity of certain passenger DNA methylation patterns is reduced after cloning but subsequently increases with time. When measured in CRC gland populations, passenger methylation diversity from different parts of nine CRCs was relatively high and uniform, consistent with older, stable lineages rather than mixtures of younger homogeneous populations arising from frequent cycles of selection. The diversity of six metastases was also high, suggesting dissemination early after transformation. Diversity was lower in DNA mismatch repair deficient CRC glands, possibly suggesting more selection and the elimination of less fit variants when mutation rates are elevated.

Conclusion/Significance

The many hitchhiking passenger variants observed in primary and metastatic CRC cell populations are consistent with relatively old populations, suggesting that clonal evolution leading to selective sweeps may be rare after transformation. Selection in human cancers appears to be a weaker than presumed force after transformation, consistent with the observed rarity of driver mutations in cancer genomes. Phenotypic plasticity rather than the stepwise acquisition of new driver mutations may better account for the many different phenotypes within human tumors.  相似文献   

13.
Methods for high-density admixture mapping of disease genes   总被引:26,自引:0,他引:26       下载免费PDF全文
Admixture mapping (also known as "mapping by admixture linkage disequilibrium," or MALD) has been proposed as an efficient approach to localizing disease-causing variants that differ in frequency (because of either drift or selection) between two historically separated populations. Near a disease gene, patient populations descended from the recent mixing of two or more ethnic groups should have an increased probability of inheriting the alleles derived from the ethnic group that carries more disease-susceptibility alleles. The central attraction of admixture mapping is that, since gene flow has occurred recently in modern populations (e.g., in African and Hispanic Americans in the past 20 generations), it is expected that admixture-generated linkage disequilibrium should extend for many centimorgans. High-resolution marker sets are now becoming available to test this approach, but progress will require (a). computational methods to infer ancestral origin at each point in the genome and (b). empirical characterization of the general properties of linkage disequilibrium due to admixture. Here we describe statistical methods to estimate the ancestral origin of a locus on the basis of the composite genotypes of linked markers, and we show that this approach accurately estimates states of ancestral origin along the genome. We apply this approach to show that strong admixture linkage disequilibrium extends, on average, for 17 cM in African Americans. Finally, we present power calculations under varying models of disease risk, sample size, and proportions of ancestry. Studying approximately 2500 markers in approximately 2500 patients should provide power to detect many regions contributing to common disease. A particularly important result is that the power of an admixture mapping study to detect a locus will be nearly the same for a wide range of mixture scenarios: the mixture proportion should be 10%-90% from both ancestral populations.  相似文献   

14.
Evolutionary aspects of the genetic architecture of common human diseases remain enigmatic. The results of more than 200 genome-wide association studies published to date were compiled in a catalog (). We used cataloged data to determine whether derived (mutant) alleles are associated with higher risk of human disease more frequently than ancestral alleles. We placed all allelic variants into ten categories of population frequency (0%-100%) in 10% increments. We then analyzed the relationship between allelic frequency, evolutionary status of the polymorphic site (ancestral versus derived), and disease risk status (risk versus protection). Given the same population frequency, derived alleles are more likely to be risk associated than ancestral alleles, as are rarer alleles. The common interpretation of this association is that negative selection prevents fixation of the risk variants. However, disease stratification as early or late onset suggests that weak selection against risk-associated alleles is unlikely a major factor shaping genetic architecture of common diseases. Our results clearly suggest that the duration of existence of an allele in a population is more important. Alleles existing longer tend to show weaker linkage disequilibrium with neighboring alleles, including the causal alleles, and are less likely to tag a SNP-disease association.  相似文献   

15.
Although human bitter taste perception is hypothesized to be a dietary adaptation, little is known about genetic signatures of selection and patterns of bitter taste perception variability in ethnically diverse populations with different diets, particularly from Africa. To better understand the genetic basis and evolutionary history of bitter taste sensitivity, we sequenced a 2,975 bp region encompassing TAS2R38, a bitter taste receptor gene, in 611 Africans from 57 populations in West Central and East Africa with diverse subsistence patterns, as well as in a comparative sample of 132 non-Africans. We also examined the association between genetic variability at this locus and threshold levels of phenylthiocarbamide (PTC) bitterness in 463 Africans from the above populations to determine how variation influences bitter taste perception. Here, we report striking patterns of variation at TAS2R38, including a significant excess of novel rare nonsynonymous polymorphisms that recently arose only in Africa, high frequencies of haplotypes in Africa associated with intermediate bitter taste sensitivity, a remarkably similar frequency of common haplotypes across genetically and culturally distinct Africans, and an ancient coalescence time of common variation in global populations. Additionally, several of the rare nonsynonymous substitutions significantly modified levels of PTC bitter taste sensitivity in diverse Africans. While ancient balancing selection likely maintained common haplotype variation across global populations, we suggest that recent selection pressures may have also resulted in the unusually high level of rare nonsynonymous variants in Africa, implying a complex model of selection at the TAS2R38 locus in African populations. Furthermore, the distribution of common haplotypes in Africa is not correlated with diet, raising the possibility that common variation may be under selection due to their role in nondietary biological processes. In addition, our data indicate that novel rare mutations contribute to the phenotypic variance of PTC sensitivity, illustrating the influence of rare variation on a common trait, as well as the relatively recent evolution of functionally diverse alleles at this locus.  相似文献   

16.
Isoallele Frequencies in Very Large Populations   总被引:5,自引:3,他引:2       下载免费PDF全文
Jack Lester King 《Genetics》1974,76(3):607-613
The frequencies of electrophoretically distinguishable allelic forms of enzymes may be very different from the corresponding frequencies of structurally distinct forms, because many sequence variants may have identical electrophoretic charge. In large populations such frequencies will be determined largely by the number of amino acid sites that are free to vary. The number of distinguishable electrophoretic variants will remain fairly small. Beyond some limiting size, no further effect of population size on allele frequencies is expected, so isolated large populations will have closely similar allele frequencies if polymorphism is due largely to mutation and drift. The most common electrophoretic alleles are expected to be flanked by the next most common, with the rarer alleles increasingly distal. Neither strong selection nor mutation/drift interpretations of enzyme polymorphism are yet disproven, nor is any point between these extremes.  相似文献   

17.
Linkage disequilibrium and the mapping of complex human traits.   总被引:30,自引:0,他引:30  
The potential value of haplotypes defined by several single nucleotide polymorphisms has attracted recent interest. With sufficient linkage disequilibrium (LD), haplotypes could be used in association studies to map common alleles that might influence the susceptibility to common diseases, as well as for reconstructing the evolution of the genome. It has been proposed that a globally useful resource need only be based on high frequency variants, identified from a few modest samples. Rapid progress has been made in quantifying the pattern of human LD and haplotypes defined by such common variants within and among populations. However, the quality and utility of the proposed LD-based resource could be seriously compromised if important sampling and analytical factors are overlooked in its design. The LD map should be based on adequately justified criteria defined by sound population genetic principles.  相似文献   

18.

Background

Recent genome-wide association (GWA) studies have provided compelling evidence of association between genetic variants and common complex diseases. These studies have made use of cases and controls almost exclusively from populations of European ancestry and little is known about the frequency of risk alleles in other populations. The present study addresses the transferability of disease associations across human populations by examining levels of population differentiation at disease-associated single nucleotide polymorphisms (SNPs).

Methods

We genotyped ~1000 individuals from 53 populations worldwide at 25 SNPs which show robust association with 6 complex human diseases (Crohn's disease, type 1 diabetes, type 2 diabetes, rheumatoid arthritis, coronary artery disease and obesity). Allele frequency differences between populations for these SNPs were measured using Fst. The Fst values for the disease-associated SNPs were compared to Fst values from 2750 random SNPs typed in the same set of individuals.

Results

On average, disease SNPs are not significantly more differentiated between populations than random SNPs in the genome. Risk allele frequencies, however, do show substantial variation across human populations and may contribute to differences in disease prevalence between populations. We demonstrate that, in some cases, risk allele frequency differences are unusually high compared to random SNPs and may be due to the action of local (i.e. geographically-restricted) positive natural selection. Moreover, some risk alleles were absent or fixed in a population, which implies that risk alleles identified in one population do not necessarily account for disease prevalence in all human populations.

Conclusion

Although differences in risk allele frequencies between human populations are not unusually large and are thus likely not due to positive local selection, there is substantial variation in risk allele frequencies between populations which may account for differences in disease prevalence between human populations.  相似文献   

19.
The signature of positive selection at randomly chosen loci   总被引:35,自引:0,他引:35  
Przeworski M 《Genetics》2002,160(3):1179-1189
In Drosophila and humans, there are accumulating examples of loci with a significant excess of high-frequency-derived alleles or high levels of linkage disequilibrium, relative to a neutral model of a random-mating population of constant size. These are features expected after a recent selective sweep. Their prevalence suggests that positive directional selection may be widespread in both species. However, as I show here, these features do not persist long after the sweep ends: The high-frequency alleles drift to fixation and no longer contribute to polymorphism, while linkage disequilibrium is broken down by recombination. As a result, loci chosen without independent evidence of recent selection are not expected to exhibit either of these features, even if they have been affected by numerous sweeps in their genealogical history. How then can we explain the patterns in the data? One possibility is population structure, with unequal sampling from different subpopulations. Alternatively, positive selection may not operate as is commonly modeled. In particular, the rate of fixation of advantageous mutations may have increased in the recent past.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号