首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 359 毫秒
1.
Candidate gene association studies have met with mixed success due to many reasons including incomplete surveys of genetic variation and differences in patterns of genetic variation among study populations. We present the results of comprehensive variant discovery for the corticotropin releasing hormone gene (CRH on chromosome 8) encoding a neuropeptide that is central to many physiologic pathways. Mouse-human hybrid cell lines were constructed that are monosomic for human chromosome 8 for resequencing of separated CRH alleles to identify variants and directly determine their chromosomal phase for three major ethnic groups including African Americans (AA), Mexican Americans (MA) and European Americans (EA). We also resequenced diploid individuals to evaluate single nucleotide polymorphism (SNP) discovery in the limited numbers of monosomic hybrid cell lines. Our results show that CRH variation is very different in AA, yielding larger numbers of variants and haplotypes compared to MA and EA. Analysis of LD structure found three haplotype blocks in AA and two blocks in EA. Comparisons between AA and EA groups yielded extremely high measures of genetic differentiation (Wright's F(ST)>0.6), likely reflecting disruptive selection in CRH evolution. Network analysis showed that AA have retained an ancestral CRH haplotype, while the most common EA haplotype is derived from a single recombination event.  相似文献   

2.
Demographic history plays a major role in shaping the distribution of genomic variation. Yet the interaction between different demographic forces and their effects in the genomes is not fully resolved in human populations. Here, we focus on the Roma population, the largest transnational ethnic minority in Europe. They have a South Asian origin and their demographic history is characterized by recent dispersals, multiple founder events, and extensive gene flow from non-Roma groups. Through the analyses of new high-coverage whole exome sequences and genome-wide array data for 89 Iberian Roma individuals together with forward simulations, we show that founder effects have reduced their genetic diversity and proportion of rare variants, gene flow has counteracted the increase in mutational load, runs of homozygosity show ancestry-specific patterns of accumulation of deleterious homozygotes, and selection signals primarily derive from preadmixture adaptation in the Roma population sources. The present study shows how two demographic forces, bottlenecks and admixture, act in opposite directions and have long-term balancing effects on the Roma genomes. Understanding how demography and gene flow shape the genome of an admixed population provides an opportunity to elucidate how genomic variation is modeled in human populations.  相似文献   

3.
Whole genome sequences (WGS) greatly increase our ability to precisely infer population genetic parameters, demographic processes, and selection signatures. However, WGS may still be not affordable for a representative number of individuals/populations. In this context, our goal was to assess the efficiency of several SNP genotyping strategies by testing their ability to accurately estimate parameters describing neutral diversity and to detect signatures of selection. We analysed 110 WGS at 12× coverage for four different species, i.e., sheep, goats and their wild counterparts. From these data we generated 946 data sets corresponding to random panels of 1K to 5M variants, commercial SNP chips and exome capture, for sample sizes of five to 48 individuals. We also extracted low‐coverage genome resequencing of 1×, 2× and 5× by randomly subsampling reads from the 12× resequencing data. Globally, 5K to 10K random variants were enough for an accurate estimation of genome diversity. Conversely, commercial panels and exome capture displayed strong ascertainment biases. Besides the characterization of neutral diversity, the detection of the signature of selection and the accurate estimation of linkage disequilibrium (LD) required high‐density panels of at least 1M variants. Finally, genotype likelihoods increased the quality of variant calling from low coverage resequencing but proportions of incorrect genotypes remained substantial, especially for heterozygote sites. Whole genome resequencing coverage of at least 5× appeared to be necessary for accurate assessment of genomic variations. These results have implications for studies seeking to deploy low‐density SNP collections or genome scans across genetically diverse populations/species showing similar genetic characteristics and patterns of LD decay for a wide variety of purposes.  相似文献   

4.
Mapping by admixture linkage disequilibrium (MALD) is a potentially powerful technique for the mapping of complex genetic diseases. The practical requirements of this method include (a) a set of markers spanning the genome that have large allele-frequency differences between the parental ethnicities contributing to the admixed population and (b) an understanding of the extent of admixture in the study population. To this end, a DNA-pooling technique was used to screen microsatellite and diallelic insertion/deletion markers for allele-frequency differences between putative representatives of the parental populations of the admixed Mexican American (MA) and African American (AA) populations. Markers with promising pooled differences were then confirmed by individual genotyping in both the parental and admixed populations. For the MA population, screening of >600 markers identified 151 ethnic-difference markers (EDMs) with delta>0.30 (where delta is the absolute value of each allele-frequency difference between two populations, summed over all marker alleles and divided by two) that are likely to be useful for MALD analysis. For the AA population, analysis of >400 markers identified 97 EDMs. In addition, individual genotyping of these markers in Pima Amerindians, Yavapai Amerindians, European American (EA) individuals, Africans from Zimbabwe, MA individuals, and AA individuals, as well as comparison to the CEPH genotyping set, suggests that the differences between subpopulations of an ethnicity are small for many markers with large interethnic differences. Estimates of admixture that are based on individual genotyping of these markers are consistent with a 60% EA:40% Amerindian contribution to MA populations and with a 20% EA:80% African contribution to AA populations. Taken together, these data suggest that EDMs with large interpopulation and small intrapopulation differences can be readily identified for MALD studies in both AA and MA populations.  相似文献   

5.
Evolutionary forces like Hill-Robertson interference and negative epistasis can lead to deleterious mutations being found on distinct haplotypes. However, the extent to which these forces depend on the selection and dominance coefficients of deleterious mutations and shape genome-wide patterns of linkage disequilibrium (LD) in natural populations with complex demographic histories has not been tested. In this study, we first used forward-in-time simulations to predict how negative selection impacts LD. Under models where deleterious mutations have additive effects on fitness, deleterious variants less than 10 kb apart tend to be carried on different haplotypes relative to pairs of synonymous SNPs. In contrast, for recessive mutations, there is no consistent ordering of how selection coefficients affect LD decay, due to the complex interplay of different evolutionary effects. We then examined empirical data of modern humans from the 1000 Genomes Project. LD between derived alleles at nonsynonymous SNPs is lower compared to pairs of derived synonymous variants, suggesting that nonsynonymous derived alleles tend to occur on different haplotypes more than synonymous variants. This result holds when controlling for potential confounding factors by matching SNPs for frequency in the sample (allele count), physical distance, magnitude of background selection, and genetic distance between pairs of variants. Lastly, we introduce a new statistic HR(j) which allows us to detect interference using unphased genotypes. Application of this approach to high-coverage human genome sequences confirms our finding that nonsynonymous derived alleles tend to be located on different haplotypes more often than are synonymous derived alleles. Our findings suggest that interference may play a pervasive role in shaping patterns of LD between deleterious variants in the human genome, and consequently influences genome-wide patterns of LD.  相似文献   

6.
Exome sequencing offers the potential to study the population-genomic variables that underlie patterns of deleterious variation. Runs of homozygosity (ROH) are long stretches of consecutive homozygous genotypes probably reflecting segments shared identically by descent as the result of processes such as consanguinity, population size reduction, and natural selection. The relationship between ROH and patterns of predicted deleterious variation can provide insight into the way in which these processes contribute to the maintenance of deleterious variants. Here, we use exome sequencing to examine ROH in relation to the distribution of deleterious variation in 27 individuals of varying levels of apparent inbreeding from 6 human populations. A significantly greater fraction of all genome-wide predicted damaging homozygotes fall in ROH than would be expected from the corresponding fraction of nondamaging homozygotes in ROH (p < 0.001). This pattern is strongest for long ROH (p < 0.05). ROH, and especially long ROH, harbor disproportionately more deleterious homozygotes than would be expected on the basis of the total ROH coverage of the genome and the genomic distribution of nondamaging homozygotes. The results accord with a hypothesis that recent inbreeding, which generates long ROH, enables rare deleterious variants to exist in homozygous form. Thus, just as inbreeding can elevate the occurrence of rare recessive diseases that represent homozygotes for strongly deleterious mutations, inbreeding magnifies the occurrence of mildly deleterious variants as well.  相似文献   

7.
Deleterious mutations are found in all populations. Their existence at low frequencies is easily understood, but explaining how they reach high frequencies has long been a challenging problem for population geneticists and evolutionary biologists. Some cases of apparently deleterious alleles are explained by pleiotropy or environmental context dependence, but for universally deleterious alleles, two mechanisms are generally invoked to explain how they can reach high frequencies: (i) genetic drift in small populations and (ii) ‘hitchhiking’ (sensu Maynard Smith J, Haigh J, Genetical Research, 1974, 23, 23–35) involving tight linkage to beneficial mutations. However, these oft‐cited explanations do not immediately resolve the problem because many real populations of interest have population sizes and recombination rates that are large enough to render it nearly impossible for all but the most weakly deleterious (i.e. nearly neutral) mutations to establish and persist. Furthermore, both mechanisms are usually silent about patterns of intraspecific variation in mutation load. In this issue, Peischl S, Dupanloup I, Kirkpatrick M, Excoffier L (Molecular Ecology, 2013) develop and explore a mechanism that puts drift and hitchhiking of deleterious mutations into a specific spatial and demographic context: range expansions. Importantly, their findings provide a plausible explanation for puzzling empirical patterns, such as the paradoxical observation that genotypes at the leading edge of a range expansion are sometimes less fit than those in the ancestral range (when fitness is assessed in a common environment).  相似文献   

8.
Exome sequencing studies in complex diseases are challenged by the allelic heterogeneity, large number and modest effect sizes of associated variants on disease risk and the presence of large numbers of neutral variants, even in phenotypically relevant genes. Isolated populations with recent bottlenecks offer advantages for studying rare variants in complex diseases as they have deleterious variants that are present at higher frequencies as well as a substantial reduction in rare neutral variation. To explore the potential of the Finnish founder population for studying low-frequency (0.5–5%) variants in complex diseases, we compared exome sequence data on 3,000 Finns to the same number of non-Finnish Europeans and discovered that, despite having fewer variable sites overall, the average Finn has more low-frequency loss-of-function variants and complete gene knockouts. We then used several well-characterized Finnish population cohorts to study the phenotypic effects of 83 enriched loss-of-function variants across 60 phenotypes in 36,262 Finns. Using a deep set of quantitative traits collected on these cohorts, we show 5 associations (p<5×10−8) including splice variants in LPA that lowered plasma lipoprotein(a) levels (P = 1.5×10−117). Through accessing the national medical records of these participants, we evaluate the LPA finding via Mendelian randomization and confirm that these splice variants confer protection from cardiovascular disease (OR = 0.84, P = 3×10−4), demonstrating for the first time the correlation between very low levels of LPA in humans with potential therapeutic implications for cardiovascular diseases. More generally, this study articulates substantial advantages for studying the role of rare variation in complex phenotypes in founder populations like the Finns and by combining a unique population genetic history with data from large population cohorts and centralized research access to National Health Registers.  相似文献   

9.
Genes underlying repeated adaptive evolution in natural populations are still largely unknown. Stickleback fish (Gasterosteus aculeatus) have undergone a recent dramatic evolutionary radiation, generating numerous examples of marine-freshwater species pairs and a small number of benthic-limnetic species pairs found within single lakes [1]. We have developed a new genome-wide SNP genotyping array to study patterns of genetic variation in sticklebacks over a wide geographic range, and to scan the genome for regions that contribute to repeated evolution of marine-freshwater or benthic-limnetic species pairs. Surveying 34 global populations with 1,159 informative markers revealed substantial genetic variation, with predominant patterns reflecting demographic history and geographic structure. After correcting for geographic structure and filtering for neutral markers, we detected large repeated shifts in allele frequency at some loci, identifying both known and novel loci likely contributing to marine-freshwater and benthic-limnetic divergence. Several novel loci fall close to genes implicated in epithelial barrier or immune functions, which have likely changed as sticklebacks adapt to contrasting environments. Specific alleles differentiating sympatric benthic-limnetic species pairs are shared in nearby solitary populations, suggesting an allopatric origin for adaptive variants and selection pressures unrelated to sympatry in the initial formation of these classic vertebrate species pairs.  相似文献   

10.
Andolfatto P  Kreitman M 《Genetics》2000,154(4):1681-1691
A previous study of nucleotide polymorphism in a Costa Rican population of Drosophila melanogaster found evidence for a nonneutral deficiency in the number of haplotypes near the proximal breakpoint of In(2L)t, a common inversion polymorphism in this species. Another striking feature of the data was a window of unusually high nucleotide diversity spanning the breakpoint site. To distinguish between selective and neutral demographic explanations for the observed patterns in the data, we sample alleles from three additional populations of D. melanogaster and one population of D. simulans. We find that the strength of associations among sites found at the breakpoint varies between populations of D. melanogaster. In D. simulans, analysis of the homologous region reveals unusually elevated levels of nucleotide polymorphism spanning the breakpoint site. As with American populations of D. melanogaster, our D. simulans sample shows a marked reduction in the number of haplotypes but not in nucleotide diversity. Haplotype tests reveal a significant deficiency in the number of haplotypes relative to the neutral expectation in the D. simulans sample and some populations of D. melanogaster. At the breakpoint site, the level of divergence between haplotype classes is comparable to interspecific divergence. The observation of interspecific polymorphisms that differentiate major haplotype classes in both species suggests that haplotype classes at this locus are considerably old. When considered in the context of other studies on patterns of variation within and between populations of D. melanogaster and D. simulans, our data appear more consistent with the operation of selection than with simple demographic explanations.  相似文献   

11.
Quantifying the distribution of fitness effects among newly arising mutations in the human genome is key to resolving important debates in medical and evolutionary genetics. Here, we present a method for inferring this distribution using Single Nucleotide Polymorphism (SNP) data from a population with non-stationary demographic history (such as that of modern humans). Application of our method to 47,576 coding SNPs found by direct resequencing of 11,404 protein coding-genes in 35 individuals (20 European Americans and 15 African Americans) allows us to assess the relative contribution of demographic and selective effects to patterning amino acid variation in the human genome. We find evidence of an ancient population expansion in the sample with African ancestry and a relatively recent bottleneck in the sample with European ancestry. After accounting for these demographic effects, we find strong evidence for great variability in the selective effects of new amino acid replacing mutations. In both populations, the patterns of variation are consistent with a leptokurtic distribution of selection coefficients (e.g., gamma or log-normal) peaked near neutrality. Specifically, we predict 27–29% of amino acid changing (nonsynonymous) mutations are neutral or nearly neutral (|s|<0.01%), 30–42% are moderately deleterious (0.01%<|s|<1%), and nearly all the remainder are highly deleterious or lethal (|s|>1%). Our results are consistent with 10–20% of amino acid differences between humans and chimpanzees having been fixed by positive selection with the remainder of differences being neutral or nearly neutral. Our analysis also predicts that many of the alleles identified via whole-genome association mapping may be selectively neutral or (formerly) positively selected, implying that deleterious genetic variation affecting disease phenotype may be missed by this widely used approach for mapping genes underlying complex traits.  相似文献   

12.
Although much work has been conducted on coastal populations of the American alligator (Alligator mississippiensis), less is known about the population dynamics and genetic structure of populations of alligators confined to inland habitats. DNA microsatellite loci, derived from the American alligator, were used to investigate patterns of genetic variation within and between populations of alligators distributed at coastal and inland localities in Texas. These data were used to evaluate the genetic discreteness of different alligator stocks relative to their basic ecology at these sites. Observed mean heterozygosities across seven loci for both coastal and inland populations ranged from 0.50-0.61, with both inland and coastal populations revealing similar patterns of variation. Measures of F(st) revealed significant population differentiation among all populations; however, analyses of molecular variance (AMOVAs) failed to demonstrate any apparent geographic pattern relative to the population differentiation indicated by F(st) values. Each population contained unique alleles for at least one locus. Additionally, assignment tests based on the distribution of genotypes placed 76% of individuals to their source population. These genetic data suggest considerable subdivision among alligator populations, possibly influenced by demographic and life history differences as well as barriers to dispersal. These results have clear implications for management. Rather than managing alligators in Texas as a single panmictic population, translocation programs and harvest quotas should consider the ecological and genetic distinctiveness of local alligator populations.  相似文献   

13.
A critically important challenge in empirical population genetics is distinguishing neutral nonequilibrium processes from selective forces that produce similar patterns of variation. We here examine the extent to which linkage disequilibrium (i.e., nonrandom associations between markers) improves this discrimination. We show that patterns of linkage disequilibrium recently proposed to be unique to hitchhiking models are replicated under nonequilibrium neutral models. We also demonstrate that jointly considering spatial patterns of association among variants alongside the site-frequency spectrum is nonetheless of value. Through a comparison of models of equilibrium neutrality, nonequilibrium neutrality, equilibrium hitchhiking, nonequilibrium hitchhiking, and recurrent hitchhiking, we evaluate a linkage disequilibrium (LD) statistic (omega(max)) that appears to have power to identify regions recently shaped by positive selection. Most notably, for demographic parameters relevant to non-African populations of Drosophila melanogaster, we demonstrate that selected loci are distinguishable from neutral loci using this statistic.  相似文献   

14.
Detailed information about the geographic distribution of genetic and genomic variation is necessary to better understand the organization and structure of biological diversity. In particular, spatial isolation within species and hybridization between them can blur species boundaries and create evolutionary relationships that are inconsistent with a strictly bifurcating tree model. Here, we analyse genome‐wide DNA sequence and genetic ancestry variation in Lycaeides butterflies to quantify the effects of admixture and spatial isolation on how biological diversity is organized in this group. We document geographically widespread and pervasive historical admixture, with more restricted recent hybridization. This includes evidence supporting previously known and unknown instances of admixture. The genome composition of admixed individuals varies much more among than within populations, and tree‐ and genetic ancestry‐based analyses indicate that multiple distinct admixed lineages or populations exist. We find that most genetic variants in Lycaeides are rare (minor allele frequency <0.5%). Because the spatial and taxonomic distributions of alleles reflect demographic and selective processes since mutation, rare alleles, which are presumably younger than common alleles, were spatially and taxonomically restricted compared with common variants. Thus, we show patterns of genetic variation in this group are multifaceted, and we argue that this complexity challenges simplistic notions concerning the organization of biological diversity into discrete, easily delineated and hierarchically structured entities.  相似文献   

15.
Full sequencing of individual human genomes has greatly expanded our understanding of human genetic variation and population history. Here, we present a systematic analysis of 50 human genomes from 11 diverse global populations sequenced at high coverage. Our sample includes 12 individuals who have admixed ancestry and who have varying degrees of recent (within the last 500 years) African, Native American, and European ancestry. We found over 21 million single-nucleotide variants that contribute to a 1.75-fold range in nucleotide heterozygosity across diverse human genomes. This heterozygosity ranged from a high of one heterozygous site per kilobase in west African genomes to a low of 0.57 heterozygous sites per kilobase in segments inferred to have diploid Native American ancestry from the genomes of Mexican and Puerto Rican individuals. We show evidence of all three continental ancestries in the genomes of Mexican, Puerto Rican, and African American populations, and the genome-wide statistics are highly consistent across individuals from a population once ancestry proportions have been accounted for. Using a generalized linear model, we identified subtle variations across populations in the proportion of neutral versus deleterious variation and found that genome-wide statistics vary in admixed populations even once ancestry proportions have been factored in. We further infer that multiple periods of gene flow shaped the diversity of admixed populations in the Americas—70% of the European ancestry in today’s African Americans dates back to European gene flow happening only 7–8 generations ago.  相似文献   

16.
Although many studies confirm long-term small isolated populations (e.g. island endemics) commonly sustain low neutral genetic variation as a result of genetic drift, it is less clear how selection on adaptive or detrimental genes interplay with random forces. We investigated sequence variation at two major histocompatibility complex (Mhc) class II loci on a porpoise endemic to the upper Gulf of California, México (Phocoena sinus, or vaquita). Its unique declining population is estimated around 500 individuals. Single-strand conformation polymorphism analysis revealed one putative functional allele fixed at the locus DQB (n = 25). At the DRB locus, we found two presumed functional alleles (n = 29), differing by a single nonsynonymous nucleotide substitution that could increase the stability at the dimer interface of alphabeta-heterodimers on heterozygous individuals. Identical trans-specific DQB1 and DRB1 alleles were identified between P. sinus and its closest relative, the Burmeister's porpoise (Phocoena spinipinnis). Comparison with studies on four island endemic mammals suggests fixation of one allele, due to genetic drift, commonly occurs at the DQA or DQB loci (effectively neutral). Similarly, deleterious alleles of small effect are also effectively neutral and can become fixed; a high frequency of anatomical malformations on vaquita gave empirical support to this prediction. In contrast, retention of low but functional polymorphism at the DRB locus was consistent with higher selection intensity. These observations indicated natural selection could maintain (and likely also purge) some crucial alleles even in the face of strong and prolonged genetic drift and inbreeding, suggesting long-term small populations should display low inbreeding depression. Low levels of Mhc variation warn about a high susceptibility to novel pathogens and diseases in vaquita.  相似文献   

17.
A fundamental challenge to contemporary genetics is to distinguish rare missense alleles that disrupt protein functions from the majority of alleles neutral on protein activities. High-throughput experimental tools to securely discriminate between disruptive and non-disruptive missense alleles are currently missing. Here we establish a scalable cell-based strategy to profile the biological effects and likely disease relevance of rare missense variants in vitro. We apply this strategy to systematically characterize missense alleles in the low-density lipoprotein receptor (LDLR) gene identified through exome sequencing of 3,235 individuals and exome-chip profiling of 39,186 individuals. Our strategy reliably identifies disruptive missense alleles, and disruptive-allele carriers have higher plasma LDL-cholesterol (LDL-C). Importantly, considering experimental data refined the risk of rare LDLR allele carriers from 4.5- to 25.3-fold for high LDL-C, and from 2.1- to 20-fold for early-onset myocardial infarction. Our study generates proof-of-concept that systematic functional variant profiling may empower rare variant-association studies by orders of magnitude.  相似文献   

18.
A major question in evolutionary biology is how natural selection has shaped patterns of genetic variation across the human genome. Previous work has documented a reduction in genetic diversity in regions of the genome with low recombination rates. However, it is unclear whether other summaries of genetic variation, like allele frequencies, are also correlated with recombination rate and whether these correlations can be explained solely by negative selection against deleterious mutations or whether positive selection acting on favorable alleles is also required. Here we attempt to address these questions by analyzing three different genome-wide resequencing datasets from European individuals. We document several significant correlations between different genomic features. In particular, we find that average minor allele frequency and diversity are reduced in regions of low recombination and that human diversity, human-chimp divergence, and average minor allele frequency are reduced near genes. Population genetic simulations show that either positive natural selection acting on favorable mutations or negative natural selection acting against deleterious mutations can explain these correlations. However, models with strong positive selection on nonsynonymous mutations and little negative selection predict a stronger negative correlation between neutral diversity and nonsynonymous divergence than observed in the actual data, supporting the importance of negative, rather than positive, selection throughout the genome. Further, we show that the widespread presence of weakly deleterious alleles, rather than a small number of strongly positively selected mutations, is responsible for the correlation between neutral genetic diversity and recombination rate. This work suggests that natural selection has affected multiple aspects of linked neutral variation throughout the human genome and that positive selection is not required to explain these observations.  相似文献   

19.
There is extensive variation in DNA methylation between individuals and ethnic groups. These differences arise from a combination of genetic and non-genetic influences and potential modifiers include nutritional cues, early life experience, and social and physical environments. Here we compare genome-wide DNA methylation in neonatal cord blood from African American (AA; N = 112) and European American (EA; N = 91) participants of the CANDLE Study (Conditions Affecting Neurocognitive Development and Learning in Early Childhood). Our goal is to determine if there are replicable ancestry-specific methylation patterns that may implicate risk factors for diseases that have differential prevalence between populations. To identify the most robust ancestry-specific CpG sites, we replicate our results in lymphoblastoid cell lines from Yoruba African and CEPH European panels of HapMap. We also evaluate the influence of maternal nutrition—specifically, plasma levels of vitamin D and folate during pregnancy—on methylation in newborns. We define stable ancestry-dependent methylation of genes that include tumor suppressors and cell cycle regulators (e.g., APC, BRCA1, MCC). Overall, there is lower global methylation in African ancestral groups. Plasma levels of 25-hydroxy vitamin D are also considerably lower among AA mothers and about 60% of AA and 40% of EA mothers have concentrations below 20 ng/ml. Using a weighted correlation analysis, we define a network of CpG sites that is jointly modulated by ancestry and maternal vitamin D. Our results show that differences in DNA methylation patterns are remarkably stable and maternal micronutrients can exert an influence on the child epigenome.  相似文献   

20.
Genome-wide associations have shown a lot of promise in dissecting the genetics of complex traits in humans with single variants, yet a large fraction of the genetic effects is still unaccounted for. Analyzing genetic interactions between variants (epistasis) is one of the potential ways forward. We investigated the abundance and functional impact of a specific type of epistasis, namely the interaction between regulatory and protein-coding variants. Using genotype and gene expression data from the 210 unrelated individuals of the original four HapMap populations, we have explored the combined effects of regulatory and protein-coding single nucleotide polymorphisms (SNPs). We predict that about 18% (1,502 out of 8,233 nsSNPs) of protein-coding variants are differentially expressed among individuals and demonstrate that regulatory variants can modify the functional effect of a coding variant in cis. Furthermore, we show that such interactions in cis can affect the expression of downstream targets of the gene containing the protein-coding SNP. In this way, a cis interaction between regulatory and protein-coding variants has a trans impact on gene expression. Given the abundance of both types of variants in human populations, we propose that joint consideration of regulatory and protein-coding variants may reveal additional genetic effects underlying complex traits and disease and may shed light on causes of differential penetrance of known disease variants.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号