首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 593 毫秒
1.
T cell receptors (TR), through their interaction with the major histocompatibility complex, play a central role in immune responsiveness and potentially immune-related disorders. We resequenced all 57 variable (V) genes in the human T cell receptor alpha and delta (TRA/TRD) locus in 40 individuals of Northern European, Mexican, African-American and Chinese descent. Two hundred and eighty-four single nucleotide polymorphisms (SNPs) were identified. The distribution of SNPs between V genes was heterogeneous, with an average of five SNPs per gene and a range of zero to 15. We describe the patterns of linkage disequilibrium for these newly discovered SNPs and compare these patterns with other emerging large-scale datasets (e.g. Perlegen and HapMap projects) to place our findings into a framework for future analysis of genotype–phenotype associations across this locus. Furthermore, we explore signatures of natural selection across V genes. We find evidence of strong directional selection at this locus as evidenced by unusually high values of F st Electronic Supplementary Material Supplementary material is available for this article at and is accessible for authorized users.  相似文献   

2.
Common genetic polymorphism may explain a portion of the heritable risk for common diseases, so considerable effort has been devoted to finding and typing common single-nucleotide polymorphisms (SNPs) in the human genome. Many SNPs show correlated genotypes, or linkage disequilibrium (LD), suggesting that only a subset of all SNPs (known as tagging SNPs, or tagSNPs) need to be genotyped for disease association studies. Based on the genetic differences that exist among human populations, most tagSNP sets are defined in a single population and applied only in populations that are closely related. To improve the efficiency of multi-population analyses, we have developed an algorithm called MultiPop-TagSelect that finds a near-minimal union of population-specific tagSNP sets across an arbitrary number of populations. We present this approach as an extension of LD-select, a tagSNP selection method that uses a greedy algorithm to group SNPs into bins based on their pairwise association patterns, although the MultiPop-TagSelect algorithm could be used with any SNP tagging approach that allows choices between nearly equivalent SNPs. We evaluate the algorithm by considering tagSNP selection in candidate-gene resequencing data and lower density whole-chromosome data. Our analysis reveals that an exhaustive search is often intractable, while the developed algorithm can quickly and reliably find near-optimal solutions even for difficult tagSNP selection problems. Using populations of African, Asian, and European ancestry, we also show that an optimal multi-population set of tagSNPs can be substantially smaller (up to 44%) than a typical set obtained through independent or sequential selection.Electronic Supplementary Material Supplementary material is available for this article at and is accessible for authorized users.  相似文献   

3.
4.
We compared the single-nucleotide polymorphisms (SNPs) in humans in 182 housekeeping and 148 tissue-specific genes. SNPs were divided into rare and common polymorphisms based on their frequencies. We found that housekeeping genes tend to be less polymorphic than tissue-specific genes for both rare and common SNPs. Using mouse as a second species for computing sequence divergences, we found no evidence of positive selection: for both housekeeping and tissue-specific genes, the ratio of nonsynonymous to synonymous common SNPs per site showed no significant difference from that of divergence. Similarly, we observed no evidence of positive selection for the 289 and 149 genes that have orthologs available for divergence calculation between humans and chimpanzees and between humans and Old World monkeys, respectively. A comparison with previous SNP studies suggests that approximately 20% of the nonsynonymous SNPs in the human population are nearly neutral and that positive selection in the human genome might not be as frequent as previously thought.  相似文献   

5.
Casto AM  Feldman MW 《PLoS genetics》2011,7(1):e1001266
Genome-wide association studies (GWAS) have identified more than 2,000 trait-SNP associations, and the number continues to increase. GWAS have focused on traits with potential consequences for human fitness, including many immunological, metabolic, cardiovascular, and behavioral phenotypes. Given the polygenic nature of complex traits, selection may exert its influence on them by altering allele frequencies at many associated loci, a possibility which has yet to be explored empirically. Here we use 38 different measures of allele frequency variation and 8 iHS scores to characterize over 1,300 GWAS SNPs in 53 globally distributed human populations. We apply these same techniques to evaluate SNPs grouped by trait association. We find that groups of SNPs associated with pigmentation, blood pressure, infectious disease, and autoimmune disease traits exhibit unusual allele frequency patterns and elevated iHS scores in certain geographical locations. We also find that GWAS SNPs have generally elevated scores for measures of allele frequency variation and for iHS in Eurasia and East Asia. Overall, we believe that our results provide evidence for selection on several complex traits that has caused changes in allele frequencies and/or elevated iHS scores at a number of associated loci. Since GWAS SNPs collectively exhibit elevated allele frequency measures and iHS scores, selection on complex traits may be quite widespread. Our findings are most consistent with this selection being either positive or negative, although the relative contributions of the two are difficult to discern. Our results also suggest that trait-SNP associations identified in Eurasian samples may not be present in Africa, Oceania, and the Americas, possibly due to differences in linkage disequilibrium patterns. This observation suggests that non-Eurasian and non-East Asian sample populations should be included in future GWAS.  相似文献   

6.
Several studies have shown that immune system proteins have on average a higher rate of amino acid evolution between different species of mammals than do most other proteins. To test whether immune-system-expressed loci show a correspondingly elevated rate of within-species nonsynonymous (amino acid altering) polymorphism, we examined gene diversity (heterozygosity) at 4,911 single nucleotide polymorphism (SNP) sites at 481 protein-coding loci. At loci with nonimmune functions, gene diversity at nonsynonymous SNP sites was typically lower than that at silent SNP sites (those not altering the amino acid sequence) in the same gene, a pattern that is an evidence of purifying selection acting to eliminate slightly deleterious variants. However, this pattern was not seen at nonsynonymous SNPs causing conservative amino acid replacements in immune system proteins, indicating that the latter are subject to a reduced level of functional constraint. Similarly, immune system genes showed higher gene diversities in their 5′ noncoding regions than did other proteins. These results identified certain immune system loci that are likely to be subject to balancing selection that acts to maintain polymorphism in either coding or regulatory regions. Electronic Supplementary Material Supplementary material is available for this article at .  相似文献   

7.
High-throughput sequencing opens avenues to find genetic variations that may be indicative of an increased risk for certain diseases. Linking these genomic data to other "omics" approaches bears the potential to deepen our understanding of pathogenic processes at the molecular level. To detect novel single nucleotide polymorphisms (SNPs) for glioblastoma multiforme (GBM), we used a combination of specific target selection and next generation sequencing (NGS). We generated a microarray covering the exonic regions of 132 GBM associated genes to enrich target sequences in two GBM tissues and corresponding leukocytes of the patients. Enriched target genes were sequenced with Illumina and the resulting reads were mapped to the human genome. With this approach we identified over 6000 SNPs, including over 1300 SNPs located in the targeted genes. Integrating the genome-wide association study (GWAS) catalog and known disease associated SNPs, we found that several of the detected SNPs were previously associated with smoking behavior, body mass index, breast cancer and high-grade glioma. Particularly, the breast cancer associated allele of rs660118 SNP in the gene SART1 showed a near doubled frequency in glioblastoma patients, as verified in an independent control cohort by Sanger sequencing. In addition, we identified SNPs in 20 of 21 GBM associated antigens providing further evidence that genetic variations are significantly associated with the immunogenicity of antigens.  相似文献   

8.
Traditional fine-mapping approaches in mouse genetics that go from a linkage region to a candidate gene are very costly and time consuming. Shared ancestry regions, along with the combination of genetics and genomics approaches, provide a powerful tool to shorten the time and effort required to identify a causative gene. In this article we present a novel methodology that predicts IBD (identical by descent) regions between pairs of inbred strains using single nucleotide polymorphism (SNP) maps. We have validated this approach by comparing the IBD regions, estimated using different algorithms, to the results derived using the sequence information in the strains present in the Celera Mouse Database. We showed that based on the current publicly available SNP genotypes, large IBD regions (>1 Mb) can be identified successfully. By assembling a list of 21,514 SNPs in 61 common inbred strains, we inferred IBD regions between all pairs of strains and confirmed, for the first time, that existing quantitative trait genes (QTG) and susceptibility genes all lie outside of IBD regions. We also illustrated how knowledge of IBD structures can be applied to strain selection for future crosses. We have made our results available for data mining and download through a public website ( ). Electronic Supplementary Material Electronic Supplementary material is available for this article at and accessible for authorised users.  相似文献   

9.
Qualitative and quantitative defects in human germ cell production that result in infertility are common and determined at least in part by genetic factors [Matzuk and Lamb, Nat Cell Biol 4(Suppl):s41–s49, 2002]. Yet, very few genes that are associated with germ cell defects in humans have been identified. In this study, we examined whether variants of the Deleted in AZoospermia-Like (DAZL) gene are associated with measures of germ cell production in three distinct populations of men and women. We identified 95 sequence variants in DAZL and further analyzed twelve single nucleotide polymorphisms (SNPs) that were present across ethnicities. We found that seven of the twelve SNPs were associated with at least one of the parameters studied (age at premature ovarian failure or menopause, total sperm count, or total motile sperm count). Surprisingly, many alleles exhibited opposing effects in men and women, which may be a result of different genetic requirements in male and female germ cells. Single SNP and haplotype analysis suggested that SNPs in the DAZL gene may act jointly to affect common reproductive characteristics in the human population. Electronic Supplementary Material Supplementary material is available for this article at and is accessible for authorized users. Joyce Y. Tung and Mitchell P. Rosen have contributed equally  相似文献   

10.
Pathogens have played a substantial role in human evolution, with past infections shaping genetic variation at loci influencing immune function. We selected 168 genes known to be involved in the immune response, genotyped common single nucleotide polymorphisms across each gene in three population samples (CEPH Europeans from Utah, Han Chinese from Guangxi, and Yoruba Nigerians from Southwest Nigeria) and searched for evidence of selection based on four tests for non-neutral evolution: minor allele frequency (MAF), derived allele frequency (DAF), Fst versus heterozygosity and extended haplotype homozygosity (EHH). Six of the 168 genes show some evidence for non-neutral evolution in this initial screen, with two showing similar signals in independent data from the International HapMap Project. These analyses identify two loci involved in immune function that are candidates for having been subject to evolutionary selection, and highlight a number of analytical challenges in searching for selection in genome-wide polymorphism data. Electronic Supplementary Material Supplementary material is available for this article at and is accessible for authorized users. Emily C. Walsh, Pardis Sabeti, Holli B. Hutcheson, and Ben Fry have contributed equally to this work and Stephen O’Brien and David Altshuler have jointly supervised this project  相似文献   

11.
Recent studies have shown that the human genome has a haplotype block structure such that it can be decomposed into large blocks with high linkage disequilibrium (LD) and relatively limited haplotype diversity, separated by short regions of low LD. One of the practical implications of this observation is that only a small fraction of all the single-nucleotide polymorphisms (SNPs) (referred as "tag SNPs") can be chosen for mapping genes responsible for human complex diseases, which can significantly reduce genotyping effort, without much loss of power. Algorithms have been developed to partition haplotypes into blocks with the minimum number of tag SNPs for an entire chromosome. In practice, investigators may have limited resources, and only a certain number of SNPs can be genotyped. In the present article, we first formulate this problem as finding a block partition with a fixed number of tag SNPs that can cover the maximal percentage of the whole genome, and we then develop two dynamic programming algorithms to solve this problem. The algorithms are sufficiently flexible to permit knowledge of functional polymorphisms to be considered. We apply the algorithms to a data set of SNPs on human chromosome 21, combining the information of coding and noncoding regions. We study the density of SNPs in intergenic regions, introns, and exons, and we find that the SNP density in intergenic regions is similar to that in introns and is higher than that in exons, results that are consistent with previous studies. We also calculate the distribution of block break points in intergenic regions, genes, exons, and coding regions and do not find any significant differences.  相似文献   

12.
Identifying the signature and targets of local adaptation is an increasingly important goal in empirical population genetics. Using data from 443 balsam poplar Populus balsamifera trees sampled from 31 populations, we tested for evidence of geographically variable selection shaping diversity at 27 homologues of the Arabidopsis flowering-time network. These genes are implicated in the control of seasonal phenology, an important determinant of fitness. Using 335 candidate and 412 reference single nucleotide polymorphisms (SNPs), we tested for evidence of local adaptation by searching for elevated population differentiation using F(ST)-based outlier analyses implemented in BayeScan or a Hierarchical Model in Arelquin and by testing for significant associations between allele frequency and environmental variables using BAYENV. A total of 46 SNPs from 14 candidate genes had signatures of local adaptation-either significantly greater population differentiation or significant covariance with one or more environmental variable relative to reference SNP distributions. Only 11 SNPs from two genes exhibited both elevated population differentiation and covariance with one or more environmental variables. Several genes including the abscisic acid gene ABI1B and the circadian clock genes ELF3 and GI5 harbored a large number of SNPs with signatures of local adaptation-with SNPs in GI5 strongly covarying with both latitude and precipitation and SNPs in ABI1B strongly covarying with temperature. In contrast to several other systems, we find little evidence that photoreceptors, including phytochromes, play an important role in local adaptation. Our results additionally show that detecting local adaptation is sensitive to the analytical approaches used and that model-based significance thresholds should be viewed with caution.  相似文献   

13.
The "thrifty" genotype and phenotype that save energy are detrimental to the health of people living in affluent societies. Individual differences in energy metabolism are caused primarily by single nucleotide polymorphisms (SNPs), some of which promote the development of obesity/type 2 diabetes mellitus. In this review, four major questions are addressed: (1) Why did regional differences in energy metabolism develop during evolution? (2) How do genes respond to starvation and affluence? (3) Which SNPs correspond to the hypothetical "thrifty genes"? (4) How can we cope with disease susceptibility caused by the "thrifty" SNPs? We examined mtDNA and genes for energy metabolism in people who live in several parts of Asia and the Pacific islands. We included 14 genes, and the SNP frequencies of PPAR gamma 2, LEPR, and UCP3-p and some other genes differ significantly between Mongoloids and Caucasoids. These differences in SNPs may have been caused by natural selection depending on the types of agriculture practiced in different regions. Interventions to counteract the adverse effects of "thrifty" SNPs have been partially effective.  相似文献   

14.
Recent technological developments have facilitated an increased focus on identifying genomic regions underlying adaptive trait variation in natural populations, and it has been advocated that this information should be important for designating population units for conservation. In marine fishes, phenotypic studies have suggested adaptation through divergence of life-history traits among natural populations, but the distribution of adaptive genetic variation in these species is still relatively poorly known. In this study, we extract information about the geographical distribution of genetic variation for 33 single nucleotide polymorphisms (SNPs) associated with life-history trait candidate genes, and compare this to variation in 70 putatively neutral SNPs in Atlantic cod (Gadus morhua). We analyse samples covering the major population complexes in the eastern Atlantic and find strong evidence for non-neutral levels and patterns of population structuring for several of the candidate gene-associated markers, including two SNPs in the growth hormone 1 gene. Thus, this study aligns with findings from phenotypic studies, providing molecular data strongly suggesting that these or closely linked genes are under selection in natural populations of Atlantic cod. Furthermore, we find that patterns of variation in outlier markers do not align with those observed at selectively neutral markers, and that outlier markers identify conservation units on finer geographical scales than those revealed when analysing only neutral markers. Accordingly, results also suggest that information about adaptive genetic variation will be useful for targeted conservation and management in this and other marine species.  相似文献   

15.
Positive selection is known to occur when the environment that an organism inhabits is suddenly altered, as is the case across recent human history. Genome-wide association studies (GWASs) have successfully illuminated disease-associated variation. However, whether human evolution is heading towards or away from disease susceptibility in general remains an open question. The genetic-basis of common complex disease may partially be caused by positive selection events, which simultaneously increased fitness and susceptibility to disease. We analyze seven diseases studied by the Wellcome Trust Case Control Consortium to compare evidence for selection at every locus associated with disease. We take a large set of the most strongly associated SNPs in each GWA study in order to capture more hidden associations at the cost of introducing false positives into our analysis. We then search for signs of positive selection in this inclusive set of SNPs. There are striking differences between the seven studied diseases. We find alleles increasing susceptibility to Type 1 Diabetes (T1D), Rheumatoid Arthritis (RA), and Crohn''s Disease (CD) underwent recent positive selection. There is more selection in alleles increasing, rather than decreasing, susceptibility to T1D. In the 80 SNPs most associated with T1D (p-value <7.01×10−5) showing strong signs of positive selection, 58 alleles associated with disease susceptibility show signs of positive selection, while only 22 associated with disease protection show signs of positive selection. Alleles increasing susceptibility to RA are under selection as well. In contrast, selection in SNPs associated with CD favors protective alleles. These results inform the current understanding of disease etiology, shed light on potential benefits associated with the genetic-basis of disease, and aid in the efforts to identify causal genetic factors underlying complex disease.  相似文献   

16.
Understanding the genetic mechanisms that facilitate adaptive radiation is an important component of evolutionary biology. Here, we genotyped 82 neutral SNPs, seven SNPs in islands of divergence identified in a previous study (island SNPs), and a region of the major histocompatibility complex (MHC) in 32 populations of sockeye salmon to investigate whether conserved genes and genomic regions are involved in adaptive radiation. Populations representing three ecotypes were sampled from seven drainages with differing habitats and colonization histories spanning a range of 2,000 km. We found strong signatures of parallel selection across drainages at the island SNPs and MHC, suggesting that the same loci undergo divergent selection during adaptive radiation. However, patterns of differentiation at most island SNPs and the MHC were not associated with ecotypes, suggesting that these loci are responding differently to a mosaic of selective pressures. Our study provides some of the first evidence that conserved genomic islands may be involved in adaptive divergence of salmon populations. Additionally, our data provide further support for the hypothesis that sockeye salmon inhabiting rivers unconnected to lakes harbour similar genetic diversity across large distances, are likely the ancestral form of the species, and have repeatedly recolonized lake systems as they have become available after glacial recession. Finally, our results highlight the value and importance of validating outlier loci by screening additional populations and regions, a practice that will hopefully become more common in the future.  相似文献   

17.
18.
Human driven selection during domestication and subsequent breed formation has likely left detectable signatures within the genome of modern cattle. The elucidation of these signatures of selection is of interest from the perspective of evolutionary biology, and for identifying domestication-related genes that ultimately may help to further genetically improve this economically important animal. To this end, we employed a panel of more than 15 million autosomal SNPs identified from re-sequencing of 43 Fleckvieh animals. We mainly applied two somewhat complementary statistics, the integrated Haplotype Homozygosity Score (iHS) reflecting primarily ongoing selection, and the Composite of Likelihood Ratio (CLR) having the most power to detect completed selection after fixation of the advantageous allele. We find 106 candidate selection regions, many of which are harboring genes related to phenotypes relevant in domestication, such as coat coloring pattern, neurobehavioral functioning and sensory perception including KIT, MITF, MC1R, NRG4, Erbb4, TMEM132D and TAS2R16, among others. To further investigate the relationship between genes with signatures of selection and genes identified in QTL mapping studies, we use a sample of 3062 animals to perform four genome-wide association analyses using appearance traits, body size and somatic cell count. We show that regions associated with coat coloring significantly (P<0.0001) overlap with the candidate selection regions, suggesting that the selection signals we identify are associated with traits known to be affected by selection during domestication. Results also provide further evidence regarding the complexity of the genetics underlying coat coloring in cattle. This study illustrates the potential of population genetic approaches for identifying genomic regions affecting domestication-related phenotypes and further helps to identify specific regions targeted by selection during speciation, domestication and breed formation of cattle. We also show that Linkage Disequilibrium (LD) decays in cattle at a much faster rate than previously thought.  相似文献   

19.
Single nucleotide polymorphisms (SNPs), due to their abundance and low mutation rate, are very useful genetic markers for genetic association studies. However, the current genotyping technology cannot afford to genotype all common SNPs in all the genes. By making use of linkage disequilibrium, we can reduce the experiment cost by genotyping a subset of SNPs, called Tag SNPs, which have a strong association with the ungenotyped SNPs, while are as independent from each other as possible. The problem of selecting Tag SNPs is NP-complete; when there are large number of SNPs, in order to avoid extremely long computational time, most of the existing Tag SNP selection methods first partition the SNPs into blocks based on certain block definitions, then Tag SNPs are selected in each block by brute-force search. The size of the Tag SNP set obtained in this way may usually be reduced further due to the inter-dependency among blocks. This paper proposes two algorithms, TSSA and TSSD, to tackle the block-independent Tag SNP selection problem. TSSA is based on A* search algorithm, and TSSD is a heuristic algorithm. Experiments show that TSSA can find the optimal solutions for medium-sized problems in reasonable time, while TSSD can handle very large problems and report approximate solutions very close to the optimal ones.  相似文献   

20.
Polyploidization is a prevalent mode of genome diversification within plants. Most gene duplicates arising from polyploidization (paralogs) are typically lost, although a subset may be maintained under selection due to dosage, partitioning of gene function, or acquisition of novel functions. Because they experience selection in the presence of other duplicate loci across the genome, interactions among genes may also play a significant role in the maintenance of paralogs resulting from polyploidization. Previously, we identified duplicates of the genes LFY/FLO and AP3/DEF that directly interact in a floral regulatory pathway and are thought to be the result of ancient polyploidization in the Lamiales (> 50 mya). Although duplicates of MADS box genes including AP3/DEF are common throughout the angiosperm lineage, LFY/FLO duplicates in Lamiales are the first reported outside of tetraploid taxa. In order to explore hypotheses for the joint preservation of these interacting floral regulatory genes including novel LFY/FLO paralogs, here we clone FLO and DEF duplicates from additional Lamiales taxa and apply codon substitution models to test how selection acts on both genes following duplication. We find acceleration in the ratio of nonsynonymous-to-synonymous nucleotide substitutions for one (FLO) or both (DEF) paralogs that appears to be due to relaxed purifying selection as opposed to positive selection and shows a different pattern among functional domains of these genes. Several mechanisms are discussed that might be responsible for preservation of co-orthologs of FLO and DEF in Lamiales, including interactions among the genes of this regulatory pathway. Electronic Supplementary Material Electronic Supplementary material is available for this article at and accessible for authorised users. [Reviewing Editor: Dr. Yves Van de Peer]  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号