首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Kim KJ  Lee HJ  Park MH  Cha SH  Kim KS  Kim HT  Kimm K  Oh B  Lee JY 《Genomics》2006,88(5):535-540
Understanding patterns of linkage disequilibrium (LD) across genomes may facilitate association mapping studies to localize genetic variants influencing complex diseases, a recognition that led to the International Haplotype Mapping Project (HapMap). Divergent patterns of haplotype frequency and LD across global populations require that the HapMap database be supplemented with haplotype and LD data from additional populations. We conducted a pilot study of the LD and haplotype structure of a genomic region in a Korean population. A total of 165 SNPs were identified in a 200-kb region of 22q13.2 by direct sequencing. Unphased genotype data were generated for 76 SNPs in 90 unrelated Korean individuals. LD, haplotype diversity, and recombination rates were assessed in this region and compared with the HapMap database. The pattern of LD and haplotype frequencies of Korean samples showed a high degree of similarity with Japanese data. There was a strong correlation between high LD and low recombination frequency in this region. We found considerable similarities in local LD patterns between three Asian populations (Han Chinese, Japanese, and Korean) and the CEPH population. Haplotype frequencies were, however, significantly different between them. Our results should further the understanding of distinctive Korean genomic features and assist in designing appropriate association studies.  相似文献   

2.
Schlenke TA  Begun DJ 《Genetics》2003,164(4):1471-1480
Evidence from disparate sources suggests that natural selection may often play a role in the evolution of host immune system proteins. However, there have been few attempts to make general population genetic inferences on the basis of analysis of several immune-system-related genes from a single species. Here we present DNA polymorphism and divergence data from 34 genes thought to function in the innate immune system of Drosophila simulans and compare these data to those from 28 nonimmunity genes sequenced from the same lines. Several statistics, including average K(A)/K(S) ratio, average silent heterozygosity, and average haplotype diversity, significantly differ between the immunity and nonimmunity genes, suggesting an important role for directional selection in immune system protein evolution. In contrast to data from mammalian immunoglobulins and other proteins, we find no strong evidence for the selective maintenance of protein diversity in Drosophila immune system proteins. This may be a consequence of Drosophila's generalized innate immune response.  相似文献   

3.
Genome-wide SNP arrays have generated unprecedented quantities of data allow the detection of human evolutionary history and dense genome-wide data also enable the identification of distance ancestry among individuals or ethnic groups. To explain wider aspects of the genetic structure of Koreans and the East Asian population, we analyzed 79 individuals from the Korean HapMap project at 555,352 common single-nucleotide polymorphism loci, and compared this data with the worldwide population groups with the 53 ethnic groups from Human Genome Diversity Panel (HGDP-CEPH). Population differentiation (FST), Principal Component Analyses, STRUCTURE and ADMIXTURE are examined. In general, all the individual samples studies here were classified into subset of ethnic groups according to their geographical origins. Korean HapMap individuals were grouped together with East Asian populations from HGDP panel. Recently, a sub-population structure within Korean population has been reported. Our result, however, revealed the genetic homogeneity of Korean population. The ADMIXTURE analysis showed that, overall the Korean populations derive 79 % of their genomic ancestry from southern Asia and have relatively little northern Asian ancestry (21 %). The present work, therefore, provide the evidence that the male-biased southern-to-northern migration influenced not only for the genetic make up of the Y chromosome in the Korean population but also, its autosomal composition.  相似文献   

4.
The HUGO Pan-Asian SNP consortium conducted the largest survey to date of human genetic diversity among Asians by sampling 1,719 unrelated individuals among 71 populations from China, India, Indonesia, Japan, Malaysia, the Philippines, Singapore, South Korea, Taiwan, and Thailand. We have constructed a database (PanSNPdb), which contains these data and various new analyses of them. PanSNPdb is a research resource in the analysis of the population structure of Asian peoples, including linkage disequilibrium patterns, haplotype distributions, and copy number variations. Furthermore, PanSNPdb provides an interactive comparison with other SNP and CNV databases, including HapMap3, JSNP, dbSNP and DGV and thus provides a comprehensive resource of human genetic diversity. The information is accessible via a widely accepted graphical interface used in many genetic variation databases. Unrestricted access to PanSNPdb and any associated files is available at: http://www4a.biotec.or.th/PASNP.  相似文献   

5.
Phenotypic divergences between modern human populations have developed as a result of genetic adaptation to local environments over the past 100,000 years. To identify genes involved in population-specific phenotypes, it is necessary to detect signatures of recent positive selection in the human genome. Although detection of elongated linkage disequilibrium (LD) has been a powerful tool in the field of evolutionary genetics, current LD-based approaches are not applicable to already fixed loci. Here, we report a method of scanning for population-specific strong selective sweeps that have reached fixation. In this method, genome-wide SNP data is used to analyze differences in the haplotype frequency, nucleotide diversity, and LD between populations, using the ratio of haplotype homozygosity between populations. To estimate the detection power of the statistics used in this study, we performed computer simulations and found that these tests are relatively robust against the density of typed SNPs and demographic parameters if the advantageous allele has reached fixation. Therefore, we could determine the threshold for maintaining high detection power, regardless of SNP density and demographic history. When this method was applied to the HapMap data, it was able to identify the candidates of population-specific strong selective sweeps more efficiently than the outlier approach that depends on the empirical distribution. This study, confirming strong positive selection on genes previously reported to be associated with specific phenotypes, also identifies other candidates that are likely to contribute to phenotypic differences between human populations.  相似文献   

6.
The evolutionarily recent geographic expansion of humans, and the even more recent development of large, relatively dense human settlements, has exposed our species to new pathogenic environments. Potentially lethal pathogens are likely to have exerted important selective pressures on our genome, so immunity genes can be expected to show molecular signatures of the adaptation of human populations to these recent conditions. While genes related to the acquired immunity system have indeed been reported to show traces of local adaptation, little is known about the response of the innate immunity system. In this study, we analyze the variability patterns in different human populations of fifteen genes related to innate immunity. We have used both single nucleotide polymorphism and sequence data, and through the analysis of interpopulation differentiation, the linkage disequilibrium pattern, and intrapopulation diversity, we have discovered some signatures of positive and especially balancing selection in these genes, thus confirming the importance of the immune system genetic plasticity in the evolutionary adaptive process. Interestingly, the strongest evidence is found in three TLR genes and CD14. These innate immunity genes play a pivotal role, being involved in the primary recognition of pathogens. In general, more evidences of selection appear in the European populations, in some case possibly related to severe population specific pressures. However, we also describe evidence from African populations, which may reflect parallel or long-term selective forces acting in different geographic areas.  相似文献   

7.
Parasite‐mediated selection may contribute to the maintenance of genetic variation at host immune genes over long time scales. To date, the best evidence for the long‐term maintenance of immunogenetic variation in natural populations comes from studies on the major histocompatibility complex (MHC) genes, whereas evidence for such processes from other immune genes remains scarce. In the present study, we show that, despite pronounced population differentiation and the occurrence of numerous private alleles within populations, the innate immune gene Toll‐like receptor 2 (TLR2) displays a distinct haplotype structure in 21 bank vole (Myodes glareolus) populations across Europe. Haplotypes from all populations grouped in four clearly differentiated clusters, with the three main clusters co‐occurring in at least three previously described mitochondrial lineages. This pattern indicates that the distinct TLR2 haplotype structure may precede the split of the mitochondrial lineages 0.19–0.56 Mya and suggests that haplotype clusters at this innate immune receptor are maintained over prolonged time in wild bank vole populations.  相似文献   

8.
Patterns of selection acting on immune defence genes have recently been the focus of considerable interest. Yet, when it comes to vertebrates, studies have mainly focused on the acquired branch of the immune system. Consequently, the direction and strength of selection acting on genes of the vertebrate innate immune defence remain poorly understood. Here, we present a molecular analysis of selection on an important receptor of the innate immune system of vertebrates, the Toll-like receptor 2 (TLR2), across 17 rodent species. Although purifying selection was the prevalent evolutionary force acting on most parts of the rodent TLR2, we found that codons in close proximity to pathogen-binding and TLR2-TLR1 heterodimerization sites have been subject to positive selection. This indicates that parasite-mediated selection is not restricted to acquired immune system genes like the major histocompatibility complex, but also affects innate defence genes. To obtain a comprehensive understanding of evolutionary processes in host-parasite systems, both innate and acquired immunity thus need to be considered.  相似文献   

9.
A fundamental goal of evolutionary biology is to understand how ecological diversity arises and is maintained in natural populations. We have investigated the contributions of gene flow and divergent selection to the distribution of genetic variation in an ecologically differentiated population of a thermophilic cyanobacterium (Mastigocladus laminosus) found along the temperature gradient of a nitrogen‐limited stream in Yellowstone National Park. For most loci sampled, gene flow appears to be sufficient to prevent substantial genetic divergence. However, one locus (rfbC) exhibited a comparatively low migration rate as well as other signatures expected for a gene experiencing spatially varying selection, including an excess of common variants, an elevated level of polymorphism and extreme genetic differentiation along the gradient. rfbC is part of an expression island involved in the production of the polysaccharide component of the protective envelope of the heterocyst, the specialized nitrogen‐fixing cell of these bacteria. SNP genotyping in the vicinity of rfbC revealed a ~5‐kbp region including a gene content polymorphism that is tightly associated with environmental temperature and therefore likely contains the target of selection. Two genes have been deleted both in the predominant haplotype found in the downstream region of White Creek and in strains from other Yellowstone populations of M. laminosus, which may result in the production of heterocysts with different envelope properties. This study implicates spatially varying selection in the maintenance of variation related to thermal performance at White Creek despite on‐going or recent gene flow.  相似文献   

10.
Lim J  Kim YJ  Yoon Y  Kim SO  Kang H  Park J  Han AR  Han B  Oh B  Kimm K  Yoon B  Song K 《Genomics》2006,87(3):392-398
The extent and pattern of linkage disequilibrium (LD) in the human genome provide important information for disease gene mapping. Previous studies have shown that LDs vary depending on chromosomal regions and populations. As the Asian samples of the International HapMap Project consisted of Japanese and Chinese populations, it was of interest whether we could use the HapMap data as a reference to carry out association studies of common complex diseases in a closely related population, such as Koreans. We have compared the LD and recombination patterns defined by single-nucleotide polymorphisms (SNPs) in ENCODE region ENm010, chromosome 7p15.2, in Korean, Japanese, and Chinese samples and further tested the robustness of tagSNPs among the Asian samples. We genotyped 792 SNPs in 500 kb (chromosome 7: 26699793-27199792, NCBI build 34) from 90 unrelated Koreans by fluorescence polarization detection and compared the data with Asian data from the HapMap project. Despite some differences in the position of high LD region boundaries, the overall patterns of LD were remarkably similar across the three samples, reflecting strong genetic affinities among them. Furthermore, the haplotype tag SNP transferability across the three samples was greater than 90%. Our results support the initial suggestion that the populations genotyped in the HapMap project might serve as reference populations for the selection of tagSNPs in association studies.  相似文献   

11.
Genetic epidemiological studies of complex diseases often rely on data from the International HapMap Consortium for identification of single nucleotide polymorphisms (SNPs), particularly those that tag haplotypes. However, little is known about the relevance of the African populations used to collect HapMap data for study populations conducted elsewhere in Africa. Toll-like receptor (TLR) genes play a key role in susceptibility to various infectious diseases, including tuberculosis. We conducted full-exon sequencing in samples obtained from Uganda (n = 48) and South Africa (n = 48), in four genes in the TLR pathway: TLR2, TLR4, TLR6, and TIRAP. We identified one novel TIRAP SNP (with minor allele frequency [MAF] 3.2%) and a novel TLR6 SNP (MAF 8%) in the Ugandan population, and a TLR6 SNP that is unique to the South African population (MAF 14%). These SNPs were also not present in the 1000 Genomes data. Genotype and haplotype frequencies and linkage disequilibrium patterns in Uganda and South Africa were similar to African populations in the HapMap datasets. Multidimensional scaling analysis of polymorphisms in all four genes suggested broad overlap of all of the examined African populations. Based on these data, we propose that there is enough similarity among African populations represented in the HapMap database to justify initial SNP selection for genetic epidemiological studies in Uganda and South Africa. We also discovered three novel polymorphisms that appear to be population-specific and would only be detected by sequencing efforts.  相似文献   

12.
Knowledge of haplotype phase is valuable for many analysis methods in the study of disease, population, and evolutionary genetics. Considerable research effort has been devoted to the development of statistical and computational methods that infer haplotype phase from genotype data. Although a substantial number of such methods have been developed, they have focused principally on inference from unrelated individuals, and comparisons between methods have been rather limited. Here, we describe the extension of five leading algorithms for phase inference for handling father-mother-child trios. We performed a comprehensive assessment of the methods applied to both trios and to unrelated individuals, with a focus on genomic-scale problems, using both simulated data and data from the HapMap project. The most accurate algorithm was PHASE (v2.1). For this method, the percentages of genotypes whose phase was incorrectly inferred were 0.12%, 0.05%, and 0.16% for trios from simulated data, HapMap Centre d'Etude du Polymorphisme Humain (CEPH) trios, and HapMap Yoruban trios, respectively, and 5.2% and 5.9% for unrelated individuals in simulated data and the HapMap CEPH data, respectively. The other methods considered in this work had comparable but slightly worse error rates. The error rates for trios are similar to the levels of genotyping error and missing data expected. We thus conclude that all the methods considered will provide highly accurate estimates of haplotypes when applied to trio data sets. Running times differ substantially between methods. Although it is one of the slowest methods, PHASE (v2.1) was used to infer haplotypes for the 1 million-SNP HapMap data set. Finally, we evaluated methods of estimating the value of r(2) between a pair of SNPs and concluded that all methods estimated r(2) well when the estimated value was >or=0.8.  相似文献   

13.
Comparing patterns of diversity and divergence between populations at immune genes and neutral markers can give insights into the nature and geographic scale of parasite-mediated selection. To date, studies investigating such patterns of selection in vertebrates have primarily focused on the acquired branch of the immune system, whereas it remains largely unknown how parasite-mediated selection shapes innate immune genes both within and across vertebrate populations. Here, we present a study on the diversity and population differentiation at the innate immune gene Toll-like receptor 2 (TLR2) across nine populations of yellow-necked mice (Apodemus flavicollis) and bank voles (Myodes glareolus) in southern Sweden. In yellow-necked mice, TLR2 diversity was very low, as was TLR2 population differentiation compared to neutral loci. In contrast, several TLR2 haplotypes co-occurred at intermediate frequencies within and across bank vole populations, and pronounced isolation by distance between populations was observed. The diversity and differentiation at neutral loci was similar in the two species. These results indicate that parasite-mediated selection has been acting in dramatically different ways on a given immune gene in ecologically similar and sympatric species. Furthermore, the finding of TLR2 population differentiation at a small geographical scale in bank voles highlights that vertebrate innate immune defense may be evolutionarily more dynamic than has previously been appreciated.  相似文献   

14.
Analysis of haplotypes based on multiple single-nucleotide polymorphisms (SNP) is becoming common for both candidate gene and fine-mapping studies. Before embarking on studies of haplotypes from genetically distinct populations, however, it is important to consider variation both in linkage disequilibrium (LD) and in haplotype frequencies within and across populations, as both vary. Such diversity will influence the choice of "tagging" SNPs for candidate gene or whole-genome association studies because some markers will not be polymorphic in all samples and some haplotypes will be poorly represented or completely absent. Here we analyze 11 genes, originally chosen as candidate genes for oral clefts, where multiple markers were genotyped on individuals from four populations. Estimated haplotype frequencies, measures of pairwise LD, and genetic diversity were computed for 135 European-Americans, 57 Chinese-Singaporeans, 45 Malay-Singaporeans, and 46 Indian-Singaporeans. Patterns of pairwise LD were compared across these four populations and haplotype frequencies were used to assess genetic variation. Although these populations are fairly similar in allele frequencies and overall patterns of LD, both haplotype frequencies and genetic diversity varied significantly across populations. Such haplotype diversity has implications for designing studies of association involving samples from genetically distinct populations.  相似文献   

15.
The International Haplotype Map Project (HapMap) has provided an essential database for studies of human population genetics and genome-wide association. Phases I and II of the HapMap project generated genotype data across ∼3 million SNP loci in 270 individuals representing four populations. Phase III provides dense genotype data on ∼1.5 million SNPs, generated by Illumina and Affymetrix platforms in a larger set of individuals. Release 3 of phase III of the HapMap contains 1397 individuals from 11 populations, including 250 of the original 270 phase I and phase II individuals and 1147 additional individuals. Although some known relationships among the phase III individuals have been described in the data release, the genotype data that are currently available provide an opportunity to empirically ascertain previously unknown relationships. We performed a systematic analysis of genetic relatedness and were able not only to confirm the reported relationships, but also to detect numerous additional, previously unidentified pairs of close relatives in the HapMap sample. The inferred relative pairs make it possible to propose standardized subsets of unrelated individuals for use in future studies in which relatedness needs to be clearly defined.  相似文献   

16.
Loss-of-function variants in innate immunity genes are associated with Mendelian disorders in the form of primary immunodeficiencies. Recent resequencing projects report that stop-gains and frameshifts are collectively prevalent in humans and could be responsible for some of the inter-individual variability in innate immune response. Current computational approaches evaluating loss-of-function in genes carrying these variants rely on gene-level characteristics such as evolutionary conservation and functional redundancy across the genome. However, innate immunity genes represent a particular case because they are more likely to be under positive selection and duplicated. To create a ranking of severity that would be applicable to innate immunity genes we evaluated 17,764 stop-gain and 13,915 frameshift variants from the NHLBI Exome Sequencing Project and 1,000 Genomes Project. Sequence-based features such as loss of functional domains, isoform-specific truncation and nonsense-mediated decay were found to correlate with variant allele frequency and validated with gene expression data. We integrated these features in a Bayesian classification scheme and benchmarked its use in predicting pathogenic variants against Online Mendelian Inheritance in Man (OMIM) disease stop-gains and frameshifts. The classification scheme was applied in the assessment of 335 stop-gains and 236 frameshifts affecting 227 interferon-stimulated genes. The sequence-based score ranks variants in innate immunity genes according to their potential to cause disease, and complements existing gene-based pathogenicity scores. Specifically, the sequence-based score improves measurement of functional gene impairment, discriminates across different variants in a given gene and appears particularly useful for analysis of less conserved genes.  相似文献   

17.
Many disease-susceptible SNPs exhibit significant disparity in ancestral and derived allele frequencies across worldwide populations. While previous studies have examined population differentiation of alleles at specific SNPs, global ethnic patterns of ensembles of disease risk alleles across human diseases are unexamined. To examine these patterns, we manually curated ethnic disease association data from 5,065 papers on human genetic studies representing 1,495 diseases, recording the precise risk alleles and their measured population frequencies and estimated effect sizes. We systematically compared the population frequencies of cross-ethnic risk alleles for each disease across 1,397 individuals from 11 HapMap populations, 1,064 individuals from 53 HGDP populations, and 49 individuals with whole-genome sequences from 10 populations. Type 2 diabetes (T2D) demonstrated extreme directional differentiation of risk allele frequencies across human populations, compared with null distributions of European-frequency matched control genomic alleles and risk alleles for other diseases. Most T2D risk alleles share a consistent pattern of decreasing frequencies along human migration into East Asia. Furthermore, we show that these patterns contribute to disparities in predicted genetic risk across 1,397 HapMap individuals, T2D genetic risk being consistently higher for individuals in the African populations and lower in the Asian populations, irrespective of the ethnicity considered in the initial discovery of risk alleles. We observed a similar pattern in the distribution of T2D Genetic Risk Scores, which are associated with an increased risk of developing diabetes in the Diabetes Prevention Program cohort, for the same individuals. This disparity may be attributable to the promotion of energy storage and usage appropriate to environments and inconsistent energy intake. Our results indicate that the differential frequencies of T2D risk alleles may contribute to the observed disparity in T2D incidence rates across ethnic populations.  相似文献   

18.
One of the many potential uses of the HapMap project is its application to the investigation of complex disease aetiology among a wide range of populations. This study aims to assess the transferability of HapMap SNP data to the Spanish population in the context of cancer research. We have carried out a genotyping study in Spanish subjects involving 175 candidate cancer genes using an indirect gene-based approach and compared results with those for HapMap CEU subjects. Allele frequencies were very consistent between the two samples, with a high positive correlation (R) of 0.91 (P<<1×10−6). Linkage disequilibrium patterns and block structures across each gene were also very similar, with disequilibrium coefficient (r 2) highly correlated (R=0.95, P<<1×10−6). We found that of the 21 genes that contained at least one block larger than 60 kb, nine (ATM, ATR, BRCA1, ERCC6, FANCC, RAD17, RAD50, RAD54B and XRCC4) belonged to the GO category “DNA repair”. Haplotype frequencies per gene were also highly correlated (mean R=0.93), as was haplotype diversity (R=0.91, P<<1×10−6). “Yin yang” haplotypes were observed for 43% of the genes analysed and 18% of those were identical to the ancestral haplotype (identified in Chimpazee). Finally, the portability of tagSNPs identified in the HapMap CEU data using pairwise r 2 thresholds of 0.8 and 0.5 was assessed by applying these to the Spanish and current HapMap data for 66 genes. In general, the HapMap tagSNPs performed very well. Our results show generally high concordance with HapMap data in allele frequencies and haplotype distributions and confirm the applicability of HapMap SNP data to the study of complex diseases among the Spanish population. Electronic Supplementary Material Supplementary material is available for this article at and is accessible for authorized users.  相似文献   

19.
We explored single nucleotide polymorphism (SNP) variation in candidate genes for bud burst from Quercus petraea populations sampled along gradients of latitude and altitude in Western Europe. SNP diversity was monitored for 106 candidate genes, in 758 individuals from 32 natural populations. We investigated whether SNP variation reflected the clinal pattern of bud burst observed in common garden experiments. We used different methods to detect imprints of natural selection (FST outlier, clinal variation at allelic frequencies, association tests) and compared the results obtained for the two gradients. FST outlier SNPs were found in 15 genes, 5 of which were common to both gradients. The type of selection differed between the two gradients (directional or balancing) for 3 of these 5. Clinal variations were observed for six SNPs, and one cline was conserved across both gradients. Association tests between the phenotypic or breeding values of trees and SNP genotypes identified 14 significant associations, involving 12 genes. The results of outlier detection on the basis of population differentiation or clinal variation were not very consistent with the results of association tests. The discrepancies between these approaches may reflect the different hierarchical levels of selection considered (inter- and intrapopulation selection). Finally, we obtained evidence for convergent selection (similar for gradients) and clinal variation for a few genes, suggesting that comparisons between parallel gradients could be used to screen for major candidate genes responding to natural selection in trees.  相似文献   

20.
MYH9 was recently identified as renal susceptibility gene (OR 3–8, p<10−8) for major forms of kidney disease disproportionately affecting individuals of African descent. The risk haplotype (E-1) occurs at much higher frequencies in African Americans (≥60%) than in European Americans (<4%), revealing a genetic basis for a major health disparity. The population distributions of MYH9 risk alleles and the E-1 risk haplotype and the demographic and selective forces acting on the MYH9 region are not well explored. We reconstructed MYH9 haplotypes from 4 tagging single nucleotide polymorphisms (SNPs) spanning introns 12–23 using available data from HapMap Phase II, and by genotyping 938 DNAs from the Human Genome Diversity Panel (HGDP). The E-1 risk haplotype followed a cline, being most frequent within sub-Saharan African populations (range 50–80%), less frequent in populations from the Middle East (9–27%) and Europe (0–9%), and rare or absent in Asia, the Americas, and Oceania. The fixation indexes (FST) for pairwise comparisons between the risk haplotypes for continental populations were calculated for MYH9 haplotypes; FST ranged from 0.27–0.40 for Africa compared to other continental populations, possibly due to selection. Uniquely in Africa, the Yoruba population showed high frequency extended haplotype length around the core risk allele (C) compared to the alternative allele (T) at the same locus (rs4821481, iHs = 2.67), as well as high population differentiation (FST(CEU vs. YRI) = 0.51) in HapMap Phase II data, also observable only in the Yoruba population from HGDP (FST = 0.49), pointing to an instance of recent selection in the genomic region. The population-specific divergence in MYH9 risk allele frequencies among the world''s populations may prove important in risk assessment and public health policies to mitigate the burden of kidney disease in vulnerable populations.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号