首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.

Background

Genome-wide data provide a powerful tool for inferring patterns of genetic variation and structure of human populations.

Principal Findings

In this study, we analysed almost 250,000 SNPs from a total of 945 samples from Eastern and Western Finland, Sweden, Northern Germany and Great Britain complemented with HapMap data. Small but statistically significant differences were observed between the European populations (FST = 0.0040, p<10−4), also between Eastern and Western Finland (FST = 0.0032, p<10−3). The latter indicated the existence of a relatively strong autosomal substructure within the country, similar to that observed earlier with smaller numbers of markers. The Germans and British were less differentiated than the Swedes, Western Finns and especially the Eastern Finns who also showed other signs of genetic drift. This is likely caused by the later founding of the northern populations, together with subsequent founder and bottleneck effects, and a smaller population size. Furthermore, our data suggest a small eastern contribution among the Finns, consistent with the historical and linguistic background of the population.

Significance

Our results warn against a priori assumptions of homogeneity among Finns and other seemingly isolated populations. Thus, in association studies in such populations, additional caution for population structure may be necessary. Our results illustrate that population history is often important for patterns of genetic variation, and that the analysis of hundreds of thousands of SNPs provides high resolution also for population genetics.  相似文献   

2.
Patterns of genetic diversity have previously been shown to mirror geography on a global scale and within continents and individual countries. Using genome-wide SNP data on 5174 Swedes with extensive geographical coverage, we analyzed the genetic structure of the Swedish population. We observed strong differences between the far northern counties and the remaining counties. The population of Dalarna county, in north middle Sweden, which borders southern Norway, also appears to differ markedly from other counties, possibly due to this county having more individuals with remote Finnish or Norwegian ancestry than other counties. An analysis of genetic differentiation (based on pairwise F(st)) indicated that the population of Sweden's southernmost counties are genetically closer to the HapMap CEU samples of Northern European ancestry than to the populations of Sweden's northernmost counties. In a comparison of extended homozygous segments, we detected a clear divide between southern and northern Sweden with small differences between the southern counties and considerably more segments in northern Sweden. Both the increased degree of homozygosity in the north and the large genetic differences between the south and the north may have arisen due to a small population in the north and the vast geographical distances between towns and villages in the north, in contrast to the more densely settled southern parts of Sweden. Our findings have implications for future genome-wide association studies (GWAS) with respect to the matching of cases and controls and the need for within-county matching. We have shown that genetic differences within a single country may be substantial, even when viewed on a European scale. Thus, population stratification needs to be accounted for, even within a country like Sweden, which is often perceived to be relatively homogenous and a favourable resource for genetic mapping, otherwise inferences based on genetic data may lead to false conclusions.  相似文献   

3.
In spite of the common belief of Europe as reasonably homogeneous at genetic level, advances in high-throughput genotyping technology have resolved several gradients which define different geographical areas with good precision. When Northern and Southern European groups were considered separately, there were clear genetic distinctions. Intra-country genetic differences were also evident, especially in Finland and, to a lesser extent, within other European populations. Here, we present the first analysis using the 125,799 genome-wide Single Nucleotide Polymorphisms (SNPs) data of 1,014 Italians with wide geographical coverage. We showed by using Principal Component analysis and model-based individual ancestry analysis, that the current population of Sardinia can be clearly differentiated genetically from mainland Italy and Sicily, and that a certain degree of genetic differentiation is detectable within the current Italian peninsula population. Pair-wise FST statistics Northern and Southern Italy amounts approximately to 0.001 between, and around 0.002 between Northern Italy and Utah residents with Northern and Western European ancestry (CEU). The Italian population also revealed a fine genetic substructure underscoring by the genomic inflation (Sardinia vs. Northern Italy = 3.040 and Northern Italy vs. CEU = 1.427), warning against confounding effects of hidden relatedness and population substructure in association studies.  相似文献   

4.
Extensive linkage disequilibrium in small human populations in Eurasia   总被引:11,自引:0,他引:11       下载免费PDF全文
The extent of linkage disequilibrium (LD) was studied in two small food-gathering populations—Evenki and Saami—and two larger food-producing populations—Finns and Swedes—in northern Eurasia. In total, 50 single-nucleotide polymorphisms (SNPs) from five genes were genotyped using real-time pyrophosphate DNA sequencing, whereas 14 microsatellites were genotyped in two X-chromosomal regions. In addition, hypervariable region I of the mtDNA was sequenced to shed light on the demographic history of the populations. The SNP data, as well as the microsatellite data, reveal extensive levels of LD in Evenki and Saami when compared to Finns and Swedes. mtDNA-sequence variation is compatible with constant population size over time in Evenki and Saami but indicates population expansion in Finns and Swedes. Furthermore, the similarity between Finns and Swedes in SNP allele- and haplotype-frequency distributions indicate that these two populations may share a recent common origin. These findings suggest that populations such as the Evenki and the Saami, rather than the Finns, may be particularly suited for the initial coarse mapping of common complex diseases.  相似文献   

5.
Although a few hundred single nucleotide polymorphisms (SNPs) suffice to infer close familial relationships, high density genome-wide SNP data make possible the inference of more distant relationships such as 2nd to 9th cousinships. In order to characterize the relationship between genetic similarity and degree of kinship given a timeframe of 100–300 years, we analyzed the sharing of DNA inferred to be identical by descent (IBD) in a subset of individuals from the 23andMe customer database (n = 22,757) and from the Human Genome Diversity Panel (HGDP-CEPH, n = 952). With data from 121 populations, we show that the average amount of DNA shared IBD in most ethnolinguistically-defined populations, for example Native American groups, Finns and Ashkenazi Jews, differs from continentally-defined populations by several orders of magnitude. Via extensive pedigree-based simulations, we determined bounds for predicted degrees of relationship given the amount of genomic IBD sharing in both endogamous and ‘unrelated’ population samples. Using these bounds as a guide, we detected tens of thousands of 2nd to 9th degree cousin pairs within a heterogenous set of 5,000 Europeans. The ubiquity of distant relatives, detected via IBD segments, in both ethnolinguistic populations and in large ‘unrelated’ populations samples has important implications for genetic genealogy, forensics and genotype/phenotype mapping studies.  相似文献   

6.
Ancient population structure shaping contemporary genetic variation has been recently appreciated and has important implications regarding our understanding of the structure of modern human genomes. We identified a ∼36-kb DNA segment in the human genome that displays an ancient substructure. The variation at this locus exists primarily as two highly divergent haplogroups. One of these haplogroups (the NE1 haplogroup) aligns with the Neandertal haplotype and contains a 4.6-kb deletion polymorphism in perfect linkage disequilibrium with 12 single nucleotide polymorphisms (SNPs) across diverse populations. The other haplogroup, which does not contain the 4.6-kb deletion, aligns with the chimpanzee haplotype and is likely ancestral. Africans have higher overall pairwise differences with the Neandertal haplotype than Eurasians do for this NE1 locus (p<10−15). Moreover, the nucleotide diversity at this locus is higher in Eurasians than in Africans. These results mimic signatures of recent Neandertal admixture contributing to this locus. However, an in-depth assessment of the variation in this region across multiple populations reveals that African NE1 haplotypes, albeit rare, harbor more sequence variation than NE1 haplotypes found in Europeans, indicating an ancient African origin of this haplogroup and refuting recent Neandertal admixture. Population genetic analyses of the SNPs within each of these haplogroups, along with genome-wide comparisons revealed significant FST (p = 0.00003) and positive Tajima''s D (p = 0.00285) statistics, pointing to non-neutral evolution of this locus. The NE1 locus harbors no protein-coding genes, but contains transcribed sequences as well as sequences with putative regulatory function based on bioinformatic predictions and in vitro experiments. We postulate that the variation observed at this locus predates Human–Neandertal divergence and is evolving under balancing selection, especially among European populations.  相似文献   

7.
Using ∼60,000 SNPs selected for minimal linkage disequilibrium, we perform population structure analysis of 1,374 unrelated Hispanic individuals from the Multi-Ethnic Study of Atherosclerosis (MESA), with self-identification corresponding to Central America (n = 93), Cuba (n = 50), the Dominican Republic (n = 203), Mexico (n = 708), Puerto Rico (n = 192), and South America (n = 111). By projection of principal components (PCs) of ancestry to samples from the HapMap phase III and the Human Genome Diversity Panel (HGDP), we show the first two PCs quantify the Caucasian, African, and Native American origins, while the third and fourth PCs bring out an axis that aligns with known South-to-North geographic location of HGDP Native American samples and further separates MESA Mexican versus Central/South American samples along the same axis. Using k-means clustering computed from the first four PCs, we define four subgroups of the MESA Hispanic cohort that show close agreement with self-identification, labeling the clusters as primarily Dominican/Cuban, Mexican, Central/South American, and Puerto Rican. To demonstrate our recommendations for genetic analysis in the MESA Hispanic cohort, we present pooled and stratified association analysis of triglycerides for selected SNPs in the LPL and TRIB1 gene regions, previously reported in GWAS of triglycerides in Caucasians but as yet unconfirmed in Hispanic populations. We report statistically significant evidence for genetic association in both genes, and we further demonstrate the importance of considering population substructure and genetic heterogeneity in genetic association studies performed in the United States Hispanic population.  相似文献   

8.
Frequencies of three different mutant haemochromatosis (HFE) alleles (282Tyr, 63Asp and 65Cys) were studied in three northern European populations, i.e. Finns, Swedes and Swedish Saamis. In Finns and Swedes the allele frequencies were within the range found in other populations from northern and western Europe. The Saamis differed from the Swedes with respect to all mutant alleles. Lower frequencies compared to Swedes were found for the 282Tyr (p = 0.0046) and 63Asp (p = 0.034) alleles, whereas the frequency of the 65Cys allele was higher (p = 0.046) in the Saamis. The total distribution of HFE alleles in Saamis showed a highly significant difference from that in Swedes (chi2 = 16.7, 3 d.f., p = 0.0008). These results further underline the genetic uniqueness of the Saamis.  相似文献   

9.
10.
Li J  Yang D  He Y  Wang M  Wen Z  Liu L  Yao J  Matsuda K  Nakamura Y  Yu J  Jiang X  Sun S  Liu Q  Jiang X  Song Q  Chen M  Yang H  Tang F  Hu X  Wang J  Chang Y  He X  Chen Y  Lin J 《PloS one》2011,6(8):e24221

Background

Human leukocyte antigen DP (HLA-DP) locus has been reported to be associated with hepatitis B virus (HBV) infection in populations of Japan and Thailand. We aimed to examine whether the association can be replicated in Han Chinese populations.

Methodology/Principal Findings

Two HLA-DP variants rs2395309 and rs9277535 (the most strongly associated SNPs from each HLA-DP locus) were genotyped in three independent Han cohorts consisting of 2 805 cases and 1 796 controls. By using logistic regression analysis, these two SNPs in the HLA-DPA1 and HLA-DPB1 genes were significantly associated with HBV infection in Han Chinese populations (P = 0.021∼3.36×10−8 at rs2395309; P = 8.37×10−3∼2.68×10−10 at rs9277535). In addition, the genotype distributions of both sites (rs2395309 and rs9277535) were clearly different between southern and northern Chinese population (P = 8.95×10−5 at rs2395309; P = 1.64×10−9 at rs9277535). By using asymptomatic HBV carrier as control group, our study showed that there were no associations of two HLA-DP variants with HBV progression (P = 0.305∼0.822 and 0.163∼0.881 in southern Chinese population, respectively; P = 0.097∼0.697 and 0.198∼0.615 in northern Chinese population, respectively).

Conclusions

Our results confirmed that two SNPs (rs2395309 and rs9277535) in the HLA-DP loci were strongly associated with HBV infection in southern and northern Han Chinese populations, but not with HBV progression.  相似文献   

11.
Transferrin C subtypes and ethnic heterogeneity in Sweden   总被引:1,自引:0,他引:1  
Transferrin (TF) C subtypes were studied in Swedish Lapps (Saami) and in Swedes from northern, central and southern Sweden, and the allele frequencies were compared with those in other European populations. The Swedish Lapps were found to have the lowest frequency of the TF*C3 allele (1-2%) so far observed in Europe. Most European populations have TF*C3 allele frequencies between 5 and 7%. Finns differ by having high TF*C3 frequencies (13-14%). The relatively high TF*C3 frequencies found in northeastern Sweden (13%) and in central Sweden (9%) are most likely due to eastern influence. Unlike other genetic markers of eastern influence (e.g. TF*DCHI), which are of Asiatic Mongoloid origin, TF*C3 appears to originate from Finno-Ugric populations.  相似文献   

12.

Background

Postoperative ventricular dysfunction (VnD) occurs in 9–20% of coronary artery bypass graft (CABG) surgical patients and is associated with increased postoperative morbidity and mortality. Understanding genetic causes of postoperative VnD should enhance patient risk stratification and improve treatment and prevention strategies. We aimed to determine if genetic variants associate with occurrence of in-hospital VnD after CABG surgery.

Methods

A genome-wide association study identified single nucleotide polymorphisms (SNPs) associated with postoperative VnD in male subjects of European ancestry undergoing isolated primary CABG surgery with cardiopulmonary bypass. VnD was defined as the need for ≥2 inotropes or mechanical ventricular support after CABG surgery. Validated SNPs were assessed further in two replication CABG cohorts and meta-analysis was performed.

Results

Over 100 SNPs were associated with VnD (P<10−4), with one SNP (rs17691914) encoded at 3p22.3 reaching genome-wide significance (Padditive model = 2.14×10−8). Meta-analysis of validation and replication study data for 17 SNPs identified three SNPs associated with increased risk for developing postoperative VnD after adjusting for clinical risk factors. These SNPs are located at 3p22.3 (rs17691914, ORadditive model = 2.01, P = 0.0002), 3p14.2 (rs17061085, ORadditive model = 1.70, P = 0.0001) and 11q23.2 (rs12279572, ORrecessive model = 2.19, P = 0.001).

Conclusions

No SNPs were consistently associated with strong risk (ORadditive model>2.1) of developing in-hospital VnD after CABG surgery. However, three genetic loci identified by meta-analysis were more modestly associated with development of postoperative VnD. Studies of larger cohorts to assess these loci as well as to define other genetic mechanisms and related biology that link genetic variants to postoperative ventricular dysfunction are warranted.  相似文献   

13.
To date, most genome-wide association studies (GWAS) and studies of fine-scale population structure have been conducted primarily on Europeans. Han Chinese, the largest ethnic group in the world, composing 20% of the entire global human population, is largely underrepresented in such studies. A well-recognized challenge is the fact that population structure can cause spurious associations in GWAS. In this study, we examined population substructures in a diverse set of over 1700 Han Chinese samples collected from 26 regions across China, each genotyped at ∼160K single-nucleotide polymorphisms (SNPs). Our results showed that the Han Chinese population is intricately substructured, with the main observed clusters corresponding roughly to northern Han, central Han, and southern Han. However, simulated case-control studies showed that genetic differentiation among these clusters, although very small (FST = 0.0002 ∼0.0009), is sufficient to lead to an inflated rate of false-positive results even when the sample size is moderate. The top two SNPs with the greatest frequency differences between the northern Han and southern Han clusters (FST > 0.06) were found in the FADS2 gene, which associates with the fatty acid composition in phospholipids, and in the HLA complex P5 gene (HCP5), which associates with HIV infection, psoriasis, and psoriatic arthritis. Ingenuity Pathway Analysis (IPA) showed that most differentiated genes among clusters are involved in cardiac arteriopathy (p < 10−101). These signals indicating significant differences among Han Chinese subpopulations should be carefully explained in case they are also detected in association studies, especially when sample sources are diverse.  相似文献   

14.
Global climate change is expected to trigger northward shifts in the ranges of natural populations of plants and animals, with subsequent effects on intraspecific genetic diversity. Investigating how genetic diversity is patterned among populations that arose following the last Ice Age is a promising method for understanding the potential future effects of climate change. Theoretical and empirical work has suggested that overall genetic diversity can decrease in colonial populations following rapid expansion into postglacial landscapes, with potential negative effects on the ability of populations to adapt to new environmental regimes. The crucial measure of this genetic variation and a population''s overall adaptability is the heritable variation in phenotypic traits, as it is this variation that mediates the rate and direction of a population''s multigenerational response to selection. Using two large full-sib quantitative genetic studies (NManitoba = 144; NSouth Dakota = 653) and a smaller phenotypic analysis from Kansas (NKansas = 44), we compared mean levels of pigmentation, genetic variation and heritability in three pigmentation traits among populations of the common garter snake, Thamnophis sirtalis, along a north-south gradient, including a postglacial northern population and a putative southern refuge population. Counter to our expectations, we found that genetic variance and heritability for the three pigmentation traits were the same or higher in the postglacial population than in the southern population.  相似文献   

15.
European population genetic substructure was examined in a diverse set of >1,000 individuals of European descent, each genotyped with >300 K SNPs. Both STRUCTURE and principal component analyses (PCA) showed the largest division/principal component (PC) differentiated northern from southern European ancestry. A second PC further separated Italian, Spanish, and Greek individuals from those of Ashkenazi Jewish ancestry as well as distinguishing among northern European populations. In separate analyses of northern European participants other substructure relationships were discerned showing a west to east gradient. Application of this substructure information was critical in examining a real dataset in whole genome association (WGA) analyses for rheumatoid arthritis in European Americans to reduce false positive signals. In addition, two sets of European substructure ancestry informative markers (ESAIMs) were identified that provide substantial substructure information. The results provide further insight into European population genetic substructure and show that this information can be used for improving error rates in association testing of candidate genes and in replication studies of WGA scans.  相似文献   

16.
17.
Zhou L  Ding H  Zhang X  He M  Huang S  Xu Y  Shi Y  Cui G  Cheng L  Wang QK  Hu FB  Wang D  Wu T 《PloS one》2011,6(11):e27481

Background

Recent genome-wide association studies (GWAS) have mapped several novel loci influencing blood lipid levels in Caucasians. We sought to explore whether the genetic variants at newly identified lipid-associated loci were associated with CHD susceptibility in a Chinese Han population.

Methodology/Principal Findings

We conducted a two-stage case-control study in a Chinese Han population. The first-stage, consisting of 1,376 CHD cases and 1,376 sex and age- frequency matched controls, examined 5 novel lipid-associated single-nucleotide polymorphisms (SNPs) identified from GWAS among Caucasians in relation to CHD risk in Chinese. We then validated significant SNPs in the second-stage, consisting of 1,269 cases and 2,745 controls. We also tested associations between SNPs within the five novel loci and blood lipid levels in 4,121 controls. We identified two novel SNPs (rs599839 in CELSR2-PSRC1-SORT1 and rs16996148 in NCAN-CILP2) that were significantly associated with reduced CHD risk in Chinese (odds ratios (95% confidence intervals) in the dominant model 0.76 (0.61-0.90; P = 0.001), 0.67 (0.57-0.77; P = 3.4×10−8), respectively). Multiple linear regression analyses using dominant model showed that rs599839 was significantly associated with decreased LDL levels (P = 0.022) and rs16996148 was significantly associated with increased LDL and HDL levels (P = 2.9×10−4 and 0.001, respectively).

Conclusions/Significance

We identified two novel SNPs (rs599839 and rs16996148) at newly identified lipid-associated loci that were significantly associated with CHD susceptibility in a Chinese Han population.  相似文献   

18.
Telomere function is essential to maintaining the physical integrity of linear chromosomes and healthy human aging. The probability of forming proper telomere structures depends on the length of the telomeric DNA tract. We attempted to identify common genetic variants associated with log relative telomere length using genome-wide genotyping data on 3,554 individuals from the Nurses'' Health Study and the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial that took part in the National Cancer Institute Cancer Genetic Markers of Susceptibility initiative for breast and prostate cancer. After genotyping 64 independent SNPs selected for replication in additional Nurses'' Health Study and Women''s Genome Health Study participants, we did not identify genome-wide significant loci; however, we replicated the inverse association of log relative telomere length with the minor allele variant [C] of rs16847897 at the TERC locus (per allele β = −0.03, P = 0.003) identified by a previous genome-wide association study. We did not find evidence for an association with variants at the OBFC1 locus or other loci reported to be associated with telomere length. With this sample size we had >80% power to detect β estimates as small as ±0.10 for SNPs with minor allele frequencies of ≥0.15 at genome-wide significance. However, power is greatly reduced for β estimates smaller than ±0.10, such as those for variants at the TERC locus. In general, common genetic variants associated with telomere length homeostasis have been difficult to detect. Potential biological and technical issues are discussed.  相似文献   

19.
African Americans are disproportionately affected by type 2 diabetes (T2DM) yet few studies have examined T2DM using genome-wide association approaches in this ethnicity. The aim of this study was to identify genes associated with T2DM in the African American population. We performed a Genome Wide Association Study (GWAS) using the Affymetrix 6.0 array in 965 African-American cases with T2DM and end-stage renal disease (T2DM-ESRD) and 1029 population-based controls. The most significant SNPs (n = 550 independent loci) were genotyped in a replication cohort and 122 SNPs (n = 98 independent loci) were further tested through genotyping three additional validation cohorts followed by meta-analysis in all five cohorts totaling 3,132 cases and 3,317 controls. Twelve SNPs had evidence of association in the GWAS (P<0.0071), were directionally consistent in the Replication cohort and were associated with T2DM in subjects without nephropathy (P<0.05). Meta-analysis in all cases and controls revealed a single SNP reaching genome-wide significance (P<2.5×10−8). SNP rs7560163 (P = 7.0×10−9, OR (95% CI) = 0.75 (0.67–0.84)) is located intergenically between RND3 and RBM43. Four additional loci (rs7542900, rs4659485, rs2722769 and rs7107217) were associated with T2DM (P<0.05) and reached more nominal levels of significance (P<2.5×10−5) in the overall analysis and may represent novel loci that contribute to T2DM. We have identified novel T2DM-susceptibility variants in the African-American population. Notably, T2DM risk was associated with the major allele and implies an interesting genetic architecture in this population. These results suggest that multiple loci underlie T2DM susceptibility in the African-American population and that these loci are distinct from those identified in other ethnic populations.  相似文献   

20.
Allergic rhinitis (AR) is an atopic disease which affects about 600 million people worldwide and results from a complex interplay between genetic and environmental factors. However genetic association studies on known candidate genes yielded variable results. The aim of this study is to identify the genetic variants that influence predisposition towards allergic rhinitis in an ethnic Chinese population in Singapore using a genome-wide association study (GWAS) approach. A total of 4461 ethnic Chinese volunteers were recruited in Singapore and classified according to their allergic disease status. The GWAS included a discovery stage comparing 515 atopic cases (including 456 AR cases) and 486 non-allergic non-rhinitis (NANR) controls. The top SNPs were then validated in a replication cohort consisting of a separate 2323 atopic cases (including 676 AR cases) and 511 NANR controls. Two SNPs showed consistent association in both discovery and replication phases; MRPL4 SNP rs8111930 on 19q13.2 (OR = 0.69, Pcombined = 4.46×10−05) and BCAP SNP rs505010 on chromosome 10q24.1 (OR = 0.64, Pcombined = 1.10×10−04). In addition, we also replicated multiple associations within known candidates regions such as HLA-DQ and NPSR1 locus in the discovery phase. Our study suggests that MRPL4 and BCAP, key components of the HIF-1α and PI3K/Akt signaling pathways respectively, are two novel candidate genes for atopy and allergic rhinitis. Further study on these molecules and their signaling pathways would help in understanding of the pathogenesis of allergic rhinitis and identification of targets for new therapeutic intervention.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号