首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.

Background

The Chinese Hui population, as the second largest minority ethnic group in China, may have a different genetic background from Han people because of its unique demographic history. In this study, we aimed to identify genetic differences between Han and Hui Chinese from the Ningxia region of China by comparing eighteen single nucleotide polymorphisms in cancer-related genes.

Methods

DNA samples were collected from 99 Hui and 145 Han people from the Ningxia Hui Autonomous Region in China, and SNPs were detected using an improved multiplex ligase detection reaction method. Genotyping data from six 1000 Genomes Project population samples (99 Utah residents with northern and western European ancestry (CEU), 107 Toscani in Italy (TSI), 108 Yoruba in Ibadan (YRI), 61 of African ancestry in the southwestern US (ASW), 103 Han Chinese in Beijing (CHB), and 104 Japanese in Tokyo (JPT)) were also included in this study. Differences in the distribution of alleles among the populations were assessed using χ2 tests, and FST was used to measure the degree of population differentiation.

Results

We found that the genetic diversity of many SNPs in cancer-related genes in the Hui Chinese in Ningxia was different from that in the Han Chinese in Ningxia. For example, the allele frequencies of four SNPs (rs13361707, rs2274223, rs465498, and rs753955) showed different genetic distributions (p<0.05) between Chinese Ningxia Han and Chinese Ningxia Hui. Five SNPs (rs730506, rs13361707, rs2274223, rs465498 and rs753955) had different FST values (FST >0.000) between the Hui and Han populations.

Conclusions

These results suggest that some SNPs associated with cancer-related genes vary among different Chinese ethnic groups. We suggest that population differences should be carefully considered in evaluating cancer risk and prognosis as well as the efficacy of cancer therapy.  相似文献   

2.
Li J  Yang D  He Y  Wang M  Wen Z  Liu L  Yao J  Matsuda K  Nakamura Y  Yu J  Jiang X  Sun S  Liu Q  Jiang X  Song Q  Chen M  Yang H  Tang F  Hu X  Wang J  Chang Y  He X  Chen Y  Lin J 《PloS one》2011,6(8):e24221

Background

Human leukocyte antigen DP (HLA-DP) locus has been reported to be associated with hepatitis B virus (HBV) infection in populations of Japan and Thailand. We aimed to examine whether the association can be replicated in Han Chinese populations.

Methodology/Principal Findings

Two HLA-DP variants rs2395309 and rs9277535 (the most strongly associated SNPs from each HLA-DP locus) were genotyped in three independent Han cohorts consisting of 2 805 cases and 1 796 controls. By using logistic regression analysis, these two SNPs in the HLA-DPA1 and HLA-DPB1 genes were significantly associated with HBV infection in Han Chinese populations (P = 0.021∼3.36×10−8 at rs2395309; P = 8.37×10−3∼2.68×10−10 at rs9277535). In addition, the genotype distributions of both sites (rs2395309 and rs9277535) were clearly different between southern and northern Chinese population (P = 8.95×10−5 at rs2395309; P = 1.64×10−9 at rs9277535). By using asymptomatic HBV carrier as control group, our study showed that there were no associations of two HLA-DP variants with HBV progression (P = 0.305∼0.822 and 0.163∼0.881 in southern Chinese population, respectively; P = 0.097∼0.697 and 0.198∼0.615 in northern Chinese population, respectively).

Conclusions

Our results confirmed that two SNPs (rs2395309 and rs9277535) in the HLA-DP loci were strongly associated with HBV infection in southern and northern Han Chinese populations, but not with HBV progression.  相似文献   

3.

Background

Genome-wide data provide a powerful tool for inferring patterns of genetic variation and structure of human populations.

Principal Findings

In this study, we analysed almost 250,000 SNPs from a total of 945 samples from Eastern and Western Finland, Sweden, Northern Germany and Great Britain complemented with HapMap data. Small but statistically significant differences were observed between the European populations (FST = 0.0040, p<10−4), also between Eastern and Western Finland (FST = 0.0032, p<10−3). The latter indicated the existence of a relatively strong autosomal substructure within the country, similar to that observed earlier with smaller numbers of markers. The Germans and British were less differentiated than the Swedes, Western Finns and especially the Eastern Finns who also showed other signs of genetic drift. This is likely caused by the later founding of the northern populations, together with subsequent founder and bottleneck effects, and a smaller population size. Furthermore, our data suggest a small eastern contribution among the Finns, consistent with the historical and linguistic background of the population.

Significance

Our results warn against a priori assumptions of homogeneity among Finns and other seemingly isolated populations. Thus, in association studies in such populations, additional caution for population structure may be necessary. Our results illustrate that population history is often important for patterns of genetic variation, and that the analysis of hundreds of thousands of SNPs provides high resolution also for population genetics.  相似文献   

4.
The Hui people are unique among Chinese ethnic minorities in that they speak the same language as Han Chinese (HAN) but practice Islam. However, as the second-largest minority group in China numbering well over 10 million, the Huis are under-represented in both global and regional genomic studies. Here, we present the first whole-genome sequencing effort of 234 Hui individuals (NXH) aged over 60 who have been living in Ningxia, where the Huis are mostly concentrated. NXH are genetically more similar to East Asian than to any other global populations. In particular, the genetic differentiation between NXH and HAN (FST = 0.0015) is only slightly larger than that between northern and southern HAN (FST = 0.0010), largely attributed to the western ancestry in NXH (∼10%). Highly differentiated functional variants between NXH and HAN were identified in genes associated with skin pigmentation (e.g., SLC24A5), facial morphology (e.g., EDAR), and lipid metabolism (e.g., ABCG8). The Huis are also distinct from other Muslim groups such as the Uyghurs (FST = 0.0187), especially, NXH derived much less western ancestry (∼10%) compared with the Uyghurs (∼50%). Modeling admixture history indicated that NXH experienced an episode of two-wave admixture. An ancient admixture occurred ∼1,025 years ago, reflecting the intensive west–east contacts during the late Tang Dynasty, and the Five Dynasties and Ten Kingdoms period. A recent admixture occurred ∼500 years ago, corresponding to the Ming Dynasty. Notably, we identified considerable sex-biased admixture, that is, excess of western males and eastern females contributing to the NXH gene pool. The origins and the genomic diversity of the Hui people imply the complex history of contacts between western and eastern Eurasians.  相似文献   

5.
Zhou L  Ding H  Zhang X  He M  Huang S  Xu Y  Shi Y  Cui G  Cheng L  Wang QK  Hu FB  Wang D  Wu T 《PloS one》2011,6(11):e27481

Background

Recent genome-wide association studies (GWAS) have mapped several novel loci influencing blood lipid levels in Caucasians. We sought to explore whether the genetic variants at newly identified lipid-associated loci were associated with CHD susceptibility in a Chinese Han population.

Methodology/Principal Findings

We conducted a two-stage case-control study in a Chinese Han population. The first-stage, consisting of 1,376 CHD cases and 1,376 sex and age- frequency matched controls, examined 5 novel lipid-associated single-nucleotide polymorphisms (SNPs) identified from GWAS among Caucasians in relation to CHD risk in Chinese. We then validated significant SNPs in the second-stage, consisting of 1,269 cases and 2,745 controls. We also tested associations between SNPs within the five novel loci and blood lipid levels in 4,121 controls. We identified two novel SNPs (rs599839 in CELSR2-PSRC1-SORT1 and rs16996148 in NCAN-CILP2) that were significantly associated with reduced CHD risk in Chinese (odds ratios (95% confidence intervals) in the dominant model 0.76 (0.61-0.90; P = 0.001), 0.67 (0.57-0.77; P = 3.4×10−8), respectively). Multiple linear regression analyses using dominant model showed that rs599839 was significantly associated with decreased LDL levels (P = 0.022) and rs16996148 was significantly associated with increased LDL and HDL levels (P = 2.9×10−4 and 0.001, respectively).

Conclusions/Significance

We identified two novel SNPs (rs599839 and rs16996148) at newly identified lipid-associated loci that were significantly associated with CHD susceptibility in a Chinese Han population.  相似文献   

6.
Population stratification is a potential problem for genome-wide association studies (GWAS), confounding results and causing spurious associations. Hence, understanding how allele frequencies vary across geographic regions or among subpopulations is an important prelude to analyzing GWAS data. Using over 350,000 genome-wide autosomal SNPs in over 6000 Han Chinese samples from ten provinces of China, our study revealed a one-dimensional “north-south” population structure and a close correlation between geography and the genetic structure of the Han Chinese. The north-south population structure is consistent with the historical migration pattern of the Han Chinese population. Metropolitan cities in China were, however, more diffused “outliers,” probably because of the impact of modern migration of peoples. At a very local scale within the Guangdong province, we observed evidence of population structure among dialect groups, probably on account of endogamy within these dialects. Via simulation, we show that empirical levels of population structure observed across modern China can cause spurious associations in GWAS if not properly handled. In the Han Chinese, geographic matching is a good proxy for genetic matching, particularly in validation and candidate-gene studies in which population stratification cannot be directly accessed and accounted for because of the lack of genome-wide data, with the exception of the metropolitan cities, where geographical location is no longer a good indicator of ancestral origin. Our findings are important for designing GWAS in the Chinese population, an activity that is expected to intensify greatly in the near future.  相似文献   

7.
Schizophrenia is a devastating neuropsychiatric disorder with genetically complex traits. Genetic variants should explain a considerable portion of the risk for schizophrenia, and genome-wide association study (GWAS) is a potentially powerful tool for identifying the risk variants that underlie the disease. Here, we report the results of a three-stage analysis of three independent cohorts consisting of a total of 2,535 samples from Japanese and Chinese populations for searching schizophrenia susceptibility genes using a GWAS approach. Firstly, we examined 115,770 single nucleotide polymorphisms (SNPs) in 120 patient-parents trio samples from Japanese schizophrenia pedigrees. In stage II, we evaluated 1,632 SNPs (1,159 SNPs of p<0.01 and 473 SNPs of p<0.05 that located in previously reported linkage regions). The second sample consisted of 1,012 case-control samples of Japanese origin. The most significant p value was obtained for the SNP in the ELAVL2 [(embryonic lethal, abnormal vision, Drosophila)-like 2] gene located on 9p21.3 (p = 0.00087). In stage III, we scrutinized the ELAVL2 gene by genotyping gene-centric tagSNPs in the third sample set of 293 family samples (1,163 individuals) of Chinese descent and the SNP in the gene showed a nominal association with schizophrenia in Chinese population (p = 0.026). The current data in Asian population would be helpful for deciphering ethnic diversity of schizophrenia etiology.  相似文献   

8.
Population genetic studies provide insights into the evolutionary processes that influence the distribution of sequence variants within and among wild populations. FST is among the most widely used measures for genetic differentiation and plays a central role in ecological and evolutionary genetic studies. It is commonly thought that large sample sizes are required in order to precisely infer FST and that small sample sizes lead to overestimation of genetic differentiation. Until recently, studies in ecological model organisms incorporated a limited number of genetic markers, but since the emergence of next generation sequencing, the panel size of genetic markers available even in non-reference organisms has rapidly increased. In this study we examine whether a large number of genetic markers can substitute for small sample sizes when estimating FST. We tested the behavior of three different estimators that infer FST and that are commonly used in population genetic studies. By simulating populations, we assessed the effects of sample size and the number of markers on the various estimates of genetic differentiation. Furthermore, we tested the effect of ascertainment bias on these estimates. We show that the population sample size can be significantly reduced (as small as n = 4–6) when using an appropriate estimator and a large number of bi-allelic genetic markers (k>1,000). Therefore, conservation genetic studies can now obtain almost the same statistical power as studies performed on model organisms using markers developed with next-generation sequencing.  相似文献   

9.
Aphis gossypii Glover (Hemiptera: Aphididae) is a serious pest of cotton in northern China. A microsatellite analysis was used to characterize the genetic structure of A. gossypii populations from different geographic, host plant, and seasonal populations in 2014. Among 906 individuals, 507 multilocus genotypes were identified, with genotypic richness values of 0.07–1.00 for the populations. We observed moderate levels of genetic differentiation among geographic populations (FST = 0.103; 95% confidence interval: 0.065–0.145) and host plant populations (FST = 0.237; 95% confidence interval: 0.187–0.296). A Mantel test of isolation by distance revealed no significant correlations between Slatkin’s linearized FST and the natural logarithm of geographic distance. A Bayesian analysis of population genetic structures identified three clusters. An analysis of molecular variance revealed significant differences among the three clusters (F = 0.26596, P < 0.0001), among seasons (F = 0.04244, P = 0.00381), and among host populations (F = 0.12975, P = 0.0029). Thus, the A. gossypii populations in northern China exhibit considerable genotypic diversity. Additionally, our findings indicated that the 31 analyzed populations could be classified as one of three host biotypes (i.e., cotton, cucumber, and pomegranate biotypes). There were also clear seasonal effects on population genetic structure diversity among aphids collected from Anyang.  相似文献   

10.
Forest tree species of temperate and boreal regions have undergone a long history of demographic changes and evolutionary adaptations. The main objective of this study was to detect signals of selection in Norway spruce (Picea abies [L.] Karst), at different sampling-scales and to investigate, accounting for population structure, the effect of environment on species genetic diversity. A total of 384 single nucleotide polymorphisms (SNPs) representing 290 genes were genotyped at two geographic scales: across 12 populations distributed along two altitudinal-transects in the Alps (micro-geographic scale), and across 27 populations belonging to the range of Norway spruce in central and south-east Europe (macro-geographic scale). At the macrogeographic scale, principal component analysis combined with Bayesian clustering revealed three major clusters, corresponding to the main areas of southern spruce occurrence, i.e. the Alps, Carpathians, and Hercynia. The populations along the altitudinal transects were not differentiated. To assess the role of selection in structuring genetic variation, we applied a Bayesian and coalescent-based F ST-outlier method and tested for correlations between allele frequencies and climatic variables using regression analyses. At the macro-geographic scale, the F ST-outlier methods detected together 11 F ST-outliers. Six outliers were detected when the same analyses were carried out taking into account the genetic structure. Regression analyses with population structure correction resulted in the identification of two (micro-geographic scale) and 38 SNPs (macro-geographic scale) significantly correlated with temperature and/or precipitation. Six of these loci overlapped with F ST-outliers, among them two loci encoding an enzyme involved in riboflavin biosynthesis and a sucrose synthase. The results of this study indicate a strong relationship between genetic and environmental variation at both geographic scales. It also suggests that an integrative approach combining different outlier detection methods and population sampling at different geographic scales is useful to identify loci potentially involved in adaptation.  相似文献   

11.
Global climate change and increases in sea levels will affect coastal marine communities. The conservation of these ecologically important areas will be a challenge because of their wide geographic distribution, ecological diversity and species richness. To address this problem, we need to better understand how the genetic variation of the species in these communities is distributed within local populations, among populations and between distant regions. In this study we apply genotyping by sequencing (GBS) and examine 955 SNPs to determine Sailfin molly (Poecilia latipinna) genetic diversity among three geographically close mangrove salt marsh flats in the Florida Keys compared to populations in southern and northern Florida. The questions we are asking are whether there is sufficient genetic variation among isolated estuarine fish within populations and whether there are significant divergences among populations. Additionally, we want to know if GBS approaches agree with previous studies using more traditional molecular approaches. We are able to identify large genetic diversity within each saltmarsh community (π ≈ 36%). Additionally, among the Florida Key populations and the mainland or between southern and northern Florida regions, there are significant differences in allele frequencies seen in population structure and evolutionary relationships among individuals. Surprisingly, even though the cumulative FST value using all 955 SNPs within the three Florida Key populations is small, there are 29 loci with significant FST values, and 11 of these were outliers suggestive of adaptive divergence. These data suggest that among the salt marsh flats surveyed here, there is significant genetic diversity within each population and small but significant differences among populations. Much of the genetic variation within and among populations found here with GBS is very similar to previous studies using allozymes and microsatellites. However, the meaningful difference between GBS and these previous measures of genetic diversity is the number of loci examined, which allows more precise delineations of population structure as well as facilitates identifying loci with excessive FST values that could indicate adaptive divergence.  相似文献   

12.
Yuan JH  Cheng FY  Zhou SL 《PloS one》2012,7(4):e34955

Background

Tree peonies are great ornamental plants associated with a rich ethnobotanical history in Chinese culture and have recently been used as an evolutionary model. The Qinling Mountains represent a significant geographic barrier in Asia, dividing mainland China into northern (temperate) and southern (semi–tropical) regions; however, their flora has not been well analyzed. In this study, the genetic differentiation and genetic structure of Paeonia rockii and the role of the Qinling Mountains as a barrier that has driven intraspecific fragmentation were evaluated using 14 microsatellite markers.

Methodology/Principal Findings

Twenty wild populations were sampled from the distributional range of P. rockii. Significant population differentiation was suggested (FST value of 0.302). Moderate genetic diversity at the population level (HS of 0.516) and high population diversity at the species level (HT of 0.749) were detected. Significant excess homozygosity (FIS of 0.076) and recent population bottlenecks were detected in three populations. Bayesian clusters, population genetic trees and principal coordinate analysis all classified the P. rockii populations into three genetic groups and one admixed Wenxian population. An isolation-by-distance model for P. rockii was suggested by Mantel tests (r = 0.6074, P<0.001) and supported by AMOVA (P<0.001), revealing a significant molecular variance among the groups (11.32%) and their populations (21.22%). These data support the five geographic boundaries surrounding the Qinling Mountains and adjacent areas that were detected with Monmonier''s maximum-difference algorithm.

Conclusions/Significance

Our data suggest that the current genetic structure of P. rockii has resulted from the fragmentation of a formerly continuously distributed large population following the restriction of gene flow between populations of this species by the Qinling Mountains. This study provides a fundamental genetic profile for the conservation and responsible exploitation of the extant germplasm of this species and for improving the genetic basis for breeding its cultivars.  相似文献   

13.
Shi Y  Qu J  Zhang D  Zhao P  Zhang Q  Tam PO  Sun L  Zuo X  Zhou X  Xiao X  Hu J  Li Y  Cai L  Liu X  Lu F  Liao S  Chen B  He F  Gong B  Lin H  Ma S  Cheng J  Zhang J  Chen Y  Zhao F  Yang X  Chen Y  Yang C  Lam DS  Li X  Shi F  Wu Z  Lin Y  Yang J  Li S  Ren Y  Xue A  Fan Y  Li D  Pang CP  Zhang X  Yang Z 《American journal of human genetics》2011,(6):438-813
High myopia, which is extremely prevalent in the Chinese population, is one of the leading causes of blindness in the world. Genetic factors play a critical role in the development of the condition. To identify the genetic variants associated with high myopia in the Han Chinese, we conducted a genome-wide association study (GWAS) of 493,947 SNPs in 1088 individuals (419 cases and 669 controls) from a Han Chinese cohort and followed up on signals that were associated with p < 1.0 × 10−4 in three independent cohorts (combined, 2803 cases and 5642 controls). We identified a significant association between high myopia and a variant at 13q12.12 (rs9318086, combined p = 1.91 × 10−16, heterozygous odds ratio = 1.32, and homozygous odds ratio = 1.64). Furthermore, five additional SNPs (rs9510902, rs3794338, rs1886970, rs7325450, and rs7331047) in the same linkage disequilibrium (LD) block with rs9318086 also proved to be significantly associated with high myopia in the Han Chinese population; p values ranged from 5.46 × 10−11 to 6.16 × 10−16. This associated locus contains three genes—MIPEP, C1QTNF9B-AS1, and C1QTNF9B. MIPEP and C1QTNF9B were found to be expressed in the retina and retinal pigment epithelium (RPE) and are more likely than C1QTNF9B-AS1 to be associated with high myopia given the evidence of retinal signaling that controls eye growth. Our results suggest that the variants at 13q12.12 are associated with high myopia.  相似文献   

14.
Ma Y  Yang M  Fan Y  Wu J  Ma Y  Xu J 《PloS one》2011,6(7):e22219

Background

Anopheles sinensis is a competent malaria vector in China. An understanding of vector population structure is important to the vector-based malaria control programs. However, there is no adequate data of A. sinensis population genetics available yet.

Methodology/Principal Findings

This study used 5 microsatellite loci to estimate population genetic diversity, genetic differentiation and demographic history of A. sinensis from 14 representative localities in China. All 5 microsatellite loci were highly polymorphic across populations, with high allelic richness and heterozygosity. Hardy–Weinberg disequilibrium was found in 12 populations associated with heterozygote deficits, which was likely caused by the presence of null allele and the Wahlund effect. Bayesian clustering analysis revealed two gene pools, grouping samples into two population clusters; one includes six and the other includes eight populations. Out of 14 samples, six samples were mixed with individuals from both gene pools, indicating the coexistence of two genetic units in the areas sampled. The overall differentiation between two genetic pools was moderate (F ST = 0.156). Pairwise differentiation between populations were lower within clusters (F ST = 0.008–0.028 in cluster I and F ST = 0.004–0.048 in cluster II) than between clusters (F ST = 0.120–0.201). A reduced gene flow (Nm = 1–1.7) was detected between clusters. No evidence of isolation by distance was detected among populations neither within nor between the two clusters. There are differences in effective population size (Ne = 14.3-infinite) across sampled populations.

Conclusions/Significance

Two genetic pools with moderate genetic differentiation were identified in the A. sinensis populations in China. The population divergence was not correlated with geographic distance or barrier in the range. Variable effective population size and other demographic effects of historical population perturbations could be the factors affecting the population differentiation. The structured populations may limit the migration of genes under pressures/selections, such as insecticides and immune genes against malaria.  相似文献   

15.
Kawasaki disease (KD) is an acute systemic vasculitis syndrome that primarily affects infants and young children. Its etiology is unknown; however, epidemiological findings suggest that genetic predisposition underlies disease susceptibility. Taiwan has the third-highest incidence of KD in the world, after Japan and Korea. To investigate novel mechanisms that might predispose individuals to KD, we conducted a genome-wide association study (GWAS) in 250 KD patients and 446 controls in a Han Chinese population residing in Taiwan, and further validated our findings in an independent Han Chinese cohort of 208 cases and 366 controls. The most strongly associated single-nucleotide polymorphisms (SNPs) detected in the joint analysis corresponded to three novel loci. Among these KD-associated SNPs three were close to the COPB2 (coatomer protein complex beta-2 subunit) gene: rs1873668 (p = 9.52×10−5), rs4243399 (p = 9.93×10−5), and rs16849083 (p = 9.93×10−5). We also identified a SNP in the intronic region of the ERAP1 (endoplasmic reticulum amino peptidase 1) gene (rs149481, pbest = 4.61×10−5). Six SNPs (rs17113284, rs8005468, rs10129255, rs2007467, rs10150241, and rs12590667) clustered in an area containing immunoglobulin heavy chain variable regions genes, with pbest-values between 2.08×10−5 and 8.93×10−6, were also identified. This is the first KD GWAS performed in a Han Chinese population. The novel KD candidates we identified have been implicated in T cell receptor signaling, regulation of proinflammatory cytokines, as well as antibody-mediated immune responses. These findings may lead to a better understanding of the underlying molecular pathogenesis of KD.  相似文献   

16.
FST is frequently used as a summary of genetic differentiation among groups. It has been suggested that FST depends on the allele frequencies at a locus, as it exhibits a variety of peculiar properties related to genetic diversity: higher values for biallelic single-nucleotide polymorphisms (SNPs) than for multiallelic microsatellites, low values among high-diversity populations viewed as substantially distinct, and low values for populations that differ primarily in their profiles of rare alleles. A full mathematical understanding of the dependence of FST on allele frequencies, however, has been elusive. Here, we examine the relationship between FST and the frequency of the most frequent allele, demonstrating that the range of values that FST can take is restricted considerably by the allele-frequency distribution. For a two-population model, we derive strict bounds on FST as a function of the frequency M of the allele with highest mean frequency between the pair of populations. Using these bounds, we show that for a value of M chosen uniformly between 0 and 1 at a multiallelic locus whose number of alleles is left unspecified, the mean maximum FST is ∼0.3585. Further, FST is restricted to values much less than 1 when M is low or high, and the contribution to the maximum FST made by the most frequent allele is on average ∼0.4485. Using bounds on homozygosity that we have previously derived as functions of M, we describe strict bounds on FST in terms of the homozygosity of the total population, finding that the mean maximum FST given this homozygosity is 1 − ln 2 ≈ 0.3069. Our results provide a conceptual basis for understanding the dependence of FST on allele frequencies and genetic diversity and for interpreting the roles of these quantities in computations of FST from population-genetic data. Further, our analysis suggests that many unusual observations of FST, including the relatively low FST values in high-diversity human populations from Africa and the relatively low estimates of FST for microsatellites compared to SNPs, can be understood not as biological phenomena associated with different groups of populations or classes of markers but rather as consequences of the intrinsic mathematical dependence of FST on the properties of allele-frequency distributions.DIFFERENTIATION among groups is one of the fundamental subjects of the field of population genetics. Comparisons of the level of variation among subpopulations with the level of variation in the total population have been employed frequently in population-genetic theory, in statistical methods for data analysis, and in empirical studies of distributions of genetic variation. Wright’s (Wright 1951) fixation indices, and FST in particular, have been central to this effort.Wright’s FST was originally defined as the correlation between two randomly sampled gametes from the same subpopulation when the correlation of two randomly sampled gametes from the total population is set to zero. Several definitions of FST or FST-like quantities are now available, relying on a variety of different conceptual formulations but all measuring some aspect of population differentiation (e.g., Charlesworth 1998; Holsinger and Weir 2009). Many authors have claimed that one or another formulation of FST is affected by levels of genetic diversity or by allele frequencies, either because the range of FST is restricted by these quantities or because these quantities affect the degree to which FST reflects population differentiation (e.g., Charlesworth 1998; Nagylaki 1998; Hedrick 1999, 2005; Long and Kittles 2003; Jost 2008; Ryman and Leimar 2008; Long 2009; Meirmans and Hedrick 2011). For example, Nagylaki (1998) and Hedrick (1999) argued that measures of FST may be poor measures of genetic differentiation when the level of diversity is high. Charlesworth (1998) suggested that FST can be inflated when diversity is low, arguing that FST might not be appropriate for comparing loci with substantially different levels of variation. In a provocative recent article, Jost (2008) used the diversity dependence of forms of FST to question their utility as differentiation measures at all.One definition that is convenient for mathematical assessment of the relationship of an FST-like quantity and allele frequencies is the quantity labeled GST by Nei (1973), which for a given locus measures the difference between the heterozygosity of the total (pooled) population, hT, and the mean heterozygosity across subpopulations, hS, divided by the heterozygosity of the total population:GST=hThShT.(1)In terms of the homozygosity of the total population, HT = 1 − hT, and the mean homozygosity across subpopulations, HS = 1 − hS, we can writeGST=HSHT1HT.(2)The Wahlund (1928) principle guarantees that HSHT and, therefore, because HS ≤ 1 and for a polymorphic locus with finitely many alleles, 0 < HT < 1, GST lies in the interval [0,1].Using GST for their definition of FST, Hedrick (1999, 2005) and Long and Kittles (2003) pointed out that because hT < 1, FST cannot exceed the mean homozygosity across subpopulations, HS:FST = 1 ? hS/hT < 1 ? hSHS.(3)Hedrick (2005) obtained this result by considering a set of K equal-sized subpopulations, in which each allele is private to a single subpopulation. In the limit as K → ∞, a stronger upper bound on FST as a function of HS and K reduces to Equation 3 (see also Jin and Chakraborty 1995 and Long and Kittles 2003).While Hedrick (1999, 2005) and Long and Kittles (2003) have clarified the relationship between FST and the mean homozygosity HS across subpopulations, their approaches do not easily illuminate the connection between FST and allele frequencies themselves. A formal understanding of the relationship between FST and allele frequencies would make it possible to more fully understand the behavior of FST in situations where markers of interest differ substantially in allele frequencies or levels of genetic diversity. Our recent work on the relationship between homozygosity and the frequency of the most frequent allele (Rosenberg and Jakobsson 2008; Reddy and Rosenberg 2012) provides a mathematical approach for formal investigation of bounds on population-genetic statistics in terms of allele frequencies. In this article, we therefore seek to thoroughly examine the dependence of FST on allele frequencies by investigating the upper bound on FST in terms of the frequency M of the most frequent allele across a pair of populations. We derive bounds on FST given the frequency of the most frequent allele and bounds on the frequency of the most frequent allele given FST. We consider loci with arbitrarily many alleles in a pair of subpopulations. Using theory for the bounds on homozygosity given the frequency of the most frequent allele, we obtain strict bounds on FST given the homozygosity of the total population. Our analysis clarifies the relationships among FST, allele frequencies, and homozygosity, providing explanations for peculiar observations of FST that can be attributed to allele-frequency dependence.  相似文献   

17.
Studies of the apportionment of human genetic variation have long established that most human variation is within population groups and that the additional variation between population groups is small but greatest when comparing different continental populations. These studies often used Wright’s F ST that apportions the standardized variance in allele frequencies within and between population groups. Because local adaptations increase population differentiation, high-F ST may be found at closely linked loci under selection and used to identify genes undergoing directional or heterotic selection. We re-examined these processes using HapMap data. We analyzed 3 million SNPs on 602 samples from eight worldwide populations and a consensus subset of 1 million SNPs found in all populations. We identified four major features of the data: First, a hierarchically F ST analysis showed that only a paucity (12%) of the total genetic variation is distributed between continental populations and even a lesser genetic variation (1%) is found between intra-continental populations. Second, the global F ST distribution closely follows an exponential distribution. Third, although the overall F ST distribution is similarly shaped (inverse J), F ST distributions varies markedly by allele frequency when divided into non-overlapping groups by allele frequency range. Because the mean allele frequency is a crude indicator of allele age, these distributions mark the time-dependent change in genetic differentiation. Finally, the change in mean-F ST of these groups is linear in allele frequency. These results suggest that investigating the extremes of the F ST distribution for each allele frequency group is more efficient for detecting selection. Consequently, we demonstrate that such extreme SNPs are more clustered along the chromosomes than expected from linkage disequilibrium for each allele frequency group. These genomic regions are therefore likely candidates for natural selection.  相似文献   

18.
Genome-wide SNP data provide a powerful tool to estimate pairwise relatedness among individuals and individual inbreeding coefficient. The aim of this study was to compare methods for estimating the two parameters in a Finnsheep population based on genome-wide SNPs and genealogies, separately. This study included ninety-nine Finnsheep in Finland that differed in coat colours (white, black, brown, grey, and black/white spotted) and were from a large pedigree comprising 319 119 animals. All the individuals were genotyped with the Illumina Ovine SNP50K BeadChip by the International Sheep Genomics Consortium. We identified three genetic subpopulations that corresponded approximately with the coat colours (grey, white, and black and brown) of the sheep. We detected a significant subdivision among the colour types (F ST = 5.4%, P<0.05). We applied robust algorithms for the genomic estimation of individual inbreeding (F SNP) and pairwise relatedness (Φ SNP) as implemented in the programs KING and PLINK, respectively. Estimates of the two parameters from pedigrees (F PED and Φ PED) were computed using the RelaX2 program. Values of the two parameters estimated from genomic and genealogical data were mostly consistent, in particular for the highly inbred animals (e.g. inbreeding coefficient F>0.0625) and pairs of closely related animals (e.g. the full- or half-sibs). Nevertheless, we also detected differences in the two parameters between the approaches, particularly with respect to the grey Finnsheep. This could be due to the smaller sample size and relative incompleteness of the pedigree for them.We conclude that the genome-wide genomic data will provide useful information on a per sample or pairwise-samples basis in cases of complex genealogies or in the absence of genealogical data.  相似文献   

19.

Background

For Chagas disease, the most serious infectious disease in the Americas, effective disease control depends on elimination of vectors through spraying with insecticides. Molecular genetic research can help vector control programs by identifying and characterizing vector populations and then developing effective intervention strategies.

Methods and Findings

The population genetic structure of Triatoma infestans (Hemiptera: Reduviidae), the main vector of Chagas disease in Bolivia, was investigated using a hierarchical sampling strategy. A total of 230 adults and nymphs from 23 localities throughout the department of Chuquisaca in Southern Bolivia were analyzed at ten microsatellite loci. Population structure, estimated using analysis of molecular variance (AMOVA) to estimate FST (infinite alleles model) and RST (stepwise mutation model), was significant between western and eastern regions within Chuquisaca and between insects collected in domestic and peri-domestic habitats. Genetic differentiation at three different hierarchical geographic levels was significant, even in the case of adjacent households within a single locality (R ST = 0.14, F ST = 0.07). On the largest geographic scale, among five communities up to 100 km apart, R ST = 0.12 and F ST = 0.06. Cluster analysis combined with assignment tests identified five clusters within the five communities.

Conclusions

Some houses are colonized by insects from several genetic clusters after spraying, whereas other households are colonized predominately by insects from a single cluster. Significant population structure, measured by both R ST and F ST, supports the hypothesis of poor dispersal ability and/or reduced migration of T. infestans. The high degree of genetic structure at small geographic scales, inferences from cluster analysis and assignment tests, and demographic data suggest reinfesting vectors are coming from nearby and from recrudescence (hatching of eggs that were laid before insecticide spraying). Suggestions for using these results in vector control strategies are made.  相似文献   

20.
The Antarctic blue whale (Balaenoptera musculus intermedia) was hunted to near extinction between 1904 and 1972, declining from an estimated initial abundance of more than 250,000 to fewer than 400. Here, we describe mtDNA control region diversity and geographic differentiation in the surviving population of the Antarctic blue whale, using 218 biopsy samples collected under the auspices of the International Whaling Commission (IWC) during research cruises from 1990–2009. Microsatellite genotypes and mtDNA sequences identified 166 individuals among the 218 samples and documented movement of a small number of individuals, including a female that traveled at least 6,650 km or 131° longitude over four years. mtDNA sequences from the 166 individuals were aligned with published sequences from 17 additional individuals, resolving 52 unique haplotypes from a consensus length of 410 bp. From this minimum census, a rarefaction analysis predicted that only 72 haplotypes (95% CL, 64, 86) have survived in the contemporary population of Antarctic blue whales. However, haplotype diversity was relatively high (0.968±0.004), perhaps as a result of the longevity of blue whales and the relatively recent timing of the bottleneck. Despite the potential for circumpolar dispersal, we found significant differentiation in mtDNA diversity (FST = 0.032, p<0.005) and microsatellite alleles (FST = 0.005, p<0.05) among the six Antarctic Areas historically used by the IWC for management of blue whales.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号