首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Single-nucleotide polymorphisms (SNPs) may be extremely important for deciphering the impact of genetic variation on complex human diseases. The ultimate value of SNPs for linkage and association mapping studies depends in part on the distribution of SNP allele frequencies and intermarker linkage disequilibrium (LD) across populations. Limited information is available about these distributions on a genomewide scale, particularly for LD. Using 114 SNPs from 33 genes, we compared these distributions in five American populations (727 individuals) of African, European, Chinese, Hispanic, and Japanese descent. The allele frequencies were highly correlated across populations but differed by >20% for at least one pair of populations in 35% of SNPs. The correlation in LD was high for some pairs of populations but not for others (e.g., Chinese American or Japanese American vs. any other population). Regardless of population, average minor-allele frequencies were significantly higher for SNPs in noncoding regions (20%-25%) than for SNPs in coding regions (12%-16%). Interestingly, we found that intermarker LD may be strongest with pairs of SNPs in which both markers are nonconservative substitutions, compared to pairs of SNPs where at least one marker is a conservative substitution. These results suggest that population differences and marker location within the gene may be important factors in the selection of SNPs for use in the study of complex disease with linkage or association mapping methods.  相似文献   

2.
Recent genome-wide association studies (GWAS) have identified multiple novel loci associated with obesity in Europeans but results in other ethnicities are less convincing. Here, we report a two-stage GWAS of BMI in African Americans. The GWAS was performed using the Affymetrix 6.0 platform in 816 nondiabetic and 899 diabetic nephropathy subjects. 746,626 single-nucleotide polymorphisms (SNPs) were tested for association with BMI after adjustment for age, gender, disease status, and population structure. Sixty high scoring SNPs that showed nominal association in both GWAS cohorts were further replicated in 3,274 additional subjects in four replication cohorts and a meta-analysis was computed. Meta-analysis of 4,989 subjects revealed five SNPs (rs6794092, rs268972, rs2033195, rs815611, and rs6088887) at four loci showing consistent associations in both GWAS (P < 0.0001) and replication cohorts (P < 0.05) with combined P values range from 2.4 × 10(-6) to 5 × 10(-5). These loci are located near PP13439-TMEM212, CDH12, MFAP3-GALNT10, and FER1L4 and had effect sizes between 0.091 and 0.167 s.d. unit (or 0.67-1.24 kg/m(2)) of BMI for each copy of the effect allele. Our findings suggest the presence of novel loci potentially associated with adiposity in African Americans. Further replication and meta-analysis in African Americans and other populations will shed light on the role of these loci in different ethnic populations.  相似文献   

3.
Kuo CL  Zaykin DV 《Genetics》2011,189(1):329-340
In recent years, genome-wide association studies (GWAS) have uncovered a large number of susceptibility variants. Nevertheless, GWAS findings provide only tentative evidence of association, and replication studies are required to establish their validity. Due to this uncertainty, researchers often focus on top-ranking SNPs, instead of considering strict significance thresholds to guide replication efforts. The number of SNPs for replication is often determined ad hoc. We show how the rank-based approach can be used for sample size allocation in GWAS as well as for deciding on a number of SNPs for replication. The basis of this approach is the "ranking probability": chances that at least j true associations will rank among top u SNPs, when SNPs are sorted by P-value. By employing simple but accurate approximations for ranking probabilities, we accommodate linkage disequilibrium (LD) and evaluate consequences of ignoring LD. Further, we relate ranking probabilities to the proportion of false discoveries among top u SNPs. A study-specific proportion can be estimated from P-values, and its expected value can be predicted for study design applications.  相似文献   

4.
The objective of this work was to integrate findings from functional genomics studies with genome-wide association studies for fertility and production traits in dairy cattle. Association analyses of production and fertility traits with SNPs located within or close to 170 candidate genes derived from two gene expression studies and from the literature were performed. Data from 2294 Holstein bulls genotyped for 39557 SNPs were used. A total of 111 SNPs were located on chromosomal segments covered by a candidate gene. Allele substitution effects for each SNP were estimated using a mixed model with a fixed effect of marker and a random polygenic effect. Assumed covariance was derived either from marker or from pedigree information. Results from the analysis with the kinship matrix built from marker genotypes were more conservative than from the analysis with the pedigree-derived relationship matrix. From sixteen SNPs with significant effects on both classes of traits, ten provided evidence of an antagonistic relationship between productivity and fertility. However, we found four SNPs with favourable effects on fertility and on yield traits, one SNP with favourable effects on fertility and percentage traits, and one SNP with antagonistic effects on two fertility traits. While most quantitative genetic studies have proven genetic antagonisms between yield and functional traits, improvements in both production and functionality may be possible when focusing on a few relevant SNPs. Investigations combining input from quantitative genetics and functional genomics with association analysis may be applied for the identification of such SNPs.  相似文献   

5.
Ma H  Li H  Jin G  Dai J  Dong J  Qin Z  Chen J  Wang S  Wang X  Hu Z  Shen H 《DNA and cell biology》2012,31(6):1114-1120
A single nucleotide polymorphism (SNP) rs999737 at 14q24.1 was identified as a susceptibility marker of breast cancer in a genome-wide association study of the European population, which was also confirmed by some of the following studies in populations of European descent. However, rs999737 is very rare or nonpolymorphic in non-Europeans including Chinese, and the role of other genetic variants at 14q24.1 has not been evaluated in populations of non-European descent. In this study, we first selected 21 common tagging SNPs (minor allele frequency [MAF] >0.05 in the Chinese population) by searching the Hapmap database, covering a linage disequilibrium region of more than 70?Kb at 14q24.1, and then conducted a two-stage study (stage I: 878 cases and 900 controls; stage II: 914 cases and 967 controls) to investigate the associations between these tagging SNPs and risk of breast cancer in a Chinese population. In stage I, two SNPs (rs2842346 and rs17828907) were identified to be significantly associated with breast cancer risk (p=0.030 and 0.027 for genotype distributions, respectively). However, no significant associations were found between these two SNPs and breast cancer risk in either stage II or the combined dataset. These findings suggest that common variants at 14q24.1 might not be associated with the risk of breast cancer in the Chinese population, which will need the replication in additional larger studies.  相似文献   

6.
Fine mapping versus replication in whole-genome association studies   总被引:3,自引:0,他引:3       下载免费PDF全文
Association replication studies have a poor track record and, even when successful, often claim association with different markers, alleles, and phenotypes than those reported in the primary study. It is unknown whether these outcomes reflect genuine associations or false-positive results. A greater understanding of these observations is essential for genomewide association (GWA) studies, since they have the potential to identify multiple new associations that that will require external validation. Theoretically, a repeat association with precisely the same variant in an independent sample is the gold standard for replication, but testing additional variants is commonplace in replication studies. Finding different associated SNPs within the same gene or region as that originally identified is often reported as confirmatory evidence. Here, we compare the probability of replicating a gene or region under two commonly used marker-selection strategies: an "exact" approach that involves only the originally significant markers and a "local" approach that involves both the originally significant markers and others in the same region. When a region of high intermarker linkage disequilibrium is tested to replicate an initial finding that is only weak association with disease, the local approach is a good strategy. Otherwise, the most powerful and efficient strategy for replication involves testing only the initially identified variants. Association with a marker other than that originally identified can occur frequently, even in the presence of real effects in a low-powered replication study, and instances of such association increase as the number of included variants increases. Our results provide a basis for the design and interpretation of GWA replication studies and point to the importance of a clear distinction between fine mapping and replication after GWA.  相似文献   

7.
Johnson T 《Genetics》1999,151(4):1621-1631
Natural selection acts in three ways on heritable variation for mutation rates. A modifier allele that increases the mutation rate is (i) disfavored due to association with deleterious mutations, but is also favored due to (ii) association with beneficial mutations and (iii) the reduced costs of lower fidelity replication. When a unique beneficial mutation arises and sweeps to fixation, genetic hitchhiking may cause a substantial change in the frequency of a modifier of mutation rate. In previous studies of the evolution of mutation rates in sexual populations, this effect has been underestimated. This article models the long-term effect of a series of such hitchhiking events and determines the resulting strength of indirect selection on the modifier. This is compared to the indirect selection due to deleterious mutations, when both types of mutations are randomly scattered over a given genetic map. Relative to an asexual population, increased levels of recombination reduce the effects of beneficial mutations more rapidly than those of deleterious mutations. However, the role of beneficial mutations in determining the evolutionarily stable mutation rate may still be significant if the function describing the cost of high-fidelity replication has a shallow gradient.  相似文献   

8.

Background

Recent epidemiological studies suggest that the maternal genome is an important contributor to spontaneous preterm delivery (PTD). There is also a significant excess of males among preterm born infants, which may imply an X-linked mode of inheritance for a subset of cases. To explore this, we examined the effect of maternal and fetal X-chromosomal single nucleotide polymorphisms (SNPs) on the risk of PTD in two independent genome-wide association studies and one replication study.

Methods

Participants were recruited from the Danish National Birth Cohort and the Norwegian Mother and Child cohort studies. Data from these two populations were first analyzed independently, and then combined in a meta-analysis. Overall, we evaluated 12,211 SNPs in 1,535 case-mother dyads and 1,487 control-mother dyads. Analyses were done using a hybrid design that combines case-mother dyads and control-mother dyads, as implemented in the Haplin statistical software package. A sex-stratified analysis was performed for the fetal SNPs. In the replication study, 10 maternal and 16 fetal SNPs were analyzed using case-parent triads from independent studies of PTD in the United States, Argentina and Denmark.

Results

In the meta-analysis, the G allele at the maternal SNP rs2747022 in the FERM domain containing 7 gene (FRMD7) increased the risk of spontaneous PTD by 1.2 (95% confidence interval (CI): 1.1, 1.4). Although an association with this SNP was confirmed in the replication study, it was no longer statistically significant after a Bonferroni correction for multiple testing.

Conclusion

We did not find strong evidence in our data to implicate X-chromosomal SNPs in the etiology of spontaneous PTD. Although non-significant after correction for multiple testing, the mother’s G allele at rs2747022 in FRMD7 increased the risk of spontaneous PTD across all populations in this study, thus warranting further investigation in other populations.  相似文献   

9.
DTNBP1 was first identified as a putative schizophrenia-susceptibility gene in Irish pedigrees, with a report of association to common genetic variation. Several replication studies have reported confirmation of an association to DTNBP1 in independent European samples; however, reported risk alleles and haplotypes appear to differ between studies, and comparison among studies has been confounded because different marker sets were employed by each group. To facilitate evaluation of existing evidence of association and further work, we supplemented the extensive genotype data, available through the International HapMap Project (HapMap), about DTNBP1 by specifically typing all associated single-nucleotide polymorphisms reported in each of the studies of the Centre d'Etude du Polymorphisme Humain (CEPH)-derived HapMap sample (CEU). Using this high-density reference map, we compared the putative disease-associated haplotype from each study and found that the association studies are inconsistent with regard to the identity of the disease-associated haplotype at DTNBP1. Specifically, all five "replication" studies define a positively associated haplotype that is different from the association originally reported. We further demonstrate that, in all six studies, the European-derived populations studied have haplotype patterns and frequencies that are consistent with HapMap CEU samples (and each other). Thus, it is unlikely that population differences are creating the inconsistency of the association studies. Evidence of association is, at present, equivocal and unsatisfactory. The new dense map of the region may be valuable in more-comprehensive follow-up studies.  相似文献   

10.
Single nucleotide polymorphisms (SNPs) are appealing genetic markers due to several beneficial attributes, but uncertainty remains about how many of these bi-allelic markers are necessary to have sufficient power to differentiate populations, a task now generally accomplished with highly polymorphic microsatellite markers. In this study, we tested the utility of 37 SNPs and 13 microsatellites for differentiating 29 broadly distributed populations of Chinook salmon ( n  = 2783). Information content of all loci was determined by In and     , and the top 12 markers ranked by In were microsatellites, but the 6 highest, and 7 of the top 10     ranked markers, were SNPs. The mean ratio of random SNPs to random microsatellites ranged from 3.9 to 4.1, but this ratio was consistently reduced when only the most informative loci were included. Individual assignment test accuracy was higher for microsatellites (73.1%) than SNPs (66.6%), and pooling all 50 markers provided the highest accuracy (83.2%). When marker types were combined, as few as 15 of the top ranked loci provided higher assignment accuracy than either microsatellites or SNPs alone. Neighbour-joining dendrograms revealed similar clustering patterns and pairwise tests of population differentiation had nearly identical results with each suite of markers. Statistical tests and simulations indicated that closely related populations were better differentiated by microsatellites than SNPs. Our results indicate that both types of markers are likely to be useful in population genetics studies and that, in some cases, a combination of SNPs and microsatellites may be the most effective suite of loci.  相似文献   

11.
Single-nucleotide polymorphisms (SNPs) are commonly used to study genetics for common diseases and predict pharmacological response. The selection of likely informative SNPs in association studies depends on their allele frequencies and on the linkage disequilibrium (LD) between SNPs, both of which may show interethnic differences. Among three populations consisting of 207 Chinese, 858 French, and 395 Spanish, we compared the allele frequency distributions of 64 intragenic SNPs of 35 candidate genes for cardiovascular diseases. Twenty-eight of these SNPs from 12 genes were also examined for intragenic LD. About 20% of SNPs were restricted to Europeans, being monomorphic in Chinese, among them mostly nonsynonymous coding SNPs and noncoding SNPs. Only 1.6% of SNPs were specific in Chinese, commensurate with the detection of these SNPs almost exclusively in Caucasians. Similarly, these SNPs were more often rare (<0.1 minor allele frequency) in Chinese (44.3%) than in Europeans (31.1%). The variant allele frequencies and intermarker LDs in terms of D' and Delta(2) were highly correlated between French and Spanish populations (r = 0.98-0.99, p < 0.001). However, only moderate correlations of allele frequencies and D' were found between the Chinese and the European populations (r = 0.7 and 0.3, respectively) despite a high correlation of Delta(2) values (r = 0.8). These results suggest that ethnic considerations are important in the selection of SNPs for association studies of candidate genes, as this may affect the power of the study as well as the likelihood of asking relevant questions and getting medically meaningful answers.  相似文献   

12.
Biodiversity of 20 chicken breeds assessed by SNPs located in gene regions   总被引:2,自引:0,他引:2  
Twenty-five single nucleotide polymorphisms (SNPs) were analyzed in 20 distinct chicken breeds. The SNPs, each located in a different gene and mostly on different chromosomes, were chosen to examine the use of SNPs in or close to genes (g-SNPs), for biodiversity studies. Phylogenetic trees were constructed from these data. When bootstrap values were used as a criterion for the tree repeatability, doubling the number of SNPs from 12 to 25 improved tree repeatability more than doubling the number of individuals per population, from five to ten. Clustering results of these 20 populations, based on the software STRUCTURE, are in agreement with those previously obtained from the analysis of microsatellites. When the number of clusters was similar to the number of populations, affiliation of birds to their original populations was correct (>95%) only when at least the 22 most polymorphic SNP loci (out of 25) were included. When ten populations were clustered into five groups based on STRUCTURE, we used membership coefficient (Q) of the major cluster at each population as an indicator for clustering success level. This value was used to compare between three marker types; microsatellites, SNPs in or close to genes (g-SNPs) and SNPs in random fragments (r-SNPs). In this comparison, the same individuals were used (five to ten birds per population) and the same number of loci (14) used for each of the marker types. The average membership coefficients (Q) of the major cluster for microsatellites, g-SNPs and r-SNPs were 0.85, 0.7, and 0.64, respectively. Analysis based on microsatellites resulted in significantly higher clustering success due to their multi-allelic nature. Nevertheless, SNPs have obvious advantages, and are an efficient and cost-effective genetic tool, providing broader genome coverage and reliable estimates of genetic relatedness.  相似文献   

13.
Colorectal cancer is the second leading cause of cancer death in developed countries. Genome-wide association studies (GWAS) have successfully identified novel susceptibility loci for colorectal cancer. To follow up on these findings, and try to identify novel colorectal cancer susceptibility loci, we present results for GWAS of colorectal cancer (2,906 cases, 3,416 controls) that have not previously published main associations. Specifically, we calculated odds ratios and 95% confidence intervals using log-additive models for each study. In order to improve our power to detect novel colorectal cancer susceptibility loci, we performed a meta-analysis combining the results across studies. We selected the most statistically significant single nucleotide polymorphisms (SNPs) for replication using ten independent studies (8,161 cases and 9,101 controls). We again used a meta-analysis to summarize results for the replication studies alone, and for a combined analysis of GWAS and replication studies. We measured ten SNPs previously identified in colorectal cancer susceptibility loci and found eight to be associated with colorectal cancer (p value range 0.02 to 1.8?×?10(-8)). When we excluded studies that have previously published on these SNPs, five SNPs remained significant at p?相似文献   

14.
Glycated hemoglobin A1C (HbA1C) level is used as a diagnostic marker for diabetes mellitus and a predictor of diabetes associated complications. Genome-wide association studies have identified genetic variants associated with HbA1C level. Most of these studies have been conducted in populations of European ancestry. Here we report the findings from a meta-analysis of genome-wide association studies of HbA1C levels in 6,682 non-diabetic subjects of Chinese, Malay and South Asian ancestries. We also sought to examine the associations between HbA1C associated SNPs and microvascular complications associated with diabetes mellitus, namely chronic kidney disease and retinopathy. A cluster of 6 SNPs on chromosome 17 showed an association with HbA1C which achieved genome-wide significance in the Malays but not in Chinese and Asian Indians. No other variants achieved genome-wide significance in the individual studies or in the meta-analysis. When we investigated the reproducibility of the findings that emerged from the European studies, six loci out of fifteen were found to be associated with HbA1C with effect sizes similar to those reported in the populations of European ancestry and P-value ≤ 0.05. No convincing associations with chronic kidney disease and retinopathy were identified in this study.  相似文献   

15.
Xiao M  Latif SM  Kwok PY 《BioTechniques》2003,34(1):190-197
Strategies for identifying genetic risk factors in complex diseases by association studies require the comparison of allele frequencies of numerous SNPs between affected and control populations. Theoretically, hundreds of thousands of SNP markers across the genome will have to be genotyped in these studies. Genotyping SNPs one sample at a time is extremely costly and time consuming. To streamline whole genome association studies, some have proposed to screen SNPs by pooling the DNA samples initially for allele frequency determination and perform individual genotyping only when there is a significant discrepancy in allele frequencies between the affected and control populations. Here we describe a new method for determining the allele frequency of SNPs in pooled DNA samples using a two-color primer extension assay with real-time monitoring of fluorescence polarization (named kinetic FP-TDI assay). By comparing the ratio of the rate of incorporation of the two allele-specific dye-terminators, one can calculate the relative amounts of each allele in the pooled sample. The accuracy of allele frequency determination with pooled samples is within 3.3 +/- 0.8% of that determined by genotyping individual samples that make up the pool.  相似文献   

16.
Otosclerosis is a common form of hearing loss characterized by abnormal bone remodeling in the otic capsule. It is considered a complex disease caused by both genetic and environmental factors. In a previous study, we identified a region on chr7q22.1 located in the RELN gene that is associated with otosclerosis in Belgian–Dutch and French populations. Evidence for allelic heterogeneity was found in this chromosomal region in the form of two independent signals. To confirm this finding, we have completed a replication study that includes four additional populations from Europe (1,141 total samples). Several SNPs in this region replicated in these populations separately. While the power to detect significant association in each population is small, when all four populations are combined, six of seven SNPs replicate and show an effect in the same direction as in the previous populations. We also confirmed the presence of allelic heterogeneity in this region. These data further implicate RELN in the pathogenesis of otosclerosis. Functional research is warranted to determine the pathways through which RELN acts in the pathogenesis of otosclerosis.  相似文献   

17.
Recent high-throughput genotyping technologies, such as the Affymetrix 500k array and the Illumina HumanHap 550 beadchip, have driven down the costs of association studies and have enabled the measurement of single-nucleotide polymorphism (SNP) allele frequency differences between case and control populations on a genomewide scale. A key aspect in the efficiency of association studies is the notion of "indirect association," where only a subset of SNPs are collected to serve as proxies for the uncollected SNPs, taking advantage of the correlation structure between SNPs. Recently, a new class of methods for indirect association, multimarker methods, has been proposed. Although the multimarker methods are a considerable advancement, current methods do not fully take advantage of the correlation structure between SNPs and their multimarker proxies. In this article, we propose a novel multimarker indirect-association method, WHAP, that is based on a weighted sum of the haplotype frequency differences. In contrast to traditional indirect-association methods, we show analytically that there is a considerable gain in power achieved by our method compared with both single-marker and multimarker tests, as well as traditional haplotype-based tests. Our results are supported by empirical evaluation across the HapMap reference panel data sets, and a software implementation for the Affymetrix 500k and Illumina HumanHap 550 chips is available for download.  相似文献   

18.
Single nucleotide polymorphism (SNP) markers have become a genetic technology of choice because of their automation and high precision of allele calls. In this study, our goal was to develop 94 SNPs and test them across well-chosen common bean (Phaseolus vulgaris L.) germplasm. We validated and accessed SNP diversity at 84 gene-based and 10 non-genic loci using KASPar technology in a panel of 70 genotypes that have been used as parents of mapping populations and have been previously evaluated for SSRs. SNPs exhibited high levels of genetic diversity, an excess of middle frequency polymorphism, and a within-genepool mismatch distribution as expected for populations affected by sudden demographic expansions after domestication bottlenecks. This set of markers was useful for distinguishing Andean and Mesoamerican genotypes but less useful for distinguishing within each gene pool. In summary, slightly greater polymorphism and race structure was found within the Andean gene pool than within the Mesoamerican gene pool but polymorphism rate between genotypes was consistent with genepool and race identity. Our survey results represent a baseline for the choice of SNP markers for future applications because gene-associated SNPs could themselves be causative SNPs for traits. Finally, we discuss that the ideal genetic marker combination with which to carry out diversity, mapping and association studies in common bean should consider a mix of both SNP and SSR markers.  相似文献   

19.
Genome-wide association studies (GWAS) have detected many disease associations. However, the reported variants tend to explain small fractions of risk, and there are doubts about issues such as the portability of findings over different ethnic groups or the relative roles of rare versus common variants in the genetic architecture of complex disease. Studying the degree of sharing of disease-associated variants across populations can help in solving these issues. We present a comprehensive survey of GWAS replicability across 28 diseases. Most loci and SNPs discovered in Europeans for these conditions have been extensively replicated using peoples of European and East Asian ancestry, while the replication with individuals of African ancestry is much less common. We found a strong and significant correlation of Odds Ratios across Europeans and East Asians, indicating that underlying causal variants are common and shared between the two ancestries. Moreover, SNPs that failed to replicate in East Asians map into genomic regions where Linkage Disequilibrium patterns differ significantly between populations. Finally, we observed that GWAS with larger sample sizes have detected variants with weaker effects rather than with lower frequencies. Our results indicate that most GWAS results are due to common variants. In addition, the sharing of disease alleles and the high correlation in their effect sizes suggest that most of the underlying causal variants are shared between Europeans and East Asians and that they tend to map close to the associated marker SNPs.  相似文献   

20.
As we move forward from the current generation of genome-wide association (GWA) studies, additional cohorts of different ancestries will be studied to increase power, fine map association signals, and generalize association results to additional populations. Knowledge of genetic ancestry as well as population substructure will become increasingly important for GWA studies in populations of unknown ancestry. Here we propose genotyping pooled DNA samples using genome-wide SNP arrays as a viable option to efficiently and inexpensively estimate admixture proportion and identify ancestry informative markers (AIMs) in populations of unknown origin. We constructed DNA pools from African American, Native Hawaiian, Latina, and Jamaican samples and genotyped them using the Affymetrix 6.0 array. Aided by individual genotype data from the African American cohort, we established quality control filters to remove poorly performing SNPs and estimated allele frequencies for the remaining SNPs in each panel. We then applied a regression-based method to estimate the proportion of admixture in each cohort using the allele frequencies estimated from pooling and populations from the International HapMap Consortium as reference panels, and identified AIMs unique to each population. In this study, we demonstrated that genotyping pooled DNA samples yields estimates of admixture proportion that are both consistent with our knowledge of population history and similar to those obtained by genotyping known AIMs. Furthermore, through validation by individual genotyping, we demonstrated that pooling is quite effective for identifying SNPs with large allele frequency differences (i.e., AIMs) and that these AIMs are able to differentiate two closely related populations (HapMap JPT and CHB).  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号