首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
《PloS one》2013,8(6)
The feasibility of using imperfectly phenotyped “silver standard” samples identified from electronic medical record diagnoses is considered in genetic association studies when these samples might be combined with an existing set of samples phenotyped with a gold standard technique. An analytic expression is derived for the power of a chi-square test of independence using either research-quality case/control samples alone, or augmented with silver standard data. The subset of the parameter space where inclusion of silver standard samples increases statistical power is identified. A case study of dementia subjects identified from electronic medical records from the Electronic Medical Records and Genomics (eMERGE) network, combined with subjects from two studies specifically targeting dementia, verifies these results.  相似文献   

2.
Type 2 diabetes (T2D) is more prevalent in African Americans than in Europeans. However, little is known about the genetic risk in African Americans despite the recent identification of more than 70 T2D loci primarily by genome-wide association studies (GWAS) in individuals of European ancestry. In order to investigate the genetic architecture of T2D in African Americans, the MEta-analysis of type 2 DIabetes in African Americans (MEDIA) Consortium examined 17 GWAS on T2D comprising 8,284 cases and 15,543 controls in African Americans in stage 1 analysis. Single nucleotide polymorphisms (SNPs) association analysis was conducted in each study under the additive model after adjustment for age, sex, study site, and principal components. Meta-analysis of approximately 2.6 million genotyped and imputed SNPs in all studies was conducted using an inverse variance-weighted fixed effect model. Replications were performed to follow up 21 loci in up to 6,061 cases and 5,483 controls in African Americans, and 8,130 cases and 38,987 controls of European ancestry. We identified three known loci (TCF7L2, HMGA2 and KCNQ1) and two novel loci (HLA-B and INS-IGF2) at genome-wide significance (4.15×10−94<P<5×10−8, odds ratio (OR) = 1.09 to 1.36). Fine-mapping revealed that 88 of 158 previously identified T2D or glucose homeostasis loci demonstrated nominal to highly significant association (2.2×10−23 < locus-wide P<0.05). These novel and previously identified loci yielded a sibling relative risk of 1.19, explaining 17.5% of the phenotypic variance of T2D on the liability scale in African Americans. Overall, this study identified two novel susceptibility loci for T2D in African Americans. A substantial number of previously reported loci are transferable to African Americans after accounting for linkage disequilibrium, enabling fine mapping of causal variants in trans-ethnic meta-analysis studies.  相似文献   

3.
4.
To date, genome-wide association studies have focused almost exclusively on populations of European ancestry. These studies continue with the advent of next-generation sequencing, designed to systematically catalog and test low-frequency variation for a role in disease. A complementary approach would be to focus further efforts on cohorts of multiple ethnicities. This leverages the idea that population genetic drift may have elevated some variants to higher allele frequency in different populations, boosting statistical power to detect an association. Based on empirical allele frequency distributions from eleven populations represented in HapMap Phase 3 and the 1000 Genomes Project, we simulate a range of genetic models to quantify the power of association studies in multiple ethnicities relative to studies that exclusively focus on samples of European ancestry. In each of these simulations, a first phase of GWAS in exclusively European samples is followed by a second GWAS phase in any of the other populations (including a multiethnic design). We find that nontrivial power gains can be achieved by conducting future whole-genome studies in worldwide populations, where, in particular, African populations contribute the largest relative power gains for low-frequency alleles (<5%) of moderate effect that suffer from low power in samples of European descent. Our results emphasize the importance of broadening genetic studies to worldwide populations to ensure efficient discovery of genetic loci contributing to phenotypic trait variability, especially for those traits for which large numbers of samples of European ancestry have already been collected and tested.  相似文献   

5.
Over the past 500 years, North America has been the site of ongoing mixing of Native Americans, European settlers, and Africans (brought largely by the trans-Atlantic slave trade), shaping the early history of what became the United States. We studied the genetic ancestry of 5,269 self-described African Americans, 8,663 Latinos, and 148,789 European Americans who are 23andMe customers and show that the legacy of these historical interactions is visible in the genetic ancestry of present-day Americans. We document pervasive mixed ancestry and asymmetrical male and female ancestry contributions in all groups studied. We show that regional ancestry differences reflect historical events, such as early Spanish colonization, waves of immigration from many regions of Europe, and forced relocation of Native Americans within the US. This study sheds light on the fine-scale differences in ancestry within and across the United States and informs our understanding of the relationship between racial and ethnic identities and genetic ancestry.  相似文献   

6.

Background

Family history and African-American race are important risk factors for both prostate cancer (CaP) incidence and aggressiveness. When studying complex diseases such as CaP that have a heritable component, chances of finding true disease susceptibility alleles can be increased by accounting for genetic ancestry within the population investigated. Race, ethnicity and ancestry were studied in a geographically diverse cohort of men with newly diagnosed CaP.

Methods

Individual ancestry (IA) was estimated in the population-based North Carolina and Louisiana Prostate Cancer Project (PCaP), a cohort of 2,106 incident CaP cases (2063 with complete ethnicity information) comprising roughly equal numbers of research subjects reporting as Black/African American (AA) or European American/Caucasian/Caucasian American/White (EA) from North Carolina or Louisiana. Mean genome wide individual ancestry estimates of percent African, European and Asian were obtained and tested for differences by state and ethnicity (Cajun and/or Creole and Hispanic/Latino) using multivariate analysis of variance models. Principal components (PC) were compared to assess differences in genetic composition by self-reported race and ethnicity between and within states.

Results

Mean individual ancestries differed by state for self-reporting AA (p = 0.03) and EA (p = 0.001). This geographic difference attenuated for AAs who answered “no” to all ethnicity membership questions (non-ethnic research subjects; p = 0.78) but not EA research subjects, p = 0.002. Mean ancestry estimates of self-identified AA Louisiana research subjects for each ethnic group; Cajun only, Creole only and both Cajun and Creole differed significantly from self-identified non-ethnic AA Louisiana research subjects. These ethnicity differences were not seen in those who self-identified as EA.

Conclusions

Mean IA differed by race between states, elucidating a potential contributing factor to these differences in AA research participants: self-reported ethnicity. Accurately accounting for genetic admixture in this cohort is essential for future analyses of the genetic and environmental contributions to CaP.  相似文献   

7.
Genotype-imputation methods provide an essential technique for high-resolution genome-wide association (GWA) studies with millions of single-nucleotide polymorphisms. For optimal design and interpretation of imputation-based GWA studies, it is important to understand the connection between imputation error and power to detect associations at imputed markers. Here, using a 2 × 3 chi-square test, we describe a relationship between genotype-imputation error rates and the sample-size inflation required for achieving statistical power at an imputed marker equal to that obtained if genotypes at the marker were known with certainty. Surprisingly, typical imputation error rates (∼2%–6%) lead to a large increase in the required sample size (∼10%–60%), and in some African populations whose genotypes are particularly difficult to impute, the required sample-size increase is as high as ∼30%–150%. In most populations, each 1% increase in imputation error leads to an increase of ∼5%–13% in the sample size required for maintaining power. These results imply that in GWA sample-size calculations investigators will need to account for a potentially considerable loss of power from even low levels of imputation error and that development of additional genomic resources that decrease imputation error will translate into substantial reduction in the sample sizes needed for imputation-based detection of the variants that underlie complex human diseases.  相似文献   

8.
Summary .  To detect association between a genetic marker and a disease in case–control studies, the Cochran–Armitage trend test is typically used. The trend test is locally optimal when the genetic model is correctly specified. However, in practice, the underlying genetic model, and hence the optimal trend test, are usually unknown. In this case, Pearson's chi-squared test, the maximum of three trend test statistics (optimal for the recessive, additive, and dominant models), and the test based on genetic model selection (GMS) are useful. In this article, we first modify the existing GMS method so that it can be used when the risk allele is unknown. Then we propose a new approach by excluding a genetic model that is not supported by the data. Using either the model selection or exclusion, the alternative space is reduced conditional on the observed data, and hence the power to detect a true association can be increased. Simulation results are reported and the proposed methods are applied to the genetic markers identified from the genome-wide association studies conducted by the Wellcome Trust Case–Control Consortium. The results demonstrate that the genetic model exclusion approach usually performs better than existing methods under its worst situation across scientifically plausible genetic models we considered.  相似文献   

9.
New sources of genetic diversity must be incorporated into plant breeding programs if they are to continue increasing grain yield and quality, and tolerance to abiotic and biotic stresses. Germplasm collections provide a source of genetic and phenotypic diversity, but characterization of these resources is required to increase their utility for breeding programs. We used a barley SNP iSelect platform with 7,842 SNPs to genotype 2,417 barley accessions sampled from the USDA National Small Grains Collection of 33,176 accessions. Most of the accessions in this core collection are categorized as landraces or cultivars/breeding lines and were obtained from more than 100 countries. Both STRUCTURE and principal component analysis identified five major subpopulations within the core collection, mainly differentiated by geographical origin and spike row number (an inflorescence architecture trait). Different patterns of linkage disequilibrium (LD) were found across the barley genome and many regions of high LD contained traits involved in domestication and breeding selection. The genotype data were used to define ‘mini-core’ sets of accessions capturing the majority of the allelic diversity present in the core collection. These ‘mini-core’ sets can be used for evaluating traits that are difficult or expensive to score. Genome-wide association studies (GWAS) of ‘hull cover’, ‘spike row number’, and ‘heading date’ demonstrate the utility of the core collection for locating genetic factors determining important phenotypes. The GWAS results were referenced to a new barley consensus map containing 5,665 SNPs. Our results demonstrate that GWAS and high-density SNP genotyping are effective tools for plant breeders interested in accessing genetic diversity in large germplasm collections.  相似文献   

10.
The evidence for the existence of genetic susceptibility variants for the common form of hypertension (“essential hypertension”) remains weak and inconsistent. We sought genetic variants underlying blood pressure (BP) by conducting a genome-wide association study (GWAS) among African Americans, a population group in the United States that is disproportionately affected by hypertension and associated complications, including stroke and kidney diseases. Using a dense panel of over 800,000 SNPs in a discovery sample of 1,017 African Americans from the Washington, D.C., metropolitan region, we identified multiple SNPs reaching genome-wide significance for systolic BP in or near the genes: PMS1, SLC24A4, YWHA7, IPO7, and CACANA1H. Two of these genes, SLC24A4 (a sodium/potassium/calcium exchanger) and CACNA1H (a voltage-dependent calcium channel), are potential candidate genes for BP regulation and the latter is a drug target for a class of calcium channel blockers. No variant reached genome wide significance for association with diastolic BP (top scoring SNP rs1867226, p = 5.8×10−7) or with hypertension as a binary trait (top scoring SNP rs9791170, p = 5.1×10−7). We replicated some of the significant SNPs in a sample of West Africans. Pathway analysis revealed that genes harboring top-scoring variants cluster in pathways and networks of biologic relevance to hypertension and BP regulation. This is the first GWAS for hypertension and BP in an African American population. The findings suggests that, in addition to or in lieu of relying solely on replicated variants of moderate-to-large effect reaching genome-wide significance, pathway and network approaches may be useful in identifying and prioritizing candidate genes/loci for further experiments.  相似文献   

11.
In response to the Surgeon General's request for more research on racial disparities in mental health care, especially research that includes high-need populations (e.g., the homeless, incarcerated, children in foster care, and substance abusers), we examined racial disparities in the provision of mental health counseling, psychotherapy, and pharmacotherapy in hospital outpatient settings using nationally representative data from the 1997 National Hospital Ambulatory Medical Care Survey (NHAMCS). After controlling for diagnosis and other factors, we found that African Americans were less likely than whites to receive mental health counseling and psychotherapy, but more likely than whites to receive pharmacotherapy. We also found that substance abuse clinics were more likely than primary care and specialty mental health clinics to provide mental health counseling and psychotherapy. However, specialty mental health clinics were the only clinics to provide pharmacotherapy. Future research should examine racial disparities in a variety of settings, controlling for diagnosis as well as other factors.  相似文献   

12.
Genotype imputation, used in genome-wide association studies to expand coverage of single nucleotide polymorphisms (SNPs), has performed poorly in African Americans compared to less admixed populations. Overall, imputation has typically relied on HapMap reference haplotype panels from Africans (YRI), European Americans (CEU), and Asians (CHB/JPT). The 1000 Genomes project offers a wider range of reference populations, such as African Americans (ASW), but their imputation performance has had limited evaluation. Using 595 African Americans genotyped on Illumina’s HumanHap550v3 BeadChip, we compared imputation results from four software programs (IMPUTE2, BEAGLE, MaCH, and MaCH-Admix) and three reference panels consisting of different combinations of 1000 Genomes populations (February 2012 release): (1) 3 specifically selected populations (YRI, CEU, and ASW); (2) 8 populations of diverse African (AFR) or European (AFR) descent; and (3) all 14 available populations (ALL). Based on chromosome 22, we calculated three performance metrics: (1) concordance (percentage of masked genotyped SNPs with imputed and true genotype agreement); (2) imputation quality score (IQS; concordance adjusted for chance agreement, which is particularly informative for low minor allele frequency [MAF] SNPs); and (3) average r2hat (estimated correlation between the imputed and true genotypes, for all imputed SNPs). Across the reference panels, IMPUTE2 and MaCH had the highest concordance (91%–93%), but IMPUTE2 had the highest IQS (81%–83%) and average r2hat (0.68 using YRI+ASW+CEU, 0.62 using AFR+EUR, and 0.55 using ALL). Imputation quality for most programs was reduced by the addition of more distantly related reference populations, due entirely to the introduction of low frequency SNPs (MAF≤2%) that are monomorphic in the more closely related panels. While imputation was optimized by using IMPUTE2 with reference to the ALL panel (average r2hat = 0.86 for SNPs with MAF>2%), use of the ALL panel for African American studies requires careful interpretation of the population specificity and imputation quality of low frequency SNPs.  相似文献   

13.
A more thorough understanding of the differences in DNA methylation (DNAm) profiles in populations may hold promise for identifying molecular mechanisms through which genetic and environmental factors jointly contribute to human diseases. Inflammation is a key molecular mechanism underlying several chronic diseases including cardiovascular disease, and it affects DNAm profile on both global and locus-specific levels. To understand the impact of inflammation on the DNAm of the human genome, we investigated DNAm profiles of peripheral blood leukocytes from 966 African American participants in the Genetic Epidemiology Network of Arteriopathy (GENOA) study. By testing the association of DNAm sites on CpG islands of over 14,000 genes with C-reactive protein (CRP), an inflammatory biomarker of cardiovascular disease, we identified 257 DNAm sites in 240 genes significantly associated with serum levels of CRP adjusted for age, sex, body mass index and smoking status, and corrected for multiple testing. Of the significantly associated DNAm sites, 80.5% were hypomethylated with higher CRP levels. The most significant Gene Ontology terms enriched in the genes associated with the CRP levels were immune system process, immune response, defense response, response to stimulus, and response to stress, which are all linked to the functions of leukocytes. While the CRP-associated DNAm may be cell-type specific, understanding the DNAm association with CRP in peripheral blood leukocytes of multi-ethnic populations can assist in unveiling the molecular mechanism of how the process of inflammation affects the risks of developing common disease through epigenetic modifications.  相似文献   

14.

Background

The timing of associations between common genetic variants and changes in growth patterns over childhood may provide insight into the development of obesity in later life. To address this question, it is important to define appropriate statistical models to allow for the detection of genetic effects influencing longitudinal childhood growth.

Methods and Results

Children from The Western Australian Pregnancy Cohort (Raine; n = 1,506) Study were genotyped at 17 genetic loci shown to be associated with childhood obesity (FTO, MC4R, TMEM18, GNPDA2, KCTD15, NEGR1, BDNF, ETV5, SEC16B, LYPLAL1, TFAP2B, MTCH2, BCDIN3D, NRXN3, SH2B1, MRSA) and an obesity-risk-allele-score was calculated as the total number of ‘risk alleles’ possessed by each individual. To determine the statistical method that fits these data and has the ability to detect genetic differences in BMI growth profile, four methods were investigated: linear mixed effects model, linear mixed effects model with skew-t random errors, semi-parametric linear mixed models and a non-linear mixed effects model. Of the four methods, the semi-parametric linear mixed model method was the most efficient for modelling childhood growth to detect modest genetic effects in this cohort. Using this method, three of the 17 loci were significantly associated with BMI intercept or trajectory in females and four in males. Additionally, the obesity-risk-allele score was associated with increased average BMI (female: β = 0.0049, P = 0.0181; male: β = 0.0071, P = 0.0001) and rate of growth (female: β = 0.0012, P = 0.0006; male: β = 0.0008, P = 0.0068) throughout childhood.

Conclusions

Using statistical models appropriate to detect genetic variants, variations in adult obesity genes were associated with childhood growth. There were also differences between males and females. This study provides evidence of genetic effects that may identify individuals early in life that are more likely to rapidly increase their BMI through childhood, which provides some insight into the biology of childhood growth.  相似文献   

15.
Variation in gene expression is a fundamental aspect of human phenotypic variation. Several recent studies have analyzed gene expression levels in populations of different continental ancestry and reported population differences at a large number of genes. However, these differences could largely be due to non-genetic (e.g., environmental) effects. Here, we analyze gene expression levels in African American cell lines, which differ from previously analyzed cell lines in that individuals from this population inherit variable proportions of two continental ancestries. We first relate gene expression levels in individual African Americans to their genome-wide proportion of European ancestry. The results provide strong evidence of a genetic contribution to expression differences between European and African populations, validating previous findings. Second, we infer local ancestry (0, 1, or 2 European chromosomes) at each location in the genome and investigate the effects of ancestry proximal to the expressed gene (cis) versus ancestry elsewhere in the genome (trans). Both effects are highly significant, and we estimate that 12±3% of all heritable variation in human gene expression is due to cis variants.  相似文献   

16.
High levels of anxiety have long been reported forAfrican Americans. Recent analyses of EpidemiologicalCatchment Area (ECA) data have failed to support this,although contemporary ethnographies have discussedimportant African American folk idioms of anxiety. This study compares ethnographically reported symptomsof anxiety in African Americans to those reported inthe ECA data. A multivariate analysis of femaleAfrican American and European American differences incomparable ECA and ethnographic symptoms wasperformed. Significant differences were found not inethnicity but in education levels. Alternativeinterpretations are discussed. Methodologicalproblems are discussed highlighting limitations ofboth household survey research, such as the ECAproject, and ethnography.  相似文献   

17.
Sarcoidosis, a systemic granulomatous disease, likely results from both environmental agents and genetic susceptibility. Sarcoidosis is more prevalent in women and, in the United States, African Americans are both more commonly and more severely affected than Caucasians. We report a follow up of the first genome scan for sarcoidosis susceptibility genes in African Americans. Both the genome scan and the present study comprise 229 African American nuclear families ascertained through two or more sibs with sarcoidosis. Regions studied included those which reached a significance in the genome scan of 0.01 (2p25, 5q11, 5q35, 9q34, 11p15 and 20q13), 0.05 (3p25 and 5p15–13) or which replicated previous findings (3p14–11). We performed genotyping with additional markers in the same families used in the genome scan. We examined multi-locus models for epistasis and performed model-based linkage analysis on subsets of the most linked families to characterize the underlying genetic model. The strongest signal was at marker D5S407 (P=0.005) on 5q11.2, using both full and half sibling pairs. Our results support, in an African American population, a sarcoidosis susceptibility gene on chromosome 5q11.2, and a gene protective for sarcoidosis on 5p15.2. These fine mapping results further prioritize the importance of candidate regions on chromosomes 2p25, 3p25, 5q35, 9q34, 11p15 and 20q13 for African Americans. Additionally, our results suggest joint action of the effects of putative genes on chromosome 3p14–11 and 5p15.2. We conclude that multiple susceptibility loci for sarcoidosis exist in African Americans and that some may have interdependent effects on disease pathogenesis.  相似文献   

18.
Low vitamin D levels are associated with an increased incidence of colorectal cancer (CRC) and higher mortality from the disease. In the US, African Americans (AAs) have the highest CRC incidence and mortality and the lowest levels of vitamin D. Single nucleotide polymorphisms (SNPs) in the vitamin D receptor (VDR) gene have been previously associated with CRC, but few studies have included AAs. We studied 795 AA CRC cases and 985 AA controls from Chicago and North Carolina as well as 1324 Caucasian cases and 990 Caucasian controls from Chicago and Spain. We genotyped 54 tagSNPs in VDR (46586959 to 46521297 Mb) and tested for association adjusting for West African ancestry, age, gender, and multiple testing. Untyped markers were imputed using MACH1.0. We analyzed associations by gender and anatomic location in the whole study group as well as by vitamin D intake in the North Carolina AA group. In the joint analysis, none of the SNPs tested was significantly associated with CRC. For four previously tested restriction fragment length polymorphisms, only one (referred to as ApaI), tagged by the SNP rs79628898, had a nominally significant p-value in AAs; none of these polymorphisms were associated with CRC in Caucasians. In the North Carolina AAs, for whom we had vitamin D intake data, we found a significant association between an intronic SNP rs11574041 and vitamin D intake, which is evidence for a VDR gene-environment interaction in AAs. In summary, using a systematic tagSNP approach, we have not found evidence for significant associations between VDR and CRC in AAs or Caucasians.  相似文献   

19.
Genome-wide association studies (GWAS) are routinely conducted for both quantitative and binary (disease) traits. We present two analytical tools for use in the experimental design of GWAS. Firstly, we present power calculations quantifying power in a unified framework for a range of scenarios. In this context we consider the utility of quantitative scores (e.g. endophenotypes) that may be available on cases only or both cases and controls. Secondly, we consider, the accuracy of prediction of genetic risk from genome-wide SNPs and derive an expression for genomic prediction accuracy using a liability threshold model for disease traits in a case-control design. The expected values based on our derived equations for both power and prediction accuracy agree well with observed estimates from simulations.  相似文献   

20.
Genotype imputation methods are now being widely used in the analysis of genome-wide association studies. Most imputation analyses to date have used the HapMap as a reference dataset, but new reference panels (such as controls genotyped on multiple SNP chips and densely typed samples from the 1,000 Genomes Project) will soon allow a broader range of SNPs to be imputed with higher accuracy, thereby increasing power. We describe a genotype imputation method (IMPUTE version 2) that is designed to address the challenges presented by these new datasets. The main innovation of our approach is a flexible modelling framework that increases accuracy and combines information across multiple reference panels while remaining computationally feasible. We find that IMPUTE v2 attains higher accuracy than other methods when the HapMap provides the sole reference panel, but that the size of the panel constrains the improvements that can be made. We also find that imputation accuracy can be greatly enhanced by expanding the reference panel to contain thousands of chromosomes and that IMPUTE v2 outperforms other methods in this setting at both rare and common SNPs, with overall error rates that are 15%–20% lower than those of the closest competing method. One particularly challenging aspect of next-generation association studies is to integrate information across multiple reference panels genotyped on different sets of SNPs; we show that our approach to this problem has practical advantages over other suggested solutions.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号