首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.

Background

Longitudinal data and repeated measurements in epigenome-wide association studies (EWAS) provide a rich resource for understanding epigenetics. We summarize 7 analytical approaches to the GAW20 data sets that addressed challenges and potential applications of phenotypic and epigenetic data. All contributions used the GAW20 real data set and employed either linear mixed effect (LME) models or marginal models through generalized estimating equations (GEE). These contributions were subdivided into 3 categories: (a) quality control (QC) methods for DNA methylation data; (b) heritability estimates pretreatment and posttreatment with fenofibrate; and (c) impact of drug response pretreatment and posttreatment with fenofibrate on DNA methylation and blood lipids.

Results

Two contributions addressed QC and identified large statistical differences with pretreatment and posttreatment DNA methylation, possibly a result of batch effects. Two contributions compared epigenome-wide heritability estimates pretreatment and posttreatment, with one employing a Bayesian LME and the other using a variance-component LME. Density curves comparing these studies indicated these heritability estimates were similar. Another contribution used a variance-component LME to depict the proportion of heritability resulting from a genetic and shared environment. By including environmental exposures as random effects, the authors found heritability estimates became more stable but not significantly different. Two contributions investigated treatment response. One estimated drug-associated methylation effects on triglyceride levels as the response, and identified 11 significant cytosine-phosphate-guanine (CpG) sites with or without adjusting for high-density lipoprotein. The second contribution performed weighted gene coexpression network analysis and identified 6 significant modules of at least 30 CpG sites, including 3 modules with topological differences pretreatment and posttreatment.

Conclusions

Four conclusions from this GAW20 working group are: (a) QC measures are an important consideration for EWAS studies that are investigating multiple time points or repeated measurements; (b) application of heritability estimates between time points for individual CpG sites is a useful QC measure for DNA methylation studies; (c) drug intervention demonstrated strong epigenome-wide DNA methylation patterns across the 2 time points; and (d) new statistical methods are required to account for the environmental contributions of DNA methylation across time. These contributions demonstrate numerous opportunities exist for the analysis of longitudinal data in future epigenetic studies.
  相似文献   

2.
Lent  Samantha  Xu  Hanfei  Wang  Lan  Wang  Zhe  Sarnowski  Chlo&#;  Hivert  Marie-France  Dupuis  Jos&#;e 《BMC genetics》2018,19(1):84-31

Background

Single-probe analyses in epigenome-wide association studies (EWAS) have identified associations between DNA methylation and many phenotypes, but do not take into account information from neighboring probes. Methods to detect differentially methylated regions (DMRs) (clusters of neighboring probes associated with a phenotype) may provide more power to detect associations between DNA methylation and diseases or phenotypes of interest.

Results

We proposed a novel approach, GlobalP, and perform comparisons with 3 methods—DMRcate, Bumphunter, and comb-p—to identify DMRs associated with log triglycerides (TGs) in real GAW20 data before and after fenofibrate treatment. We applied these methods to the summary statistics from an EWAS performed on the methylation data. Comb-p, DMRcate, and GlobalP detected very similar DMRs near the gene CPT1A on chromosome 11 in both the pre- and posttreatment data. In addition, GlobalP detected 2 DMRs before fenofibrate treatment in the genes ETV6 and ABCG1. Bumphunter identified several DMRs on chromosomes 1 and 20, which did not overlap with DMRs detected by other methods.

Conclusions

Our novel method detected the same DMR identified by two existing methods and detected two additional DMRs not identified by any of the existing methods we compared.
  相似文献   

3.

Background

An important feature in many genomic studies is quality control and normalization. This is particularly important when analyzing epigenetic data, where the process of obtaining measurements can be bias prone. The GAW20 data was from the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN), a study with multigeneration families, where DNA cytosine-phosphate-guanine (CpG) methylation was measured pre- and posttreatment with fenofibrate. We performed quality control assessment of the GAW20 DNA methylation data, including normalization, assessment of batch effects and detection of sample swaps.

Results

We show that even after normalization, the GOLDN methylation data has systematic differences pre- and posttreatment. Through investigation of (a) CpGs sites containing a single nucleotide polymorphism, (b) the stability of breeding values for methylation across time points, and (c) autosomal gender-associated CpGs, 13 sample swaps were detected, 11 of which were posttreatment.

Conclusions

This paper demonstrates several ways to perform quality control of methylation data in the absence of raw data files and highlights the importance of normalization and quality control of the GAW20 methylation data from the GOLDN study.
  相似文献   

4.

Background

In studies with multi-omics data available, there is an opportunity to investigate interdependent mechanisms of biological causality. The GAW20 data set includes both DNA genotype and methylation measures before and after fenofibrate treatment. Using change in triglyceride (TG) levels pre- to posttreatment as outcome, we present a mediation analysis that incorporates methylation. This approach allows us to simultaneously consider a mediation hypothesis that genotype affects change in TG level by means of its effect on methylation, and an interaction hypothesis that the effect of change in methylation on change in TG levels differs by genotype. We select 322 single-nucleotide polymorphism–cytosine-phosphate-guanine (SNP-CpG) site pairs for mediation analysis on the basis of proximity and marginal genome-wide association study (GWAS) and epigenome-wide association study (EWAS) significance, and present results from the real-data sample of 407 individuals with complete genotype, methylation, TG levels, and covariate data.

Results

We identified 3 SNP-CpG site pairs with significant interaction effects at a Bonferroni-corrected significance threshold of 1.55E-4. None of the analyzed sites showed significant evidence of mediation. Power analysis by simulation showed that a sample size of at least 19,500 is needed to detect nominally significant indirect effects with true effect sizes equal to the point estimates at the locus with strongest evidence of mediation.

Conclusions

These results suggest that there is stronger evidence for interaction between genotype and methylation on change in triglycerides than for methylation mediating the effect of genotype.
  相似文献   

5.
Background

Genome-wide association studies performed on triglycerides (TGs) have not accounted for epigenetic mechanisms that may partially explain trait heritability.

Results

Parent-of-origin (POO) effect association analyses using an agnostic approach or a candidate approach were performed for pretreatment TG levels, posttreatment TG levels, and pre- and posttreatment TG-level differences in the real GAW20 family data set. We detected 22 genetic variants with suggestive POO effects with at least 1 phenotype (P ≤ 10− 5). We evaluated the association of these 22 significant genetic variants showing POO effects with close DNA methylation probes associated with TGs. A total of 18 DNA methylation probes located in the vicinity of the 22 SNPs were associated with at least 1 phenotype and 6 SNP-probe pairs were associated with DNA methylation probes at the nominal level of P < 0.05, among which 1 pair presented evidence of POO effect. Our analyses identified a paternal effect of SNP rs301621 on the difference between pre- and posttreatment TG levels (P = 1.2 × 10− 5). This same SNP showed evidence for a maternal effect on methylation levels of a nearby probe (cg10206250; P = 0.01). Using a causal inference test we established that the observed POO effect of rs301621 was not mediated by DNA methylation at cg10206250.

Conclusions

We performed POO effect association analyses of SNPs with TGs, as well as association analyses of SNPs with DNA methylation probes. These analyses, which were followed by a causal inference test, established that the paternal effect at the SNP rs301621 is induced by treatment and is not mediated by methylation level at cg10206250.

  相似文献   

6.
Ghosh  Saurabh  Fardo  David W. 《BMC genetics》2018,19(1):127-131
Background

The GAW20 group formed on the theme of methods for association analyses of repeated measures comprised 4sets of investigators. The provided “real” data set included genotypes obtained from a human whole-genome association study based on longitudinal measurements of triglycerides (TGs) and high-density lipoprotein in addition to methylation levels before and after administration of fenofibrate. The simulated data set contained 200 replications of methylation levels and posttreatment TGs, mimicking the real data set.

Results

The different investigators in the group focused on the statistical challenges unique to family-based association analyses of phenotypes measured longitudinally and applied a wide spectrum of statistical methods such as linear mixed models, generalized estimating equations, and quasi-likelihood–based regression models. This article discusses the varying strategies explored by the group’s investigators with the common goal of improving the power to detect association with repeated measures of a phenotype.

Conclusions

Although it is difficult to identify a common message emanating from the different contributions because of the diversity in the issues addressed, the unifying theme of the contributions lie in the search for novel analytic strategies to circumvent the limitations of existing methodologies to detect genetic association.

  相似文献   

7.
DNA methylation is a widely studied epigenetic mechanism and alterations in methylation patterns may be involved in the development of common diseases. Unlike inherited changes in genetic sequence, variation in site-specific methylation varies by tissue, developmental stage, and disease status, and may be impacted by aging and exposure to environmental factors, such as diet or smoking. These non-genetic factors are typically included in epigenome-wide association studies (EWAS) because they may be confounding factors to the association between methylation and disease. However, missing values in these variables can lead to reduced sample size and decrease the statistical power of EWAS. We propose a site selection and multiple imputation (MI) method to impute missing covariate values and to perform association tests in EWAS. Then, we compare this method to an alternative projection-based method. Through simulations, we show that the MI-based method is slightly conservative, but provides consistent estimates for effect size. We also illustrate these methods with data from the Atherosclerosis Risk in Communities (ARIC) study to carry out an EWAS between methylation levels and smoking status, in which missing cell type compositions and white blood cell counts are imputed.  相似文献   

8.
DNA methylation plays a fundamental role in the regulation of the genome, but the optimal strategy for analysis of genome-wide DNA methylation data remains to be determined. We developed a comprehensive analysis pipeline for epigenome-wide association studies (EWAS) using the Illumina Infinium HumanMethylation450 BeadChip, based on 2,687 individuals, with 36 samples measured in duplicate. We propose new approaches to quality control, data normalisation and batch correction through control-probe adjustment and establish a null hypothesis for EWAS using permutation testing. Our analysis pipeline outperforms existing approaches, enabling accurate identification of methylation quantitative trait loci for hypothesis driven follow-up experiments.

Electronic supplementary material

The online version of this article (doi:10.1186/s13059-015-0600-x) contains supplementary material, which is available to authorized users.  相似文献   

9.
de Andrade  Mariza  Warwick Daw  E.  Kraja  Aldi T.  Fisher  Virginia  Wang  Lan  Hu  Ke  Li  Jing  Romanescu  Razvan  Veenstra  Jenna  Sun  Rui  Weng  Haoyi  Zhou  Wenda 《BMC genetics》2018,19(1):119-125
Background

GAW20 working group 5 brought together researchers who contributed 7 papers with the aim of evaluating methods to detect genetic by epigenetic interactions. GAW20 distributed real data from the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN) study, including single-nucleotide polymorphism (SNP) markers, methylation (cytosine-phosphate-guanine [CpG]) markers, and phenotype information on up to 995 individuals. In addition, a simulated data set based on the real data was provided.

Results

The 7 contributed papers analyzed these data sets with a number of different statistical methods, including generalized linear mixed models, mediation analysis, machine learning, W-test, and sparsity-inducing regularized regression. These methods generally appeared to perform well. Several papers confirmed a number of causative SNPs in either the large number of simulation sets or the real data on chromosome 11. Findings were also reported for different SNPs, CpG sites, and SNP–CpG site interaction pairs.

Conclusions

In the simulation (200 replications), power appeared generally good for large interaction effects, but smaller effects will require larger studies or consortium collaboration for realizing a sufficient power.

  相似文献   

10.
11.
12.
13.
Epigenetic factors such as DNA methylation are DNA alterations affecting gene expression that can convey environmental information through generations. Only a few studies have demonstrated epigenetic inheritance in humans. Our objective is to quantify genetic and common environmental determinants of familial resemblances in DNA methylation levels, using a family based sample. DNA methylation was measured in 48 French Canadians from 16 families as part of the GENERATION Study. We used the Illumina HumanMethylation450 BeadChip array to measure DNA methylation levels in blood leukocytes on 485,577 CpG sites. Heritability was assessed using the variance components method implemented in the QTDT software, which partitions the variance into polygenic (G), common environmental (C), and non-shared environmental (E) effects. We computed maximal heritability, genetic heritability, and common environmental effect for all probes (12.7%, 8.2%, and 4.5%, respectively) and for statistically significant probes (81.8%, 26.9%, and 54.9%, respectively). Higher maximal heritability was observed in the Major Histocompatibility Complex region on chromosome 6. In conclusion, familial resemblances in DNA methylation levels are mainly attributable to genetic factors when considering the average across the genome, but common environmental effect plays an important role when considering statistically significant probes. Further epigenome-wide studies on larger samples combined with genome-wide genotyping studies are needed to better understand the underlying mechanisms of DNA methylation heritability.  相似文献   

14.
目的 族群地域、体貌特征等表型是基因型与环境共同作用的结果。大量基因组学研究表明,汉族人群具有混合特征,内部存在明显的南北遗传差异。本研究旨在探索研究表观基因组在中国南北方汉族人群之间是否存在差异,并筛选差异遗传位点。方法 使用GLINT软件对483份汉族样本的全基因组甲基化芯片数据进行EWAS分析,使用Lasso回归方法筛选位点。使用多元逻辑回归算法构建南北方汉族人群预测模型,通过十折交叉验证的方法评估。结果 筛选出一组南北方汉族之间差异显著的CpG位点,准确性为99.03%,Kappa系数为0.979 6。结论 本研究表明南北方汉族人群之间存在表观遗传差异,本研究为进一步开展不同地域汉族人群之间的表观遗传差异研究奠定了基础。  相似文献   

15.
Prenatal maternal stress exposure has been associated with neonatal differential DNA methylation. However, the available evidence in humans is largely based on candidate gene methylation studies, where only a few CpG sites were evaluated. The aim of this study was to examine the association between prenatal exposure to maternal stress and offspring genome-wide cord blood methylation using different methods. First, we conducted a meta-analysis and follow-up pathway analyses. Second, we used novel region discovery methods [i.e., differentially methylated regions (DMRs) analyses]. To this end, we used data from two independent population-based studies, the Generation R Study (n = 912) and the Avon Longitudinal Study of Parents and Children (ALSPAC, n = 828), to (i) measure genome-wide DNA methylation in cord blood and (ii) extract a prenatal maternal stress composite. The meta-analysis (ntotal = 1,740) revealed no epigenome-wide (meta P <1.00e-07) associations of prenatal maternal stress exposure with neonatal differential DNA methylation. Follow-up analyses of the top hits derived from our epigenome-wide meta-analysis (meta P <1.00e-04) indicated an over-representation of the methyltransferase activity pathway. We identified no Bonferroni-corrected (P <1.00e-06) DMRs associated with prenatal maternal stress exposure. Combining data from two independent population-based samples in an epigenome-wide meta-analysis, the current study indicates that there are no large effects of prenatal maternal stress exposure on neonatal DNA methylation. Such replication efforts are essential in the search for robust associations, whether derived from candidate gene methylation or epigenome-wide studies.  相似文献   

16.
The Illumina HumanMethylation450 BeadChip is increasingly utilized in epigenome-wide association studies, however, this array-based measurement of DNA methylation is subject to measurement variation. Appropriate data preprocessing to remove background noise is important for detecting the small changes that may be associated with disease. We developed a novel background correction method, ENmix, that uses a mixture of exponential and truncated normal distributions to flexibly model signal intensity and uses a truncated normal distribution to model background noise. Depending on data availability, we employ three approaches to estimate background normal distribution parameters using (i) internal chip negative controls, (ii) out-of-band Infinium I probe intensities or (iii) combined methylated and unmethylated intensities. We evaluate ENmix against other available methods for both reproducibility among duplicate samples and accuracy of methylation measurement among laboratory control samples. ENmix out-performed other background correction methods for both these measures and substantially reduced the probe-design type bias between Infinium I and II probes. In reanalysis of existing EWAS data we show that ENmix can identify additional CpGs, and results in smaller P-value estimates for previously-validated CpGs. We incorporated the method into R package ENmix, which is freely available from Bioconductor website.  相似文献   

17.
Given the tissue-specific nature of epigenetic processes, the assessment of disease-relevant tissue is an important consideration for epigenome-wide association studies (EWAS). Little is known about whether easily accessible tissues, such as whole blood, can be used to address questions about interindividual epigenomic variation in inaccessible tissues, such as the brain. We quantified DNA methylation in matched DNA samples isolated from whole blood and 4 brain regions (prefrontal cortex, entorhinal cortex, superior temporal gyrus, and cerebellum) from 122 individuals. We explored co-variation between tissues and the extent to which methylomic variation in blood is predictive of interindividual variation identified in the brain. For the majority of DNA methylation sites, interindividual variation in whole blood is not a strong predictor of interindividual variation in the brain, although the relationship with cortical regions is stronger than with the cerebellum. Variation at a subset of probes is strongly correlated across tissues, even in instances when the actual level of DNA methylation is significantly different between them. A substantial proportion of this co-variation, however, is likely to result from genetic influences. Our data suggest that for the majority of the genome, a blood-based EWAS for disorders where brain is presumed to be the primary tissue of interest will give limited information relating to underlying pathological processes. These results do not, however, discount the utility of using a blood-based EWAS to identify biomarkers of disease phenotypes manifest in the brain. We have generated a searchable database for the interpretation of data from blood-based EWAS analyses (http://epigenetics.essex.ac.uk/bloodbrain/).  相似文献   

18.

Background

Transgenerational epigenetic inheritance has been posited as a possible contributor to the observed heritability of metabolic syndrome (MetS). Yet the extent to which estimates of epigenetic inheritance for DNA methylation sites are inflated by environmental and genetic covariance within families is still unclear. We applied current methods to quantify the environmental and genetic contributors to the observed heritability and familial correlations of four previously associated MetS methylation sites at three genes (CPT1A, SOCS3 and ABCG1) using real data made available through the GAW20.

Results

Our findings support the role of both shared environment and genetic variation in explaining the heritability of MetS and the four MetS cytosine-phosphate-guanine (CpG) sites, although the resulting heritability estimates were indistinguishable from one another. Familial correlations by type of relative pair generally followed our expectation based on relatedness, but in the case of sister and parent pairs we observed nonsignificant trends toward greater correlation than expected, as would be consistent with the role of shared environmental factors in the inflation of our estimated correlations.

Conclusions

Our work provides an interesting and flexible statistical framework for testing models of epigenetic inheritance in the context of human family studies. Future work should endeavor to replicate our findings and advance these methods to more robustly describe epigenetic inheritance patterns in human populations.
  相似文献   

19.
The potential influence of underlying differences in relative leukocyte distributions in studies involving blood-based profiling of DNA methylation is well recognized and has prompted development of a set of statistical methods for inferring changes in the distribution of white blood cells using DNA methylation signatures. However, the extent to which this methodology can accurately predict cell-type proportions based on blood-derived DNA methylation data in a large-scale epigenome-wide association study (EWAS) has yet to be examined. We used publicly available data deposited in the Gene Expression Omnibus (GEO) database (accession number GSE37008), which consisted of both blood-derived epigenome-wide DNA methylation data assayed using the Illumina Infinium HumanMethylation27 BeadArray and complete blood cell (CBC) counts among a community cohort of 94 non-diseased individuals. Constrained projection (CP) was used to obtain predictions of the proportions of lymphocytes, monocytes and granulocytes for each of the study samples based on their DNA methylation signatures. Our findings demonstrated high consistency between the average CBC-derived and predicted percentage of monocytes and lymphocytes (17.9% and 17.6% for monocytes and 82.1% and 81.4% for lymphocytes), with root mean squared error (rMSE) of 5% and 6%, for monocytes and lymphocytes, respectively. Similarly, there was moderate-high correlation between the CP-predicted and CBC-derived percentages of monocytes and lymphocytes (0.60 and 0.61, respectively), and these results were robust to the number of leukocyte differentially methylated regions (L-DMRs) used for CP prediction. These results serve as further validation of the CP approach and highlight the promise of this technique for EWAS where DNA methylation is profiled using whole-blood genomic DNA.  相似文献   

20.
Interindividual variability in the epigenome has gained tremendous attention for its potential in pathophysiological investigation, disease diagnosis, and evaluation of clinical intervention. DNA methylation is the most studied epigenetic mark in epigenome-wide association studies (EWAS) as it can be detected from limited starting material. Infinium 450K methylation array is the most popular platform for high-throughput profiling of this mark in clinical samples, as it is cost-effective and requires small amounts of DNA. However, this method suffers from low genome coverage and errors introduced by probe cross-hybridization. Whole-genome bisulfite sequencing can overcome these limitations but elevates the costs tremendously. Methyl-Capture Sequencing (MC Seq) is an attractive intermediate solution to increase the methylome coverage in large sample sets. Here we first demonstrate that MC Seq can be employed using DNA amounts comparable to the amounts used for Infinium 450K. Second, to provide guidance when choosing between the 2 platforms for EWAS, we evaluate and compare MC Seq and Infinium 450K in terms of coverage, technical variation, and concordance of methylation calls in clinical samples. Last, since the focus in EWAS is to study interindividual variation, we demonstrate the utility of MC Seq in studying interindividual variation in subjects from different ethnicities.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号