首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
Family studies of individual tissues have shown that gene expression traits are genetically heritable. Here, we investigate cis and trans components of heritability both within and across tissues by applying variance-components methods to 722 Icelanders from family cohorts, using identity-by-descent (IBD) estimates from long-range phased genome-wide SNP data and gene expression measurements for approximately 19,000 genes in blood and adipose tissue. We estimate the proportion of gene expression heritability attributable to cis regulation as 37% in blood and 24% in adipose tissue. Our results indicate that the correlation in gene expression measurements across these tissues is primarily due to heritability at cis loci, whereas there is little sharing of trans regulation across tissues. One implication of this finding is that heritability in tissues composed of heterogeneous cell types is expected to be more dominated by cis regulation than in tissues composed of more homogeneous cell types, consistent with our blood versus adipose results as well as results of previous studies in lymphoblastoid cell lines. Finally, we obtained similar estimates of the cis components of heritability using IBD between unrelated individuals, indicating that transgenerational epigenetic inheritance does not contribute substantially to the "missing heritability" of gene expression in these tissue types.  相似文献   

3.
Hi-C data provide population averaged estimates of three-dimensional chromatin contacts across cell types and states in bulk samples. Effective analysis of Hi-C data entails controlling for the potential confounding factor of differential cell type proportions across heterogeneous bulk samples. We propose a novel unsupervised deconvolution method for inferring cell type composition from bulk Hi-C data, the Two-step Hi-c UNsupervised DEconvolution appRoach (THUNDER). We conducted extensive simulations to test THUNDER based on combining two published single-cell Hi-C (scHi-C) datasets. THUNDER more accurately estimates the underlying cell type proportions compared to reference-free methods (e.g., TOAST, and NMF) and is more robust than reference-dependent methods (e.g. MuSiC). We further demonstrate the practical utility of THUNDER to estimate cell type proportions and identify cell-type-specific interactions in Hi-C data from adult human cortex tissue samples. THUNDER will be a useful tool in adjusting for varying cell type composition in population samples, facilitating valid and more powerful downstream analysis such as differential chromatin organization studies. Additionally, THUNDER estimated contact profiles provide a useful exploratory framework to investigate cell-type-specificity of the chromatin interactome while experimental data is still rare.  相似文献   

4.
5.
6.
7.
Allele-specific gene expression, ASE, is an important aspect of gene regulation. We developed a novel method MBASED, meta-analysis based allele-specific expression detection for ASE detection using RNA-seq data that aggregates information across multiple single nucleotide variation loci to obtain a gene-level measure of ASE, even when prior phasing information is unavailable. MBASED is capable of one-sample and two-sample analyses and performs well in simulations. We applied MBASED to a panel of cancer cell lines and paired tumor-normal tissue samples, and observed extensive ASE in cancer, but not normal, samples, mainly driven by genomic copy number alterations.  相似文献   

8.
9.
10.
RNA sequencing (RNA-Seq) is popular for measuring gene expression in non-model organisms, including wild populations. While RNA-Seq can detect gene expression variation among wild-caught individuals and yield important insights into biological function, sampling methods can also affect gene expression estimates. We examined the influence of multiple technical variables on estimated gene expression in a non-model fish, the westslope cutthroat trout (Oncorhynchus clarkii lewisi), using two RNA-Seq library types: 3′ RNA-Seq (QuantSeq) and whole mRNA-Seq (NEB). We evaluated effects of dip netting versus electrofishing, and of harvesting tissue immediately versus 5 min after euthanasia on estimated gene expression in blood, gill, and muscle. We found no significant differences in gene expression between sampling methods or tissue collection times with either library type. When library types were compared using the same blood samples, 58% of genes detected by both NEB and QuantSeq showed significantly different expression between library types, and NEB detected 31% more genes than QuantSeq. Although the two library types recovered different numbers of genes and expression levels, results with NEB and QuantSeq were consistent in that neither library type showed differences in gene expression between sampling methods and tissue harvesting times. Our study suggests that researchers can safely rely on different fish sampling strategies in the field. In addition, while QuantSeq is more cost effective, NEB detects more expressed genes. Therefore, when it is crucial to detect as many genes as possible (especially low expressed genes), when alternative splicing is of interest, or when working with an organism lacking good genomic resources, whole mRNA-Seq is more powerful.  相似文献   

11.
12.
13.
Wang Y  Robbins KR  Rekaya R 《PloS one》2010,5(10):e13239
Assessing conservation/divergence of gene expression across species is important for the understanding of gene regulation evolution. Although advances in microarray technology have provided massive high-dimensional gene expression data, the analysis of such data is still challenging. To date, assessing cross-species conservation of gene expression using microarray data has been mainly based on comparison of expression patterns across corresponding tissues, or comparison of co-expression of a gene with a reference set of genes. Because direct and reliable high-throughput experimental data on conservation of gene expression are often unavailable, the assessment of these two computational models is very challenging and has not been reported yet. In this study, we compared one corresponding tissue based method and three co-expression based methods for assessing conservation of gene expression, in terms of their pair-wise agreements, using a frequently used human-mouse tissue expression dataset. We find that 1) the co-expression based methods are only moderately correlated with the corresponding tissue based methods, 2) the reliability of co-expression based methods is affected by the size of the reference ortholog set, and 3) the corresponding tissue based methods may lose some information for assessing conservation of gene expression. We suggest that the use of either of these two computational models to study the evolution of a gene's expression may be subject to great uncertainty, and the investigation of changes in both gene expression patterns over corresponding tissues and co-expression of the gene with other genes is necessary.  相似文献   

14.
15.
16.
17.
Maximum-likelihood estimation of admixture proportions from genetic data   总被引:9,自引:0,他引:9  
Wang J 《Genetics》2003,164(2):747-765
For an admixed population, an important question is how much genetic contribution comes from each parental population. Several methods have been developed to estimate such admixture proportions, using data on genetic markers sampled from parental and admixed populations. In this study, I propose a likelihood method to estimate jointly the admixture proportions, the genetic drift that occurred to the admixed population and each parental population during the period between the hybridization and sampling events, and the genetic drift in each ancestral population within the interval between their split and hybridization. The results from extensive simulations using various combinations of relevant parameter values show that in general much more accurate and precise estimates of admixture proportions are obtained from the likelihood method than from previous methods. The likelihood method also yields reasonable estimates of genetic drift that occurred to each population, which translate into relative effective sizes (N(e)) or absolute average N(e)'s if the times when the relevant events (such as population split, admixture, and sampling) occurred are known. The proposed likelihood method also has features such as relatively low computational requirement compared with previous ones, flexibility for admixture models, and marker types. In particular, it allows for missing data from a contributing parental population. The method is applied to a human data set and a wolflike canids data set, and the results obtained are discussed in comparison with those from other estimators and from previous studies.  相似文献   

18.
19.
20.
Capture–recapture techniques provide valuable information, but are often more cost‐prohibitive at large spatial and temporal scales than less‐intensive sampling techniques. Model development combining multiple data sources to leverage data source strengths and for improved parameter precision has increased, but with limited discussion on precision gain versus effort. We present a general framework for evaluating trade‐offs between precision gained and costs associated with acquiring multiple data sources, useful for designing future or new phases of current studies.We illustrated how Bayesian hierarchical joint models using detection/non‐detection and banding data can improve abundance, survival, and recruitment inference, and quantified data source costs in a northern Arizona, USA, western bluebird (Sialia mexicana) population. We used an 8‐year detection/non‐detection (distributed across the landscape) and banding (subset of locations within landscape) data set to estimate parameters. We constructed separate models using detection/non‐detection and banding data, and a joint model using both data types to evaluate parameter precision gain relative to effort.Joint model parameter estimates were more precise than single data model estimates, but parameter precision varied (apparent survival > abundance > recruitment). Banding provided greater apparent survival precision than detection/non‐detection data. Therefore, little precision was gained when detection/non‐detection data were added to banding data. Additional costs were minimal; however, additional spatial coverage and ability to estimate abundance and recruitment improved inference. Conversely, more precision was gained when adding banding to detection/non‐detection data at higher cost. Spatial coverage was identical, yet survival and abundance estimates were more precise. Justification of increased costs associated with additional data types depends on project objectives.We illustrate a general framework for evaluating precision gain relative to effort, applicable to joint data models with any data type combination. This framework evaluates costs and benefits from and effort levels between multiple data types, thus improving population monitoring designs.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号