首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Chromosomes are giant chain molecules organized into an ensemble of three-dimensional structures characterized with its genomic state and the corresponding biological functions. Despite the strong cell-to-cell heterogeneity, the cell-type specific pattern demonstrated in high-throughput chromosome conformation capture (Hi-C) data hints at a valuable link between structure and function, which makes inference of chromatin domains (CDs) from the pattern of Hi-C a central problem in genome research. Here we present a unified method for analyzing Hi-C data to determine spatial organization of CDs over multiple genomic scales. By applying statistical physics-based clustering analysis to a polymer physics model of the chromosome, our method identifies the CDs that best represent the global pattern of correlation manifested in Hi-C. The multi-scale intra-chromosomal structures compared across different cell types uncover the principles underlying the multi-scale organization of chromatin chain: (i) Sub-TADs, TADs, and meta-TADs constitute a robust hierarchical structure. (ii) The assemblies of compartments and TAD-based domains are governed by different organizational principles. (iii) Sub-TADs are the common building blocks of chromosome architecture. Our physically principled interpretation and analysis of Hi-C not only offer an accurate and quantitative view of multi-scale chromatin organization but also help decipher its connections with genome function.  相似文献   

2.
The recently developed Hi-C technique has been widely applied to map genome-wide chromatin interactions. However, current methods for analyzing diploid Hi-C data cannot fully distinguish between homologous chromosomes. Consequently, the existing diploid Hi-C analyses are based on sparse and inaccurate allele-specific contact matrices, which might lead to incorrect modeling of diploid genome architecture. Here we present ASHIC, a hierarchical Bayesian framework to model allele-specific chromatin organizations in diploid genomes. We developed two models under the Bayesian framework: the Poisson-multinomial (ASHIC-PM) model and the zero-inflated Poisson-multinomial (ASHIC-ZIPM) model. The proposed ASHIC methods impute allele-specific contact maps from diploid Hi-C data and simultaneously infer allelic 3D structures. Through simulation studies, we demonstrated that ASHIC methods outperformed existing approaches, especially under low coverage and low SNP density conditions. Additionally, in the analyses of diploid Hi-C datasets in mouse and human, our ASHIC-ZIPM method produced fine-resolution diploid chromatin maps and 3D structures and provided insights into the allelic chromatin organizations and functions. To summarize, our work provides a statistically rigorous framework for investigating fine-scale allele-specific chromatin conformations. The ASHIC software is publicly available at https://github.com/wmalab/ASHIC.  相似文献   

3.
4.
《Biophysical journal》2020,118(9):2220-2228
The one-dimensional information of genomic DNA is hierarchically packed inside the eukaryotic cell nucleus and organized in a three-dimensional (3D) space. Genome-wide chromosome conformation capture (Hi-C) methods have uncovered the 3D genome organization and revealed multiscale chromatin domains of compartments and topologically associating domains (TADs). Moreover, single-nucleosome live-cell imaging experiments have revealed the dynamic organization of chromatin domains caused by stochastic thermal fluctuations. However, the mechanism underlying the dynamic regulation of such hierarchical and structural chromatin units within the microscale thermal medium remains unclear. Microrheology is a way to measure dynamic viscoelastic properties coupling between thermal microenvironment and mechanical response. Here, we propose a new, to our knowledge, microrheology for Hi-C data to analyze the dynamic compliance property as a measure of rigidness and flexibility of genomic regions along with the time evolution. Our method allows the conversion of an Hi-C matrix into the spectrum of the dynamic rheological property along the genomic coordinate of a single chromosome. To demonstrate the power of the technique, we analyzed Hi-C data during the neural differentiation of mouse embryonic stem cells. We found that TAD boundaries behave as more rigid nodes than the intra-TAD regions. The spectrum clearly shows the dynamic viscoelasticity of chromatin domain formation at different timescales. Furthermore, we characterized the appearance of synchronous and liquid-like intercompartment interactions in differentiated cells. Together, our microrheology data derived from Hi-C data provide physical insights into the dynamics of the 3D genome organization.  相似文献   

5.
Single-cell Hi-C (scHi-C) sequencing technologies allow us to investigate three-dimensional chromatin organization at the single-cell level. However, we still need computational tools to deal with the sparsity of the contact maps from single cells and embed single cells in a lower-dimensional Euclidean space. This embedding helps us understand relationships between the cells in different dimensions, such as cell-cycle dynamics and cell differentiation. We present an open-source computational toolbox, scHiCTools, for analyzing single-cell Hi-C data comprehensively and efficiently. The toolbox provides two methods for screening single cells, three common methods for smoothing scHi-C data, three efficient methods for calculating the pairwise similarity of cells, three methods for embedding single cells, three methods for clustering cells, and a build-in function to visualize the cells embedding in a two-dimensional or three-dimensional plot. scHiCTools, written in Python3, is compatible with different platforms, including Linux, macOS, and Windows.  相似文献   

6.
Hi-C data provide population averaged estimates of three-dimensional chromatin contacts across cell types and states in bulk samples. Effective analysis of Hi-C data entails controlling for the potential confounding factor of differential cell type proportions across heterogeneous bulk samples. We propose a novel unsupervised deconvolution method for inferring cell type composition from bulk Hi-C data, the Two-step Hi-c UNsupervised DEconvolution appRoach (THUNDER). We conducted extensive simulations to test THUNDER based on combining two published single-cell Hi-C (scHi-C) datasets. THUNDER more accurately estimates the underlying cell type proportions compared to reference-free methods (e.g., TOAST, and NMF) and is more robust than reference-dependent methods (e.g. MuSiC). We further demonstrate the practical utility of THUNDER to estimate cell type proportions and identify cell-type-specific interactions in Hi-C data from adult human cortex tissue samples. THUNDER will be a useful tool in adjusting for varying cell type composition in population samples, facilitating valid and more powerful downstream analysis such as differential chromatin organization studies. Additionally, THUNDER estimated contact profiles provide a useful exploratory framework to investigate cell-type-specificity of the chromatin interactome while experimental data is still rare.  相似文献   

7.
The spatial structure of the orderly organized chromatin in the nucleus has important roles in maintaining normal cell function and in regulation of gene expression, and the high-throughput Hi-C and Ch IA-PET methods have been widely used in various biological studies for determining potential spatial genome structures and their functions. However, there are still great difficulties and challenges in three-dimensional(3D) genomics research. More efficient, economical, and unbiased approaches to studying 3D genomics need to be developed for more widespread and easier applications. Here, we review the most recent studies on new 3D genomics research technologies, such as improvements of the traditional Hi-C and Ch IA-PET methods, new approaches based on non-proximal-ligation strategies, and imaging-based methods improved in recent years. Especially, we review the CRISPR-based methods for functional validations in 3D genomics, which could be the forthcoming directions. We hope this review can show some insights into the potential improvements for future 3D genomics.  相似文献   

8.
9.
Defining the target genes of distal regulatory elements (DREs), such as enhancer, repressors and insulators, is a challenging task. The recently developed Hi-C technology is designed to capture chromosome conformation structure by high-throughput sequencing, and can be potentially used to determine the target genes of DREs. However, Hi-C data are noisy, making it difficult to directly use Hi-C data to identify DRE–target gene relationships. In this study, we show that DREs–gene pairs that are confirmed by Hi-C data are strongly phylogenetic correlated, and have thus developed a method that combines Hi-C read counts with phylogenetic correlation to predict long-range DRE–target gene relationships. Analysis of predicted DRE–target gene pairs shows that genes regulated by large number of DREs tend to have essential functions, and genes regulated by the same DREs tend to be functionally related and co-expressed. In addition, we show with a couple of examples that the predicted target genes of DREs can help explain the causal roles of disease-associated single-nucleotide polymorphisms located in the DREs. As such, these predictions will be of importance not only for our understanding of the function of DREs but also for elucidating the causal roles of disease-associated noncoding single-nucleotide polymorphisms.  相似文献   

10.
Over the last decade the 3C-based (Chromosome Conformation Capture, 3C) approaches have been developed to describe the frequency of chromatin interaction. The invention of Hi-C allows us to obtain genome-wide chromatin interaction map. However, it is challenging to develop efficient and robust analytical tools to interpret the Hi-C data. Here we present a new method called Clustering based Hi-C Domain Finder (CHDF), which is based on the difference of interaction intensity inside/outside domains, to identify Hi-C domains. We also compared CHDF with existing methods including Direction Index (DI) and HiCseg. CHDF can define more chromatin domains validated by higher resolution local chromatin structure data (Chromosome Conformation Capture Carbon Copy (5C) data). Using Hi-C data of lower sequencing depth, chromatin structure identified by CHDF is closer to that discovered by data of higher sequencing depth. Furthermore, the implement of CHDF is faster than the other two. Using CHDF, we are potentially able to discover more hints and clues about chromatin structural elements at domain level.  相似文献   

11.
In recent years, much effort has been devoted to understanding the three-dimensional (3D) organization of the genome and how genomic structure mediates nuclear function. The development of experimental techniques that combine DNA proximity ligation with high-throughput sequencing, such as Hi-C, have substantially improved our knowledge about chromatin organization. Numerous experimental advancements, not only utilizing DNA proximity ligation but also high-resolution genome imaging (DNA tracing), have required theoretical modeling to determine the structural ensembles consistent with such data. These 3D polymer models of the genome provide an understanding of the physical mechanisms governing genome architecture. Here, we present an overview of the recent advances in modeling the ensemble of 3D chromosomal structures by employing the maximum entropy approach combined with polymer physics. Particularly, we discuss the minimal chromatin model (MiChroM) along with the “maximum entropy genomic annotations from biomarkers associated with structural ensembles” (MEGABASE) model, which have been remarkably successful in the accurate modeling of chromosomes consistent with both Hi-C and DNA-tracing data.  相似文献   

12.
基因组三维结构在基因表达调控中发挥重要作用,染色质拓扑关联结构域(topologically associated domain,TAD)是DNA复制和基因转录的基本功能单位,也是DNA损伤修复的功能单元,在辐射诱导的DNA损伤修复中发挥重要作用。近期研究表明,TAD并非是完全独立的结构单元,其内部常呈现多层级结构,对基因表达具有重要调控作用。为探究TAD多层级结构在细胞辐射响应中的作用,本研究使用TAD层级结构识别算法OnTAD对Gene expression omnibus数据库中5Gy X射线照射的淋巴细胞、成纤维细胞和毛细血管扩张性共济失调突变(ataxia telangiectasia mutated,ATM)基因缺陷的成纤维细胞,共26个样本的Hi-C(high-through chromosome conformation capture,Hi-C)数据进行分析,发现辐射后细胞的TAD层级结构出现规律性变化,高层级TAD缺失较多,低层级TAD相对保守;辐射诱导的TAD层级结构变化通过调节基因表达参与细胞辐射响应;ATM是辐射诱导TAD层级结构变化和恢复的重要因子。本研究为从TAD多层级结构角度理解基因组三维结构在细胞辐射响应中的作用提供了新思路。  相似文献   

13.

Background

The study of nuclear architecture using Chromosome Conformation Capture (3C) technologies is a novel frontier in biology. With further reduction in sequencing costs, the potential of Hi-C in describing nuclear architecture as a phenotype is only about to unfold. To use Hi-C for phenotypic comparisons among different cell types, conditions, or genetic backgrounds, Hi-C data processing needs to be more accessible to biologists.

Results

HiCdat provides a simple graphical user interface for data pre-processing and a collection of higher-level data analysis tools implemented in R. Data pre-processing also supports a wide range of additional data types required for in-depth analysis of the Hi-C data (e.g. RNA-Seq, ChIP-Seq, and BS-Seq).

Conclusions

HiCdat is easy-to-use and provides solutions starting from aligned reads up to in-depth analyses. Importantly, HiCdat is focussed on the analysis of larger structural features of chromosomes, their correlation to genomic and epigenomic features, and on comparative studies. It uses simple input and output formats and can therefore easily be integrated into existing workflows or combined with alternative tools.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0678-x) contains supplementary material, which is available to authorized users.  相似文献   

14.
HiChIP and PLAC-Seq are emerging technologies for studying genome-wide long-range chromatin interactions mediated by the protein of interest, enabling more sensitive and cost-efficient interrogation of protein-centric chromatin conformation. However, due to the unbalanced read distribution introduced by protein immunoprecipitation, existing reproducibility measures developed for Hi-C data are not appropriate for the analysis of HiChIP and PLAC-Seq data. Here, we present HPRep, a stratified and weighted correlation metric derived from normalized contact counts, to quantify reproducibility in HiChIP and PLAC-Seq data. We applied HPRep to multiple real datasets and demonstrate that HPRep outperforms existing reproducibility measures developed for Hi-C data. Specifically, we applied HPRep to H3K4me3 PLAC-Seq data from mouse embryonic stem cells and mouse brain tissues as well as H3K27ac HiChIP data from human lymphoblastoid cell line GM12878 and leukemia cell line K562, showing that HPRep can more clearly separate among pseudo-replicates, real replicates, and non-replicates. Furthermore, in an H3K4me3 PLAC-Seq dataset consisting of 11 samples from four human brain cell types, HPRep demonstrated the expected clustering of data that could not be achieved by existing methods developed for Hi-C data, highlighting the need for a reproducibility metric tailored to HiChIP and PLAC-Seq data.  相似文献   

15.
Genome-wide mapping of three dimensional chromatin organization is an important yet technically challenging task. To aid experimental effort and to understand the determinants of long-range chromatin interactions, we have developed a computational model integrating Hi-C and histone mark ChIP-seq data to predict two important features of chromatin organization: chromatin interaction hubs and topologically associated domain (TAD) boundaries. Our model accurately and robustly predicts these features across datasets and cell types. Cell-type specific histone mark information is required for prediction of chromatin interaction hubs but not for TAD boundaries. Our predictions provide a useful guide for the exploration of chromatin organization.

Electronic supplementary material

The online version of this article (doi:10.1186/s13059-015-0740-z) contains supplementary material, which is available to authorized users.  相似文献   

16.
While the reciprocal effects of ecological and evolutionary dynamics are increasingly recognized as an important driver for biodiversity, detection of such eco-evolutionary feedbacks, their underlying mechanisms, and their consequences remains challenging. Eco-evolutionary dynamics occur at different spatial and temporal scales and can leave signatures at different levels of organization (e.g., gene, protein, trait, community) that are often difficult to detect. Recent advances in statistical methods combined with alternative hypothesis testing provides a promising approach to identify potential eco-evolutionary drivers for observed data even in non-model systems that are not amenable to experimental manipulation. We discuss recent advances in eco-evolutionary modeling and statistical methods and discuss challenges for fitting mechanistic models to eco-evolutionary data.  相似文献   

17.
18.
Hi-C is a genome-wide chromosome conformation capture technology that detects interactions between pairs of genomic regions and exploits higher order chromatin structures. Conceptually Hi-C data counts interaction frequencies between every position in the genome and every other position. Biologically functional interactions are expected to occur more frequently than transient background and artefactual interactions. To identify biologically relevant interactions, several background models that take biases such as distance, GC content and mappability into account have been proposed. Here we introduce MaxHiC, a background correction tool that deals with these complex biases and robustly identifies statistically significant interactions in both Hi-C and capture Hi-C experiments. MaxHiC uses a negative binomial distribution model and a maximum likelihood technique to correct biases in both Hi-C and capture Hi-C libraries. We systematically benchmark MaxHiC against major Hi-C background correction tools including Hi-C significant interaction callers (SIC) and Hi-C loop callers using published Hi-C, capture Hi-C, and Micro-C datasets. Our results demonstrate that 1) Interacting regions identified by MaxHiC have significantly greater levels of overlap with known regulatory features (e.g. active chromatin histone marks, CTCF binding sites, DNase sensitivity) and also disease-associated genome-wide association SNPs than those identified by currently existing models, 2) the pairs of interacting regions are more likely to be linked by eQTL pairs and 3) more likely to link known regulatory features including known functional enhancer-promoter pairs validated by CRISPRi than any of the existing methods. We also demonstrate that interactions between different genomic region types have distinct distance distributions only revealed by MaxHiC. MaxHiC is publicly available as a python package for the analysis of Hi-C, capture Hi-C and Micro-C data.  相似文献   

19.
Genome-wide chromosome conformation capture (3C)-based high-throughput sequencing (Hi-C) has enabled identification of genome-wide chromatin loops. Because the Hi-C map with restriction fragment resolution is intrinsically associated with sparsity and stochastic noise, Hi-C data are usually binned at particular intervals; however, the binning method has limited reliability, especially at high resolution. Here, we describe a new method called HiCORE, which provides simple pipelines and algorithms to overcome the limitations of single-layered binning and predict core chromatin regions with three-dimensional physical interactions. In this approach, multiple layers of binning with slightly shifted genome coverage are generated, and interacting bins at each layer are integrated to infer narrower regions of chromatin interactions. HiCORE predicts chromatin looping regions with higher resolution, both in human and Arabidopsis genomes, and contributes to the identification of the precise positions of potential genomic elements in an unbiased manner.  相似文献   

20.
Significant efforts have been recently made to obtain the three-dimensional (3D) structure of the genome with the goal of understanding how structures may affect gene regulation and expression. Chromosome conformational capture techniques such as Hi-C, have been key in uncovering the quantitative information needed to determine chromatin organization. Complementing these experimental tools, co-polymers theoretical methods are necessary to determine the ensemble of three-dimensional structures associated to the experimental data provided by Hi-C maps. Going beyond just structural information, these theoretical advances also start to provide an understanding of the underlying mechanisms governing genome assembly and function. Recent theoretical work, however, has been focused on single chromosome structures, missing the fact that, in the full nucleus, interactions between chromosomes play a central role in their organization. To overcome this limitation, MiChroM (Minimal Chromatin Model) has been modified to become capable of performing these multi-chromosome simulations. It has been upgraded into a fast and scalable software version, which is able to perform chromosome simulations using GPUs via OpenMM Python API, called Open-MiChroM. To validate the efficiency of this new version, analyses for GM12878 individual autosomes were performed and compared to earlier studies. This validation was followed by multi-chain simulations including the four largest human chromosomes (C1-C4). These simulations demonstrated the full power of this new approach. Comparison to Hi-C data shows that these multiple chromosome interactions are essential for a more accurate agreement with experimental results. Without any changes to the original MiChroM potential, it is now possible to predict experimentally observed inter-chromosome contacts. This scalability of Open-MiChroM allow for more audacious investigations, looking at interactions of multiple chains as well as moving towards higher resolution chromosomes models.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号