首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
We describe methods with enhanced power and specificity to identify genes targeted by somatic copy-number alterations (SCNAs) that drive cancer growth. By separating SCNA profiles into underlying arm-level and focal alterations, we improve the estimation of background rates for each category. We additionally describe a probabilistic method for defining the boundaries of selected-for SCNA regions with user-defined confidence. Here we detail this revised computational approach, GISTIC2.0, and validate its performance in real and simulated datasets.  相似文献   

2.
The accumulation of data on structural variation in cancer genomes provides an opportunity to better understand the mechanisms of genomic alterations and the forces of selection that act upon these alterations in cancer. Here we test evidence supporting the influence of two major forces, spatial chromosome structure and purifying (or negative) selection, on the landscape of somatic copy-number alterations (SCNAs) in cancer. Using a maximum likelihood approach, we compare SCNA maps and three-dimensional genome architecture as determined by genome-wide chromosome conformation capture (HiC) and described by the proposed fractal-globule model. This analysis suggests that the distribution of chromosomal alterations in cancer is spatially related to three-dimensional genomic architecture and that purifying selection, as well as positive selection, influences SCNAs during somatic evolution of cancer cells.  相似文献   

3.
Copy Number Alterations (CNAs) such as deletions and duplications; compose a larger percentage of genetic variations than single nucleotide polymorphisms or other structural variations in cancer genomes that undergo major chromosomal re-arrangements. It is, therefore, imperative to identify cancer-specific somatic copy number alterations (SCNAs), with respect to matched normal tissue, in order to understand their association with the disease. We have devised an accurate, sensitive, and easy-to-use tool, COPS, COpy number using Paired Samples, for detecting SCNAs. We rigorously tested the performance of COPS using short sequence simulated reads at various sizes and coverage of SCNAs, read depths, read lengths and also with real tumor:normal paired samples. We found COPS to perform better in comparison to other known SCNA detection tools for all evaluated parameters, namely, sensitivity (detection of true positives), specificity (detection of false positives) and size accuracy. COPS performed well for sequencing reads of all lengths when used with most upstream read alignment tools. Additionally, by incorporating a downstream boundary segmentation detection tool, the accuracy of SCNA boundaries was further improved. Here, we report an accurate, sensitive and easy to use tool in detecting cancer-specific SCNAs using short-read sequence data. In addition to cancer, COPS can be used for any disease as long as sequence reads from both disease and normal samples from the same individual are available. An added boundary segmentation detection module makes COPS detected SCNA boundaries more specific for the samples studied. COPS is available at ftp://115.119.160.213 with username “cops” and password “cops”.  相似文献   

4.
Cancer genomes exhibit profound somatic copy number alterations (SCNAs). Studying tumor SCNAs using massively parallel sequencing provides unprecedented resolution and meanwhile gives rise to new challenges in data analysis, complicated by tumor aneuploidy and heterogeneity as well as normal cell contamination. While the majority of read depth based methods utilize total sequencing depth alone for SCNA inference, the allele specific signals are undervalued. We proposed a joint segmentation and inference approach using both signals to meet some of the challenges. Our method consists of four major steps: 1) extracting read depth supporting reference and alternative alleles at each SNP/Indel locus and comparing the total read depth and alternative allele proportion between tumor and matched normal sample; 2) performing joint segmentation on the two signal dimensions; 3) correcting the copy number baseline from which the SCNA state is determined; 4) calling SCNA state for each segment based on both signal dimensions. The method is applicable to whole exome/genome sequencing (WES/WGS) as well as SNP array data in a tumor-control study. We applied the method to a dataset containing no SCNAs to test the specificity, created by pairing sequencing replicates of a single HapMap sample as normal/tumor pairs, as well as a large-scale WGS dataset consisting of 88 liver tumors along with adjacent normal tissues. Compared with representative methods, our method demonstrated improved accuracy, scalability to large cancer studies, capability in handling both sequencing and SNP array data, and the potential to improve the estimation of tumor ploidy and purity.  相似文献   

5.
BackgroundWhile RB1 loss initiates retinoblastoma development, additional somatic copy number alterations (SCNAs) can drive tumor progression. Although SCNAs have been identified with good concordance between studies at a cytoband resolution, accurate identification of single genes for all recurrent SCNAs is still challenging. This study presents a comprehensive meta-analysis of genome-wide SCNAs integrated with gene expression profiling data, narrowing down the list of plausible retinoblastoma driver genes.MethodsWe performed SCNA profiling of 45 primary retinoblastoma samples and eight retinoblastoma cell lines by high-resolution microarrays. We combined our data with genomic, clinical and histopathological data of ten published genome-wide SCNA studies, which strongly enhanced the power of our analyses (N = 310).ResultsComprehensive recurrence analysis of SCNAs in all studies integrated with gene expression data allowed us to reduce candidate gene lists for 1q, 2p, 6p, 7q and 13q to a limited gene set. Besides the well-established driver genes RB1 (13q-loss) and MYCN (2p-gain) we identified CRB1 and NEK7 (1q-gain), SOX4 (6p-gain) and NUP205 (7q-gain) as novel retinoblastoma driver candidates. Depending on the sample subset and algorithms used, alternative candidates were identified including MIR181 (1q-gain) and DEK (6p gain). Remarkably, our study showed that copy number gains rarely exceeded change of one copy, even in pure tumor samples with 100% homozygosity at the RB1 locus (N = 34), which is indicative for intra-tumor heterogeneity. In addition, profound between-tumor variability was observed that was associated with age at diagnosis and differentiation grades.InterpretationSince focal alterations at commonly altered chromosome regions were rare except for 2p24.3 (MYCN), further functional validation of the oncogenic potential of the described candidate genes is now required. For further investigations, our study provides a refined and revised set of candidate retinoblastoma driver genes.  相似文献   

6.
The establishment of human chromosomal regions as distinct and characteristic domains has been demonstrated by the reproducible banding patterns observed on metaphase chromosomes as a result of various staining techniques. Although the exact molecular properties responsible for the patterns are not well understood, a general correlation has been established between the time of replication of a particular region of DNA and its banding characteristics. Using a replication timing assay based on fluorescence in situ hybridization patterns, we investigated replication timing properties across chromosomal regions with potentially distinct chromatin properties. Relative replication timing values were determined using cosmid DNA probes around the pseudoautosomal region boundary in Xp22.3 and the cytogenetic band boundary regions surrounding Xp22.2. Although we observed replication timing domains that were generally consistent with cytogenetic banding patterns, we did not find sharp replication timing boundaries at either the pseudoautosomal region boundary or at the cytogenetic band boundaries. Received: 6 September 1997; in revised form: 16 December 1997 / Accepted: 5 January 1998  相似文献   

7.
Genome-wide replication timing studies have suggested that mammalian chromosomes consist of megabase-scale domains of coordinated origin firing separated by large originless transition regions. Here, we report a quantitative genome-wide analysis of DNA replication kinetics in several human cell types that contradicts this view. DNA combing in HeLa cells sorted into four temporal compartments of S phase shows that replication origins are spaced at 40 kb intervals and fire as small clusters whose synchrony increases during S phase and that replication fork velocity (mean 0.7 kb/min, maximum 2.0 kb/min) remains constant and narrowly distributed through S phase. However, multi-scale analysis of a genome-wide replication timing profile shows a broad distribution of replication timing gradients with practically no regions larger than 100 kb replicating at less than 2 kb/min. Therefore, HeLa cells lack large regions of unidirectional fork progression. Temporal transition regions are replicated by sequential activation of origins at a rate that increases during S phase and replication timing gradients are set by the delay and the spacing between successive origin firings rather than by the velocity of single forks. Activation of internal origins in a specific temporal transition region is directly demonstrated by DNA combing of the IGH locus in HeLa cells. Analysis of published origin maps in HeLa cells and published replication timing and DNA combing data in several other cell types corroborate these findings, with the interesting exception of embryonic stem cells where regions of unidirectional fork progression seem more abundant. These results can be explained if origins fire independently of each other but under the control of long-range chromatin structure, or if replication forks progressing from early origins stimulate initiation in nearby unreplicated DNA. These findings shed a new light on the replication timing program of mammalian genomes and provide a general model for their replication kinetics.  相似文献   

8.
9.
Eukaryotic cells must inhibit re-initiation of DNA replication at each of the thousands of origins in their genome because re-initiation can generate genomic alterations with extraordinary frequency. To minimize the probability of re-initiation from so many origins, cells use a battery of regulatory mechanisms that reduce the activity of replication initiation proteins. Given the global nature of these mechanisms, it has been presumed that all origins are inhibited identically. However, origins re-initiate with diverse efficiencies when these mechanisms are disabled, and this diversity cannot be explained by differences in the efficiency or timing of origin initiation during normal S phase replication. This observation raises the possibility of an additional layer of replication control that can differentially regulate re-initiation at distinct origins. We have identified novel genetic elements that are necessary for preferential re-initiation of two origins and sufficient to confer preferential re-initiation on heterologous origins when the control of re-initiation is partially deregulated. The elements do not enhance the S phase timing or efficiency of adjacent origins and thus are specifically acting as re-initiation promoters (RIPs). We have mapped the two RIPs to ∼60 bp AT rich sequences that act in a distance- and sequence-dependent manner. During the induction of re-replication, Mcm2-7 reassociates both with origins that preferentially re-initiate and origins that do not, suggesting that the RIP elements can overcome a block to re-initiation imposed after Mcm2-7 associates with origins. Our findings identify a local level of control in the block to re-initiation. This local control creates a complex genomic landscape of re-replication potential that is revealed when global mechanisms preventing re-replication are compromised. Hence, if re-replication does contribute to genomic alterations, as has been speculated for cancer cells, some regions of the genome may be more susceptible to these alterations than others.  相似文献   

10.
Facioscapulohumeral muscular dystrophy (FSHD) is linked to contraction of an array of tandem 3.3-kb repeats (D4Z4) at 4q35.2 from 11-100 copies to 1-10 copies. The extent to which D4Z4 contraction at 4q35.2 affects overall 4q35.2 chromatin organization remains unclear. Because DNA replication timing is highly predictive of long-range chromatin interactions, we generated genome-wide replication-timing profiles for FSHD and control myogenic precursor cells. We compared non-immortalized myoblasts from four FSHD patients and three control individuals to each other and to a variety of other human cell types. This study also represents the first genome-wide comparison of replication timing profiles in non-immortalized human cell cultures. Myoblasts from both control and FSHD individuals all shared a myoblast-specific replication profile. In contrast, male and female individuals were readily distinguished by monoallelic differences in replication timing at DXZ4 and other regions across the X chromosome affected by X inactivation. We conclude that replication timing is a robust cell-type specific feature that is unaffected by FSHD-related D4Z4 contraction.  相似文献   

11.
A wavelet transform of the DNA "walk" constructed from a genomic sequence offers a direct visualization of short and long-range patterns in nucleotide sequences. We study sequences that encode diverse biological functions, taken from a variety of genomes. Pattern irregularities in the transform are frequently associated with sequences of biological interest. Exonic regions, for example, visualize differently under wavelet analysis than introns, and ribosomal RNA regions display distinct universal signatures. DNA walk wavelet analysis can provide a sensitive and rapid assessment of the putative biological significance of genomic DNA.  相似文献   

12.
Recent evidence suggests that the timing of DNA replication is coordinated across megabase-scale domains in metazoan genomes, yet the importance of this aspect of genome organization is unclear. Here we show that replication timing is remarkably conserved between human and mouse, uncovering large regions that may have been governed by similar replication dynamics since these species have diverged. This conservation is both tissue-specific and independent of the genomic G+C content conservation. Moreover, we show that time of replication is globally conserved despite numerous large-scale genome rearrangements. We systematically identify rearrangement fusion points and demonstrate that replication time can be locally diverged at these loci. Conversely, rearrangements are shown to be correlated with early replication and physical chromosomal proximity. These results suggest that large chromosomal domains of coordinated replication are shuffled by evolution while conserving the large-scale nuclear architecture of the genome.  相似文献   

13.
14.
Razin SV 《Genetika》2003,39(2):173-181
In this review, of problems concerning initiation of DNA replication in higher eukaryotes is discussed, with special emphasis on the methods of replication origin mapping and biological tests for the activity of DNA replication origins in higher eukaryotes. Protein factors interacting with replication origins are considered in detail. The main events of replication initiation in higher eukaryotes are briefly analyzed. New data on the control of replication timing of large genomic regions are discussed.  相似文献   

15.

Background

Somatic copy number alternations (SCNAs) can be utilized to infer tumor subclonal populations in whole genome seuqncing studies, where usually their read count ratios between tumor-normal paired samples serve as the inferring proxy. Existing SCNA based subclonal population inferring tools consider the GC bias of tumor and normal sample is of the same fature, and could be fully offset by read count ratio. However, we found that, the read count ratio on SCNA segments presents a Log linear biased pattern, which influence existing read count ratios based subclonal inferring tools performance. Currently no correction tools take into account the read ratio bias.

Results

We present Pre-SCNAClonal, a tool that improving tumor subclonal population inferring by correcting GC-bias at SCNAs level. Pre-SCNAClonal first corrects GC bias using Markov chain Monte Carlo probability model, then accurately locates baseline DNA segments (not containing any SCNAs) with a hierarchy clustering model. We show Pre-SCNAClonal’s superiority to exsiting GC-bias correction methods at any level of subclonal population.

Conclusions

Pre-SCNAClonal could be run independently as well as serving as pre-processing/gc-correction step in conjuntion with exsiting SCNA-based subclonal inferring tools.
  相似文献   

16.
In this review, the problems concerning initiation of DNA replication in higher eukaryotes are discussed, with special emphasis on the methods of replication origin mapping and biological tests for the activity of DNA replication origins in higher eukaryotes. Protein factors interacting with replication origins are considered in detail. The main events of replication initiation in higher eukaryotes are briefly analyzed. New data on the control of replication timing of large genomic regions are discussed.  相似文献   

17.
Watanabe Y  Tenzen T  Nagasaka Y  Inoko H  Ikemura T 《Gene》2000,252(1-2):163-172
The human genome is composed of long-range G+C% mosaic structures, which are thought to be related to chromosome bands. Replication timing during S phase is associated with chromosomal band zones; thus, band boundaries are thought to correspond to regions where replication timing switches. The proximal limit of the human X-inactivation center (XIC) has been localized cytologically to the junction zone between Xq13.1 and Xq13.2. Using PCR-based quantification of the newly replicated DNA from cell-cycle fractionated THP-1 cells, the replication timing in and around the XIC was determined at the genome sequence level. We found two regions where replication timing changes from the early to late period during S phase. One is located near a large inverted duplication proximal to the XIC, and the other is near the XIST locus. We propose that the 1Mb late-replicated zone (from the large inverted duplication to XIST) corresponds to a G-band Xq13.2. Several common characteristics were observed in the XIST region and the MHC class II-III junction which was previously defined as a band boundary. These characteristics included differential high-density clustering of Alu and LINE repeats, and the presence of polypurine/polypyrimidine tracts, MER41A, MER57 and MER58B.  相似文献   

18.
Although chromatin folding is known to be of functional importance to control the gene expression program, less is known regarding its interplay with DNA replication. Here, using Circular Chromatin Conformation Capture combined with high-throughput sequencing, we identified megabase-sized self-interacting domains in the nucleus of a human lymphoblastoid cell line, as well as in cycling and resting peripheral blood mononuclear cells (PBMC). Strikingly, the boundaries of those domains coincide with early-initiation zones in every cell types. Preferential interactions have been observed between the consecutive early-initiation zones, but also between those separated by several tens of megabases. Thus, the 3D conformation of chromatin is strongly correlated with the replication timing along the whole chromosome. We furthermore provide direct clues that, in addition to the timing value per se, the shape of the timing profile at a given locus defines its set of genomic contacts. As this timing-related scheme of chromatin organization exists in lymphoblastoid cells, resting and cycling PBMC, this indicates that it is maintained several weeks or months after the previous S-phase. Lastly, our work highlights that the major chromatin changes accompanying PBMC entry into cell cycle occur while keeping largely unchanged the long-range chromatin contacts.  相似文献   

19.
Establishing how mammalian chromosome replication is regulated and how groups of replication origins are organized into replication bands will significantly increase our understanding of chromosome organization. Replication time bands in mammalian chromosomes show overall congruency with structural R- and G-banding patterns as revealed by different chromosome banding techniques. Thus, chromosome bands reflect variations in the longitudinal structure and function of the chromosome, but little is known about the structural basis of the metaphase chromosome banding pattern. At the microscopic level, both structural R and G bands and replication bands occupy discrete domains along chromosomes, suggesting separation by distinct boundaries. The purpose of this study was to determine replication timing differences encompassing a boundary between differentially replicating chromosomal bands. Using competitive PCR on replicated DNA from flow-sorted cell cycle fractions, we have analyzed the replication timing of markers spanning roughly 5 Mb of human chromosome 13q14.3/q21.1. This is only the second report of high-resolution analysis of replication timing differences across an R/G-band boundary. In contrast to previous work, however, we find that band boundaries are defined by a gradient in replication timing rather than by a sharp boundary separating R and G bands into functionally distinct chromatin compartments. These findings indicate that topographical band boundaries are not defined by specific sequences or structures.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号