首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.

Background

Deviations in the amount of genomic content that arise during tumorigenesis, called copy number alterations, are structural rearrangements that can critically affect gene expression patterns. Additionally, copy number alteration profiles allow insight into cancer discrimination, progression and complexity. On data obtained from high-throughput sequencing, improving quality through GC bias correction and keeping false positives to a minimum help build reliable copy number alteration profiles.

Results

We introduce seqCNA, a parallelized R package for an integral copy number analysis of high-throughput sequencing cancer data. The package includes novel methodology on (i) filtering, reducing false positives, and (ii) GC content correction, improving copy number profile quality, especially under great read coverage and high correlation between GC content and copy number. Adequate analysis steps are automatically chosen based on availability of paired-end mapping, matched normal samples and genome annotation.

Conclusions

seqCNA, available through Bioconductor, provides accurate copy number predictions in tumoural data, thanks to the extensive filtering and better GC bias correction, while providing an integrated and parallelized workflow.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-178) contains supplementary material, which is available to authorized users.  相似文献   

2.

Background

Domestic goats (Capra hircus) have been selected to play an essential role in agricultural production systems, since being domesticated from their wild progenitor, bezoar (Capra aegagrus). A detailed understanding of the genetic consequences imparted by the domestication process remains a key goal of evolutionary genomics.

Results

We constructed the reference genome of bezoar and sequenced representative breeds of domestic goats to search for genomic changes that likely have accompanied goat domestication and breed formation. Thirteen copy number variation genes associated with coat color were identified in domestic goats, among which ASIP gene duplication contributes to the generation of light coat-color phenotype in domestic goats. Analysis of rapidly evolving genes identified genic changes underlying behavior-related traits, immune response and production-related traits.

Conclusion

Based on the comparison studies of copy number variation genes and rapidly evolving genes between wild and domestic goat, our findings and methodology shed light on the genetic mechanism of animal domestication and will facilitate future goat breeding.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1606-1) contains supplementary material, which is available to authorized users.  相似文献   

3.

Background

Urothelial bladder cancer is a highly heterogeneous disease. Cancer cell lines are useful tools for its study. This is a comprehensive genomic characterization of 40 urothelial bladder carcinoma (UBC) cell lines including information on origin, mutation status of genes implicated in bladder cancer (FGFR3, PIK3CA, TP53, and RAS), copy number alterations assessed using high density SNP arrays, uniparental disomy (UPD) events, and gene expression.

Results

Based on gene mutation patterns and genomic changes we identify lines representative of the FGFR3-driven tumor pathway and of the TP53/RB tumor suppressor-driven pathway. High-density array copy number analysis identified significant focal gains (1q32, 5p13.1-12, 7q11, and 7q33) and losses (i.e. 6p22.1) in regions altered in tumors but not previously described as affected in bladder cell lines. We also identify new evidence for frequent regions of UPD, often coinciding with regions reported to be lost in tumors. Previously undescribed chromosome X losses found in UBC lines also point to potential tumor suppressor genes. Cell lines representative of the FGFR3-driven pathway showed a lower number of UPD events.

Conclusions

Overall, there is a predominance of more aggressive tumor subtypes among the cell lines. We provide a cell line classification that establishes their relatedness to the major molecularly-defined bladder tumor subtypes. The compiled information should serve as a useful reference to the bladder cancer research community and should help to select cell lines appropriate for the functional analysis of bladder cancer genes, for example those being identified through massive parallel sequencing.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1450-3) contains supplementary material, which is available to authorized users.  相似文献   

4.
5.

Background

To determine which changes in the host cell genome are crucial for cervical carcinogenesis, a longitudinal in vitro model system of HPV-transformed keratinocytes was profiled in a genome-wide manner. Four cell lines affected with either HPV16 or HPV18 were assayed at 8 sequential time points for gene expression (mRNA) and gene copy number (DNA) using high-resolution microarrays. Available methods for temporal differential expression analysis are not designed for integrative genomic studies.

Results

Here, we present a method that allows for the identification of differential gene expression associated with DNA copy number changes over time. The temporal variation in gene expression is described by a generalized linear mixed model employing low-rank thin-plate splines. Model parameters are estimated with an empirical Bayes procedure, which exploits integrated nested Laplace approximation for fast computation. Iteratively, posteriors of hyperparameters and model parameters are estimated. The empirical Bayes procedure shrinks multiple dispersion-related parameters. Shrinkage leads to more stable estimates of the model parameters, better control of false positives and improvement of reproducibility. In addition, to make estimates of the DNA copy number more stable, model parameters are also estimated in a multivariate way using triplets of features, imposing a spatial prior for the copy number effect.

Conclusion

With the proposed method for analysis of time-course multilevel molecular data, more profound insight may be gained through the identification of temporal differential expression induced by DNA copy number abnormalities. In particular, in the analysis of an integrative oncogenomics study with a time-course set-up our method finds genes previously reported to be involved in cervical carcinogenesis. Furthermore, the proposed method yields improvements in sensitivity, specificity and reproducibility compared to existing methods. Finally, the proposed method is able to handle count (RNAseq) data from time course experiments as is shown on a real data set.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2105-15-327) contains supplementary material, which is available to authorized users.  相似文献   

6.

Background

The detection and functional characterization of genomic structural variations are important for understanding the landscape of genetic variation in the chicken. A recently recognized aspect of genomic structural variation, called copy number variation (CNV), is gaining interest in chicken genomic studies. The aim of the present study was to investigate the pattern and functional characterization of CNVs in five characteristic chicken breeds, which will be important for future studies associating phenotype with chicken genome architecture.

Results

Using a commercial 385 K array-based comparative genomic hybridization (aCGH) genome array, we performed CNV discovery using 10 chicken samples from four local Chinese breeds and the French breed Houdan chicken. The female Anka broiler was used as a reference. A total of 281 copy number variation regions (CNVR) were identified, covering 12.8 Mb of polymorphic sequences or 1.07% of the entire chicken genome. The functional annotation of CNVRs indicated that these regions completely or partially overlapped with 231 genes and 1032 quantitative traits loci, suggesting these CNVs have important functions and might be promising resources for exploring differences among various breeds. In addition, we employed quantitative PCR (qPCR) to further validate several copy number variable genes, such as prolactin receptor, endothelin 3 (EDN3), suppressor of cytokine signaling 2, CD8a molecule, with important functions, and the results suggested that EDN3 might be a molecular marker for the selection of dark skin color in poultry production. Moreover, we also identified a new CNVR (chr24: 3484617–3512275), encoding the sortilin-related receptor gene, with copy number changes in only black-bone chicken.

Conclusions

Here, we report a genome-wide analysis of the CNVs in five chicken breeds using aCGH. The association between EDN3 and melanoblast proliferation was further confirmed using qPCR. These results provide additional information for understanding genomic variation and related phenotypic characteristics.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-934) contains supplementary material, which is available to authorized users.  相似文献   

7.
8.

Background

Copy number variations (CNVs) confer significant effects on genetic innovation and phenotypic variation. Previous CNV studies in swine seldom focused on in-depth characterization of global CNVs.

Results

Using whole-genome assembly comparison (WGAC) and whole-genome shotgun sequence detection (WSSD) approaches by next generation sequencing (NGS), we probed formation signatures of both segmental duplications (SDs) and individualized CNVs in an integrated fashion, building the finest resolution CNV and SD maps of pigs so far. We obtained copy number estimates of all protein-coding genes with copy number variation carried by individuals, and further confirmed two genes with high copy numbers in Meishan pigs through an enlarged population. We determined genome-wide CNV hotspots, which were significantly enriched in SD regions, suggesting evolution of CNV hotspots may be affected by ancestral SDs. Through systematically enrichment analyses based on simulations and bioinformatics analyses, we revealed CNV-related genes undergo a different selective constraint from those CNV-unrelated regions, and CNVs may be associated with or affect pig health and production performance under recent selection.

Conclusions

Our studies lay out one way for characterization of CNVs in the pig genome, provide insight into the pig genome variation and prompt CNV mechanisms studies when using pigs as biomedical models for human diseases.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-593) contains supplementary material, which is available to authorized users.  相似文献   

9.
10.

Background

Wheat is an excellent plant species for nuclear mitochondrial interaction studies due to availability of large collection of alloplasmic lines. These lines exhibit different vegetative and physiological properties than their parents. To investigate the level of sequence changes introduced into the mitochondrial genome under the alloplasmic condition, three mitochondrial genomes of the Triticum-Aegilops species were sequenced: 1) durum alloplasmic line with the Ae. longissima cytoplasm that carries the T. turgidum nucleus designated as (lo) durum, 2) the cytoplasmic donor line, and 3) the nuclear donor line.

Results

The mitochondrial genome of the T. turgidum was 451,678 bp in length with high structural and nucleotide identity to the previously characterized T. aestivum genome. The assembled mitochondrial genome of the (lo) durum and the Ae. longissima were 431,959 bp and 399,005 bp in size, respectively. The high sequence coverage for all three genomes allowed analysis of heteroplasmy within each genome. The mitochondrial genome structure in the alloplasmic line was genetically distant from both maternal and paternal genomes. The alloplasmic durum and the Ae. longissima carry the same versions of atp6, nad6, rps19-p, cob and cox2 exon 2 which are different from the T. turgidum parent. Evidence of paternal leakage was also observed by analyzing nad9 and orf359 among all three lines. Nucleotide search identified a number of open reading frames, of which 27 were specific to the (lo) durum line.

Conclusions

Several heteroplasmic regions were observed within genes and intergenic regions of the mitochondrial genomes of all three lines. The number of rearrangements and nucleotide changes in the mitochondrial genome of the alloplasmic line that have occurred in less than half a century was significant considering the high sequence conservation between the T. turgidum and the T. aestivum that diverged from each other 10,000 years ago. We showed that the changes in genes were not limited to paternal leakage but were sufficiently significant to suggest that other mechanisms, such as recombination and mutation, were responsible. The newly formed ORFs, differences in gene sequences and copy numbers, heteroplasmy, and substoichiometric changes show the potential of the alloplasmic condition to accelerate evolution towards forming new mitochondrial genomes.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-67) contains supplementary material, which is available to authorized users.  相似文献   

11.

Background

Survival outcomes for patients with osteosarcoma (OS) have remained stagnant over the past three decades. Insulin-like growth factor 1 receptor (IGF1R) is over-expressed in a number of malignancies, and anti-IGF1R antibodies have and are currently being studied in clinical trials. Understanding the molecular aberrations which result in increased tumor response to anti-IGF1R therapy could allow for the selection of patients most likely to benefit from IGF1R targeted therapy.

Methods

IGF1R mRNA expression was assessed by RT PCR in OS patient primary tumors, cell lines, and xenograft tumors. IGF1R copy number was assessed by 3 approaches: PCR, FISH, and dot blot analysis. Exons 1–20 of IGF1R were sequenced in xenograft tumors and 87 primary OS tumors, and surface expression of IGF1R was assessed by flow cytometry. Levels of mRNA and protein expression, copy number, and mutation status were compared with tumor response to anti-IGF1R antibody therapy in 4 OS xenograft models.

Results

IGF1R mRNA is expressed in OS. Primary patient samples and xenograft samples had higher mRNA expression and copy number compared with corresponding cell lines. IGF1R mRNA expression, cell surface expression, copy number, and mutation status were not associated with tumor responsiveness to anti-IGF1R antibody therapy.

Conclusions

IGF1R is expressed in OS, however, no clear molecular markers predict response to IGF1R antibody-mediated therapy. Additional pre-clinical studies assessing potential predictive biomarkers and investigating targetable molecular pathways critical to the proliferation of OS cells are needed.  相似文献   

12.

Background

Lateral gene transfer (LGT) from bacterial Wolbachia endosymbionts has been detected in ~20% of arthropod and nematode genome sequencing projects. Many of these transfers are large and contain a substantial part of the Wolbachia genome.

Results

Here, we re-sequenced three D. ananassae genomes from Asia and the Pacific that contain large LGTs from Wolbachia. We find that multiple copies of the Wolbachia genome are transferred to the Drosophila nuclear genome in all three lines. In the D. ananassae line from Indonesia, the copies of Wolbachia DNA in the nuclear genome are nearly identical in size and sequence yielding an even coverage of mapped reads over the Wolbachia genome. In contrast, the D. ananassae lines from Hawaii and India show an uneven coverage of mapped reads over the Wolbachia genome suggesting that different parts of these LGTs are present in different copy numbers. In the Hawaii line, we find that this LGT is underrepresented in third instar larvae indicative of being heterochromatic. Fluorescence in situ hybridization of mitotic chromosomes confirms that the LGT in the Hawaii line is heterochromatic and represents ~20% of the sequence on chromosome 4 (dot chromosome, Muller element F).

Conclusions

This collection of related lines contain large lateral gene transfers composed of multiple Wolbachia genomes that constitute >2% of the D. ananassae genome (~5 Mbp) and partially explain the abnormally large size of chromosome 4 in D. ananassae.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-1097) contains supplementary material, which is available to authorized users.  相似文献   

13.

Background

Despite having predominately deleterious fitness effects, transposable elements (TEs) are major constituents of eukaryote genomes in general and of plant genomes in particular. Although the proportion of the genome made up of TEs varies at least four-fold across plants, the relative importance of the evolutionary forces shaping variation in TE abundance and distributions across taxa remains unclear. Under several theoretical models, mating system plays an important role in governing the evolutionary dynamics of TEs. Here, we use the recently sequenced Capsella rubella reference genome and short-read whole genome sequencing of multiple individuals to quantify abundance, genome distributions, and population frequencies of TEs in three recently diverged species of differing mating system, two self-compatible species (C. rubella and C. orientalis) and their self-incompatible outcrossing relative, C. grandiflora.

Results

We detect different dynamics of TE evolution in our two self-compatible species; C. rubella shows a small increase in transposon copy number, while C. orientalis shows a substantial decrease relative to C. grandiflora. The direction of this change in copy number is genome wide and consistent across transposon classes. For insertions near genes, however, we detect the highest abundances in C. grandiflora. Finally, we also find differences in the population frequency distributions across the three species.

Conclusion

Overall, our results suggest that the evolution of selfing may have different effects on TE evolution on a short and on a long timescale. Moreover, cross-species comparisons of transposon abundance are sensitive to reference genome bias, and efforts to control for this bias are key when making comparisons across species.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-602) contains supplementary material, which is available to authorized users.  相似文献   

14.

Background

Disseminated cancer cells (DCCs) and circulating tumor cells (CTCs) are extremely rare, but comprise the precursors cells of distant metastases or therapy resistant cells. The detailed molecular analysis of these cells may help to identify key events of cancer cell dissemination, metastatic colony formation and systemic therapy escape.

Methodology/Principal Findings

Using the Ampli1™ whole genome amplification (WGA) technology and high-resolution oligonucleotide aCGH microarrays we optimized conditions for the analysis of structural copy number changes. The protocol presented here enables reliable detection of numerical genomic alterations as small as 0.1 Mb in a single cell. Analysis of single cells from well-characterized cell lines and single normal cells confirmed the stringent quantitative nature of the amplification and hybridization protocol. Importantly, fixation and staining procedures used to detect DCCs showed no significant impact on the outcome of the analysis, proving the clinical usability of our method. In a proof-of-principle study we tracked the chromosomal changes of single DCCs over a full course of high-dose chemotherapy treatment by isolating and analyzing DCCs of an individual breast cancer patient at four different time points.

Conclusions/Significance

The protocol enables detailed genome analysis of DCCs and thereby assessment of the clonal evolution during the natural course of the disease and under selection pressures. The results from an exemplary patient provide evidence that DCCs surviving selective therapeutic conditions may be recruited from a pool of genomically less advanced cells, which display a stable subset of specific genomic alterations.  相似文献   

15.

Background

Signatures of selection are regions in the genome that have been preferentially increased in frequency and fixed in a population because of their functional importance in specific processes. These regions can be detected because of their lower genetic variability and specific regional linkage disequilibrium (LD) patterns.

Methods

By comparing the differences in regional LD variation between dairy and beef cattle types, and between indicine and taurine subspecies, we aim at finding signatures of selection for production and adaptation in cattle breeds. The VarLD method was applied to compare the LD variation in the autosomal genome between breeds, including Angus and Brown Swiss, representing taurine breeds, and Nelore and Gir, representing indicine breeds. Genomic regions containing the top 0.01 and 0.1 percentile of signals were characterized using the UMD3.1 Bos taurus genome assembly to identify genes in those regions and compared with previously reported selection signatures and regions with copy number variation.

Results

For all comparisons, the top 0.01 and 0.1 percentile included 26 and 165 signals and 17 and 125 genes, respectively, including TECRL, BT.23182 or FPPS, CAST, MYOM1, UVRAG and DNAJA1.

Conclusions

The VarLD method is a powerful tool to identify differences in linkage disequilibrium between cattle populations and putative signatures of selection with potential adaptive and productive importance.  相似文献   

16.

Background

The determination of structural haplotypes at copy number variable regions can indicate the mechanisms responsible for changes in copy number, as well as explain the relationship between gene copy number and expression. However, obtaining spatial information at regions displaying extensive copy number variation, such as the DEFA1A3 locus, is complex, because of the difficulty in the phasing and assembly of these regions. The DEFA1A3 locus is intriguing in that it falls within a region of high linkage disequilibrium, despite its high variability in copy number (n = 3–16); hence, the mechanisms responsible for changes in copy number at this locus are unclear.

Results

In this study, a region flanking the DEFA1A3 locus was sequenced across 120 independent haplotypes with European ancestry, identifying five common classes of DEFA1A3 haplotype. Assigning DEFA1A3 class to haplotypes within the 1000 Genomes project highlights a significant difference in DEFA1A3 class frequencies between populations with different ancestry. The features of each DEFA1A3 class, for example, the associated DEFA1A3 copy numbers, were initially assessed in a European cohort (n = 599) and replicated in the 1000 Genomes samples, showing within-class similarity, but between-class and between-population differences in the features of the DEFA1A3 locus. Emulsion haplotype fusion-PCR was used to generate 61 structural haplotypes at the DEFA1A3 locus, showing a high within-class similarity in structure.

Conclusions

Structural haplotypes across the DEFA1A3 locus indicate that intra-allelic rearrangement is the predominant mechanism responsible for changes in DEFA1A3 copy number, explaining the conservation of linkage disequilibrium across the locus. The identification of common structural haplotypes at the DEFA1A3 locus could aid studies into how DEFA1A3 copy number influences expression, which is currently unclear.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-614) contains supplementary material, which is available to authorized users.  相似文献   

17.

Background

CRISPR-Cas9 is a revolutionary genome editing technique that allows for efficient and directed alterations of the eukaryotic genome. This relatively new technology has already been used in a large number of ‘loss of function’ experiments in cultured cells. Despite its simplicity and efficiency, screening for mutated clones remains time-consuming, laborious and/or expensive.

Results

Here we report a high-throughput screening strategy that allows parallel screening of up to 96 clones, using next-generation sequencing. As a proof of principle, we used CRISPR-Cas9 to disrupt the coding sequence of the homeobox gene, Evx1 in mouse embryonic stem cells. We screened 67 CRISPR-Cas9 transfected clones simultaneously by next-generation sequencing on the Ion Torrent PGM. We were able to identify both homozygous and heterozygous Evx1 mutants, as well as mixed clones, which must be identified to maintain the integrity of subsequent experiments.

Conclusions

Our CRISPR-Cas9 screening strategy could be widely applied to screen for CRISPR-Cas9 mutants in a variety of contexts including the generation of mutant cell lines for in vitro research, the generation of transgenic organisms and for assessing the veracity of CRISPR-Cas9 homology directed repair. This technique is cost and time-effective, provides information on clonal heterogeneity and is adaptable for use on various sequencing platforms.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-1002) contains supplementary material, which is available to authorized users.  相似文献   

18.

Background

Copy number variation (CNV) is important and widespread in the genome, and is a major cause of disease and phenotypic diversity. Herein, we performed a genome-wide CNV analysis in 12 diversified chicken genomes based on whole genome sequencing.

Results

A total of 8,840 CNV regions (CNVRs) covering 98.2 Mb and representing 9.4% of the chicken genome were identified, ranging in size from 1.1 to 268.8 kb with an average of 11.1 kb. Sequencing-based predictions were confirmed at a high validation rate by two independent approaches, including array comparative genomic hybridization (aCGH) and quantitative PCR (qPCR). The Pearson’s correlation coefficients between sequencing and aCGH results ranged from 0.435 to 0.755, and qPCR experiments revealed a positive validation rate of 91.71% and a false negative rate of 22.43%. In total, 2,214 (25.0%) predicted CNVRs span 2,216 (36.4%) RefSeq genes associated with specific biological functions. Besides two previously reported copy number variable genes EDN3 and PRLR, we also found some promising genes with potential in phenotypic variation. Two genes, FZD6 and LIMS1, related to disease susceptibility/resistance are covered by CNVRs. The highly duplicated SOCS2 may lead to higher bone mineral density. Entire or partial duplication of some genes like POPDC3 may have great economic importance in poultry breeding.

Conclusions

Our results based on extensive genetic diversity provide a more refined chicken CNV map and genome-wide gene copy number estimates, and warrant future CNV association studies for important traits in chickens.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-962) contains supplementary material, which is available to authorized users.  相似文献   

19.

Background

Osteosarcomas are the most common non-haematological primary malignant tumours of bone, and all conventional osteosarcomas are high-grade tumours showing complex genomic aberrations. We have integrated genome-wide genetic and epigenetic profiles from the EuroBoNeT panel of 19 human osteosarcoma cell lines based on microarray technologies.

Principal Findings

The cell lines showed complex patterns of DNA copy number changes, where genomic copy number gains were significantly associated with gene-rich regions and losses with gene-poor regions. By integrating the datasets, 350 genes were identified as having two types of aberrations (gain/over-expression, hypo-methylation/over-expression, loss/under-expression or hyper-methylation/under-expression) using a recurrence threshold of 6/19 (>30%) cell lines. The genes showed in general alterations in either DNA copy number or DNA methylation, both within individual samples and across the sample panel. These 350 genes are involved in embryonic skeletal system development and morphogenesis, as well as remodelling of extracellular matrix. The aberrations of three selected genes, CXCL5, DLX5 and RUNX2, were validated in five cell lines and five tumour samples using PCR techniques. Several genes were hyper-methylated and under-expressed compared to normal osteoblasts, and expression could be reactivated by demethylation using 5-Aza-2′-deoxycytidine treatment for four genes tested; AKAP12, CXCL5, EFEMP1 and IL11RA. Globally, there was as expected a significant positive association between gain and over-expression, loss and under-expression as well as hyper-methylation and under-expression, but gain was also associated with hyper-methylation and under-expression, suggesting that hyper-methylation may oppose the effects of increased copy number for detrimental genes.

Conclusions

Integrative analysis of genome-wide genetic and epigenetic alterations identified dependencies and relationships between DNA copy number, DNA methylation and mRNA expression in osteosarcomas, contributing to better understanding of osteosarcoma biology.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号