首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.

Background

Deviations in the amount of genomic content that arise during tumorigenesis, called copy number alterations, are structural rearrangements that can critically affect gene expression patterns. Additionally, copy number alteration profiles allow insight into cancer discrimination, progression and complexity. On data obtained from high-throughput sequencing, improving quality through GC bias correction and keeping false positives to a minimum help build reliable copy number alteration profiles.

Results

We introduce seqCNA, a parallelized R package for an integral copy number analysis of high-throughput sequencing cancer data. The package includes novel methodology on (i) filtering, reducing false positives, and (ii) GC content correction, improving copy number profile quality, especially under great read coverage and high correlation between GC content and copy number. Adequate analysis steps are automatically chosen based on availability of paired-end mapping, matched normal samples and genome annotation.

Conclusions

seqCNA, available through Bioconductor, provides accurate copy number predictions in tumoural data, thanks to the extensive filtering and better GC bias correction, while providing an integrated and parallelized workflow.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-178) contains supplementary material, which is available to authorized users.  相似文献   

2.

Background

Array comparative genomic hybridization (aCGH) to detect copy number variants (CNVs) in mammalian genomes has led to a growing awareness of the potential importance of this category of sequence variation as a cause of phenotypic variation. Yet there are large discrepancies between studies, so that the extent of the genome affected by CNVs is unknown. We combined molecular and aCGH analyses of CNVs in inbred mouse strains to investigate this question.

Principal Findings

Using a 2.1 million probe array we identified 1,477 deletions and 499 gains in 7 inbred mouse strains. Molecular characterization indicated that approximately one third of the CNVs detected by the array were false positives and we estimate the false negative rate to be more than 50%. We show that low concordance between studies is largely due to the molecular nature of CNVs, many of which consist of a series of smaller deletions and gains interspersed by regions where the DNA copy number is normal.

Conclusions

Our results indicate that CNVs detected by arrays may be the coincidental co-localization of smaller CNVs, whose presence is more likely to perturb an aCGH hybridization profile than the effect of an isolated, small, copy number alteration. Our findings help explain the hitherto unexplored discrepancies between array-based studies of copy number variation in the mouse genome.  相似文献   

3.

Introduction

Tonic immobility (TI) is fear-induced freezing that animals may undergo when confronted by a threat. It is principally observed in prey species as defence mechanisms. In our preliminary research, we detected large inter-individual variations in the frequency and duration of freezing behavior among newly hatched domestic chicks (Gallus gallus). In this study we aim to identify the copy number variations (CNVs) in the genome of chicks as genetic candidates that underlie the behavioral plasticity to fearful stimuli.

Methods

A total of 110 domestic chicks were used for an association study between TI responses and copy number polymorphisms. Array comparative genomic hybridization (aCGH) was conducted between chicks with high and low TI scores using an Agilent 4×180 custom microarray. We specifically focused on 3 genomic regions (>60 Mb) of chromosome 1 where previous quantitative trait loci (QTL) analysis showed significant F-values for fearful responses.

Results

ACGH successfully detected short CNVs within the regions overlapping 3 QTL peaks. Eleven of these identified loci were validated by real-time quantitative polymerase chain reaction (qPCR) as copy number polymorphisms. Although there wkas no significant p value in the correlation analysis between TI scores and the relative copy number within each breed, several CNV loci showed significant differences in the relative copy number between 2 breeds of chicken (White Leghorn and Nagoya) which had different quantitative characteristics of fear-induced responses.

Conclusion

Our data shows the potential CNVs that may be responsible for innate fear response in domestic chicks.  相似文献   

4.

Introduction

Mutations in the NLRP3 gene are associated with the dominantly inherited cryopyrin-associated periodic syndrome (CAPS). The significance of the V198M variant is unclear; it has been reported in association with various CAPS phenotypes and as a variant of uncertain consequence. The aim of this study was to characterize the clinical phenotypes and treatments in individuals with V198M assessed in a single UK center.

Methods

DNA samples from 830 subjects with fever syndromes or a family history of CAPS were screened for mutations in the NLRP3 gene with polymerase chain reaction (PCR) and sequencing. A detailed medical history was available in all cases. Inflammatory disease activity was monitored monthly with measurements of serum amyloid A protein (SAA) and C-reactive protein (CRP) in symptomatic individuals.

Results

NLRP3 V198M was identified in 19 subjects. It was found in association with CAPS in five cases, in one patient with Schnitzler syndrome, in three patients who also had a nucleotide alteration in another fever gene, and in three other patients with evidence of an autoinflammatory phenotype. Seven asymptomatic individuals were detected during screening of family members.

Conclusions

The NLRP3 V198M variant shows variable expressivity and reduced penetrance. It may be associated with classical inherited or apparently sporadic CAPS and with atypical autoinflammatory disease of varying severity, intriguingly including Schnitzler syndrome. The factors that influence the pathogenic consequences of this variant remain unknown. However, the remarkable response to interleukin 1 (IL-1) blockade in all but one individual in our series confirms that their clinical features are indeed mediated by IL-1.  相似文献   

5.
N Kumar  H Cai  C von Mering  M Baudis 《PloS one》2012,7(8):e43689

Background

Regional genomic copy number alterations (CNA) are observed in the vast majority of cancers. Besides specifically targeting well-known, canonical oncogenes, CNAs may also play more subtle roles in terms of modulating genetic potential and broad gene expression patterns of developing tumors. Any significant differences in the overall CNA patterns between different cancer types may thus point towards specific biological mechanisms acting in those cancers. In addition, differences among CNA profiles may prove valuable for cancer classifications beyond existing annotation systems.

Principal Findings

We have analyzed molecular-cytogenetic data from 25579 tumors samples, which were classified into 160 cancer types according to the International Classification of Disease (ICD) coding system. When correcting for differences in the overall CNA frequencies between cancer types, related cancers were often found to cluster together according to similarities in their CNA profiles. Based on a randomization approach, distance measures from the cluster dendrograms were used to identify those specific genomic regions that contributed significantly to this signal. This approach identified 43 non-neutral genomic regions whose propensity for the occurrence of copy number alterations varied with the type of cancer at hand. Only a subset of these identified loci overlapped with previously implied, highly recurrent (hot-spot) cytogenetic imbalance regions.

Conclusions

Thus, for many genomic regions, a simple null-hypothesis of independence between cancer type and relative copy number alteration frequency can be rejected. Since a subset of these regions display relatively low overall CNA frequencies, they may point towards second-tier genomic targets that are adaptively relevant but not necessarily essential for cancer development.  相似文献   

6.

Background

The ability to accurately detect DNA copy number variation in both a sensitive and quantitative manner is important in many research areas. However, genome-wide DNA copy number analyses are complicated by variations in detection signal.

Results

While GC content has been used to correct for this, here we show that coverage biases are tissue-specific and independent of the detection method as demonstrated by next-generation sequencing and array CGH. Moreover, we show that DNA isolation stringency affects the degree of equimolar coverage and that the observed biases coincide with chromatin characteristics like gene expression, genomic isochores, and replication timing.

Conclusion

These results indicate that chromatin organization is a main determinant for differential DNA retrieval. These findings are highly relevant for germline and somatic DNA copy number variation analyses.  相似文献   

7.

Introduction

In breast cancer, the basal-like subtype has high levels of genomic instability relative to other breast cancer subtypes with many basal-like-specific regions of aberration. There is evidence that this genomic instability extends to smaller scale genomic aberrations, as shown by a previously described micro-deletion event in the PTEN gene in the Basal-like SUM149 breast cancer cell line.

Methods

We sought to identify if small regions of genomic DNA copy number changes exist by using a high density, gene-centric Comparative Genomic Hybridizations (CGH) array on cell lines and primary tumors. A custom tiling array for CGH (244,000 probes, 200 bp tiling resolution) was created to identify small regions of genomic change, which was focused on previously identified basal-like-specific, and general cancer genes. Tumor genomic DNA from 94 patients and 2 breast cancer cell lines was labeled and hybridized to these arrays. Aberrations were called using SWITCHdna and the smallest 25% of SWITCHdna-defined genomic segments were called micro-aberrations (<64 contiguous probes, ∼ 15 kb).

Results

Our data showed that primary tumor breast cancer genomes frequently contained many small-scale copy number gains and losses, termed micro-aberrations, most of which are undetectable using typical-density genome-wide aCGH arrays. The basal-like subtype exhibited the highest incidence of these events. These micro-aberrations sometimes altered expression of the involved gene. We confirmed the presence of the PTEN micro-amplification in SUM149 and by mRNA-seq showed that this resulted in loss of expression of all exons downstream of this event. Micro-aberrations disproportionately affected the 5′ regions of the affected genes, including the promoter region, and high frequency of micro-aberrations was associated with poor survival.

Conclusion

Using a high-probe-density, gene-centric aCGH microarray, we present evidence of small-scale genomic aberrations that can contribute to gene inactivation. These events may contribute to tumor formation through mechanisms not detected using conventional DNA copy number analyses.  相似文献   

8.

Background

Deidentified newborn screening bloodspot samples (NBS) represent a valuable potential resource for genomic research if impediments to whole exome sequencing of NBS deoxyribonucleic acid (DNA), including the small amount of genomic DNA in NBS material, can be overcome. For instance, genomic analysis of NBS could be used to define allele frequencies of disease-associated variants in local populations, or to conduct prospective or retrospective studies relating genomic variation to disease emergence in pediatric populations over time. In this study, we compared the recovery of variant calls from exome sequences of amplified NBS genomic DNA to variant calls from exome sequencing of non-amplified NBS DNA from the same individuals.

Results

Using a standard alignment-based Genome Analysis Toolkit (GATK), we find 62,000–76,000 additional variants in amplified samples. After application of a unique kmer enumeration and variant detection method (RUFUS), only 38,000–47,000 additional variants are observed in amplified gDNA. This result suggests that roughly half of the amplification-introduced variants identified using GATK may be the result of mapping errors and read misalignment.

Conclusions

Our results show that it is possible to obtain informative, high-quality data from exome analysis of whole genome amplified NBS with the important caveat that different data generation and analysis methods can affect variant detection accuracy, and the concordance of variant calls in whole-genome amplified and non-amplified exomes.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1747-2) contains supplementary material, which is available to authorized users.  相似文献   

9.

Background

Mutations in the PRRT2 gene have been identified as the major cause of benign familial infantile epilepsy (BFIE), paroxysmal kinesigenic dyskinesia (PKD) and infantile convulsions with paroxysmal choreoathetosis/dyskinesias (ICCA). Here, we analyzed the phenotypes and PRRT2 mutations in Chinese families with BFIE and ICCA.

Methods

Clinical data were collected from 22 families with BFIE and eight families with ICCA. PRRT2 mutations were screened using PCR and direct sequencing.

Results

Ninety-five family members were clinically affected in the 22 BFIE families. During follow-up, two probands had one seizure induced by diarrhea at the age of two years. Thirty-one family members were affected in the eight ICCA families, including 11 individuals with benign infantile epilepsy, nine with PKD, and 11 with benign infantile epilepsy followed by PKD. Two individuals in one ICCA family had PKD or ICCA co-existing with migraine. One affected member in another ICCA family had experienced a fever-induced seizure at 7 years old. PRRT2 mutations were detected in 13 of the 22 BFIE families. The mutation c.649_650insC (p.R217PfsX8) was found in nine families. The mutations c.649delC (p.R217EfsX12) and c.904_905insG (p.D302GfsX39) were identified in three families and one family, respectively. PRRT2 mutations were identified in all eight ICCA families, including c.649_650insC (p.R217PfsX8), c.649delC (p.R217EfsX12), c.514_517delTCTG (p.S172RfsX3) and c.1023A?>?T (X341C). c.1023A?>?T is a novel mutation predicted to elongate the C-terminus of the protein by 28 residues.

Conclusions

Our data demonstrated that PRRT2 is the major causative gene of BFIE and ICCA in Chinese families. Site c.649 is a mutation hotspot: c.649_650insC is the most common mutation, and c.649delC is the second most common mutation in Chinese families with BFIE and ICCA. As far as we know, c.1023A?>?T is the first reported mutation in exon 4 of PRRT2. c.649delC was previously reported in PKD, ICCA and hemiplegic migraine families, but we further detected it in BFIE-only families. c.904_905insG was reported in an ICCA family, but we identified it in a BFIE family. c.514_517delTCTG was previously reported in a PKD family, but we identified it in an ICCA family. Migraine and febrile seizures plus could co-exist in ICCA families.
  相似文献   

10.

Background

Numerous linkage studies have been performed in pedigrees of Autism Spectrum Disorders, and these studies point to diverse loci and etiologies of autism in different pedigrees. The underlying pattern may be identified by an integrative approach, especially since ASD is a complex disorder manifested through many loci.

Method

Autism spectrum disorder (ASD) was studied through two different and independent genome-scale measurement modalities. We analyzed the results of copy number variation in autism and triangulated these with linkage studies.

Results

Consistently across both genome-scale measurements, the same two molecular themes emerged: immune/chemokine pathways and developmental pathways.

Conclusion

Linkage studies in aggregate do indeed share a thematic consistency, one which structural analyses recapitulate with high significance. These results also show for the first time that genomic profiling of pathways using a recombination distance metric can capture pathways that are consistent with those obtained from copy number variations (CNV).  相似文献   

11.

Background

In single-cell human genome analysis using whole-genome amplified product, a strong amplification bias involving allele dropout and preferential amplification hampers the quality of results. Using an oligonucleotide single nucleotide polymorphism (SNP) array, we systematically examined the nature of this amplification bias, including frequency, degree, and preference for genomic location, and we assessed the effects of this amplification bias on subsequent genotype and chromosomal copy number analyses.

Methodology/Principal Findings

We found a large variability in amplification bias among the amplified products obtained by multiple displacement amplification (MDA), and this bias had a severe effect on the genotype and chromosomal copy number analyses. We established optimal experimental conditions for pre-screening for high-quality amplified products, processing array data, and analyzing chromosomal structural alterations. Using this optimized protocol, we successfully detected previously unidentified chromosomal structural alterations in single cells from a lymphoblastoid cell line. These alterations were subsequently confirmed by karyotype analysis. In addition, we successfully obtained reproducible chromosomal copy number profiles of single cells from the cell line with a complex karyotype, indicating the applicability and potential of our optimized workflow.

Conclusions/Significance

Our results suggest that the quality of amplification products should be critically assessed before using them for genomic analyses. The method of MDA-based whole-genome amplification followed by SNP array analysis described here will be useful for exploring chromosomal alterations in single cells.  相似文献   

12.

Background

Personal genome assembly is a critical process when studying tumor genomes and other highly divergent sequences. The accuracy of downstream analyses, such as RNA-seq and ChIP-seq, can be greatly enhanced by using personal genomic sequences rather than standard references. Unfortunately, reads sequenced from these types of samples often have a heterogeneous mix of various subpopulations with different variants, making assembly extremely difficult using existing assembly tools. To address these challenges, we developed SHEAR (Sample Heterogeneity Estimation and Assembly by Reference; http://vk.cs.umn.edu/SHEAR), a tool that predicts SVs, accounts for heterogeneous variants by estimating their representative percentages, and generates personal genomic sequences to be used for downstream analysis.

Results

By making use of structural variant detection algorithms, SHEAR offers improved performance in the form of a stronger ability to handle difficult structural variant types and better computational efficiency. We compare against the lead competing approach using a variety of simulated scenarios as well as real tumor cell line data with known heterogeneous variants. SHEAR is shown to successfully estimate heterogeneity percentages in both cases, and demonstrates an improved efficiency and better ability to handle tandem duplications.

Conclusion

SHEAR allows for accurate and efficient SV detection and personal genomic sequence generation. It is also able to account for heterogeneous sequencing samples, such as from tumor tissue, by estimating the subpopulation percentage for each heterogeneous variant.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-84) contains supplementary material, which is available to authorized users.  相似文献   

13.
14.

Background

Usher syndrome (USH) is a genetically heterogeneous condition with ten disease-causing genes. The spectrum of genes and mutations causing USH in the Lebanese and Middle Eastern populations has not been described. Consequently, diagnostic approaches designed to screen for previously reported mutations were unlikely to identify the mutations in 11 unrelated families, eight of Lebanese and three of Middle Eastern origins. In addition, six of the ten USH genes consist of more than 20 exons, each, which made mutational analysis by Sanger sequencing of PCR-amplified exons from genomic DNA tedious and costly. The study was aimed at the identification of USH causing genes and mutations in 11 unrelated families with USH type I or II.

Methods

Whole exome sequencing followed by expanded familial validation by Sanger sequencing.

Results

We identified disease-causing mutations in all the analyzed patients in four USH genes, MYO7A, USH2A, GPR98 and CDH23. Eleven of the mutations were novel and protein truncating, including a complex rearrangement in GPR98.

Conclusion

Our data highlight the genetic diversity of Usher syndrome in the Lebanese population and the time and cost-effectiveness of whole exome sequencing approach for mutation analysis of genetically heterogeneous conditions caused by large genes.  相似文献   

15.

Background

Peptidases are key proteins involved in essential plant physiological processes. Although protein peptidase inhibitors are essential molecules that modulate peptidase activity, their global presence in different plant species remains still unknown. Comparative genomic analyses are powerful tools to get advanced knowledge into the presence and evolution of both, peptidases and their inhibitors across the Viridiplantae kingdom.

Results

A genomic comparative analysis of peptidase inhibitors and several groups of peptidases in representative species of different plant taxonomic groups has been performed. The results point out: i) clade-specific presence is common to many families of peptidase inhibitors, being some families present in most land plants; ii) variability is a widespread feature for peptidase inhibitory families, with abundant species-specific (or clade-specific) gene family proliferations; iii) peptidases are more conserved in different plant clades, being C1A papain and S8 subtilisin families present in all species analyzed; and iv) a moderate correlation among peptidases and their inhibitors suggests that inhibitors proliferated to control both endogenous and exogenous peptidases.

Conclusions

Comparative genomics has provided valuable insights on plant peptidase inhibitor families and could explain the evolutionary reasons that lead to the current variable repertoire of peptidase inhibitors in specific plant clades.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-812) contains supplementary material, which is available to authorized users.  相似文献   

16.

Background

Numerous efforts have been made to elucidate the etiology and improve the treatment of lung cancer, but the overall five-year survival rate is still only 15%. Although cigarette smoking is the primary risk factor for lung cancer, only 7% of female lung cancer patients in Taiwan have a history of smoking. Since cancer results from progressive accumulation of genetic aberrations, genomic rearrangements may be early events in carcinogenesis.

Results

In order to identify biomarkers of early-stage adenocarcinoma, the genome-wide DNA aberrations of 60 pairs of lung adenocarcinoma and adjacent normal lung tissue in non-smoking women were examined using Affymetrix Genome-Wide Human SNP 6.0 arrays. Common copy number variation (CNV) regions were identified by ≥30% of patients with copy number beyond 2 ± 0.5 of copy numbers for each single nucleotide polymorphism (SNP) and at least 100 continuous SNP variant loci. SNPs associated with lung adenocarcinoma were identified by McNemar’s test. Loss of heterozygosity (LOH) SNPs were identified in ≥18% of patients with LOH in the locus. Aberration of SNP rs10248565 at HDAC9 in chromosome 7p21.1 was identified from concurrent analyses of CNVs, SNPs, and LOH.

Conclusion

The results elucidate the genetic etiology of lung adenocarcinoma by demonstrating that SNP rs10248565 may be a potential biomarker of cancer susceptibility.  相似文献   

17.

Background

We have previously identified genome-wide DNA methylation changes in a cell line model of breast cancer metastasis. These complex epigenetic changes that we observed, along with concurrent karyotype analyses, have led us to hypothesize that complex genomic alterations in cancer cells (deletions, translocations and ploidy) are superimposed over promoter-specific methylation events that are responsible for gene-specific expression changes observed in breast cancer metastasis.

Methodology/Principal Findings

We undertook simultaneous high-resolution, whole-genome analyses of MDA-MB-468GFP and MDA-MB-468GFP-LN human breast cancer cell lines (an isogenic, paired lymphatic metastasis cell line model) using Affymetrix gene expression (U133), promoter (1.0R), and SNP/CNV (SNP 6.0) microarray platforms to correlate data from gene expression, epigenetic (DNA methylation), and combination copy number variant/single nucleotide polymorphism microarrays. Using Partek Software and Ingenuity Pathway Analysis we integrated datasets from these three platforms and detected multiple hypomethylation and hypermethylation events. Many of these epigenetic alterations correlated with gene expression changes. In addition, gene dosage events correlated with the karyotypic differences observed between the cell lines and were reflected in specific promoter methylation patterns. Gene subsets were identified that correlated hyper (and hypo) methylation with the loss (or gain) of gene expression and in parallel, with gene dosage losses and gains, respectively. Individual gene targets from these subsets were also validated for their methylation, expression and copy number status, and susceptible gene pathways were identified that may indicate how selective advantage drives the processes of tumourigenesis and metastasis.

Conclusions/Significance

Our approach allows more precisely profiling of functionally relevant epigenetic signatures that are associated with cancer progression and metastasis.  相似文献   

18.

Objective

The current study aimed to develop a reliable targeted array comparative genomic hybridization (aCGH) to detect microdeletions and microduplications in congenital conotruncal defects (CTDs), especially on 22q11.2 region, and for some other chromosomal aberrations, such as 5p15-5p, 7q11.23 and 4p16.3.

Methods

Twenty-seven patients with CTDs, including 12 pulmonary atresia (PA), 10 double-outlet right ventricle (DORV), 3 transposition of great arteries (TGA), 1 tetralogy of Fallot (TOF) and one ventricular septal defect (VSD), were enrolled in this study and screened for pathogenic copy number variations (CNVs), using Agilent 8 x 15K targeted aCGH. Real-time quantitative polymerase chain reaction (qPCR) was performed to test the molecular results of targeted aCGH.

Results

Four of 27 patients (14.8%) had 22q11.2 CNVs, 1 microdeletion and 3 microduplications. qPCR test confirmed the microdeletion and microduplication detected by the targeted aCGH.

Conclusion

Chromosomal abnormalities were a well-known cause of multiple congenital anomalies (MCA). This aCGH using arrays with high-density coverage in the targeted regions can detect genomic imbalances including 22q11.2 and other 10 kinds CNVs effectively and quickly. This approach has the potential to be applied to detect aneuploidy and common microdeletion/microduplication syndromes on a single microarray.  相似文献   

19.
20.

Background

The extent of intratumoral mutational heterogeneity remains unclear in gliomas, the most common primary brain tumors, especially with respect to point mutation. To address this, we applied single molecule molecular inversion probes targeting 33 cancer genes to assay both point mutations and gene amplifications within spatially distinct regions of 14 glial tumors.

Results

We find evidence of regional mutational heterogeneity in multiple tumors, including mutations in TP53 and RB1 in an anaplastic oligodendroglioma and amplifications in PDGFRA and KIT in two glioblastomas (GBMs). Immunohistochemistry confirms heterogeneity of TP53 mutation and PDGFRA amplification. In all, 3 out of 14 glial tumors surveyed have evidence for heterogeneity for clinically relevant mutations.

Conclusions

Our results underscore the need to sample multiple regions in GBM and other glial tumors when devising personalized treatments based on genomic information, and furthermore demonstrate the importance of measuring both point mutation and copy number alteration while investigating genetic heterogeneity within cancer samples.

Electronic supplementary material

The online version of this article (doi:10.1186/s13059-014-0530-z) contains supplementary material, which is available to authorized users.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号