共查询到20条相似文献,搜索用时 15 毫秒
1.
Andrew Feber Paul Guilhamon Matthias Lechner Tim Fenton Gareth A Wilson Christina Thirlwell Tiffany J Morris Adrienne M Flanagan Andrew E Teschendorff John D Kelly Stephan Beck 《Genome biology》2014,15(2):R30
The integration of genomic and epigenomic data is an increasingly popular approach for studying the complex mechanisms driving cancer development. We have developed a method for evaluating both methylation and copy number from high-density DNA methylation arrays. Comparing copy number data from Infinium HumanMethylation450 BeadChips and SNP arrays, we demonstrate that Infinium arrays detect copy number alterations with the sensitivity of SNP platforms. These results show that high-density methylation arrays provide a robust and economic platform for detecting copy number and methylation changes in a single experiment. Our method is available in the ChAMP Bioconductor package: http://www.bioconductor.org/packages/2.13/bioc/html/ChAMP.html. 相似文献
2.
Rigaill G Hupé P Almeida A La Rosa P Meyniel JP Decraene C Barillot E 《Bioinformatics (Oxford, England)》2008,24(6):768-774
MOTIVATION: Affymetrix SNP arrays can be used to determine the DNA copy number measurement of 11 000-500 000 SNPs along the genome. Their high density facilitates the precise localization of genomic alterations and makes them a powerful tool for studies of cancers and copy number polymorphism. Like other microarray technologies it is influenced by non-relevant sources of variation, requiring correction. Moreover, the amplitude of variation induced by non-relevant effects is similar or greater than the biologically relevant effect (i.e. true copy number), making it difficult to estimate non-relevant effects accurately without including the biologically relevant effect. RESULTS: We addressed this problem by developing ITALICS, a normalization method that estimates both biological and non-relevant effects in an alternate, iterative manner, accurately eliminating irrelevant effects. We compared our normalization method with other existing and available methods, and found that ITALICS outperformed these methods for several in-house datasets and one public dataset. These results were validated biologically by quantitative PCR. AVAILABILITY: The R package ITALICS (ITerative and Alternative normaLIzation and Copy number calling for affymetrix Snp arrays) has been submitted to Bioconductor. 相似文献
3.
4.
We have isolated E. coli mutants which can grow at 30 degrees C but not at 42 degrees C and are able to harbor the oriC plasmid (minichromosome) at a higher copy number than the parental wild-type strain at the permissive temperature. The mutants were found to contain higher amounts of chromosomal DNA per mg protein than the wild-type, whether or not they harbor the plasmid. Experimental results suggest that the higher amount of chromosomal DNA is due to a higher copy number of chromosomes and not to a larger amount of DNA per chromosome. These properties in each of the mutants are caused by a single mutation at the rpoB or rpoC gene that code for the beta or beta' subunit of RNA polymerase, respectively. The mutations are thought to affect the regulation of replication of oriC-bearing replicons, that is, the E. coli chromosome and oriC plasmids, but not the miniF plasmid. 相似文献
5.
Genomic copy number variations (CNVs) are considered as a significant source of genetic diversity and widely involved in gene expression and regulatory mechanism, genetic disorders and disease risk, susceptibility to certain diseases and conditions, and resistance to medical drugs. Many studies have targeted the identification, profiling, analysis, and associations of genetic CNVs. We propose herein two new fuzzy methods, taht is, one based on the fuzzy inference from the pre-processed input, and another based on fuzzy C-means clustering. Our solutions present a higher true positive rate and a lower false negative with no false positive, efficient performance and consumption of least resources. 相似文献
6.
Thomas Kuilman Arno Velds Kristel Kemper Marco Ranzani Lorenzo Bombardelli Marlous Hoogstraat Ekaterina Nevedomskaya Guotai Xu Julian de Ruiter Martijn P Lolkema Bauke Ylstra Jos Jonkers Sven Rottenberg Lodewyk F Wessels David J Adams Daniel S Peeper Oscar Krijgsman 《Genome biology》2015,16(1)
Current methods for detection of copy number variants (CNV) and aberrations (CNA) from targeted sequencing data are based on the depth of coverage of captured exons. Accurate CNA determination is complicated by uneven genomic distribution and non-uniform capture efficiency of targeted exons. Here we present CopywriteR, which eludes these problems by exploiting ‘off-target’ sequence reads. CopywriteR allows for extracting uniformly distributed copy number information, can be used without reference, and can be applied to sequencing data obtained from various techniques including chromatin immunoprecipitation and target enrichment on small gene panels. CopywriteR outperforms existing methods and constitutes a widely applicable alternative to available tools.
Electronic supplementary material
The online version of this article (doi:10.1186/s13059-015-0617-1) contains supplementary material, which is available to authorized users. 相似文献7.
A genome-wide detection of copy number variation using SNP genotyping arrays in Beijing-You chickens
Wei Zhou Ranran Liu Jingjing Zhang Maiqing Zheng Peng Li Guobin Chang Jie Wen Guiping Zhao 《Genetica》2014,142(5):441-450
Copy number variation (CNV) has been recently examined in many species and is recognized as being a source of genetic variability, especially for disease-related phenotypes. In this study, the PennCNV software, a genome-wide CNV detection system based on the 60 K SNP BeadChip was used on a total sample size of 1,310 Beijing-You chickens (a Chinese local breed). After quality control, 137 high confidence CNVRs covering 27.31 Mb of the chicken genome and corresponding to 2.61 % of the whole chicken genome. Within these regions, 131 known genes or coding sequences were involved. Q-PCR was applied to verify some of the genes related to disease development. Results showed that copy number of genes such as, phosphatidylinositol-5-phosphate 4-kinase II alpha, PHD finger protein 14, RHACD8 (a CD8α- like messenger RNA), MHC B-G, zinc finger protein, sarcosine dehydrogenase and ficolin 2 varied between individual chickens, which also supports the reliability of chip-detection of the CNVs. As one source of genomic variation, CNVs may provide new insight into the relationship between the genome and phenotypic characteristics. 相似文献
8.
9.
SUMMARY: Gene copy number and DNA methylation alterations are key regulators of gene expression in cancer. Accordingly, genes that show simultaneous methylation, copy number and expression alterations are likely to have a key role in tumor progression. We have implemented a novel software package (CNAmet) for integrative analysis of high-throughput copy number, DNA methylation and gene expression data. To demonstrate the utility of CNAmet, we use copy number, DNA methylation and gene expression data from 50 glioblastoma multiforme and 188 ovarian cancer primary tumor samples. Our results reveal a synergistic effect of DNA methylation and copy number alterations on gene expression for several known oncogenes as well as novel candidate oncogenes. AVAILABILITY: CNAmet R-package and user guide are freely available under GNU General Public License at http://csbi.ltdk.helsinki.fi/CNAmet. 相似文献
10.
11.
Andrew E. Dellinger Seang-Mei Saw Liang K. Goh Mark Seielstad Terri L. Young Yi-Ju Li 《Nucleic acids research》2010,38(9):e105
Determination of copy number variants (CNVs) inferred in genome wide single nucleotide polymorphism arrays has shown increasing utility in genetic variant disease associations. Several CNV detection methods are available, but differences in CNV call thresholds and characteristics exist. We evaluated the relative performance of seven methods: circular binary segmentation, CNVFinder, cnvPartition, gain and loss of DNA, Nexus algorithms, PennCNV and QuantiSNP. Tested data included real and simulated Illumina HumHap 550 data from the Singapore cohort study of the risk factors for Myopia (SCORM) and simulated data from Affymetrix 6.0 and platform-independent distributions. The normalized singleton ratio (NSR) is proposed as a metric for parameter optimization before enacting full analysis. We used 10 SCORM samples for optimizing parameter settings for each method and then evaluated method performance at optimal parameters using 100 SCORM samples. The statistical power, false positive rates, and receiver operating characteristic (ROC) curve residuals were evaluated by simulation studies. Optimal parameters, as determined by NSR and ROC curve residuals, were consistent across datasets. QuantiSNP outperformed other methods based on ROC curve residuals over most datasets. Nexus Rank and SNPRank have low specificity and high power. Nexus Rank calls oversized CNVs. PennCNV detects one of the fewest numbers of CNVs. 相似文献
12.
Lin Wan Kelian Sun Qi Ding Yuehua Cui Ming Li Yalu Wen Robert C. Elston Minping Qian Wenjiang J Fu 《Nucleic acids research》2009,37(17):e117
Affymetrix SNP arrays have been widely used for single-nucleotide polymorphism (SNP) genotype calling and DNA copy number variation inference. Although numerous methods have achieved high accuracy in these fields, most studies have paid little attention to the modeling of hybridization of probes to off-target allele sequences, which can affect the accuracy greatly. In this study, we address this issue and demonstrate that hybridization with mismatch nucleotides (HWMMN) occurs in all SNP probe-sets and has a critical effect on the estimation of allelic concentrations (ACs). We study sequence binding through binding free energy and then binding affinity, and develop a probe intensity composite representation (PICR) model. The PICR model allows the estimation of ACs at a given SNP through statistical regression. Furthermore, we demonstrate with cell-line data of known true copy numbers that the PICR model can achieve reasonable accuracy in copy number estimation at a single SNP locus, by using the ratio of the estimated AC of each sample to that of the reference sample, and can reveal subtle genotype structure of SNPs at abnormal loci. We also demonstrate with HapMap data that the PICR model yields accurate SNP genotype calls consistently across samples, laboratories and even across array platforms. 相似文献
13.
Magi A Tattini L Pippucci T Torricelli F Benelli M 《Bioinformatics (Oxford, England)》2012,28(4):470-478
MOTIVATION: The advent of high-throughput sequencing technologies is revolutionizing our ability in discovering and genotyping DNA copy number variants (CNVs). Read count-based approaches are able to detect CNV regions with an unprecedented resolution. Although this computational strategy has been recently introduced in literature, much work has been already done for the preparation, normalization and analysis of this kind of data. RESULTS: Here we face the many aspects that cover the detection of CNVs by using read count approach. We first study the characteristics and systematic biases of read count distributions, focusing on the normalization methods designed for removing these biases. Subsequently, we compare the algorithms designed to detect the boundaries of CNVs and we investigate the ability of read count data to predict the exact number of DNA copy. Finally, we review the tools publicly available for analysing read count data. To better understand the state of the art of read count approaches, we compare the performance of the three most widely used sequencing technologies (Illumina Genome Analyzer, Roche 454 and Life Technologies SOLiD) in all the analyses that we perform. 相似文献
14.
15.
Fuhai Li Xiaobo Zhou Wanting Huang Chung-Che Chang Stephen TC Wong 《BMC bioinformatics》2010,11(1):200
Background
DNA copy number aberration (CNA) is very important in the pathogenesis of tumors and other diseases. For example, CNAs may result in suppression of anti-oncogenes and activation of oncogenes, which would cause certain types of cancers. High density single nucleotide polymorphism (SNP) array data is widely used for the CNA detection. However, it is nontrivial to detect the CNA automatically because the signals obtained from high density SNP arrays often have low signal-to-noise ratio (SNR), which might be caused by whole genome amplification, mixtures of normal and tumor cells, experimental noise or other technical limitations. With the reduction in SNR, many false CNA regions are often detected and the true CNA regions are missed. Thus, more sophisticated statistical models are needed to make the CNAs detection, using the low SNR signals, more robust and reliable. 相似文献16.
17.
Plasmid pKKH: An improved vector with higher copy number for expression of foreign genes in Escherichia coli 总被引:1,自引:0,他引:1
Summary An improved vector of 2889 bp was constructed by mutation of copy number control system, into which foreign genes without or with the start codon ATG can be directly inserted for high-level expression in Escherichia coli. Deletion of the rop gene encoding a negative regulation protein ROP leads to increase the plasmid copy number, and finally makes the vector increase expression level more than 60% in comparison with initial one. 相似文献
18.
19.
The effects of mRNA stability and plasmid copy number on gene expression in Escherichia coli were evaluated by constructing multicopy (pMB1-based) and low-copy (F-based) plasmids containing an arabinose-inducible promoter system, the lacZ reporter gene, and mRNA-stabilizing 5' hairpin structures. Product formation and cell growth were evaluated under a number of inducer concentrations. The introduction of a 5' hairpin into the untranslated region of the mRNA resulted in significantly higher gene expression from the multicopy plasmids at low inducer concentrations and increased gene expression from the low-copy plasmids across all inducer concentrations investigated. With high inducer concentrations, expression from high-copy plasmids significantly slowed cell growth, whereas expression from the low-copy plasmids had little effect on growth rate. At inducer concentrations between 1 x 10(-4) and 4 x 10(-4)%, the productivity of low-copy plasmids containing the 5'-hairpin was equal to or greater than that from multicopy plasmids. Together, these two gene expression strategies may find important use in metabolic engineering and heterologous gene expression. 相似文献
20.
Alistair H.A. Bingham 《FEMS microbiology letters》1991,79(2-3):239-246
A series of six expression vectors, pXM184Lac.A, B, C, pXM184Z.A, B, C, based on the low copy plasmid pACYC184 that allow for expression of proteins fused to beta-galactosidase in Escherichia coli is described. A level of 50,000 units of beta-galactosidase is routinely observed and is easily identifiable on protein gels. This paper also reports the tight regulation of expression of the Trc promoter in these vectors using the LacIq repressor. 相似文献