期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Using high-density DNA methylation arrays to profile copy number alterations

Andrew Feber Paul Guilhamon Matthias Lechner Tim Fenton Gareth A Wilson Christina Thirlwell Tiffany J Morris Adrienne M Flanagan Andrew E Teschendorff John D Kelly Stephan Beck 《Genome biology》2014,15(2):R30

The integration of genomic and epigenomic data is an increasingly popular approach for studying the complex mechanisms driving cancer development. We have developed a method for evaluating both methylation and copy number from high-density DNA methylation arrays. Comparing copy number data from Infinium HumanMethylation450 BeadChips and SNP arrays, we demonstrate that Infinium arrays detect copy number alterations with the sensitivity of SNP platforms. These results show that high-density methylation arrays provide a robust and economic platform for detecting copy number and methylation changes in a single experiment. Our method is available in the ChAMP Bioconductor package: http://www.bioconductor.org/packages/2.13/bioc/html/ChAMP.html. 相似文献

2.

SomatiCA: Identifying,Characterizing and Quantifying Somatic Copy Number Aberrations from Cancer Genome Sequencing Data

Mengjie Chen Murat Gunel Hongyu Zhao 《PloS one》2013,8(11)

Whole genome sequencing of matched tumor-normal sample pairs is becoming routine in cancer research. However, analysis of somatic copy-number changes from sequencing data is still challenging because of insufficient sequencing coverage, unknown tumor sample purity and subclonal heterogeneity. Here we describe a computational framework, named SomatiCA, which explicitly accounts for tumor purity and subclonality in the analysis of somatic copy-number profiles. Taking read depths (RD) and lesser allele frequencies (LAF) as input, SomatiCA will output 1) admixture rate for each tumor sample, 2) somatic allelic copy-number for each genomic segment, 3) fraction of tumor cells with subclonal change in each somatic copy number aberration (SCNA), and 4) a list of substantial genomic aberration events including gain, loss and LOH. SomatiCA is available as a Bioconductor R package at http://www.bioconductor.org/packages/2.13/bioc/html/SomatiCA.html. 相似文献

3.

ontoCAT: an R package for ontology traversal and search

Kurbatova N Adamusiak T Kurnosov P Swertz MA Kapushesky M 《Bioinformatics (Oxford, England)》2011,27(17):2468-2470

MOTIVATION: There exist few simple and easily accessible methods to integrate ontologies programmatically in the R environment. We present ontoCAT-an R package to access ontologies in widely used standard formats, stored locally in the filesystem or available online. The ontoCAT package supports a number of traversal and search functions on a single ontology, as well as searching for ontology terms across multiple ontologies and in major ontology repositories. AVAILABILITY: The package and sources are freely available in Bioconductor starting from version 2.8: http://bioconductor.org/help/bioc-views/release/bioc/html/ontoCAT.html or via the OntoCAT website http://www.ontocat.org/wiki/r. CONTACT: natalja@ebi.ac.uk; natalja@ebi.ac.uk. 相似文献

4.

SeqGL Identifies Context-Dependent Binding Signals in Genome-Wide Regulatory Element Maps

Manu Setty Christina S. Leslie 《PLoS computational biology》2015,11(5)

相似文献

5.

Gain in 1q is a common abnormality in phyllodes tumours of the breast. 总被引：4，自引：0，他引：4

Kowan J Jee Gyungyub Gong Sei Hyun Ahn Jeong Mi Park Sakari Knuutila 《Analytical cellular pathology》2003,25(2):89-93

We studied DNA copy number changes by CGH and allelic imbalance (AI) on 3p by LOH analysis on 22 phyllodes tumours (PT) of the breast in order to gain insight into the genetic basis of tumour progression in PT. Copy number changes were observed in 14 cases (63%). Gain in 1q with 1q21-23 as the minimal overlapping area was seen in 12 cases (55%). The gain was observed both in benign and malignant tumours. Our study did not reveal any DNA copy number changes or allelic loss on 3p. The results suggest that DNA copy number changes are not associated with the histological grade or clinical behaviour of PT and the chromosomal changes on 3p appear to be rare. Colour figure can be viewed on http://www.esacp.org/acp/2003/25-2/jee.htm 相似文献

6.

Patchwork: allele-specific copy number analysis of whole-genome sequenced tumor tissue

Markus Mayrhofer Sebastian DiLorenzo Anders Isaksson 《Genome biology》2013,14(3):R24

Whole-genome sequencing of tumor tissue has the potential to provide comprehensive characterization of genomic alterations in tumor samples. We present Patchwork, a new bioinformatic tool for allele-specific copy number analysis using whole-genome sequencing data. Patchwork can be used to determine the copy number of homologous sequences throughout the genome, even in aneuploid samples with moderate sequence coverage and tumor cell content. No prior knowledge of average ploidy or tumor cell content is required. Patchwork is freely available as an R package, installable via R-Forge (http://patchwork.r-forge.r-project.org/). 相似文献

7.

SpeCond: a method to detect condition-specific gene expression

Cavalli FM Bourgon R Huber W Vaquerizas JM Luscombe NM 《Genome biology》2011,12(10):R101-12

相似文献

8.

PureCN: copy number calling and SNV classification using targeted short read sequencing

Markus?Riester Email author View author&#;s OrcID profile Angad?P.?Singh A.?Rose?Brannon Kun?Yu Catarina?D.?Campbell Derek?Y.?Chiang Michael?P.?Morrissey 《Source code for biology and medicine》2016,11(1):13

Background

Matched sequencing of both tumor and normal tissue is routinely used to classify variants of uncertain significance (VUS) into somatic vs. germline. However, assays used in molecular diagnostics focus on known somatic alterations in cancer genes and often only sequence tumors. Therefore, an algorithm that reliably classifies variants would be helpful for retrospective exploratory analyses. Contamination of tumor samples with normal cells results in differences in expected allelic fractions of germline and somatic variants, which can be exploited to accurately infer genotypes after adjusting for local copy number. However, existing algorithms for determining tumor purity, ploidy and copy number are not designed for unmatched short read sequencing data.

Results

We describe a methodology and corresponding open source software for estimating tumor purity, copy number, loss of heterozygosity (LOH), and contamination, and for classification of single nucleotide variants (SNVs) by somatic status and clonality. This R package, PureCN, is optimized for targeted short read sequencing data, integrates well with standard somatic variant detection pipelines, and has support for matched and unmatched tumor samples. Accuracy is demonstrated on simulated data and on real whole exome sequencing data.

Conclusions

Our algorithm provides accurate estimates of tumor purity and ploidy, even if matched normal samples are not available. This in turn allows accurate classification of SNVs. The software is provided as open source (Artistic License 2.0) R/Bioconductor package PureCN (http://bioconductor.org/packages/PureCN/).

相似文献

9.

phangorn: phylogenetic analysis in R 总被引：4，自引：0，他引：4

Schliep KP 《Bioinformatics (Oxford, England)》2011,27(4):592-593

SUMMARY: phangorn is a package for phylogenetic reconstruction and analysis in the R language. Previously it was only possible to estimate phylogenetic trees with distance methods in R. phangorn, now offers the possibility of reconstructing phylogenies with distance based methods, maximum parsimony or maximum likelihood (ML) and performing Hadamard conjugation. Extending the general ML framework, this package provides the possibility of estimating mixture and partition models. Furthermore, phangorn offers several functions for comparing trees, phylogenetic models or splits, simulating character data and performing congruence analyses. AVAILABILITY: phangorn can be obtained through the CRAN homepage http://cran.r-project.org/web/packages/phangorn/index.html. phangorn is licensed under GPL 2. 相似文献

10.

MixHMM: Inferring Copy Number Variation and Allelic Imbalance Using SNP Arrays and Tumor Samples Mixed with Stromal Cells

Zongzhi Liu Ao Li Vincent Schulz Min Chen David Tuck 《PloS one》2010,5(6)

相似文献

11.

Implementation of a gene expression index calculation method based on the PDNN model

Nielsen HB Gautier L Knudsen S 《Bioinformatics (Oxford, England)》2005,21(5):687-688

SUMMARY: Gene expression index calculations from Affymetrix GeneChips have been dominated by the Affymetrix MAS, dChip, and RMA methods. A new method to estimate the gene expression value utilizing the probe sequence information named position-dependent nearest-neighbor (PDNN) has been suggested by Zhang et al. (2003). Here we describe an open source implementation of the PDNN method for the statistical language R. AVAILABILITY: The package can be downloaded from http://www.bioconductor.org/repository/devel/package/html/affypdnn.html CONTACT: hbjorn@cbs.dtu.dk. 相似文献

12.

Prophet, a web-based tool for class prediction using microarray data

Medina I Montaner D Tárraga J Dopazo J 《Bioinformatics (Oxford, England)》2007,23(3):390-391

Sample classification and class prediction is the aim of many gene expression studies. We present a web-based application, Prophet, which builds prediction rules and allows using them for further sample classification. Prophet automatically chooses the best classifier, along with the optimal selection of genes, using a strategy that renders unbiased cross-validated errors. Prophet is linked to different microarray data analysis modules, and includes a unique feature: the possibility of performing the functional interpretation of the molecular signature found. Availability: Prophet can be found at the URL http://prophet.bioinfo.cipf.es/ or within the GEPAS package at http://www.gepas.org/ Supplementary information: http://gepas.bioinfo.cipf.es/tutorial/prophet.html. 相似文献

13.

Gibbs Recursive Sampler: finding transcription factor binding sites

Thompson W Rouchka EC Lawrence CE 《Nucleic acids research》2003,31(13):3580-3585

相似文献

14.

TROM: A Testing-Based Method for Finding Transcriptomic Similarity of Biological Samples

Wei Vivian Li Yiling Chen Jingyi Jessica Li 《Statistics in biosciences》2017,9(1):105-136

相似文献

15.

EXP-PAC: providing comparative analysis and storage of next generation gene expression data

Church PC Goscinski A Lefèvre C 《Genomics》2012,100(1):8-13

Microarrays and more recently RNA sequencing has led to an increase in available gene expression data. How to manage and store this data is becoming a key issue. In response we have developed EXP-PAC, a web based software package for storage, management and analysis of gene expression and sequence data. Unique to this package is SQL based querying of gene expression data sets, distributed normalization of raw gene expression data and analysis of gene expression data across experiments and species. This package has been populated with lactation data in the international milk genomic consortium web portal (http://milkgenomics.org/). Source code is also available which can be hosted on a Windows, Linux or Mac APACHE server connected to a private or public network (http://mamsap.it.deakin.edu.au/~pcc/Release/EXP_PAC.html). 相似文献

16.

A comparison of normalization methods for high density oligonucleotide array data based on variance and bias 总被引：74，自引：0，他引：74

Bolstad BM Irizarry RA Astrand M Speed TP 《Bioinformatics (Oxford, England)》2003,19(2):185-193

MOTIVATION: When running experiments that involve multiple high density oligonucleotide arrays, it is important to remove sources of variation between arrays of non-biological origin. Normalization is a process for reducing this variation. It is common to see non-linear relations between arrays and the standard normalization provided by Affymetrix does not perform well in these situations. RESULTS: We present three methods of performing normalization at the probe intensity level. These methods are called complete data methods because they make use of data from all arrays in an experiment to form the normalizing relation. These algorithms are compared to two methods that make use of a baseline array: a one number scaling based algorithm and a method that uses a non-linear normalizing relation by comparing the variability and bias of an expression measure. Two publicly available datasets are used to carry out the comparisons. The simplest and quickest complete data method is found to perform favorably. AVAILABILITY: Software implementing all three of the complete data normalization methods is available as part of the R package Affy, which is a part of the Bioconductor project http://www.bioconductor.org. SUPPLEMENTARY INFORMATION: Additional figures may be found at http://www.stat.berkeley.edu/~bolstad/normalize/index.html 相似文献

17.

OTUbase: an R infrastructure package for operational taxonomic unit data

Beck D Settles M Foster JA 《Bioinformatics (Oxford, England)》2011,27(12):1700-1701

SUMMARY: OTUbase is an R package designed to facilitate the analysis of operational taxonomic unit (OTU) data and sequence classification (taxonomic) data. Currently there are programs that will cluster sequence data into OTUs and/or classify sequence data into known taxonomies. However, there is a need for software that can take the summarized output of these programs and organize it into easily accessed and manipulated formats. OTUbase provides this structure and organization within R, to allow researchers to easily manipulate the data with the rich library of R packages currently available for additional analysis. AVAILABILITY: OTUbase is an R package available through Bioconductor. It can be found at http://www.bioconductor.org/packages/release/bioc/html/OTUbase.html. 相似文献

18.

snp.plotter: an R-based SNP/haplotype association and linkage disequilibrium plotting package

Luna A Nicodemus KK 《Bioinformatics (Oxford, England)》2007,23(6):774-776

snp.plotter is a newly developed R package which produces high-quality plots of results from genetic association studies. The main features of the package include options to display a linkage disequilibrium (LD) plot below the P-value plot using either the r2 or D' LD metric, to set the X-axis to equal spacing or to use the physical map of markers, and to specify plot labels, colors, symbols and LD heatmap color scheme. snp.plotter can plot single SNP and/or haplotype data and simultaneously plot multiple sets of results. R is a free software environment for statistical computing and graphics available for most platforms. The proposed package provides a simple way to convey both association and LD information in a single appealing graphic for genetic association studies. AVAILABILITY: Downloadable R package and example datasets are available at http://cbdb.nimh.nih.gov/~kristin/snp.plotter.html and http://www.r-project.org. 相似文献

19.

Gene Expression Dynamics Inspector (GEDI): for integrative analysis of expression profiles 总被引：2，自引：0，他引：2

Eichler GS Huang S Ingber DE 《Bioinformatics (Oxford, England)》2003,19(17):2321-2322

Genome-wide expression profiles contain global patterns that evade visual detection in current gene clustering analysis. Here, a Gene Expression Dynamics Inspector (GEDI) is described that uses self-organizing maps to translate high-dimensional expression profiles of time courses or sample classes into animated, coherent and robust mosaics images. GEDI facilitates identification of interesting patterns of molecular activity simultaneously across gene, time and sample space without prior assumption of any structure in the data, and then permits the user to retrieve genes of interest. Important changes in genome-wide activities may be quickly identified based on 'Gestalt' recognition and hence, GEDI may be especially useful for non-specialist end users, such as physicians. AVAILABILITY: GEDI v1.0 is written in Matlab, and binary Matlab.dll files which require Matlab to run can be downloaded for free by academic institutions at http://www.chip.org/~ge/gedihome.html Supplementary information: http://www.chip.org/~ge/gedihome.html 相似文献

20.

ATAQS: A computational software tool for high throughput transition optimization and validation for selected reaction monitoring mass spectrometry

Mi-Youn K Brusniak Sung-Tat Kwok Mark Christiansen David Campbell Lukas Reiter Paola Picotti Ulrike Kusebauch Hector Ramos Eric W Deutsch Jingchun Chen Robert L Moritz Ruedi Aebersold 《BMC bioinformatics》2011,12(1):1-15

Background

Copy number variants (CNVs), including deletions, amplifications, and other rearrangements, are common in human and cancer genomes. Copy number data from array comparative genome hybridization (aCGH) and next-generation DNA sequencing is widely used to measure copy number variants. Comparison of copy number data from multiple individuals reveals recurrent variants. Typically, the interior of a recurrent CNV is examined for genes or other loci associated with a phenotype. However, in some cases, such as gene truncations and fusion genes, the target of variant lies at the boundary of the variant.

Results

We introduce Neighborhood Breakpoint Conservation (NBC), an algorithm for identifying rearrangement breakpoints that are highly conserved at the same locus in multiple individuals. NBC detects recurrent breakpoints at varying levels of resolution, including breakpoints whose location is exactly conserved and breakpoints whose location varies within a gene. NBC also identifies pairs of recurrent breakpoints such as those that result from fusion genes. We apply NBC to aCGH data from 36 primary prostate tumors and identify 12 novel rearrangements, one of which is the well-known TMPRSS2-ERG fusion gene. We also apply NBC to 227 glioblastoma tumors and predict 93 novel rearrangements which we further classify as gene truncations, germline structural variants, and fusion genes. A number of these variants involve the protein phosphatase PTPN12 suggesting that deregulation of PTPN12, via a variety of rearrangements, is common in glioblastoma.

Conclusions

We demonstrate that NBC is useful for detection of recurrent breakpoints resulting from copy number variants or other structural variants, and in particular identifies recurrent breakpoints that result in gene truncations or fusion genes. Software is available at http://http.//cs.brown.edu/people/braphael/software.html. 相似文献