期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

methylKit: a comprehensive R package for the analysis of genome-wide DNA methylation profiles

Altuna Akalin Matthias Kormaksson Sheng Li Francine E Garrett-Bakelman Maria E Figueroa Ari Melnick Christopher E Mason 《Genome biology》2012,13(10):R87

DNA methylation is a chemical modification of cytosine bases that is pivotal for gene regulation, cellular specification and cancer development. Here, we describe an R package, methylKit, that rapidly analyzes genome-wide cytosine epigenetic profiles from high-throughput methylation and hydroxymethylation sequencing experiments. methylKit includes functions for clustering, sample quality visualization, differential methylation analysis and annotation features, thus automating and simplifying many of the steps for discerning statistically significant bases or regions of DNA methylation. Finally, we demonstrate methylKit on breast cancer data, in which we find statistically significant regions of differential methylation and stratify tumor subtypes. methylKit is available at http://code.google.com/p/methylkit. 相似文献

2.

Fast and Sensitive Alignment of Microbial Whole Genome Sequencing Reads to Large Sequence Datasets on a Desktop PC: Application to Metagenomic Datasets and Pathogen Identification

L?rinc S. Pongor Roberto Vera Balázs Ligeti 《PloS one》2014,9(7)

Next generation sequencing (NGS) of metagenomic samples is becoming a standard approach to detect individual species or pathogenic strains of microorganisms. Computer programs used in the NGS community have to balance between speed and sensitivity and as a result, species or strain level identification is often inaccurate and low abundance pathogens can sometimes be missed. We have developed Taxoner, an open source, taxon assignment pipeline that includes a fast aligner (e.g. Bowtie2) and a comprehensive DNA sequence database. We tested the program on simulated datasets as well as experimental data from Illumina, IonTorrent, and Roche 454 sequencing platforms. We found that Taxoner performs as well as, and often better than BLAST, but requires two orders of magnitude less running time meaning that it can be run on desktop or laptop computers. Taxoner is slower than the approaches that use small marker databases but is more sensitive due the comprehensive reference database. In addition, it can be easily tuned to specific applications using small tailored databases. When applied to metagenomic datasets, Taxoner can provide a functional summary of the genes mapped and can provide strain level identification. Taxoner is written in C for Linux operating systems. The code and documentation are available for research applications at http://code.google.com/p/taxoner. 相似文献

3.

Masking as an effective quality control method for next-generation sequencing data analysis

Sajung Yun Sijung Yun 《BMC bioinformatics》2014,15(1)

相似文献

4.

MOABS: model based analysis of bisulfite sequencing data

Deqiang Sun Yuanxin Xi Benjamin Rodriguez Hyun Jung Park Pan Tong Mira Meong Margaret A Goodell Wei Li 《Genome biology》2014,15(2):R38

Bisulfite sequencing (BS-seq) is the gold standard for studying genome-wide DNA methylation. We developed MOABS to increase the speed, accuracy, statistical power and biological relevance of BS-seq data analysis. MOABS detects differential methylation with 10-fold coverage at single-CpG resolution based on a Beta-Binomial hierarchical model and is capable of processing two billion reads in 24 CPU hours. Here, using simulated and real BS-seq data, we demonstrate that MOABS outperforms other leading algorithms, such as Fisher’s exact test and BSmooth. Furthermore, MOABS analysis can be easily extended to differential 5hmC analysis using RRBS and oxBS-seq. MOABS is available at http://code.google.com/p/moabs/. 相似文献

5.

DistMap: A Toolkit for Distributed Short Read Mapping on a Hadoop Cluster

Ram Vinay Pandey Christian Schl?tterer 《PloS one》2013,8(8)

With the rapid and steady increase of next generation sequencing data output, the mapping of short reads has become a major data analysis bottleneck. On a single computer, it can take several days to map the vast quantity of reads produced from a single Illumina HiSeq lane. In an attempt to ameliorate this bottleneck we present a new tool, DistMap - a modular, scalable and integrated workflow to map reads in the Hadoop distributed computing framework. DistMap is easy to use, currently supports nine different short read mapping tools and can be run on all Unix-based operating systems. It accepts reads in FASTQ format as input and provides mapped reads in a SAM/BAM format. DistMap supports both paired-end and single-end reads thereby allowing the mapping of read data produced by different sequencing platforms. DistMap is available from http://code.google.com/p/distmap/ 相似文献

6.

Comparison of Metatranscriptomic Samples Based on k-Tuple Frequencies

Ying Wang Lin Liu Lina Chen Ting Chen Fengzhu Sun 《PloS one》2014,9(1)

相似文献

7.

PathVisio 3: An Extendable Pathway Analysis Toolbox

Martina Kutmon Martijn P. van Iersel Anwesha Bohler Thomas Kelder Nuno Nunes Alexander R. Pico Chris T. Evelo 《PLoS computational biology》2015,11(2)

PathVisio is a commonly used pathway editor, visualization and analysis software. Biological pathways have been used by biologists for many years to describe the detailed steps in biological processes. Those powerful, visual representations help researchers to better understand, share and discuss knowledge. Since the first publication of PathVisio in 2008, the original paper was cited more than 170 times and PathVisio was used in many different biological studies. As an online editor PathVisio is also integrated in the community curated pathway database WikiPathways.Here we present the third version of PathVisio with the newest additions and improvements of the application. The core features of PathVisio are pathway drawing, advanced data visualization and pathway statistics. Additionally, PathVisio 3 introduces a new powerful extension systems that allows other developers to contribute additional functionality in form of plugins without changing the core application.PathVisio can be downloaded from http://www.pathvisio.org and in 2014 PathVisio 3 has been downloaded over 5,500 times. There are already more than 15 plugins available in the central plugin repository. PathVisio is a freely available, open-source tool published under the Apache 2.0 license (http://www.apache.org/licenses/LICENSE-2.0). It is implemented in Java and thus runs on all major operating systems. The code repository is available at http://svn.bigcat.unimaas.nl/pathvisio. The support mailing list for users is available on https://groups.google.com/forum/#!forum/wikipathways-discuss and for developers on https://groups.google.com/forum/#!forum/wikipathways-devel.

This is a PLOS Computational Biology software article.

相似文献

8.

Dynamic evolution of clonal epialleles revealed by methclone

Sheng Li Francine Garrett-Bakelman Alexander E Perl Selina M Luger Chao Zhang Bik L To Ian D Lewis Anna L Brown Richard J D’Andrea M Elizabeth Ross Ross Levine Martin Carroll Ari Melnick Christopher E Mason 《Genome biology》2014,15(9)

We describe methclone, a novel method to identify epigenetic loci that harbor large changes in the clonality of their epialleles (epigenetic alleles). Methclone efficiently analyzes genome-wide DNA methylation sequencing data. We quantify the changes using a composition entropy difference calculation and also introduce a new measure of global clonality shift, loci with epiallele shift per million loci covered, which enables comparisons between different samples to gauge overall epiallelic dynamics. Finally, we demonstrate the utility of methclone in capturing functional epiallele shifts in leukemia patients from diagnosis to relapse. Methclone is open-source and freely available at https://code.google.com/p/methclone.

Electronic supplementary material

The online version of this article (doi:10.1186/s13059-014-0472-5) contains supplementary material, which is available to authorized users. 相似文献

9.

Integrating biological pathways and genomic profiles with ChiBE 2

?zgün Babur Ugur Dogrusoz Merve ?ak?r Bülent Arman Aksoy Nikolaus Schultz Chris Sander Emek Demir 《BMC genomics》2014,15(1)

Background

Dynamic visual exploration of detailed pathway information can help researchers digest and interpret complex mechanisms and genomic datasets.

Results

ChiBE is a free, open-source software tool for visualizing, querying, and analyzing human biological pathways in BioPAX format. The recently released version 2 can search for neighborhoods, paths between molecules, and common regulators/targets of molecules, on large integrated cellular networks in the Pathway Commons database as well as in local BioPAX models. Resulting networks can be automatically laid out for visualization using a graphically rich, process-centric notation. Profiling data from the cBioPortal for Cancer Genomics and expression data from the Gene Expression Omnibus can be overlaid on these networks.

Conclusions

ChiBE’s new capabilities are organized around a genomics-oriented workflow and offer a unique comprehensive pathway analysis solution for genomics researchers. The software is freely available at http://code.google.com/p/chibe. 相似文献

10.

Corset: enabling differential gene expression analysis for de novo assembled transcriptomes

Nadia M Davidson Alicia Oshlack 《Genome biology》2014,15(7)

相似文献

11.

Inference of the Properties of the Recombination Process from Whole Bacterial Genomes

M. Azim Ansari Xavier Didelot 《Genetics》2014,196(1):253-265

Patterns of linkage disequilibrium, homoplasy, and incompatibility are difficult to interpret because they depend on several factors, including the recombination process and the population structure. Here we introduce a novel model-based framework to infer recombination properties from such summary statistics in bacterial genomes. The underlying model is sequentially Markovian so that data can be simulated very efficiently, and we use approximate Bayesian computation techniques to infer parameters. As this does not require us to calculate the likelihood function, the model can be easily extended to investigate less probed aspects of recombination. In particular, we extend our model to account for the bias in the recombination process whereby closely related bacteria recombine more often with one another. We show that this model provides a good fit to a data set of Bacillus cereus genomes and estimate several recombination properties, including the rate of bias in recombination. All the methods described in this article are implemented in a software package that is freely available for download at http://code.google.com/p/clonalorigin/. 相似文献

12.

Exploratory Analysis of the Copy Number Alterations in Glioblastoma Multiforme

Pablo Freire Marco Vilela Helena Deus Yong-Wan Kim Dimpy Koul Howard Colman Kenneth D. Aldape Oliver Bogler W. K. Alfred Yung Kevin Coombes Gordon B. Mills Ana T. Vasconcelos Jonas S. Almeida 《PloS one》2008,3(12)

相似文献

13.

GobyWeb: Simplified Management and Analysis of Gene Expression and DNA Methylation Sequencing Data

Kevin C. Dorff Nyasha Chambwe Zachary Zeno Manuele Simi Rita Shaknovich Fabien Campagne 《PloS one》2013,8(7)

We present GobyWeb, a web-based system that facilitates the management and analysis of high-throughput sequencing (HTS) projects. The software provides integrated support for a broad set of HTS analyses and offers a simple plugin extension mechanism. Analyses currently supported include quantification of gene expression for messenger and small RNA sequencing, estimation of DNA methylation (i.e., reduced bisulfite sequencing and whole genome methyl-seq), or the detection of pathogens in sequenced data. In contrast to previous analysis pipelines developed for analysis of HTS data, GobyWeb requires significantly less storage space, runs analyses efficiently on a parallel grid, scales gracefully to process tens or hundreds of multi-gigabyte samples, yet can be used effectively by researchers who are comfortable using a web browser. We conducted performance evaluations of the software and found it to either outperform or have similar performance to analysis programs developed for specialized analyses of HTS data. We found that most biologists who took a one-hour GobyWeb training session were readily able to analyze RNA-Seq data with state of the art analysis tools. GobyWeb can be obtained at http://gobyweb.campagnelab.org and is freely available for non-commercial use. GobyWeb plugins are distributed in source code and licensed under the open source LGPL3 license to facilitate code inspection, reuse and independent extensions http://github.com/CampagneLaboratory/gobyweb2-plugins. 相似文献

14.

miRAFinder and GeneAFinder scripts: large-scale searching for miRNA and related information in indexed literature abstracts

Olga Berillo Mireille Régnier Anatoly Ivashchenko 《Bioinformation》2014,10(8):539-543

相似文献

15.

Bringing non-human primate research into the post-genomic era: how monkeys are teaching us about elite controllers of HIV/AIDS

Eric J Vallender 《Genome biology》2014,15(11)

Whole-genome sequencing of Mauritian cynomolgus macaques reveals novel candidate loci for controlling simian immunodeficiency virus replication.See related Research, http://genomebiology.com/2014/15/11/478 相似文献

16.

Hidden Markov Modeling with HMMTeacher

Camilo Fuentes-Beals Alejandro Valds-Jimnez Gonzalo Riadi 《PLoS computational biology》2022,18(2)

Is it possible to learn and create a first Hidden Markov Model (HMM) without programming skills or understanding the algorithms in detail? In this concise tutorial, we present the HMM through the 2 general questions it was initially developed to answer and describe its elements. The HMM elements include variables, hidden and observed parameters, the vector of initial probabilities, and the transition and emission probability matrices. Then, we suggest a set of ordered steps, for modeling the variables and illustrate them with a simple exercise of modeling and predicting transmembrane segments in a protein sequence. Finally, we show how to interpret the results of the algorithms for this particular problem. To guide the process of information input and explicit solution of the basic HMM algorithms that answer the HMM questions posed, we developed an educational webserver called HMMTeacher. Additional solved HMM modeling exercises can be found in the user’s manual and answers to frequently asked questions. HMMTeacher is available at https://hmmteacher.mobilomics.org, mirrored at https://hmmteacher1.mobilomics.org. A repository with the code of the tool and the webpage is available at https://gitlab.com/kmilo.f/hmmteacher. 相似文献

17.

Rising from the crypt: decreasing DNA methylation during differentiation of the small intestine

Sean M Cullen Margaret A Goodell 《Genome biology》2013,14(5):116

The differentiation of intestinal stem cells involves few DNA methylation changes, assayed by bisulfite sequencing, in contrast to other adult somatic stem cell hierarchies.Please see related Research article: http://genomebiology.com/2013/14/5/R50 相似文献

18.

Accurate Diagnostics for Bovine tuberculosis Based on High-Throughput Sequencing

Alexander Churbanov Brook Milligan 《PloS one》2012,7(11)

相似文献

19.

PD5: A General Purpose Library for Primer Design Software

Michael C. Riley Wayne Aubrey Michael Young Amanda Clare 《PloS one》2013,8(11)

Background

Complex PCR applications for large genome-scale projects require fast, reliable and often highly sophisticated primer design software applications. Presently, such applications use pipelining methods to utilise many third party applications and this involves file parsing, interfacing and data conversion, which is slow and prone to error. A fully integrated suite of software tools for primer design would considerably improve the development time, the processing speed, and the reliability of bespoke primer design software applications.

Results

The PD5 software library is an open-source collection of classes and utilities, providing a complete collection of software building blocks for primer design and analysis. It is written in object-oriented C⁺⁺ with an emphasis on classes suitable for efficient and rapid development of bespoke primer design programs. The modular design of the software library simplifies the development of specific applications and also integration with existing third party software where necessary. We demonstrate several applications created using this software library that have already proved to be effective, but we view the project as a dynamic environment for building primer design software and it is open for future development by the bioinformatics community. Therefore, the PD5 software library is published under the terms of the GNU General Public License, which guarantee access to source-code and allow redistribution and modification.

Conclusions

The PD5 software library is downloadable from Google Code and the accompanying Wiki includes instructions and examples: http://code.google.com/p/primer-design 相似文献

20.

Allele Workbench: Transcriptome Pipeline and Interactive Graphics for Allele-Specific Expression

Carol A. Soderlund William M. Nelson Stephen A. Goff 《PloS one》2014,9(12)

相似文献