期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read

Juan Falgueras Antonio J Lara Noé Fernández-Pozo Francisco R Cantón Guillermo Pérez-Trabado M Gonzalo Claros 《BMC bioinformatics》2010,11(1):38

Background

High-throughput automated sequencing has enabled an exponential growth rate of sequencing data. This requires increasing sequence quality and reliability in order to avoid database contamination with artefactual sequences. The arrival of pyrosequencing enhances this problem and necessitates customisable pre-processing algorithms. 相似文献

2.

Making sense of EST sequences by CLOBBing them

John Parkinson David B Guiliano Mark Blaxter 《BMC bioinformatics》2002,3(1):31-8

相似文献

3.

An integrative variant analysis suite for whole exome next-generation sequencing data

Danny Challis Jin Yu Uday S Evani Andrew R Jackson Sameer Paithankar Cristian Coarfa Aleksandar Milosavljevic Richard A Gibbs Fuli Yu 《BMC bioinformatics》2012,13(1):8

Background

Whole exome capture sequencing allows researchers to cost-effectively sequence the coding regions of the genome. Although the exome capture sequencing methods have become routine and well established, there is currently a lack of tools specialized for variant calling in this type of data. 相似文献

4.

Genome-wide SNP identification by high-throughput sequencing and selective mapping allows sequence assembly positioning using a framework genetic linkage map

Jean-Marc Celton Alan Christoffels Daniel J Sargent Xiangming Xu D Jasper G Rees 《BMC biology》2010,8(1):155

Background

Determining the position and order of contigs and scaffolds from a genome assembly within an organism's genome remains a technical challenge in a majority of sequencing projects. In order to exploit contemporary technologies for DNA sequencing, we developed a strategy for whole genome single nucleotide polymorphism sequencing allowing the positioning of sequence contigs onto a linkage map using the bin mapping method. 相似文献

5.

Pash 3.0: A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencing

Cristian Coarfa Fuli Yu Christopher A Miller Zuozhou Chen R Alan Harris Aleksandar Milosavljevic 《BMC bioinformatics》2010,11(1):572

Background

Massively parallel sequencing readouts of epigenomic assays are enabling integrative genome-wide analyses of genomic and epigenomic variation. Pash 3.0 performs sequence comparison and read mapping and can be employed as a module within diverse configurable analysis pipelines, including ChIP-Seq and methylome mapping by whole-genome bisulfite sequencing. 相似文献

6.

Gene capture prediction and overlap estimation in EST sequencing from one or multiple libraries

Ji-Ping?Z?Wang Email author Bruce?G?Lindsay Liying?Cui P?Kerr?Wall Josh?Marion Jiaxuan?Zhang Claude?W?dePamphilis 《BMC bioinformatics》2005,6(1):300

Background

In expressed sequence tag (EST) sequencing, we are often interested in how many genes we can capture in an EST sample of a targeted size. This information provides insights to sequencing efficiency in experimental design, as well as clues to the diversity of expressed genes in the tissue from which the library was constructed. 相似文献

7.

Finding sRNA generative locales from high-throughput sequencing data with NiBLS

Daniel MacLean Vincent Moulton David J Studholme 《BMC bioinformatics》2010,11(1):93

Background

Next-generation sequencing technologies allow researchers to obtain millions of sequence reads in a single experiment. One important use of the technology is the sequencing of small non-coding regulatory RNAs and the identification of the genomic locales from which they originate. Currently, there is a paucity of methods for finding small RNA generative locales. 相似文献

8.

<Emphasis Type="Italic">Tracembler</Emphasis> – software for <Emphasis Type="Italic">in-silico</Emphasis> chromosome walking in unassembled genomes

Qunfeng Dong Matthew D Wilkerson Volker Brendel 《BMC bioinformatics》2007,8(1):151

Background

Whole genome shotgun sequencing produces increasingly higher coverage of a genome with random sequence reads. Progressive whole genome assembly and eventual finishing sequencing is a process that typically takes several years for large eukaryotic genomes. In the interim, all sequence reads of public sequencing projects are made available in repositories such as the NCBI Trace Archive. For a particular locus, sequencing coverage may be high enough early on to produce a reliable local genome assembly. We have developed software, Tracembler, that facilitates in silico chromosome walking by recursively assembling reads of a selected species from the NCBI Trace Archive starting with reads that significantly match sequence seeds supplied by the user. 相似文献

9.

MEMOSys: Bioinformatics platform for genome-scale metabolic models

Stephan Pabinger Robert Rader Rasmus Agren Jens Nielsen Zlatko Trajanoski 《BMC systems biology》2011,5(1):20

Background

Recent advances in genomic sequencing have enabled the use of genome sequencing in standard biological and biotechnological research projects. The challenge is how to integrate the large amount of data in order to gain novel biological insights. One way to leverage sequence data is to use genome-scale metabolic models. We have therefore designed and implemented a bioinformatics platform which supports the development of such metabolic models. 相似文献

10.

Aspects of coverage in medical DNA sequencing

Michael C Wendl Richard K Wilson 《BMC bioinformatics》2008,9(1):239

Background

DNA sequencing is now emerging as an important component in biomedical studies of diseases like cancer. Short-read, highly parallel sequencing instruments are expected to be used heavily for such projects, but many design specifications have yet to be conclusively established. Perhaps the most fundamental of these is the redundancy required to detect sequence variations, which bears directly upon genomic coverage and the consequent resolving power for discerning somatic mutations. 相似文献

11.

FastGroupII: A web-based bioinformatics platform for analyses of large 16S rDNA libraries

Yanan Yu Mya Breitbart Pat McNairnie Forest Rohwer 《BMC bioinformatics》2006,7(1):57

Background

High-throughput sequencing makes it possible to rapidly obtain thousands of 16S rDNA sequences from environmental samples. Bioinformatic tools for the analyses of large 16S rDNA sequence databases are needed to comprehensively describe and compare these datasets. 相似文献

12.

Increasing phylogenetic resolution at low taxonomic levels using massively parallel sequencing of chloroplast genomes

Matthew Parks Richard Cronn Aaron Liston 《BMC biology》2009,7(1):84

Background

Molecular evolutionary studies share the common goal of elucidating historical relationships, and the common challenge of adequately sampling taxa and characters. Particularly at low taxonomic levels, recent divergence, rapid radiations, and conservative genome evolution yield limited sequence variation, and dense taxon sampling is often desirable. Recent advances in massively parallel sequencing make it possible to rapidly obtain large amounts of sequence data, and multiplexing makes extensive sampling of megabase sequences feasible. Is it possible to efficiently apply massively parallel sequencing to increase phylogenetic resolution at low taxonomic levels? 相似文献

13.

Clustering of protein domains for functional and evolutionary studies

Pavle Goldstein Jurica Zucko Du?ica Vujaklija Anita Kri?ko Daslav Hranueli Paul F Long Catherine Etchebest Bojan Basrak John Cullum 《BMC bioinformatics》2009,10(1):335

Background

The number of protein family members defined by DNA sequencing is usually much larger than those characterised experimentally. This paper describes a method to divide protein families into subtypes purely on sequence criteria. Comparison with experimental data allows an independent test of the quality of the clustering. 相似文献

14.

Extension of Lander-Waterman theory for sequencing filtered DNA libraries

Michael?C?Wendl Email author W?Brad?Barbazuk 《BMC bioinformatics》2005,6(1):245

Background

The degree to which conventional DNA sequencing techniques will be successful for highly repetitive genomes is unclear. Investigators are therefore considering various filtering methods to select against high-copy sequence in DNA clone libraries. The standard model for random sequencing, Lander-Waterman theory, does not account for two important issues in such libraries, discontinuities and position-based sampling biases (the so-called "edge effect"). We report an extension of the theory for analyzing such configurations. 相似文献

15.

Large scale clustering of protein sequences with FORCE -A layout based heuristic for weighted cluster editing

Tobias Wittkop Jan Baumbach Francisco P Lobo Sven Rahmann 《BMC bioinformatics》2007,8(1):396

Background

Detecting groups of functionally related proteins from their amino acid sequence alone has been a long-standing challenge in computational genome research. Several clustering approaches, following different strategies, have been published to attack this problem. Today, new sequencing technologies provide huge amounts of sequence data that has to be efficiently clustered with constant or increased accuracy, at increased speed. 相似文献

16.

Predicting phenotypic traits of prokaryotes from protein domain frequencies

Thomas Lingner Stefanie Mühlhausen Toni Gabaldón Cedric Notredame Peter Meinicke 《BMC bioinformatics》2010,11(1):481

Background

Establishing the relationship between an organism's genome sequence and its phenotype is a fundamental challenge that remains largely unsolved. Accurately predicting microbial phenotypes solely based on genomic features will allow us to infer relevant phenotypic characteristics when the availability of a genome sequence precedes experimental characterization, a scenario that is favored by the advent of novel high-throughput and single cell sequencing techniques. 相似文献

17.

BSMAP: whole genome bisulfite sequence MAPping program

Yuanxin Xi Wei Li 《BMC bioinformatics》2009,10(1):232-9

Background

Bisulfite sequencing is a powerful technique to study DNA cytosine methylation. Bisulfite treatment followed by PCR amplification specifically converts unmethylated cytosines to thymine. Coupled with next generation sequencing technology, it is able to detect the methylation status of every cytosine in the genome. However, mapping high-throughput bisulfite reads to the reference genome remains a great challenge due to the increased searching space, reduced complexity of bisulfite sequence, asymmetric cytosine to thymine alignments, and multiple CpG heterogeneous methylation. 相似文献

18.

Local alignment of two-base encoded DNA sequence

Nils Homer Barry Merriman Stanley F Nelson 《BMC bioinformatics》2009,10(1):175-11

Background

DNA sequence comparison is based on optimal local alignment of two sequences using a similarity score. However, some new DNA sequencing technologies do not directly measure the base sequence, but rather an encoded form, such as the two-base encoding considered here. In order to compare such data to a reference sequence, the data must be decoded into sequence. The decoding is deterministic, but the possibility of measurement errors requires searching among all possible error modes and resulting alignments to achieve an optimal balance of fewer errors versus greater sequence similarity. 相似文献

19.

preAssemble: a tool for automatic sequencer trace data processing

Alexei A Adzhubei Jon K Laerdahl Anna V Vlasova 《BMC bioinformatics》2006,7(1):22-5

Background

Trace or chromatogram files (raw data) are produced by automatic nucleic acid sequencing equipment or sequencers. Each file contains information which can be interpreted by specialised software to reveal the sequence (base calling). This is done by the sequencer proprietary software or publicly available programs. Depending on the size of a sequencing project the number of trace files can vary from just a few to thousands of files. Sequencing quality assessment on various criteria is important at the stage preceding clustering and contig assembly. Two major publicly available packages – Phred and Staden are used by preAssemble to perform sequence quality processing. 相似文献

20.

JANE: efficient mapping of prokaryotic ESTs and variable length sequence reads on related template genomes

Chunguang Liang Alexander Schmid María José López-Sánchez Andres Moya Roy Gross J?rg Bernhardt Thomas Dandekar 《BMC bioinformatics》2009,10(1):391

相似文献