期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Efficient algorithms for training the parameters of hidden Markov models using stochastic expectation maximization (EM) training and Viterbi training

Tin Y Lam Irmtraud M Meyer 《Algorithms for molecular biology : AMB》2010,5(1):38

Background

Hidden Markov models are widely employed by numerous bioinformatics programs used today. Applications range widely from comparative gene prediction to time-series analyses of micro-array data. The parameters of the underlying models need to be adjusted for specific data sets, for example the genome of a particular species, in order to maximize the prediction accuracy. Computationally efficient algorithms for parameter training are thus key to maximizing the usability of a wide range of bioinformatics applications. 相似文献

2.

Multithreaded comparative RNA secondary structure prediction using stochastic context-free grammars

Zsuzsanna Sükösd Bjarne Knudsen Morten Værum Jørgen Kjems Ebbe S Andersen 《BMC bioinformatics》2011,12(1):103

Background

The prediction of the structure of large RNAs remains a particular challenge in bioinformatics, due to the computational complexity and low levels of accuracy of state-of-the-art algorithms. The pfold model couples a stochastic context-free grammar to phylogenetic analysis for a high accuracy in predictions, but the time complexity of the algorithm and underflow errors have prevented its use for long alignments. Here we present PPfold, a multithreaded version of pfold, which is capable of predicting the structure of large RNA alignments accurately on practical timescales. 相似文献

3.

Impact of RNA structure on the prediction of donor and acceptor splice sites

Sayed-Amir Marashi Changiz Eslahchi Hamid Pezeshk Mehdi Sadeghi 《BMC bioinformatics》2006,7(1):297-8

Background

gene identification in genomic DNA sequences by computational methods has become an important task in bioinformatics and computational gene prediction tools are now essential components of every genome sequencing project. Prediction of splice sites is a key step of all gene structural prediction algorithms. 相似文献

4.

Improvement in accuracy of multiple sequence alignment using novel group-to-group sequence alignment algorithm with piecewise linear gap cost

Shinsuke Yamada Osamu Gotoh Hayato Yamana 《BMC bioinformatics》2006,7(1):524-17

Background

Multiple sequence alignment (MSA) is a useful tool in bioinformatics. Although many MSA algorithms have been developed, there is still room for improvement in accuracy and speed. In the alignment of a family of protein sequences, global MSA algorithms perform better than local ones in many cases, while local ones perform better than global ones when some sequences have long insertions or deletions (indels) relative to others. Many recent leading MSA algorithms have incorporated pairwise alignment information obtained from a mixture of sources into their scoring system to improve accuracy of alignment containing long indels. 相似文献

5.

SynTReN: a generator of synthetic gene expression data for design and analysis of structure learning algorithms

Tim Van den Bulcke Koenraad Van Leemput Bart Naudts Piet van Remortel Hongwu Ma Alain Verschoren Bart De Moor Kathleen Marchal 《BMC bioinformatics》2006,7(1):43

Background

The development of algorithms to infer the structure of gene regulatory networks based on expression data is an important subject in bioinformatics research. Validation of these algorithms requires benchmark data sets for which the underlying network is known. Since experimental data sets of the appropriate size and design are usually not available, there is a clear need to generate well-characterized synthetic data sets that allow thorough testing of learning algorithms in a fast and reproducible manner. 相似文献

6.

FITBAR: a web tool for the robust prediction of prokaryotic regulons

Jacques Oberto 《BMC bioinformatics》2010,11(1):554

Background

The binding of regulatory proteins to their specific DNA targets determines the accurate expression of the neighboring genes. The in silico prediction of new binding sites in completely sequenced genomes is a key aspect in the deeper understanding of gene regulatory networks. Several algorithms have been described to discriminate against false-positives in the prediction of new binding targets; however none of them has been implemented so far to assist the detection of binding sites at the genomic scale. 相似文献

7.

Representative transcript sets for evaluating a translational initiation sites predictor

Jia Zeng Reda Alhajj Douglas J Demetrick 《BMC bioinformatics》2009,10(1):206

Background

Translational initiation site (TIS) prediction is a very important and actively studied topic in bioinformatics. In order to complete a comparative analysis, it is desirable to have several benchmark data sets which can be used to test the effectiveness of different algorithms. An ideal benchmark data set should be reliable, representative and readily available. Preferably, proteins encoded by members of the data set should also be representative of the protein population actually expressed in cellular specimens. 相似文献

8.

Protein secondary structure prediction for a single-sequence using hidden semi-Markov models

Zafer Aydin Yucel Altunbasak Mark Borodovsky 《BMC bioinformatics》2006,7(1):178-15

Background

The accuracy of protein secondary structure prediction has been improving steadily towards the 88% estimated theoretical limit. There are two types of prediction algorithms: Single-sequence prediction algorithms imply that information about other (homologous) proteins is not available, while algorithms of the second type imply that information about homologous proteins is available, and use it intensively. The single-sequence algorithms could make an important contribution to studies of proteins with no detected homologs, however the accuracy of protein secondary structure prediction from a single-sequence is not as high as when the additional evolutionary information is present. 相似文献

9.

A simplified approach to disulfide connectivity prediction from protein sequences

Marc Vincent Andrea Passerini Matthieu Labbé Paolo Frasconi 《BMC bioinformatics》2008,9(1):20

Background

Prediction of disulfide bridges from protein sequences is useful for characterizing structural and functional properties of proteins. Several methods based on different machine learning algorithms have been applied to solve this problem and public domain prediction services exist. These methods are however still potentially subject to significant improvements both in terms of prediction accuracy and overall architectural complexity. 相似文献

10.

GASP: Gapped Ancestral Sequence Prediction for proteins

Richard?J?Edwards Email author Denis?C?Shields 《BMC bioinformatics》2004,5(1):123

Background

The prediction of ancestral protein sequences from multiple sequence alignments is useful for many bioinformatics analyses. Predicting ancestral sequences is not a simple procedure and relies on accurate alignments and phylogenies. Several algorithms exist based on Maximum Parsimony or Maximum Likelihood methods but many current implementations are unable to process residues with gaps, which may represent insertion/deletion (indel) events or sequence fragments. 相似文献

11.

Improved alignment quality by combining evolutionary information, predicted secondary structure and self-organizing maps

Tomas Ohlson Varun Aggarwal Arne Elofsson Robert M MacCallum 《BMC bioinformatics》2006,7(1):357

Background

Protein sequence alignment is one of the basic tools in bioinformatics. Correct alignments are required for a range of tasks including the derivation of phylogenetic trees and protein structure prediction. Numerous studies have shown that the incorporation of predicted secondary structure information into alignment algorithms improves their performance. Secondary structure predictors have to be trained on a set of somewhat arbitrarily defined states (e.g. helix, strand, coil), and it has been shown that the choice of these states has some effect on alignment quality. However, it is not unlikely that prediction of other structural features also could provide an improvement. In this study we use an unsupervised clustering method, the self-organizing map, to assign sequence profile windows to "structural states" and assess their use in sequence alignment. 相似文献

12.

Rank-based edge reconstruction for scale-free genetic regulatory networks

Guanrao Chen Peter Larsen Eyad Almasri Yang Dai 《BMC bioinformatics》2008,9(1):75

Background

The reconstruction of genetic regulatory networks from microarray gene expression data has been a challenging task in bioinformatics. Various approaches to this problem have been proposed, however, they do not take into account the topological characteristics of the targeted networks while reconstructing them. 相似文献

13.

A comprehensive comparison of comparative RNA structure prediction approaches

Paul?P?Gardner Email author Robert?Giegerich 《BMC bioinformatics》2004,5(1):140

Background

An increasing number of researchers have released novel RNA structure analysis and prediction algorithms for comparative approaches to structure prediction. Yet, independent benchmarking of these algorithms is rarely performed as is now common practice for protein-folding, gene-finding and multiple-sequence-alignment algorithms. 相似文献

14.

A comparison of common programming languages used in bioinformatics

Mathieu Fourment Michael R Gillings 《BMC bioinformatics》2008,9(1):82

Background

The performance of different programming languages has previously been benchmarked using abstract mathematical algorithms, but not using standard bioinformatics algorithms. We compared the memory usage and speed of execution for three standard bioinformatics methods, implemented in programs using one of six different programming languages. Programs for the Sellers algorithm, the Neighbor-Joining tree construction algorithm and an algorithm for parsing BLAST file outputs were implemented in C, C++, C#, Java, Perl and Python. 相似文献

15.

BigFoot: Bayesian alignment and phylogenetic footprinting with MCMC

Rahul Satija Ádám Novák István Miklós Rune Lyngsø Jotun Hein 《BMC evolutionary biology》2009,9(1):217-14

Background

We have previously combined statistical alignment and phylogenetic footprinting to detect conserved functional elements without assuming a fixed alignment. Considering a probability-weighted distribution of alignments removes sensitivity to alignment errors, properly accommodates regions of alignment uncertainty, and increases the accuracy of functional element prediction. Our method utilized standard dynamic programming hidden markov model algorithms to analyze up to four sequences. 相似文献

16.

Pol II promoter prediction using characteristic 4-mer motifs: a machine learning approach

Firoz Anwar Syed Murtuza Baker Taskeed Jabid Md Mehedi Hasan Mohammad Shoyaib Haseena Khan Ray Walshe 《BMC bioinformatics》2008,9(1):414

Background

Eukaryotic promoter prediction using computational analysis techniques is one of the most difficult jobs in computational genomics that is essential for constructing and understanding genetic regulatory networks. The increased availability of sequence data for various eukaryotic organisms in recent years has necessitated for better tools and techniques for the prediction and analysis of promoters in eukaryotic sequences. Many promoter prediction methods and tools have been developed to date but they have yet to provide acceptable predictive performance. One obvious criteria to improve on current methods is to devise a better system for selecting appropriate features of promoters that distinguish them from non-promoters. Secondly improved performance can be achieved by enhancing the predictive ability of the machine learning algorithms used. 相似文献

17.

Characterization and simulation of cDNA microarray spots using a novel mathematical model

Hye Young Kim Seo Eun Lee Min Jung Kim Jin Il Han Bo Kyung Kim Yong Sung Lee Young Seek Lee Jin Hyuk Kim 《BMC bioinformatics》2007,8(1):485

Background

The quality of cDNA microarray data is crucial for expanding its application to other research areas, such as the study of gene regulatory networks. Despite the fact that a number of algorithms have been suggested to increase the accuracy of microarray gene expression data, it is necessary to obtain reliable microarray images by improving wet-lab experiments. As the first step of a cDNA microarray experiment, spotting cDNA probes is critical to determining the quality of spot images. 相似文献

18.

Testing the additional predictive value of high-dimensional molecular data

Anne-Laure Boulesteix Torsten Hothorn 《BMC bioinformatics》2010,11(1):78

Background

While high-dimensional molecular data such as microarray gene expression data have been used for disease outcome prediction or diagnosis purposes for about ten years in biomedical research, the question of the additional predictive value of such data given that classical predictors are already available has long been under-considered in the bioinformatics literature. 相似文献

19.

Bioclipse: an open source workbench for chemo- and bioinformatics

Ola Spjuth Tobias Helmus Egon L Willighagen Stefan Kuhn Martin Eklund Johannes Wagener Peter Murray-Rust Christoph Steinbeck Jarl ES Wikberg 《BMC bioinformatics》2007,8(1):59

Background

There is a need for software applications that provide users with a complete and extensible toolkit for chemo- and bioinformatics accessible from a single workbench. Commercial packages are expensive and closed source, hence they do not allow end users to modify algorithms and add custom functionality. Existing open source projects are more focused on providing a framework for integrating existing, separately installed bioinformatics packages, rather than providing user-friendly interfaces. No open source chemoinformatics workbench has previously been published, and no sucessful attempts have been made to integrate chemo- and bioinformatics into a single framework. 相似文献

20.

Reliable transfer of transcriptional gene regulatory networks between taxonomically related organisms

Jan Baumbach Sven Rahmann Andreas Tauch 《BMC systems biology》2009,3(1):8-8

相似文献