期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Predicting functional associations from metabolism using bi-partite network algorithms

Balaji Veeramani Joel S Bader 《BMC systems biology》2010,4(1):95

Background

Metabolic reconstructions contain detailed information about metabolic enzymes and their reactants and products. These networks can be used to infer functional associations between metabolic enzymes. Many methods are based on the number of metabolites shared by two enzymes, or the shortest path between two enzymes. Metabolite sharing can miss associations between non-consecutive enzymes in a serial pathway, and shortest-path algorithms are sensitive to high-degree metabolites such as water and ATP that create connections between enzymes with little functional similarity. 相似文献

2.

Efficient algorithms for the discovery of gapped factors

Alberto Apostolico Cinzia Pizzi Esko Ukkonen 《Algorithms for molecular biology : AMB》2011,6(1):5

Background

The discovery of surprisingly frequent patterns is of paramount interest in bioinformatics and computational biology. Among the patterns considered, those consisting of pairs of solid words that co-occur within a prescribed maximum distance -or gapped factors- emerge in a variety of contexts of DNA and protein sequence analysis. A few algorithms and tools have been developed in connection with specific formulations of the problem, however, none can handle comprehensively each of the multiple ways in which the distance between the two terms in a pair may be defined. 相似文献

3.

EMD: an ensemble algorithm for discovering regulatory motifs in DNA sequences

Jianjun Hu Yifeng D Yang Daisuke Kihara 《BMC bioinformatics》2006,7(1):342-13

Background

Understanding gene regulatory networks has become one of the central research problems in bioinformatics. More than thirty algorithms have been proposed to identify DNA regulatory sites during the past thirty years. However, the prediction accuracy of these algorithms is still quite low. Ensemble algorithms have emerged as an effective strategy in bioinformatics for improving the prediction accuracy by exploiting the synergetic prediction capability of multiple algorithms. 相似文献

4.

Simulation of microarray data with realistic characteristics

Matti Nykter Tommi Aho Miika Ahdesmäki Pekka Ruusuvuori Antti Lehmussola Olli Yli-Harja 《BMC bioinformatics》2006,7(1):349-17

Background

Microarray technologies have become common tools in biological research. As a result, a need for effective computational methods for data analysis has emerged. Numerous different algorithms have been proposed for analyzing the data. However, an objective evaluation of the proposed algorithms is not possible due to the lack of biological ground truth information. To overcome this fundamental problem, the use of simulated microarray data for algorithm validation has been proposed. 相似文献

5.

PCAS – a precomputed proteome annotation database resource

Zhang Y Yin Y Chen Y Gao G Yu P Luo J Jiang Y 《BMC genomics》2003,4(1):42

Background

Many model proteomes or "complete" sets of proteins of given organisms are now publicly available. Much effort has been invested in computational annotation of those "draft" proteomes. Motif or domain based algorithms play a pivotal role in functional classification of proteins. Employing most available computational algorithms, mainly motif or domain recognition algorithms, we set up to develop an online proteome annotation system with integrated proteome annotation data to complement existing resources. 相似文献

6.

Statistical significance of cis-regulatory modules

Dustin E Schones Andrew D Smith Michael Q Zhang 《BMC bioinformatics》2007,8(1):19

相似文献

7.

Evaluation of methods for detection of fluorescence labeled subcellular objects in microscope images

Pekka Ruusuvuori Tarmo Äijö Sharif Chowdhury Cecilia Garmendia-Torres Jyrki Selinummi Mirko Birbaumer Aimée M Dudley Lucas Pelkmans Olli Yli-Harja 《BMC bioinformatics》2010,11(1):248

Background

Several algorithms have been proposed for detecting fluorescently labeled subcellular objects in microscope images. Many of these algorithms have been designed for specific tasks and validated with limited image data. But despite the potential of using extensive comparisons between algorithms to provide useful information to guide method selection and thus more accurate results, relatively few studies have been performed. 相似文献

8.

Considerations in the identification of functional RNA structural elements in genomic alignments

Tomas Babak Benjamin J Blencowe Timothy R Hughes 《BMC bioinformatics》2007,8(1):33

Background

Accurate identification of novel, functional noncoding (nc) RNA features in genome sequence has proven more difficult than for exons. Current algorithms identify and score potential RNA secondary structures on the basis of thermodynamic stability, conservation, and/or covariance in sequence alignments. Neither the algorithms nor the information gained from the individual inputs have been independently assessed. Furthermore, due to issues in modelling background signal, it has been difficult to gauge the precision of these algorithms on a genomic scale, in which even a seemingly small false-positive rate can result in a vast excess of false discoveries. 相似文献

9.

A mass accuracy sensitive probability based scoring algorithm for database searching of tandem mass spectrometry data

Hua Xu Michael A Freitas 《BMC bioinformatics》2007,8(1):133

Background

Liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) has become one of the most used tools in mass spectrometry based proteomics. Various algorithms have since been developed to automate the process for modern high-throughput LC-MS/MS experiments. 相似文献

10.

Computing paths and cycles in biological interaction graphs

Steffen Klamt Axel von Kamp 《BMC bioinformatics》2009,10(1):181

Background

Interaction graphs (signed directed graphs) provide an important qualitative modeling approach for Systems Biology. They enable the analysis of causal relationships in cellular networks and can even be useful for predicting qualitative aspects of systems dynamics. Fundamental issues in the analysis of interaction graphs are the enumeration of paths and cycles (feedback loops) and the calculation of shortest positive/negative paths. These computational problems have been discussed only to a minor extent in the context of Systems Biology and in particular the shortest signed paths problem requires algorithmic developments. 相似文献

11.

A comparison of common programming languages used in bioinformatics

Mathieu Fourment Michael R Gillings 《BMC bioinformatics》2008,9(1):82

Background

The performance of different programming languages has previously been benchmarked using abstract mathematical algorithms, but not using standard bioinformatics algorithms. We compared the memory usage and speed of execution for three standard bioinformatics methods, implemented in programs using one of six different programming languages. Programs for the Sellers algorithm, the Neighbor-Joining tree construction algorithm and an algorithm for parsing BLAST file outputs were implemented in C, C++, C#, Java, Perl and Python. 相似文献

12.

Protein secondary structure prediction for a single-sequence using hidden semi-Markov models

Zafer Aydin Yucel Altunbasak Mark Borodovsky 《BMC bioinformatics》2006,7(1):178-15

Background

The accuracy of protein secondary structure prediction has been improving steadily towards the 88% estimated theoretical limit. There are two types of prediction algorithms: Single-sequence prediction algorithms imply that information about other (homologous) proteins is not available, while algorithms of the second type imply that information about homologous proteins is available, and use it intensively. The single-sequence algorithms could make an important contribution to studies of proteins with no detected homologs, however the accuracy of protein secondary structure prediction from a single-sequence is not as high as when the additional evolutionary information is present. 相似文献

13.

TargetSpy: a supervised machine learning approach for microRNA target prediction

Martin Sturm Michael Hackenberg David Langenberger Dmitrij Frishman 《BMC bioinformatics》2010,11(1):292

Background

Virtually all currently available microRNA target site prediction algorithms require the presence of a (conserved) seed match to the 5' end of the microRNA. Recently however, it has been shown that this requirement might be too stringent, leading to a substantial number of missed target sites. 相似文献

14.

Linear-time computation of minimal absent words using suffix array

Carl Barton Alice Heliou Laurent Mouchard Solon P Pissis 《BMC bioinformatics》2014,15(1)

Background

An absent word of a word y of length n is a word that does not occur in y. It is a minimal absent word if all its proper factors occur in y. Minimal absent words have been computed in genomes of organisms from all domains of life; their computation also provides a fast alternative for measuring approximation in sequence comparison. There exists an ??(n)-time and ??(n)-space algorithm for computing all minimal absent words on a fixed-sized alphabet based on the construction of suffix automata (Crochemore et al., 1998). No implementation of this algorithm is publicly available. There also exists an ??(n²)-time and ??(n)-space algorithm for the same problem based on the construction of suffix arrays (Pinho et al., 2009). An implementation of this algorithm was also provided by the authors and is currently the fastest available.

Results

Our contribution in this article is twofold: first, we bridge this unpleasant gap by presenting an ??(n)-time and ??(n)-space algorithm for computing all minimal absent words based on the construction of suffix arrays; and second, we provide the respective implementation of this algorithm. Experimental results, using real and synthetic data, show that this implementation outperforms the one by Pinho et al. The open-source code of our implementation is freely available at http://github.com/solonas13/maw.

Conclusions

Classical notions for sequence comparison are increasingly being replaced by other similarity measures that refer to the composition of sequences in terms of their constituent patterns. One such measure is the minimal absent words. In this article, we present a new linear-time and linear-space algorithm for the computation of minimal absent words based on the suffix array. 相似文献

15.

An efficient genetic algorithm for structural RNA pairwise alignment and its application to non-coding RNA discovery in yeast

Akito Taneda 《BMC bioinformatics》2008,9(1):521

Background

Aligning RNA sequences with low sequence identity has been a challenging problem since such a computation essentially needs an algorithm with high complexities for taking structural conservation into account. Although many sophisticated algorithms for the purpose have been proposed to date, further improvement in efficiency is necessary to accelerate its large-scale applications including non-coding RNA (ncRNA) discovery. 相似文献

16.

Comparison of public peak detection algorithms for MALDI mass spectrometry data analysis

Chao Yang Zengyou He Weichuan Yu 《BMC bioinformatics》2009,10(1):4

Background

In mass spectrometry (MS) based proteomic data analysis, peak detection is an essential step for subsequent analysis. Recently, there has been significant progress in the development of various peak detection algorithms. However, neither a comprehensive survey nor an experimental comparison of these algorithms is yet available. The main objective of this paper is to provide such a survey and to compare the performance of single spectrum based peak detection methods. 相似文献

17.

Intergeneric transfer of ribosomal genes between two fungi

Jiatao Xie Yanping Fu Daohong Jiang Guoqing Li Junbin Huang Bo Li Tom Hsiang Youliang Peng 《BMC evolutionary biology》2008,8(1):87

Background

Horizontal gene transfer, also called lateral gene transfer, frequently occurs among prokaryotic organisms, and is considered an important force in their evolution. However, there are relatively few reports of transfer to or from fungi, with some notable exceptions in the acquisition of prokaryotic genes. Some fungal species have been found to contain sequences resembling those of bacterial genes, and with such sequences absent in other fungal species, this has been interpreted as horizontal gene transfer. Similarly, a few fungi have been found to contain genes absent in close relatives but present in more distantly related taxa, and horizontal gene transfer has been invoked as a parsimonious explanation. There is a paucity of direct experimental evidence demonstrating the occurrence of horizontal gene transfer in fungi. 相似文献

18.

Data processing and classification analysis of proteomic changes: a case study of oil pollution in the mussel, Mytilus edulis

Monsinjon T Andersen OK Leboulenger F Knigge T 《Proteome science》2006,4(1):17-13

Background

Proteomics may help to detect subtle pollution-related changes, such as responses to mixture pollution at low concentrations, where clear signs of toxicity are absent. The challenges associated with the analysis of large-scale multivariate proteomic datasets have been widely discussed in medical research and biomarker discovery. This concept has been introduced to ecotoxicology only recently, so data processing and classification analysis need to be refined before they can be readily applied in biomarker discovery and monitoring studies. 相似文献

19.

Shape-based peak identification for ChIP-Seq

Valerie Hower Steven N Evans Lior Pachter 《BMC bioinformatics》2011,12(1):15

Background

The identification of binding targets for proteins using ChIP-Seq has gained popularity as an alternative to ChIP-chip. Sequencing can, in principle, eliminate artifacts associated with microarrays, and cheap sequencing offers the ability to sequence deeply and obtain a comprehensive survey of binding. A number of algorithms have been developed to call "peaks" representing bound regions from mapped reads. Most current algorithms incorporate multiple heuristics, and despite much work it remains difficult to accurately determine individual peaks corresponding to distinct binding events. 相似文献

20.

GenClust: A genetic algorithm for clustering gene expression data

Vito?Di Gesú Raffaele?Giancarlo Email author Giosué?Lo Bosco Alessandra?Raimondi Davide?Scaturro 《BMC bioinformatics》2005,6(1):289

Background

Clustering is a key step in the analysis of gene expression data, and in fact, many classical clustering algorithms are used, or more innovative ones have been designed and validated for the task. Despite the widespread use of artificial intelligence techniques in bioinformatics and, more generally, data analysis, there are very few clustering algorithms based on the genetic paradigm, yet that paradigm has great potential in finding good heuristic solutions to a difficult optimization problem such as clustering. 相似文献