期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Bayesian coestimation of phylogeny and sequence alignment

Gerton?Lunter Email author István?Miklós Alexei?Drummond Jens?Ledet?Jensen Jotun?Hein 《BMC bioinformatics》2005,6(1):83

Background

Two central problems in computational biology are the determination of the alignment and phylogeny of a set of biological sequences. The traditional approach to this problem is to first build a multiple alignment of these sequences, followed by a phylogenetic reconstruction step based on this multiple alignment. However, alignment and phylogenetic inference are fundamentally interdependent, and ignoring this fact leads to biased and overconfident estimations. Whether the main interest be in sequence alignment or phylogeny, a major goal of computational biology is the co-estimation of both. 相似文献

2.

A combinatorial optimization approach for diverse motif finding applications

Elena Zaslavsky Mona Singh 《Algorithms for molecular biology : AMB》2006,1(1):13-13

Background

Discovering approximately repeated patterns, or motifs, in biological sequences is an important and widely-studied problem in computational molecular biology. Most frequently, motif finding applications arise when identifying shared regulatory signals within DNA sequences or shared functional and structural elements within protein sequences. Due to the diversity of contexts in which motif finding is applied, several variations of the problem are commonly studied. 相似文献

3.

Large scale hierarchical clustering of protein sequences

Antje?Krause Email author Jens?Stoye Martin?Vingron 《BMC bioinformatics》2005,6(1):15

Background

Searching a biological sequence database with a query sequence looking for homologues has become a routine operation in computational biology. In spite of the high degree of sophistication of currently available search routines it is still virtually impossible to identify quickly and clearly a group of sequences that a given query sequence belongs to. 相似文献

4.

Word correlation matrices for protein sequence analysis and remote homology detection

Thomas Lingner Peter Meinicke 《BMC bioinformatics》2008,9(1):259

Background

Classification of protein sequences is a central problem in computational biology. Currently, among computational methods discriminative kernel-based approaches provide the most accurate results. However, kernel-based methods often lack an interpretable model for analysis of discriminative sequence features, and predictions on new sequences usually are computationally expensive. 相似文献

5.

Motif kernel generated by genetic programming improves remote homology and fold detection

Tony Håndstad Arne JH Hestnes Pål Sætrom 《BMC bioinformatics》2007,8(1):23

Background

Protein remote homology detection is a central problem in computational biology. Most recent methods train support vector machines to discriminate between related and unrelated sequences and these studies have introduced several types of kernels. One successful approach is to base a kernel on shared occurrences of discrete sequence motifs. Still, many protein sequences fail to be classified correctly for a lack of a suitable set of motifs for these sequences. 相似文献

6.

Efficient mining gapped sequential patterns for motifs in biological sequences

Vance Chiang-Chi Liao Ming-Syan Chen 《BMC systems biology》2013,7(Z4):S7

Background

Pattern mining for biological sequences is an important problem in bioinformatics and computational biology. Biological data mining yield impact in diverse biological fields, such as discovery of co-occurring biosequences, which is important for biological data analyses. The approaches of mining sequential patterns can discover all-length motifs of biological sequences. Nevertheless, traditional approaches of mining sequential patterns inefficiently mine DNA and protein data since the data have fewer letters and lengthy sequences. Furthermore, gap constraints are important in computational biology since they cope with irrelative regions, which are not conserved in evolution of biological sequences.

Results

We devise an approach to efficiently mine sequential patterns (motifs) with gap constraints in biological sequences. The approach is the Depth-First Spelling algorithm for mining sequential patterns of biological sequences with Gap constraints (termed DFSG).

Conclusions

PrefixSpan is one of the most efficient methods in traditional approaches of mining sequential patterns, and it is the basis of GenPrefixSpan. GenPrefixSpan is an approach built on PrefixSpan with gap constraints, and therefore we compare DFSG with GenPrefixSpan. In the experimental results, DFSG mines biological sequences much faster than GenPrefixSpan.

相似文献

7.

MultiLoc2: integrating phylogeny and Gene Ontology terms improves subcellular protein localization prediction 总被引：2，自引：0，他引：2

Torsten Blum Sebastian Briesemeister Oliver Kohlbacher 《BMC bioinformatics》2009,10(1):274

Background

Knowledge of subcellular localization of proteins is crucial to proteomics, drug target discovery and systems biology since localization and biological function are highly correlated. In recent years, numerous computational prediction methods have been developed. Nevertheless, there is still a need for prediction methods that show more robustness and higher accuracy. 相似文献

8.

HitKeeper, a generic software package for hit list management

Jörg Hau Michael Muller Marco Pagni 《Source code for biology and medicine》2007,2(1):1-8

Background

The automated annotation of biological sequences (protein, DNA) relies on the computation of hits (predicted features) on the sequences using various algorithms. Public databases of biological sequences provide a wealth of biological "knowledge", for example manually validated annotations (features) that are located on the sequences, but mining the sequence annotations and especially the predicted and curated features requires dedicated tools. Due to the heterogeneity and diversity of the biological information, it is difficult to handle redundancy, frequent updates, taxonomic information and "private" data together with computational algorithms in a common workflow. 相似文献

9.

Detecting disease associated modules and prioritizing active genes based on high throughput data

Yu-Qing Qiu Shihua Zhang Xiang-Sun Zhang Luonan Chen 《BMC bioinformatics》2010,11(1):26

Background

The accumulation of high-throughput data greatly promotes computational investigation of gene function in the context of complex biological systems. However, a biological function is not simply controlled by an individual gene since genes function in a cooperative manner to achieve biological processes. In the study of human diseases, rather than to discover disease related genes, identifying disease associated pathways and modules becomes an essential problem in the field of systems biology. 相似文献

10.

Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change

Andrew V Uzilov Joshua M Keegan David H Mathews 《BMC bioinformatics》2006,7(1):173-30

Background

Non-coding RNAs (ncRNAs) have a multitude of roles in the cell, many of which remain to be discovered. However, it is difficult to detect novel ncRNAs in biochemical screens. To advance biological knowledge, computational methods that can accurately detect ncRNAs in sequenced genomes are therefore desirable. The increasing number of genomic sequences provides a rich dataset for computational comparative sequence analysis and detection of novel ncRNAs. 相似文献

11.

Towards a lightweight generic computational grid framework for biological research

Mark D Halling-Brown David S Moss Adrian J Shepherd 《BMC bioinformatics》2008,9(1):407

Background

An increasing number of scientific research projects require access to large-scale computational resources. This is particularly true in the biological field, whether to facilitate the analysis of large high-throughput data sets, or to perform large numbers of complex simulations – a characteristic of the emerging field of systems biology. 相似文献

12.

Finding evolutionarily conserved cis-regulatory modules with a universal set of motifs

Bartek Wilczynski Norbert Dojer Mateusz Patelak Jerzy Tiuryn 《BMC bioinformatics》2009,10(1):82

相似文献

13.

FunnyBase: a systems level functional annotation of Fundulus ESTs for the analysis of gene expression

Paschall JE Oleksiak MF VanWye JD Roach JL Whitehead JA Wyckoff GJ Kolell KJ Crawford DL 《BMC genomics》2004,5(1):96

Background

While studies of non-model organisms are critical for many research areas, such as evolution, development, and environmental biology, they present particular challenges for both experimental and computational genomic level research. Resources such as mass-produced microarrays and the computational tools linking these data to functional annotation at the system and pathway level are rarely available for non-model species. This type of "systems-level" analysis is critical to the understanding of patterns of gene expression that underlie biological processes. 相似文献

14.

Pattern statistics on Markov chains and sensitivity to parameter estimation

Grégory Nuel 《Algorithms for molecular biology : AMB》2006,1(1):17-13

Background:

In order to compute pattern statistics in computational biology a Markov model is commonly used to take into account the sequence composition. Usually its parameter must be estimated. The aim of this paper is to determine how sensitive these statistics are to parameter estimation, and what are the consequences of this variability on pattern studies (finding the most over-represented words in a genome, the most significant common words to a set of sequences,...). 相似文献

15.

The new biology: beyond the Modern Synthesis

Michael R Rose Todd H Oakley 《Biology direct》2007,2(1):30-17

Background

The last third of the 20^th Century featured an accumulation of research findings that severely challenged the assumptions of the "Modern Synthesis" which provided the foundations for most biological research during that century. The foundations of that "Modernist" biology had thus largely crumbled by the start of the 21^st Century. This in turn raises the question of foundations for biology in the 21^st Century. 相似文献

16.

Gene ontology based transfer learning for protein subcellular localization

Suyu Mei Wang Fei Shuigeng Zhou 《BMC bioinformatics》2011,12(1):44

Background

Prediction of protein subcellular localization generally involves many complex factors, and using only one or two aspects of data information may not tell the true story. For this reason, some recent predictive models are deliberately designed to integrate multiple heterogeneous data sources for exploiting multi-aspect protein feature information. Gene ontology, hereinafter referred to as GO, uses a controlled vocabulary to depict biological molecules or gene products in terms of biological process, molecular function and cellular component. With the rapid expansion of annotated protein sequences, gene ontology has become a general protein feature that can be used to construct predictive models in computational biology. Existing models generally either concatenated the GO terms into a flat binary vector or applied majority-vote based ensemble learning for protein subcellular localization, both of which can not estimate the individual discriminative abilities of the three aspects of gene ontology. 相似文献

17.

Towards the high-resolution protein structure prediction. Fast refinement of reduced models with all-atom force field

Sebastian Kmiecik Dominik Gront Andrzej Kolinski 《BMC structural biology》2007,7(1):43

Background

Although experimental methods for determining protein structure are providing high resolution structures, they cannot keep the pace at which amino acid sequences are resolved on the scale of entire genomes. For a considerable fraction of proteins whose structures will not be determined experimentally, computational methods can provide valuable information. The value of structural models in biological research depends critically on their quality. Development of high-accuracy computational methods that reliably generate near-experimental quality structural models is an important, unsolved problem in the protein structure modeling. 相似文献

18.

Discriminating between rival biochemical network models: three approaches to optimal experiment design

Bence Mélykúti Antonis Papachristodoulou Hana El-Samad 《BMC systems biology》2010,4(1):38

Background

The success of molecular systems biology hinges on the ability to use computational models to design predictive experiments, and ultimately unravel underlying biological mechanisms. A problem commonly encountered in the computational modelling of biological networks is that alternative, structurally different models of similar complexity fit a set of experimental data equally well. In this case, more than one molecular mechanism can explain available data. In order to rule out the incorrect mechanisms, one needs to invalidate incorrect models. At this point, new experiments maximizing the difference between the measured values of alternative models should be proposed and conducted. Such experiments should be optimally designed to produce data that are most likely to invalidate incorrect model structures. 相似文献

19.

Understanding dynamics using sensitivity analysis: caveat and solution

Thanneer M Perumal Rudiyanto Gunawan 《BMC systems biology》2011,5(1):41

Background

Parametric sensitivity analysis (PSA) has become one of the most commonly used tools in computational systems biology, in which the sensitivity coefficients are used to study the parametric dependence of biological models. As many of these models describe dynamical behaviour of biological systems, the PSA has subsequently been used to elucidate important cellular processes that regulate this dynamics. However, in this paper, we show that the PSA coefficients are not suitable in inferring the mechanisms by which dynamical behaviour arises and in fact it can even lead to incorrect conclusions. 相似文献

20.

Critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences

Yungki Park 《BMC bioinformatics》2009,10(1):419

Background

Protein-protein interactions underlie many important biological processes. Computational prediction methods can nicely complement experimental approaches for identifying protein-protein interactions. Recently, a unique category of sequence-based prediction methods has been put forward - unique in the sense that it does not require homologous protein sequences. This enables it to be universally applicable to all protein sequences unlike many of previous sequence-based prediction methods. If effective as claimed, these new sequence-based, universally applicable prediction methods would have far-reaching utilities in many areas of biology research. 相似文献