期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A multistep bioinformatic approach detects putative regulatory elements in gene promoters

Stefania?Bortoluzzi Alessandro?Coppe Andrea?Bisognin Cinzia?Pizzi Gian?Antonio?Danieli Email author 《BMC bioinformatics》2005,6(1):121

Background

Searching for approximate patterns in large promoter sequences frequently produces an exceedingly high numbers of results. Our aim was to exploit biological knowledge for definition of a sheltered search space and of appropriate search parameters, in order to develop a method for identification of a tractable number of sequence motifs. 相似文献

2.

A tryptophan-rich peptide acts as a transcription activation domain

Chen-Huan Lin Grace Lin Chia-Pei Chang Chien-Chia Wang 《BMC molecular biology》2010,11(1):85

相似文献

3.

Fast index based algorithms and software for matching position specific scoring matrices

Michael Beckstette Robert Homann Robert Giegerich Stefan Kurtz 《BMC bioinformatics》2006,7(1):389-25

Background

In biological sequence analysis, position specific scoring matrices (PSSMs) are widely used to represent sequence motifs in nucleotide as well as amino acid sequences. Searching with PSSMs in complete genomes or large sequence databases is a common, but computationally expensive task. 相似文献

4.

Automatic annotation of protein motif function with Gene Ontology terms

Xinghua?Lu Email author Chengxiang?Zhai Vanathi?Gopalakrishnan Bruce?G?Buchanan 《BMC bioinformatics》2004,5(1):122

Background

Conserved protein sequence motifs are short stretches of amino acid sequence patterns that potentially encode the function of proteins. Several sequence pattern searching algorithms and programs exist foridentifying candidate protein motifs at the whole genome level. However, amuch needed and importanttask is to determine the functions of the newly identified protein motifs. The Gene Ontology (GO) project is an endeavor to annotate the function of genes or protein sequences with terms from a dynamic, controlled vocabulary and these annotations serve well as a knowledge base. 相似文献

5.

An analysis of the positional distribution of DNA motifs in promoter regions and its biological relevance

Ana C Casimiro Susana Vinga Ana T Freitas Arlindo L Oliveira 《BMC bioinformatics》2008,9(1):89

Background

Motif finding algorithms have developed in their ability to use computationally efficient methods to detect patterns in biological sequences. However the posterior classification of the output still suffers from some limitations, which makes it difficult to assess the biological significance of the motifs found. Previous work has highlighted the existence of positional bias of motifs in the DNA sequences, which might indicate not only that the pattern is important, but also provide hints of the positions where these patterns occur preferentially. 相似文献

6.

c-REDUCE: Incorporating sequence conservation to detect motifs that correlate with expression

Katerina Kechris Hao Li 《BMC bioinformatics》2008,9(1):506

相似文献

7.

Evolutionary conservation and over-representation of functionally enriched network patterns in the yeast regulatory network

Ofer Meshi Tomer Shlomi Eytan Ruppin 《BMC systems biology》2007,1(1):1-7

Background

Localized network patterns are assumed to represent an optimal design principle in different biological networks. A widely used method for identifying functional components in biological networks is looking for network motifs – over-represented network patterns. A number of recent studies have undermined the claim that these over-represented patterns are indicative of optimal design principles and question whether localized network patterns are indeed of functional significance. This paper examines the functional significance of regulatory network patterns via their biological annotation and evolutionary conservation. 相似文献

8.

Statistical tests to compare motif count exceptionalities

Stéphane Robin Sophie Schbath Vincent Vandewalle 《BMC bioinformatics》2007,8(1):84

Background

Finding over- or under-represented motifs in biological sequences is now a common task in genomics. Thanks to p-value calculation for motif counts, exceptional motifs are identified and represent candidate functional motifs. The present work addresses the related question of comparing the exceptionality of one motif in two different sequences. Just comparing the motif count p-values in each sequence is indeed not sufficient to decide if this motif is significantly more exceptional in one sequence compared to the other one. A statistical test is required. 相似文献

9.

Soon-Heng Tan Willy Hugo Wing-Kin Sung See-Kiong Ng 《BMC bioinformatics》2006,7(1):502-16

Background

An important class of interaction switches for biological circuits and disease pathways are short binding motifs. However, the biological experiments to find these binding motifs are often laborious and expensive. With the availability of protein interaction data, novel binding motifs can be discovered computationally: by applying standard motif extracting algorithms on protein sequence sets each interacting with either a common protein or a protein group with similar properties. The underlying assumption is that proteins with common interacting partners will share some common binding motifs. Although novel binding motifs have been discovered with such approach, it is not applicable if a protein interacts with very few other proteins or when prior knowledge of protein group is not available or erroneous. Experimental noise in input interaction data can further deteriorate the dismal performance of such approaches. 相似文献

10.

WildSpan: mining structured motifs from protein sequences

Hsu CM Chen CY Liu BJ 《Algorithms for molecular biology : AMB》2011,6(1):6

Background

Automatic extraction of motifs from biological sequences is an important research problem in study of molecular biology. For proteins, it is desired to discover sequence motifs containing a large number of wildcard symbols, as the residues associated with functional sites are usually largely separated in sequences. Discovering such patterns is time-consuming because abundant combinations exist when long gaps (a gap consists of one or more successive wildcards) are considered. Mining algorithms often employ constraints to narrow down the search space in order to increase efficiency. However, improper constraint models might degrade the sensitivity and specificity of the motifs discovered by computational methods. We previously proposed a new constraint model to handle large wildcard regions for discovering functional motifs of proteins. The patterns that satisfy the proposed constraint model are called W-patterns. A W-pattern is a structured motif that groups motif symbols into pattern blocks interleaved with large irregular gaps. Considering large gaps reflects the fact that functional residues are not always from a single region of protein sequences, and restricting motif symbols into clusters corresponds to the observation that short motifs are frequently present within protein families. To efficiently discover W-patterns for large-scale sequence annotation and function prediction, this paper first formally introduces the problem to solve and proposes an algorithm named WildSpan (sequential pattern mining across large wildcard regions) that incorporates several pruning strategies to largely reduce the mining cost. 相似文献

11.

tacg – a grep for DNA

Harry J Mangalam 《BMC bioinformatics》2002,3(1):8-4

Background

Pattern matching is the core of bioinformatics; it is used in database searching, restriction enzyme mapping, and finding open reading frames. It is done repeatedly over increasingly long sequences, thus codes must be efficient and insensitive to sequence length. Such patterns of interest include simple motifs with IUPAC degeneracies, regular expressions, patterns allowing mismatches, and probability matrices. 相似文献

12.

Discovering structural motifs using a structural alphabet: Application to magnesium-binding sites

Minko Dudev Carmay Lim 《BMC bioinformatics》2007,8(1):106

Background

For many metalloproteins, sequence motifs characteristic of metal-binding sites have not been found or are so short that they would not be expected to be metal-specific. Striking examples of such metalloproteins are those containing Mg²⁺, one of the most versatile metal cofactors in cellular biochemistry. Even when Mg²⁺-proteins share insufficient sequence homology to identify Mg²⁺-specific sequence motifs, they may still share similarity in the Mg²⁺-binding site structure. However, no structural motifs characteristic of Mg²⁺-binding sites have been reported. Thus, our aims are (i) to develop a general method for discovering structural patterns/motifs characteristic of ligand-binding sites, given the 3D protein structures, and (ii) to apply it to Mg²⁺-proteins sharing <30% sequence identity. Our motif discovery method employs structural alphabet encoding to convert 3D structures to the corresponding 1D structural letter sequences, where the Mg²⁺-structural motifs are identified as recurring structural patterns. 相似文献

13.

NBLAST: a cluster variant of BLAST for NxN comparisons

Michel?Dumontier Christopher?WV?Hogue Email author 《BMC bioinformatics》2002,3(1):13

Background

The BLAST algorithm compares biological sequences to one another in order to determine shared motifs and common ancestry. However, the comparison of all non-redundant (NR) sequences against all other NR sequences is a computationally intensive task. We developed NBLAST as a cluster computer implementation of the BLAST family of sequence comparison programs for the purpose of generating pre-computed BLAST alignments and neighbour lists of NR sequences. 相似文献

14.

An intragenic distribution bias of DNA uptake sequences in <Emphasis Type="Italic">Pasteurellaceae</Emphasis> and <Emphasis Type="Italic">Neisseriae</Emphasis>

Mark WJ van Passel 《Biology direct》2008,3(1):12

Most sequenced strains from Pasteurellaceae and Neisseriae contain hundreds to thousands of uptake sequence (US) motifs in their genome, which are associated with natural competence for DNA uptake. The mechanism of their recognition is still unclear, and I searched for intragenic location patterns of these motifs for clues about their distribution. In all cases, one orientation of the US has a higher occurrence in the reading frame, and in all Pasteurellaceae, the US and the reverse complement motifs are biased towards the gene termini. These findings could help design experimental set-ups to study preferential DNA uptake, thereby further unravelling the phenomenon of natural competence. 相似文献

15.

A combinatorial optimization approach for diverse motif finding applications

Elena Zaslavsky Mona Singh 《Algorithms for molecular biology : AMB》2006,1(1):13-13

Background

Discovering approximately repeated patterns, or motifs, in biological sequences is an important and widely-studied problem in computational molecular biology. Most frequently, motif finding applications arise when identifying shared regulatory signals within DNA sequences or shared functional and structural elements within protein sequences. Due to the diversity of contexts in which motif finding is applied, several variations of the problem are commonly studied. 相似文献

16.

NestedMICA as an ab initio protein motif discovery tool

Mutlu Doğruel Thomas A Down Tim JP Hubbard 《BMC bioinformatics》2008,9(1):19

相似文献

17.

Kalign – an accurate and fast multiple sequence alignment algorithm

Timo?Lassmann Email author Erik?LL?Sonnhammer 《BMC bioinformatics》2005,6(1):298

Background

The alignment of multiple protein sequences is a fundamental step in the analysis of biological data. It has traditionally been applied to analyzing protein families for conserved motifs, phylogeny, structural properties, and to improve sensitivity in homology searching. The availability of complete genome sequences has increased the demands on multiple sequence alignment (MSA) programs. Current MSA methods suffer from being either too inaccurate or too computationally expensive to be applied effectively in large-scale comparative genomics. 相似文献

18.

False occurrences of functional motifs in protein sequences highlight evolutionary constraints

Allegra Via Pier Federico Gherardini Enrico Ferraro Gabriele Ausiello Gianpaolo Scalia Tomba Manuela Helmer-Citterich 《BMC bioinformatics》2007,8(1):68

Background

False occurrences of functional motifs in protein sequences can be considered as random events due solely to the sequence composition of a proteome. Here we use a numerical approach to investigate the random appearance of functional motifs with the aim of addressing biological questions such as: How are organisms protected from undesirable occurrences of motifs otherwise selected for their functionality? Has the random appearance of functional motifs in protein sequences been affected during evolution?

Results

Here we analyse the occurrence of functional motifs in random sequences and compare it to that observed in biological proteomes; the behaviour of random motifs is also studied. Most motifs exhibit a number of false positives significantly similar to the number of times they appear in randomized proteomes (=expected number of false positives). Interestingly, about 3% of the analysed motifs show a different kind of behaviour and appear in biological proteomes less than they do in random sequences. In some of these cases, a mechanism of evolutionary negative selection is apparent; this helps to prevent unwanted functionalities which could interfere with cellular mechanisms.

Conclusion

Our thorough statistical and biological analysis showed that there are several mechanisms and evolutionary constraints both of which affect the appearance of functional motifs in protein sequences.

相似文献

19.

Species-specific analysis of protein sequence motifs using mutual information

Jan?Hummel Nima?Keshvari Wolfram?Weckwerth Joachim?Selbig Email author 《BMC bioinformatics》2005,6(1):164

相似文献

20.

Mining, compressing and classifying with extensible motifs

Apostolico A Comin M Parida L 《Algorithms for molecular biology : AMB》2006,1(1):4-7

Background

Motif patterns of maximal saturation emerged originally in contexts of pattern discovery in biomolecular sequences and have recently proven a valuable notion also in the design of data compression schemes. Informally, a motif is a string of intermittently solid and wild characters that recurs more or less frequently in an input sequence or family of sequences. Motif discovery techniques and tools tend to be computationally imposing, however, special classes of "rigid" motifs have been identified of which the discovery is affordable in low polynomial time. 相似文献