期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

POWRS: position-sensitive motif discovery

IW Davis C Benninger PN Benfey T Elich 《PloS one》2012,7(7):e40373

相似文献

2.

A tree-based approach for motif discovery and sequence classification

Yan R Boutros PC Jurisica I 《Bioinformatics (Oxford, England)》2011,27(15):2054-2061

相似文献

3.

Bayesian multiple-instance motif discovery with BAMBI: inference of recombinase and transcription factor binding sites

Jajamovich GH Wang X Arkin AP Samoilov MS 《Nucleic acids research》2011,39(21):e146

Finding conserved motifs in genomic sequences represents one of essential bioinformatic problems. However, achieving high discovery performance without imposing substantial auxiliary constraints on possible motif features remains a key algorithmic challenge. This work describes BAMBI-a sequential Monte Carlo motif-identification algorithm, which is based on a position weight matrix model that does not require additional constraints and is able to estimate such motif properties as length, logo, number of instances and their locations solely on the basis of primary nucleotide sequence data. Furthermore, should biologically meaningful information about motif attributes be available, BAMBI takes advantage of this knowledge to further refine the discovery results. In practical applications, we show that the proposed approach can be used to find sites of such diverse DNA-binding molecules as the cAMP receptor protein (CRP) and Din-family site-specific serine recombinases. Results obtained by BAMBI in these and other settings demonstrate better statistical performance than any of the four widely-used profile-based motif discovery methods: MEME, BioProspector with BioOptimizer, SeSiMCMC and Motif Sampler as measured by the nucleotide-level correlation coefficient. Additionally, in the case of Din-family recombinase target site discovery, the BAMBI-inferred motif is found to be the only one functionally accurate from the underlying biochemical mechanism standpoint. C++ and Matlab code is available at http://www.ee.columbia.edu/~guido/BAMBI or http://genomics.lbl.gov/BAMBI/. 相似文献

4.

HeliCis: a DNA motif discovery tool for colocalized motif pairs with periodic spacing

Erik Larsson Per Lindahl Petter Mostad 《BMC bioinformatics》2007,8(1):418

相似文献

5.

Combining phylogenetic motif discovery and motif clustering to predict co-regulated genes 总被引：2，自引：0，他引：2

Jensen ST Shen L Liu JS 《Bioinformatics (Oxford, England)》2005,21(20):3832-3839

相似文献

6.

PhenomeNET: a whole-phenome approach to disease gene discovery

Hoehndorf R Schofield PN Gkoutos GV 《Nucleic acids research》2011,39(18):e119

Phenotypes are investigated in model organisms to understand and reveal the molecular mechanisms underlying disease. Phenotype ontologies were developed to capture and compare phenotypes within the context of a single species. Recently, these ontologies were augmented with formal class definitions that may be utilized to integrate phenotypic data and enable the direct comparison of phenotypes between different species. We have developed a method to transform phenotype ontologies into a formal representation, combine phenotype ontologies with anatomy ontologies, and apply a measure of semantic similarity to construct the PhenomeNET cross-species phenotype network. We demonstrate that PhenomeNET can identify orthologous genes, genes involved in the same pathway and gene-disease associations through the comparison of mutant phenotypes. We provide evidence that the Adam19 and Fgf15 genes in mice are involved in the tetralogy of Fallot, and, using zebrafish phenotypes, propose the hypothesis that the mammalian homologs of Cx36.7 and Nkx2.5 lie in a pathway controlling cardiac morphogenesis and electrical conductivity which, when defective, cause the tetralogy of Fallot phenotype. Our method implements a whole-phenome approach toward disease gene discovery and can be applied to prioritize genes for rare and orphan diseases for which the molecular basis is unknown. 相似文献

7.

Evolutionary HMMs: a Bayesian approach to multiple alignment

Holmes I Bruno WJ 《Bioinformatics (Oxford, England)》2001,17(9):803-820

MOTIVATION: We review proposed syntheses of probabilistic sequence alignment, profiling and phylogeny. We develop a multiple alignment algorithm for Bayesian inference in the links model proposed by Thorne et al. (1991, J. Mol. Evol., 33, 114-124). The algorithm, described in detail in Section 3, samples from and/or maximizes the posterior distribution over multiple alignments for any number of DNA or protein sequences, conditioned on a phylogenetic tree. The individual sampling and maximization steps of the algorithm require no more computational resources than pairwise alignment. METHODS: We present a software implementation (Handel) of our algorithm and report test results on (i) simulated data sets and (ii) the structurally informed protein alignments of BAliBASE (Thompson et al., 1999, Nucleic Acids Res., 27, 2682-2690). RESULTS: We find that the mean sum-of-pairs score (a measure of residue-pair correspondence) for the BAliBASE alignments is only 13% lower for Handelthan for CLUSTALW(Thompson et al., 1994, Nucleic Acids Res., 22, 4673-4680), despite the relative simplicity of the links model (CLUSTALW uses affine gap scores and increased penalties for indels in hydrophobic regions). With reference to these benchmarks, we discuss potential improvements to the links model and implications for Bayesian multiple alignment and phylogenetic profiling. AVAILABILITY: The source code to Handelis freely distributed on the Internet at http://www.biowiki.org/Handel under the terms of the GNU Public License (GPL, 2000, http://www.fsf.org./copyleft/gpl.html). 相似文献

8.

Recognizing complex, asymmetric functional sites in protein structures using a Bayesian scoring function

Wei L Altman RB 《Journal of bioinformatics and computational biology》2003,1(1):119-138

The increase in known three-dimensional protein structures enables us to build statistical profiles of important functional sites in protein molecules. These profiles can then be used to recognize sites in large-scale automated annotations of new protein structures. We report an improved FEATURE system which recognizes functional sites in protein structures. FEATURE defines multi-level physico-chemical properties and recognizes sites based on the spatial distribution of these properties in the sites' microenvironments. It uses a Bayesian scoring function to compare a query region with the statistical profile built from known examples of sites and control nonsites. We have previously shown that FEATURE can accurately recognize calcium-binding sites and have reported interesting results scanning for calcium-binding sites in the entire Protein Data Bank. Here we report the ability of the improved FEATURE to characterize and recognize geometrically complex and asymmetric sites such as ATP-binding sites and disulfide bond-forming sites. FEATURE does not rely on conserved residues or conserved residue geometry of the sites. We also demonstrate that, in the absence of a statistical profile of the sites, FEATURE can use an artificially constructed profile based on a priori knowledge to recognize the sites in new structures, using redoxin active sites as an example. 相似文献

9.

A general approach for discriminative de novo motif discovery from high-throughput data

Jan Grau Stefan Posch Ivo Grosse Jens Keilwagen 《Nucleic acids research》2013,41(21):e197

相似文献

10.

CoMoDis: composite motif discovery in mammalian genomes

Donaldson IJ Göttgens B 《Nucleic acids research》2007,35(1):e1

相似文献

11.

Refining gene signatures: a Bayesian approach

Amira Djebbari Aurélie Labbe 《BMC bioinformatics》2009,10(1):410

Background

In high density arrays, the identification of relevant genes for disease classification is complicated by not only the curse of dimensionality but also the highly correlated nature of the array data. In this paper, we are interested in the question of how many and which genes should be selected for a disease class prediction. Our work consists of a Bayesian supervised statistical learning approach to refine gene signatures with a regularization which penalizes for the correlation between the variables selected. 相似文献

12.

GibbsST: a Gibbs sampling method for motif discovery with enhanced resistance to local optima

Kazuhito Shida 《BMC bioinformatics》2006,7(1):486

相似文献

13.

SeAMotE: a method for high-throughput motif discovery in nucleic acid sequences

Federico Agostini Davide Cirillo Riccardo Delli Ponti Gian Gaetano Tartaglia 《BMC genomics》2014,15(1)

相似文献

14.

Regulatory motif discovery using a population clustering evolutionary algorithm 总被引：2，自引：0，他引：2

Lones MA Tyrrell AM 《IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM》2007,4(3):403-414

This paper describes a novel evolutionary algorithm for regulatory motif discovery in DNA promoter sequences. The algorithm uses data clustering to logically distribute the evolving population across the search space. Mating then takes place within local regions of the population, promoting overall solution diversity and encouraging discovery of multiple solutions. Experiments using synthetic data sets have demonstrated the algorithm's capacity to find position frequency matrix models of known regulatory motifs in relatively long promoter sequences. These experiments have also shown the algorithm's ability to maintain diversity during search and discover multiple motifs within a single population. The utility of the algorithm for discovering motifs in real biological data is demonstrated by its ability to find meaningful motifs within muscle-specific regulatory sequences. 相似文献

15.

A Bayesian approach to bioassay 总被引：2，自引：0，他引：2

F L Ramsey 《Biometrics》1972,28(3):841-858

相似文献

16.

DREME: motif discovery in transcription factor ChIP-seq data

Bailey TL 《Bioinformatics (Oxford, England)》2011,27(12):1653-1659

相似文献

17.

Assessment of composite motif discovery methods

Kjetil Klepper Geir K Sandve Osman Abul Jostein Johansen Finn Drablos 《BMC bioinformatics》2008,9(1):123

相似文献

18.

Improved benchmarks for computational motif discovery

Geir Kjetil Sandve Osman Abul Vegard Walseng Finn Drabløs 《BMC bioinformatics》2007,8(1):193

相似文献

19.

Incremental paradigms of motif discovery.

Alberto Apostolico Laxmi Parida 《Journal of computational biology》2004,11(1):15-25

We examine the problem of extracting maximal irredundant motifs from a string. A combinatorial argument poses a linear bound on the total number of such motifs, thereby opening the way to the quest for the fastest and most efficient methods of extraction. The basic paradigm explored here is that of iterated updates of the set of irredundant motifs in a string under consecutive unit symbol extensions of the string itself. This approach exposes novel characterizations for the base set of motifs in a string, hinged on notions of partial order. Such properties support the design of ad hoc data structures and constructs, and lead to develop an O(n(3)) time incremental discovery algorithm. 相似文献

20.

RSIR: regularized sliced inverse regression for motif discovery 总被引：3，自引：0，他引：3

Zhong W Zeng P Ma P Liu JS Zhu Y 《Bioinformatics (Oxford, England)》2005,21(22):4169-4175

相似文献