排序方式: 共有50条查询结果,搜索用时 140 毫秒
41.
42.
43.
Bayesian adaptive sequence alignment algorithms 总被引:3,自引:1,他引:2
The selection of a scoring matrix and gap penalty parameters continues to
be an important problem in sequence alignment. We describe here an
algorithm, the 'Bayes block aligner, which bypasses this requirement.
Instead of requiring a fixed set of parameter settings, this algorithm
returns the Bayesian posterior probability for the number of gaps and for
the scoring matrices in any series of interest. Furthermore, instead of
returning the single best alignment for the chosen parameter settings, this
algorithm returns the posterior distribution of all alignments considering
the full range of gapping and scoring matrices selected, weighing each in
proportion to its probability based on the data. We compared the Bayes
aligner with the popular Smith-Waterman algorithm with parameter settings
from the literature which had been optimized for the identification of
structural neighbors, and found that the Bayes aligner correctly identified
more structural neighbors. In a detailed examination of the alignment of a
pair of kinase and a pair of GTPase sequences, we illustrate the
algorithm's potential to identify subsequences that are conserved to
different degrees. In addition, this example shows that the Bayes aligner
returns an alignment-free assessment of the distance between a pair of
sequences.
相似文献
44.
45.
Ingolf Sommer Stefano Toppo Oliver Sander Thomas Lengauer Silvio CE Tosatto 《BMC bioinformatics》2006,7(1):364
Background
In the area of protein structure prediction, recently a lot of effort has gone into the development of Model Quality Assessment Programs (MQAPs). MQAPs distinguish high quality protein structure models from inferior models. Here, we propose a new method to use an MQAP to improve the quality of models. With a given target sequence and template structure, we construct a number of different alignments and corresponding models for the sequence. The quality of these models is scored with an MQAP and used to choose the most promising model. An SVM-based selection scheme is suggested for combining MQAP partial potentials, in order to optimize for improved model selection. 相似文献46.
Background
The impressive increase of novel RNA structures, during the past few years, demands automated methods for structure comparison. While many algorithms handle only small motifs, few techniques, developed in recent years, (ARTS, DIAL, SARA, SARSA, and LaJolla) are available for the structural comparison of large and intact RNA molecules. 相似文献47.
Tandem repeats (TRs) are often present in proteins with crucial functions, responsible for resistance, pathogenicity and associated with infectious or neurodegenerative diseases. This motivates numerous studies of TRs and their evolution, requiring accurate multiple sequence alignment. TRs may be lost or inserted at any position of a TR region by replication slippage or recombination, but current methods assume fixed unit boundaries, and yet are of high complexity. We present a new global graph-based alignment method that does not restrict TR unit indels by unit boundaries. TR indels are modeled separately and penalized using the phylogeny-aware alignment algorithm. This ensures enhanced accuracy of reconstructed alignments, disentangling TRs and measuring indel events and rates in a biologically meaningful way. Our method detects not only duplication events but also all changes in TR regions owing to recombination, strand slippage and other events inserting or deleting TR units. We evaluate our method by simulation incorporating TR evolution, by either sampling TRs from a profile hidden Markov model or by mimicking strand slippage with duplications. The new method is illustrated on a family of type III effectors, a pathogenicity determinant in agriculturally important bacteria Ralstonia solanacearum. We show that TR indel rate variation contributes to the diversification of this protein family. 相似文献
48.
49.
D. T. Truong A. Che A. R. Rendall C. E. Szalkowski J. J. LoTurco A. M. Galaburda R. Holly Fitch 《Genes, Brain & Behavior》2014,13(8):802-811
Dyslexia is a complex neurodevelopmental disorder characterized by impaired reading ability despite normal intellect, and is associated with specific difficulties in phonological and rapid auditory processing (RAP), visual attention and working memory. Genetic variants in Doublecortin domain‐containing protein 2 (DCDC2) have been associated with dyslexia, impairments in phonological processing and in short‐term/working memory. The purpose of this study was to determine whether sensory and behavioral impairments can result directly from mutation of the Dcdc2 gene in mice. Several behavioral tasks, including a modified pre‐pulse inhibition paradigm (to examine auditory processing), a 4/8 radial arm maze (to assess/dissociate working vs. reference memory) and rotarod (to examine sensorimotor ability and motor learning), were used to assess the effects of Dcdc2 mutation. Behavioral results revealed deficits in RAP, working memory and reference memory in Dcdc2del2/del2 mice when compared with matched wild types. Current findings parallel clinical research linking genetic variants of DCDC2 with specific impairments of phonological processing and memory ability. 相似文献
50.
Szalkowski AM 《BMC bioinformatics》2012,13(1):129
ABSTRACT: BACKGROUND: ProGraphMSA is a state-of-the-art multiple sequence alignment tool which produces phylogenetically sensiblegap patterns while maintaining robustness by allowing alternative splicings and errors in the branching pattern ofthe guide tree. RESULTS: This is achieved by incorporating a graph-based sequence representation combined with the advantages of thephylogeny-aware gap placement algorithm of Prank. Further, we account for variations in the substitution patternby implementing context-specific profiles as in CS-Blast and by estimating amino acid frequencies from inputdata. CONCLUSIONS: ProGraphMSA shows good performance and competitive execution times in various benchmarks. 相似文献