期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Improving pairwise sequence alignment accuracy using near-optimal protein sequence alignments

Michael L Sierk Michael E Smoot Ellen J Bass William R Pearson 《BMC bioinformatics》2010,11(1):146

Background

While the pairwise alignments produced by sequence similarity searches are a powerful tool for identifying homologous proteins - proteins that share a common ancestor and a similar structure; pairwise sequence alignments often fail to represent accurately the structural alignments inferred from three-dimensional coordinates. Since sequence alignment algorithms produce optimal alignments, the best structural alignments must reflect suboptimal sequence alignment scores. Thus, we have examined a range of suboptimal sequence alignments and a range of scoring parameters to understand better which sequence alignments are likely to be more structurally accurate. 相似文献

2.

Statistical distributions of optimal global alignment scores of random protein sequences

Hongxia?Pang Jiaowei?Tang Su-Shing?Chen Shiheng?Tao Email author 《BMC bioinformatics》2005,6(1):257

Background

The inference of homology from statistically significant sequence similarity is a central issue in sequence alignments. So far the statistical distribution function underlying the optimal global alignments has not been completely determined. 相似文献

3.

Accuracy of structure-based sequence alignment of automatic methods

Changhoon Kim Byungkook Lee 《BMC bioinformatics》2007,8(1):355

Background

Accurate sequence alignments are essential for homology searches and for building three-dimensional structural models of proteins. Since structure is better conserved than sequence, structure alignments have been used to guide sequence alignments and are commonly used as the gold standard for sequence alignment evaluation. Nonetheless, as far as we know, there is no report of a systematic evaluation of pairwise structure alignment programs in terms of the sequence alignment accuracy. 相似文献

4.

State of the art: refinement of multiple sequence alignments

Saikat Chakrabarti Christopher J Lanczycki Anna R Panchenko Teresa M Przytycka Paul A Thiessen Stephen H Bryant 《BMC bioinformatics》2006,7(1):499

Background

Accurate multiple sequence alignments of proteins are very important in computational biology today. Despite the numerous efforts made in this field, all alignment strategies have certain shortcomings resulting in alignments that are not always correct. Refinement of existing alignment can prove to be an intelligent choice considering the increasing importance of high quality alignments in large scale high-throughput analysis. 相似文献

5.

A configuration space of homologous proteins conserving mutual information and allowing a phylogeny inference based on pair-wise Z-score probabilities

Olivier Bastien Philippe Ortet Sylvaine Roy Eric Maréchal 《BMC bioinformatics》2005,6(1):49

Background

Popular methods to reconstruct molecular phylogenies are based on multiple sequence alignments, in which addition or removal of data may change the resulting tree topology. We have sought a representation of homologous proteins that would conserve the information of pair-wise sequence alignments, respect probabilistic properties of Z-scores (Monte Carlo methods applied to pair-wise comparisons) and be the basis for a novel method of consistent and stable phylogenetic reconstruction. 相似文献

6.

GASP: Gapped Ancestral Sequence Prediction for proteins

Richard?J?Edwards Email author Denis?C?Shields 《BMC bioinformatics》2004,5(1):123

Background

The prediction of ancestral protein sequences from multiple sequence alignments is useful for many bioinformatics analyses. Predicting ancestral sequences is not a simple procedure and relies on accurate alignments and phylogenies. Several algorithms exist based on Maximum Parsimony or Maximum Likelihood methods but many current implementations are unable to process residues with gaps, which may represent insertion/deletion (indel) events or sequence fragments. 相似文献

7.

ProfileGrids as a new visual representation of large multiple sequence alignments: a case study of the RecA protein family

Alberto I Roca Albert E Almada Aaron C Abajian 《BMC bioinformatics》2008,9(1):554

Background

Multiple sequence alignments are a fundamental tool for the comparative analysis of proteins and nucleic acids. However, large data sets are no longer manageable for visualization and investigation using the traditional stacked sequence alignment representation. 相似文献

8.

AIR: A batch-oriented web program package for construction of supermatrices ready for phylogenomic analyses

Surendra Kumar ?smund Skj?veland Russell JS Orr P?l Enger Torgeir Ruden Bj?rn-Helge Mevik Fabien Burki Andreas Botnen Kamran Shalchian-Tabrizi 《BMC bioinformatics》2009,10(1):357

Background

Large multigene sequence alignments have over recent years been increasingly employed for phylogenomic reconstruction of the eukaryote tree of life. Such supermatrices of sequence data are preferred over single gene alignments as they contain vastly more information about ancient sequence characteristics, and are thus more suitable for resolving deeply diverging relationships. However, as alignments are expanded, increasingly numbers of sites with misleading phylogenetic information are also added. Therefore, a major goal in phylogenomic analyses is to maximize the ratio of information to noise; this can be achieved by the reduction of fast evolving sites. 相似文献

9.

High quality protein sequence alignment by combining structural profile prediction and profile alignment using SABERTOOTH

Florian Teichert Jonas Minning Ugo Bastolla Markus Porto 《BMC bioinformatics》2010,11(1):251

Background

Protein alignments are an essential tool for many bioinformatics analyses. While sequence alignments are accurate for proteins of high sequence similarity, they become unreliable as they approach the so-called 'twilight zone' where sequence similarity gets indistinguishable from random. For such distant pairs, structure alignment is of much better quality. Nevertheless, sequence alignment is the only choice in the majority of cases where structural data is not available. This situation demands development of methods that extend the applicability of accurate sequence alignment to distantly related proteins. 相似文献

10.

Progressive multiple sequence alignments from triplets

Matthias Kruspe Peter F Stadler

《BMC bioinformatics》

Background

The quality of progressive sequence alignments strongly depends on the accuracy of the individual pairwise alignment steps since gaps that are introduced at one step cannot be removed at later aggregation steps. Adjacent insertions and deletions necessarily appear in arbitrary order in pairwise alignments and hence form an unavoidable source of errors. 相似文献

11.

ConStruct: Improved construction of RNA consensus structures

Andreas Wilm Kornelia Linnenbrink Gerhard Steger 《BMC bioinformatics》2008,9(1):219

Background

Aligning homologous non-coding RNAs (ncRNAs) correctly in terms of sequence and structure is an unresolved problem, due to both mathematical complexity and imperfect scoring functions. High quality alignments, however, are a prerequisite for most consensus structure prediction approaches, homology searches, and tools for phylogeny inference. Automatically created ncRNA alignments often need manual corrections, yet this manual refinement is tedious and error-prone. 相似文献

12.

Incorporating background frequency improves entropy-based residue conservation measures

Kai Wang Ram Samudrala 《BMC bioinformatics》2006,7(1):385

Background

Several entropy-based methods have been developed for scoring sequence conservation in protein multiple sequence alignments. High scoring amino acid positions may correlate with structurally or functionally important residues. However, amino acid background frequencies are usually not taken into account in these entropy-based scoring schemes. 相似文献

13.

CoSMoS: Conserved Sequence Motif Search in the proteome

Xiao I Liu Neeraj Korde Ursula Jakob Lars I Leichert 《BMC bioinformatics》2006,7(1):37-6

Background

With the ever-increasing number of gene sequences in the public databases, generating and analyzing multiple sequence alignments becomes increasingly time consuming. Nevertheless it is a task performed on a regular basis by researchers in many labs. 相似文献

14.

Improving model construction of profile HMMs for remote homology detection through structural alignment

Juliana S Bernardes Alberto MR Dávila Vítor S Costa Gerson Zaverucha 《BMC bioinformatics》2007,8(1):435

Background

Remote homology detection is a challenging problem in Bioinformatics. Arguably, profile Hidden Markov Models (pHMMs) are one of the most successful approaches in addressing this important problem. pHMM packages present a relatively small computational cost, and perform particularly well at recognizing remote homologies. This raises the question of whether structural alignments could impact the performance of pHMMs trained from proteins in the Twilight Zone, as structural alignments are often more accurate than sequence alignments at identifying motifs and functional residues. Next, we assess the impact of using structural alignments in pHMM performance. 相似文献

15.

Evolutionary rates at codon sites may be used to align sequences and infer protein domain function

Pierre M Durand Scott Hazelhurst Theresa L Coetzer 《BMC bioinformatics》2010,11(1):151

Background

Sequence alignments form part of many investigations in molecular biology, including the determination of phylogenetic relationships, the prediction of protein structure and function, and the measurement of evolutionary rates. However, to obtain meaningful results, a significant degree of sequence similarity is required to ensure that the alignments are accurate and the inferences correct. Limitations arise when sequence similarity is low, which is particularly problematic when working with fast-evolving genes, evolutionary distant taxa, genomes with nucleotide biases, and cases of convergent evolution. 相似文献

16.

Searching for evolutionary distant RNA homologs within genomic sequences using partition function posterior probabilities

Usman Roshan Satish Chikkagoudar Dennis R Livesay 《BMC bioinformatics》2008,9(1):61

Background

Identification of RNA homologs within genomic stretches is difficult when pairwise sequence identity is low or unalignable flanking residues are present. In both cases structure-sequence or profile/family-sequence alignment programs become difficult to apply because of unreliable RNA structures or family alignments. As such, local sequence-sequence alignment programs are frequently used instead. We have recently demonstrated that maximal expected accuracy alignments using partition function match probabilities (implemented in Probalign) are significantly better than contemporary methods on heterogeneous length protein sequence datasets, thus suggesting an affinity for local alignment. 相似文献

17.

<Emphasis Type="Italic">Phylo-mLogo</Emphasis>: an interactive and hierarchical multiple-logo visualization tool for alignment of many sequences

Arthur Chun-Chieh Shih DT Lee Chin-Lin Peng Yu-Wei Wu 《BMC bioinformatics》2007,8(1):63

Background

When aligning several hundreds or thousands of sequences, such as epidemic virus sequences or homologous/orthologous sequences of some big gene families, to reconstruct the epidemiological history or their phylogenies, how to analyze and visualize the alignment results of many sequences has become a new challenge for computational biologists. Although there are several tools available for visualization of very long sequence alignments, few of them are applicable to the alignments of many sequences. 相似文献

18.

Multiple non-collinear TF-map alignments of promoter regions

Enrique Blanco Roderic Guigó Xavier Messeguer 《BMC bioinformatics》2007,8(1):138

Background

The analysis of the promoter sequence of genes with similar expression patterns is a basic tool to annotate common regulatory elements. Multiple sequence alignments are on the basis of most comparative approaches. The characterization of regulatory regions from co-expressed genes at the sequence level, however, does not yield satisfactory results in many occasions as promoter regions of genes sharing similar expression programs often do not show nucleotide sequence conservation. 相似文献

19.

Detecting the limits of regulatory element conservation and divergence estimation using pairwise and multiple alignments

Daniel A Pollard Alan M Moses Venky N Iyer Michael B Eisen 《BMC bioinformatics》2006,7(1):376-14

Background

Molecular evolutionary studies of noncoding sequences rely on multiple alignments. Yet how multiple alignment accuracy varies across sequence types, tree topologies, divergences and tools, and further how this variation impacts specific inferences, remains unclear. 相似文献

20.

A weighted average difference method for detecting differentially expressed genes from microarray data

Koji Kadota Yuji Nakai Kentaro Shimizu 《Algorithms for molecular biology : AMB》2008,3(1):1-12

相似文献