期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

γ-MYN: a new algorithm for estimating Ka and Ks with consideration of variable substitution rates

Da-Peng Wang Hao-Lei Wan Song Zhang Jun Yu 《Biology direct》2009,4(1):20-18

Background

Over the past two decades, there have been several approximate methods that adopt different mutation models and used for estimating nonsynonymous and synonymous substitution rates (Ka and Ks) based on protein-coding sequences across species or even different evolutionary lineages. Among them, MYN method (a Modified version of Yang-Nielsen method) considers three major dynamic features of evolving DNA sequences–bias in transition/transversion rate, nucleotide frequency, and unequal transitional substitution but leaves out another important feature: unequal substitution rates among different sites or nucleotide positions. 相似文献

2.

Pitfalls of the most commonly used models of context dependent substitution

Helen Lindsay Von Bing Yap Hua Ying Gavin A Huttley 《Biology direct》2008,3(1):52

Background

Neighboring nucleotides exert a striking influence on mutation, with the hypermutability of CpG dinucleotides in many genomes being an exemplar. Among the approaches employed to measure the relative importance of sequence neighbors on molecular evolution have been continuous-time Markov process models for substitutions that treat sequences as a series of independent tuples. The most widely used examples are the codon substitution models. We evaluated the suitability of derivatives of the nucleotide frequency weighted (hereafter NF) and tuple frequency weighted (hereafter TF) models for measuring sequence context dependent substitution. Critical properties we address are their relationships to an independent nucleotide process and the robustness of parameter estimation to changes in sequence composition. We then consider the impact on inference concerning dinucleotide substitution processes from application of these two forms to intron sequence alignments from primates. 相似文献

3.

Rooting a phylogenetic tree with nonreversible substitution models

Von?Bing?Yap Email author Terry?Speed 《BMC evolutionary biology》2005,5(1):2

Background

We compared two methods of rooting a phylogenetic tree: the stationary and the nonstationary substitution processes. These methods do not require an outgroup. 相似文献

4.

Weak preservation of local neutral substitution rates across mammalian genomes

Hideo Imamura John E Karro Jeffrey H Chuang 《BMC evolutionary biology》2009,9(1):89

Background

The rate at which neutral (non-functional) bases undergo substitution is highly dependent on their location within a genome. However, it is not clear how fast these location-dependent rates change, or to what extent the substitution rate patterns are conserved between lineages. To address this question, which is critical not only for understanding the substitution process but also for evaluating phylogenetic footprinting algorithms, we examine ancestral repeats: a predominantly neutral dataset with a significantly higher genomic density than other datasets commonly used to study substitution rate variation. Using this repeat data, we measure the extent to which orthologous ancestral repeat sequences exhibit similar substitution patterns in separate mammalian lineages, allowing us to ascertain how well local substitution rates have been preserved across species. 相似文献

5.

Evaluating the protein coding potential of exonized transposable element sequences

Jittima Piriyapongsa Mark T Rutledge Sanil Patel Mark Borodovsky I King Jordan 《Biology direct》2007,2(1):31-24

Background

Transposable element (TE) sequences, once thought to be merely selfish or parasitic members of the genomic community, have been shown to contribute a wide variety of functional sequences to their host genomes. Analysis of complete genome sequences have turned up numerous cases where TE sequences have been incorporated as exons into mRNAs, and it is widely assumed that such 'exonized' TEs encode protein sequences. However, the extent to which TE-derived sequences actually encode proteins is unknown and a matter of some controversy. We have tried to address this outstanding issue from two perspectives: i-by evaluating ascertainment biases related to the search methods used to uncover TE-derived protein coding sequences (CDS) and ii-through a probabilistic codon-frequency based analysis of the protein coding potential of TE-derived exons. 相似文献

6.

IdentiCS – Identification of coding sequence and <Emphasis Type="Italic">in silico</Emphasis> reconstruction of the metabolic network directly from unannotated low-coverage bacterial genome sequence

Jibin?Sun An-Ping?Zeng Email author 《BMC bioinformatics》2004,5(1):112

Background

A necessary step for a genome level analysis of the cellular metabolism is the in silico reconstruction of the metabolic network from genome sequences. The available methods are mainly based on the annotation of genome sequences including two successive steps, the prediction of coding sequences (CDS) and their function assignment. The annotation process takes time. The available methods often encounter difficulties when dealing with unfinished error-containing genomic sequence. 相似文献

7.

ProfNet,a method to derive profile-profile alignment scoring functions that improves the alignments of distantly related proteins

Tomas?Ohlson Arne?Elofsson Email author 《BMC bioinformatics》2005,6(1):253

Background

Profile-profile methods have been used for some years now to detect and align homologous proteins. The best such methods use information from the background distribution of amino acids and substitution tables either when constructing the profiles or in the scoring. This makes the methods dependent on the quality and choice of substitution table as well as the construction of the profiles. 相似文献

8.

Empirical codon substitution matrix

Adrian?Schneider Gina?M?Cannarozzi Email author Gaston?H?Gonnet 《BMC bioinformatics》2005,6(1):134

Background

Codon substitution probabilities are used in many types of molecular evolution studies such as determining Ka/Ks ratios, creating ancestral DNA sequences or aligning coding DNA. Until the recent dramatic increase in genomic data enabled construction of empirical matrices, researchers relied on parameterized models of codon evolution. Here we present the first empirical codon substitution matrix entirely built from alignments of coding sequences from vertebrate DNA and thus provide an alternative to parameterized models of codon evolution. 相似文献

9.

A model of evolution with constant selective pressure for regulatory DNA sites

Farida N Enikeeva Ekaterina A Kotelnikova Mikhail S Gelfand Vsevolod J Makeev 《BMC evolutionary biology》2007,7(1):125

相似文献

10.

Directed acyclic graph kernels for structural RNA analysis

Kengo Sato Toutai Mituyama Kiyoshi Asai Yasubumi Sakakibara 《BMC bioinformatics》2008,9(1):318

Background

Recent discoveries of a large variety of important roles for non-coding RNAs (ncRNAs) have been reported by numerous researchers. In order to analyze ncRNAs by kernel methods including support vector machines, we propose stem kernels as an extension of string kernels for measuring the similarities between two RNA sequences from the viewpoint of secondary structures. However, applying stem kernels directly to large data sets of ncRNAs is impractical due to their computational complexity. 相似文献

11.

FRAGS: estimation of coding sequence substitution rates from fragmentary data

Estienne?C?Swart Winston?A?Hide Cathal?Seoighe Email author 《BMC bioinformatics》2004,5(1):8

相似文献

12.

Physicochemical property distributions for accurate and rapid pairwise protein homology detection

Bobbie-Jo M Webb-Robertson Kyle G Ratuiste Christopher S Oehmen 《BMC bioinformatics》2010,11(1):145

Background

The challenge of remote homology detection is that many evolutionarily related sequences have very little similarity at the amino acid level. Kernel-based discriminative methods, such as support vector machines (SVMs), that use vector representations of sequences derived from sequence properties have been shown to have superior accuracy when compared to traditional approaches for the task of remote homology detection. 相似文献

13.

Phylogeny based discovery of regulatory elements

Jason Gertz Justin C Fay Barak A Cohen 《BMC bioinformatics》2006,7(1):266-9

相似文献

14.

PhyloSim - Monte Carlo simulation of sequence evolution in the R statistical computing environment

Botond Sipos Tim Massingham Gregory E Jordan Nick Goldman 《BMC bioinformatics》2011,12(1):104

Background

The Monte Carlo simulation of sequence evolution is routinely used to assess the performance of phylogenetic inference methods and sequence alignment algorithms. Progress in the field of molecular evolution fuels the need for more realistic and hence more complex simulations, adapted to particular situations, yet current software makes unreasonable assumptions such as homogeneous substitution dynamics or a uniform distribution of indels across the simulated sequences. This calls for an extensible simulation framework written in a high-level functional language, offering new functionality and making it easy to incorporate further complexity. 相似文献

15.

MTRAP: Pairwise sequence alignment algorithm by a new measure based on transition probability between two consecutive pairs of residues

Toshihide Hara Keiko Sato Masanori Ohya 《BMC bioinformatics》2010,11(1):235

Background

Sequence alignment is one of the most important techniques to analyze biological systems. It is also true that the alignment is not complete and we have to develop it to look for more accurate method. In particular, an alignment for homologous sequences with low sequence similarity is not in satisfactory level. Usual methods for aligning protein sequences in recent years use a measure empirically determined. As an example, a measure is usually defined by a combination of two quantities (1) and (2) below: (1) the sum of substitutions between two residue segments, (2) the sum of gap penalties in insertion/deletion region. Such a measure is determined on the assumption that there is no an intersite correlation on the sequences. In this paper, we improve the alignment by taking the correlation of consecutive residues. 相似文献

16.

Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified

Thomas M Keane Christopher J Creevey Melissa M Pentony Thomas J Naughton James O Mclnerney 《BMC evolutionary biology》2006,6(1):29-17

Background

In recent years, model based approaches such as maximum likelihood have become the methods of choice for constructing phylogenies. A number of authors have shown the importance of using adequate substitution models in order to produce accurate phylogenies. In the past, many empirical models of amino acid substitution have been derived using a variety of different methods and protein datasets. These matrices are normally used as surrogates, rather than deriving the maximum likelihood model from the dataset being examined. With few exceptions, selection between alternative matrices has been carried out in an ad hoc manner. 相似文献

17.

Optimizing amino acid substitution matrices with a local alignment kernel

Hiroto Saigo Jean-Philippe Vert Tatsuya Akutsu 《BMC bioinformatics》2006,7(1):246-12

Background

Detecting remote homologies by direct comparison of protein sequences remains a challenging task. We had previously developed a similarity score between sequences, called a local alignment kernel, that exhibits good performance for this task in combination with a support vector machine. The local alignment kernel depends on an amino acid substitution matrix. Since commonly used BLOSUM or PAM matrices for scoring amino acid matches have been optimized to be used in combination with the Smith-Waterman algorithm, the matrices optimal for the local alignment kernel can be different. 相似文献

18.

Predicting domain-domain interaction based on domain profiles with feature selection and support vector machines

Alvaro J González Li Liao 《BMC bioinformatics》2010,11(1):537

Background

Protein-protein interaction (PPI) plays essential roles in cellular functions. The cost, time and other limitations associated with the current experimental methods have motivated the development of computational methods for predicting PPIs. As protein interactions generally occur via domains instead of the whole molecules, predicting domain-domain interaction (DDI) is an important step toward PPI prediction. Computational methods developed so far have utilized information from various sources at different levels, from primary sequences, to molecular structures, to evolutionary profiles. 相似文献

19.

Browsing repeats in genomes: Pygram and an application to non-coding region analysis

Patrick Durand Frédéric Mahé Anne-Sophie Valin Jacques Nicolas 《BMC bioinformatics》2006,7(1):477-17

Background

A large number of studies on genome sequences have revealed the major role played by repeated sequences in the structure, function, dynamics and evolution of genomes. In-depth repeat analysis requires specialized methods, including visualization techniques, to achieve optimum exploratory power. 相似文献

20.

Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals

Daniel J Gaffney Peter D Keightley 《BMC evolutionary biology》2008,8(1):265

Background

Molecular evolutionary studies in mammals often estimate nucleotide substitution rates within and outside CpG dinucleotides separately. Frequently, in alignments of two sequences, the division of sites into CpG and non-CpG classes is based simply on the presence or absence of a CpG dinucleotide in either sequence, a procedure that we refer to as CpG/non-CpG assignment. Although it likely that this procedure is biased, it is generally assumed that the bias is negligible if species are very closely related. 相似文献