期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

LineageSpecificSeqgen: generating sequence data with lineage-specific variation in the proportion of variable sites

Liat Shavit Grievink David Penny Mike D Hendy Barbara R Holland 《BMC evolutionary biology》2008,8(1):317

Background

Commonly used phylogenetic models assume a homogeneous evolutionary process throughout the tree. It is known that these homogeneous models are often too simplistic, and that with time some properties of the evolutionary process can change (due to selection or drift). In particular, as constraints on sequences evolve, the proportion of variable sites can vary between lineages. This affects the ability of phylogenetic methods to correctly estimate phylogenetic trees, especially for long timescales. To date there is no phylogenetic model that allows for change in the proportion of variable sites, and the degree to which this affects phylogenetic reconstruction is unknown. 相似文献

2.

BEAST: Bayesian evolutionary analysis by sampling trees 总被引：2，自引：0，他引：2

Alexei J Drummond Andrew Rambaut 《BMC evolutionary biology》2007,7(1):214

Background

The evolutionary analysis of molecular sequence variation is a statistical enterprise. This is reflected in the increased use of probabilistic models for phylogenetic inference, multiple sequence alignment, and molecular population genetics. Here we present BEAST: a fast, flexible software architecture for Bayesian analysis of molecular sequences related by an evolutionary tree. A large number of popular stochastic models of sequence evolution are provided and tree-based models suitable for both within- and between-species sequence data are implemented. 相似文献

3.

Scaling statistical multiple sequence alignment to large datasets

Nute Michael Warnow Tandy 《BMC genomics》2016,17(10):764-144

Background

Multiple sequence alignment is an important task in bioinformatics, and alignments of large datasets containing hundreds or thousands of sequences are increasingly of interest. While many alignment methods exist, the most accurate alignments are likely to be based on stochastic models where sequences evolve down a tree with substitutions, insertions, and deletions. While some methods have been developed to estimate alignments under these stochastic models, only the Bayesian method BAli-Phy has been able to run on even moderately large datasets, containing 100 or so sequences. A technique to extend BAli-Phy to enable alignments of thousands of sequences could potentially improve alignment and phylogenetic tree accuracy on large-scale data beyond the best-known methods today.

Results

We use simulated data with up to 10,000 sequences representing a variety of model conditions, including some that are significantly divergent from the statistical models used in BAli-Phy and elsewhere. We give a method for incorporating BAli-Phy into PASTA and UPP, two strategies for enabling alignment methods to scale to large datasets, and give alignment and tree accuracy results measured against the ground truth from simulations. Comparable results are also given for other methods capable of aligning this many sequences.

Conclusions

Extensions of BAli-Phy using PASTA and UPP produce significantly more accurate alignments and phylogenetic trees than the current leading methods.

相似文献

4.

Exploration of phylogenetic data using a global sequence analysis method

Charles?Chapus Christine?Dufraigne Scott?Edwards Alain?Giron Bernard?Fertil Patrick?Deschavanne Email author 《BMC evolutionary biology》2005,5(1):63

Background

Molecular phylogenetic methods are based on alignments of nucleic or peptidic sequences. The tremendous increase in molecular data permits phylogenetic analyses of very long sequences and of many species, but also requires methods to help manage large datasets. 相似文献

5.

Taxonomic colouring of phylogenetic trees of protein sequences

Gareth Palidwor Emmanuel G Reynaud Miguel A Andrade-Navarro 《BMC bioinformatics》2006,7(1):79-4

Background

Phylogenetic analyses of protein families are used to define the evolutionary relationships between homologous proteins. The interpretation of protein-sequence phylogenetic trees requires the examination of the taxonomic properties of the species associated to those sequences. However, there is no online tool to facilitate this interpretation, for example, by automatically attaching taxonomic information to the nodes of a tree, or by interactively colouring the branches of a tree according to any combination of taxonomic divisions. This is especially problematic if the tree contains on the order of hundreds of sequences, which, given the accelerated increase in the size of the protein sequence databases, is a situation that is becoming common. 相似文献

6.

Detecting lateral gene transfers by statistical reconciliation of phylogenetic forests

Sophie S Abby Eric Tannier Manolo Gouy Vincent Daubin 《BMC bioinformatics》2010,11(1):324

Background

To understand the evolutionary role of Lateral Gene Transfer (LGT), accurate methods are needed to identify transferred genes and infer their timing of acquisition. Phylogenetic methods are particularly promising for this purpose, but the reconciliation of a gene tree with a reference (species) tree is computationally hard. In addition, the application of these methods to real data raises the problem of sorting out real and artifactual phylogenetic conflict. 相似文献

7.

BranchClust: a phylogenetic algorithm for selecting gene families

Maria S Poptsova J Peter Gogarten 《BMC bioinformatics》2007,8(1):120

Background

Automated methods for assembling families of orthologous genes include those based on sequence similarity scores and those based on phylogenetic approaches. The first are easy to automate but usually they do not distinguish between paralogs and orthologs or have restriction on the number of taxa. Phylogenetic methods often are based on reconciliation of a gene tree with a known rooted species tree; a limitation of this approach, especially in case of prokaryotes, is that the species tree is often unknown, and that from the analyses of single gene families the branching order between related organisms frequently is unresolved. 相似文献

8.

Phylogenetic identification of lateral genetic transfer events

Robert G Beiko Nicholas Hamilton 《BMC evolutionary biology》2006,6(1):15-17

Background

Lateral genetic transfer can lead to disagreements among phylogenetic trees comprising sequences from the same set of taxa. Where topological discordance is thought to have arisen through genetic transfer events, tree comparisons can be used to identify the lineages that may have shared genetic information. An 'edit path' of one or more transfer events can be represented with a series of subtree prune and regraft (SPR) operations, but finding the optimal such set of operations is NP-hard for comparisons between rooted trees, and may be so for unrooted trees as well. 相似文献

9.

How reliably can we predict the reliability of protein structure predictions?

István Miklós Ádám Novák ' Balázs Dombai Jotun Hein 《BMC bioinformatics》2008,9(1):137

Background

Comparative methods have been the standard techniques for in silico protein structure prediction. The prediction is based on a multiple alignment that contains both reference sequences with known structures and the sequence whose unknown structure is predicted. Intensive research has been made to improve the quality of multiple alignments, since misaligned parts of the multiple alignment yield misleading predictions. However, sometimes all methods fail to predict the correct alignment, because the evolutionary signal is too weak to find the homologous parts due to the large number of mutations that separate the sequences. 相似文献

10.

PHY·FI: fast and easy online creation and manipulation of phylogeny color figures

Jakob Fredslund 《BMC bioinformatics》2006,7(1):315-7

Background

The need to depict a phylogeny, or some other kind of abstract tree, is very frequently experienced by researchers from a broad range of biological and computational disciplines. Thousands of papers and talks include phylogeny figures, and often during everyday work, one would like to quickly get a graphical display of, e.g., the phylogenetic relationship between a set of sequences as calculated by an alignment program such as ClustalW or the phylogenetic package Phylip. A wealth of software tools capable of tree drawing exists; most are comprehensive packages that also perform various types of analysis, and hence they are available only for download and installing. Some online tools exist, too. 相似文献

11.

TaxMan: a taxonomic database manager

Martin Jones Mark Blaxter 《BMC bioinformatics》2006,7(1):536

Background

Phylogenetic analysis of large, multiple-gene datasets, assembled from public sequence databases, is rapidly becoming a popular way to approach difficult phylogenetic problems. Supermatrices (concatenated multiple sequence alignments of multiple genes) can yield more phylogenetic signal than individual genes. However, manually assembling such datasets for a large taxonomic group is time-consuming and error-prone. Additionally, sequence curation, alignment and assessment of the results of phylogenetic analysis are made particularly difficult by the potential for a given gene in a given species to be unrepresented, or to be represented by multiple or partial sequences. We have developed a software package, TaxMan, that largely automates the processes of sequence acquisition, consensus building, alignment and taxon selection to facilitate this type of phylogenetic study. 相似文献

12.

MicroSyn: A user friendly tool for detection of microsynteny in a gene family

Bin Cai Xiaohan Yang Gerald A Tuskan Zong-Ming Cheng 《BMC bioinformatics》2011,12(1):79

Background

The traditional phylogeny analysis within gene family is mainly based on DNA or amino acid sequence homologies. However, these phylogenetic tree analyses are not suitable for those "non-traditional" gene families like microRNA with very short sequences. For the normal protein-coding gene families, low bootstrap values are frequently encountered in some nodes, suggesting low confidence or likely inappropriateness of placement of those members in those nodes. 相似文献

13.

Morphology and molecular phylogeny of a marine interstitial tetraflagellate with putative endosymbionts: <Emphasis Type="Italic">Auranticordis quadriverberis</Emphasis> n. gen. et sp. (Cercozoa)

Chitchai Chantangsi Heather J Esson Brian S Leander 《BMC microbiology》2008,8(1):123

Background

Comparative morphological studies and environmental sequencing surveys indicate that marine benthic environments contain a diverse assortment of microorganisms that are just beginning to be explored and characterized. The most conspicuous predatory flagellates in these habitats range from about 20–150 μm in size and fall into three major groups of eukaryotes that are very distantly related to one another: dinoflagellates, euglenids and cercozoans. The Cercozoa is a diverse group of amoeboflagellates that cluster together in molecular phylogenies inferred mainly from ribosomal gene sequences. These molecular phylogenetic studies have demonstrated that several enigmatic taxa, previously treated as Eukaryota insertae sedis, fall within the Cercozoa, and suggest that the actual diversity of this group is largely unknown. Improved knowledge of cercozoan diversity is expected to help resolve major branches in the tree of eukaryotes and demonstrate important cellular innovations for understanding eukaryote evolution. 相似文献

14.

PhyloExplorer: a web server to validate, explore and query phylogenetic trees

Vincent Ranwez Nicolas Clairon Frédéric Delsuc Saeed Pourali Nicolas Auberval Sorel Diser Vincent Berry 《BMC evolutionary biology》2009,9(1):108-13

相似文献

15.

The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs 总被引：1，自引：0，他引：1

Jamie J Cannone Sankar Subramanian Murray N Schnare James R Collett Lisa M D'Souza Yushi Du Brian Feng Nan Lin Lakshmi V Madabusi Kirsten M Müller Nupur Pande Zhidi Shang Nan Yu Robin R Gutell 《BMC bioinformatics》2002,3(1):1-31

Background

Comparative analysis of RNA sequences is the basis for the detailed and accurate predictions of RNA structure and the determination of phylogenetic relationships for organisms that span the entire phylogenetic tree. Underlying these accomplishments are very large, well-organized, and processed collections of RNA sequences. This data, starting with the sequences organized into a database management system and aligned to reveal their higher-order structure, and patterns of conservation and variation for organisms that span the phylogenetic tree, has been collected and analyzed. This type of information can be fundamental for and have an influence on the study of phylogenetic relationships, RNA structure, and the melding of these two fields.

Results

We have prepared a large web site that disseminates our comparative sequence and structure models and data. The four major types of comparative information and systems available for the three ribosomal RNAs (5S, 16S, and 23S rRNA), transfer RNA (tRNA), and two of the catalytic intron RNAs (group I and group II) are: (1) Current Comparative Structure Models; (2) Nucleotide Frequency and Conservation Information; (3) Sequence and Structure Data; and (4) Data Access Systems.

Conclusions

This online RNA sequence and structure information, the result of extensive analysis, interpretation, data collection, and computer program and web development, is accessible at our Comparative RNA Web (CRW) Site http://www.rna.icmb.utexas.edu. In the future, more data and information will be added to these existing categories, new categories will be developed, and additional RNAs will be studied and presented at the CRW Site. 相似文献

16.

Variance adjusted weighted UniFrac: a powerful beta diversity measure for comparing communities based on phylogeny

Qin Chang Yihui Luan Fengzhu Sun 《BMC bioinformatics》2011,12(1):118

Background

Beta diversity, which involves the assessment of differences between communities, is an important problem in ecological studies. Many statistical methods have been developed to quantify beta diversity, and among them, UniFrac and weighted-UniFrac (W-UniFrac) are widely used. The W-UniFrac is a weighted sum of branch lengths in a phylogenetic tree of the sequences from the communities. However, W-UniFrac does not consider the variation of the weights under random sampling resulting in less power detecting the differences between communities. 相似文献

17.

Assessing what is needed to resolve a molecular phylogeny: simulations and empirical data from emydid turtles

Phillip Q Spinks Robert C Thomson Geoff A Lovely H Bradley Shaffer 《BMC evolutionary biology》2009,9(1):1-17

Background

Section Calochroi is one of the most species-rich lineages in the genus Cortinarius (Agaricales, Basidiomycota) and is widely distributed across boreo-nemoral areas, with some extensions into meridional zones. Previous phylogenetic studies of Calochroi (incl. section Fulvi) have been geographically restricted; therefore, phylogenetic and biogeographic relationships within this lineage at a global scale have been largely unknown. In this study, we obtained DNA sequences from a nearly complete taxon sampling of known species from Europe, Central America and North America. We inferred intra- and interspecific phylogenetic relationships as well as major morphological evolutionary trends within section Calochroi based on 576 ITS sequences, 230 ITS + 5.8S + D1/D2 sequences, and a combined dataset of ITS + 5.8S + D1/D2 and RPB1 sequences of a representative subsampling of 58 species. 相似文献

18.

libcov: A C++ bioinformatic library to manipulate protein structures,sequence alignments and phylogeny

Davin?Butt Andrew?J?Roger Christian?Blouin Email author 《BMC bioinformatics》2005,6(1):138

Background

An increasing number of bioinformatics methods are considering the phylogenetic relationships between biological sequences. Implementing new methodologies using the maximum likelihood phylogenetic framework can be a time consuming task. 相似文献

19.

Inferring angiosperm phylogeny from EST data with widespread gene duplication

Sanderson MJ McMahon MM 《BMC evolutionary biology》2007,7(Z1):S3

Background

Most studies inferring species phylogenies use sequences from single copy genes or sets of orthologs culled from gene families. For taxa such as plants, with very high levels of gene duplication in their nuclear genomes, this has limited the exploitation of nuclear sequences for phylogenetic studies, such as those available in large EST libraries. One rarely used method of inference, gene tree parsimony, can infer species trees from gene families undergoing duplication and loss, but its performance has not been evaluated at a phylogenomic scale for EST data in plants.

Results

A gene tree parsimony analysis based on EST data was undertaken for six angiosperm model species and Pinus, an outgroup. Although a large fraction of the tentative consensus sequences obtained from the TIGR database of ESTs was assembled into homologous clusters too small to be phylogenetically informative, some 557 clusters contained promising levels of information. Based on maximum likelihood estimates of the gene trees obtained from these clusters, gene tree parsimony correctly inferred the accepted species tree with strong statistical support. A slight variant of this species tree was obtained when maximum parsimony was used to infer the individual gene trees instead.

Conclusion

Despite the complexity of the EST data and the relatively small fraction eventually used in inferring a species tree, the gene tree parsimony method performed well in the face of very high apparent rates of duplication.

相似文献

20.

Consistency of the Neighbor-Net Algorithm

David Bryant Vincent Moulton Andreas Spillner 《Algorithms for molecular biology : AMB》2007,2(1):8-11

Background

Neighbor-Net is a novel method for phylogenetic analysis that is currently being widely used in areas such as virology, bacteriology, and plant evolution. Given an input distance matrix, Neighbor-Net produces a phylogenetic network, a generalization of an evolutionary or phylogenetic tree which allows the graphical representation of conflicting phylogenetic signals. 相似文献