共查询到20条相似文献,搜索用时 0 毫秒
1.
Cyprinidae is the biggest family of freshwater fish, but the phylogenetic relationships among its higher-level taxa are not yet fully resolved. In this study, we used the nuclear recombination activating gene 2 and the mitochondrial 16S ribosomal RNA and cytochrome b genes to reconstruct cyprinid phylogeny. Our aims were to (i) demonstrate the effects of partitioned phylogenetic analyses on phylogeny reconstruction of cyprinid fishes; (ii) provide new insights into the phylogeny of cyprinids. Our study indicated that unpartitioned strategy was optimal for our analyses; partitioned analyses did not provide better-resolved or -supported estimates of cyprinid phylogeny. Bayesian analyses support the following relationships among the major monophyletic groups within Cyprinidae: (Cyprininae, Labeoninae), ((Acheilognathinae, ((Leuciscinae, Tincinae), Gobioninae)), Xenocyprininae). The placement of Danioninae was poorly resolved. Estimates of divergence dates within the family showed that radiation of the major cyprinid groups occurred during the Late Oligocene through the Late Miocene. Our phylogenetic analyses improved our understanding of the evolutionary history of this important fish family. 相似文献
2.
There has recently been increased interest in the use of Markov Chain Monte Carlo (MCMC)-based Bayesian methods for estimating genetic maps. The advantage of these methods is that they can deal accurately with missing data and genotyping errors. Here we present an extension of the previous methods that makes the Bayesian method applicable to large data sets. We present an extensive simulation study examining the statistical properties of the method and comparing it with the likelihood method implemented in Mapmaker. We show that the Maximum A Posteriori (MAP) estimator of the genetic distances, corresponding to the maximum likelihood estimator, performs better than estimators based on the posterior expectation. We also show that while the performance is similar between Mapmaker and the MCMC-based method in the absence of genotyping errors, the MCMC-based method has a distinct advantage in the presence of genotyping errors. A similar advantage of the Bayesian method was not observed for missing data. We also re-analyse a recently published set of data from the eggplant and show that the use of the MCMC-based method leads to smaller estimates of genetic distances. 相似文献
3.
Kuhner MK 《Bioinformatics (Oxford, England)》2006,22(6):768-770
We present a Markov chain Monte Carlo coalescent genealogy sampler, LAMARC 2.0, which estimates population genetic parameters from genetic data. LAMARC can co-estimate subpopulation Theta = 4N(e)mu, immigration rates, subpopulation exponential growth rates and overall recombination rate, or a user-specified subset of these parameters. It can perform either maximum-likelihood or Bayesian analysis, and accomodates nucleotide sequence, SNP, microsatellite or elecrophoretic data, with resolved or unresolved haplotypes. It is available as portable source code and executables for all three major platforms. AVAILABILITY: LAMARC 2.0 is freely available at http://evolution.gs.washington.edu/lamarc 相似文献
4.
The 30 living species of armadillos, anteaters, and sloths (Mammalia: Xenarthra) represent one of the three major clades of placentals. Armadillos (Cingulata: Dasypodidae) are the earliest and most speciose xenarthran lineage with 21 described species. The question of their tricky phylogeny was here studied by adding two mitochondrial genes (NADH dehydrogenase subunit 1 [ND1] and 12S ribosomal RNA [12S rRNA]) to the three protein-coding nuclear genes (alpha2B adrenergic receptor [ADRA2B], breast cancer susceptibility exon 11 [BRCA1], and von Willebrand factor exon 28 [VWF]) yielding a total of 6869 aligned nucleotide sites for thirteen xenarthran species. The two mitochondrial genes were characterized by marked excesses of transitions over transversions-with a strong bias toward CT transitions for the 12S rRNA-and exhibited two- to fivefold faster evolutionary rates than the fastest nuclear gene (ADRA2B). Maximum likelihood and Bayesian phylogenetic analyses supported the monophyly of Dasypodinae, Tolypeutinae, and Euphractinae, with the latter two armadillo subfamilies strongly clustering together. Conflicting branching points between individual genes involved relationships within the subfamilies Tolypeutinae and Euphractinae. Owing to a greater number of informative sites, the overall concatenation favored the mitochondrial topology with the classical grouping of Cabassous and Priodontes within Tolypeutinae, and a close relationship between Euphractus and Chaetophractus within Euphractinae. However, low statistical support values associated with almost equal distributions of apomorphies among alternatives suggested that two parallel events of rapid speciation occurred within these two armadillo subfamilies. 相似文献
5.
Background
Bayesian phylogenetic inference holds promise as an alternative to maximum likelihood, particularly for large molecular-sequence data sets. We have investigated the performance of Bayesian inference with empirical and simulated protein-sequence data under conditions of relative branch-length differences and model violation. 相似文献6.
Y Sugaya 《BMC research notes》2012,5(1):465
ABSTRACT: BACKGROUND: Linkage analysis is a useful tool for detecting genetic variants that regulate a trait of interest, especially genes associated with a given disease. Although penetrance parameters play an important role in determining gene location, they are assigned arbitrary values according to the researcher's intuition or as estimated by the maximum likelihood principle. Several methods exist by which to evaluate the maximum likelihood estimates of penetrance, although not all of these are supported by software packages and some are biased by marker genotype information, even when disease development is due solely to the genotype of a single allele. FINDINGS: Programs for exploring the maximum likelihood estimates of penetrance parameters were developed using the R statistical programming language supplemented by external C functions. The software returns a vector of polynomial coefficients of penetrance parameters, representing the likelihood of pedigree data. From the likelihood polynomial supplied by the proposed method, the likelihood value and its gradient can be precisely computed. To reduce the effect of the supplied dataset on the likelihood function, feasible parameter constraints can be introduced into maximum likelihood estimates, thus enabling flexible exploration of the penetrance estimates. An auxiliary program generates a perspective plot allowing visual validation of the model's convergence. The functions are collectively available as the MLEP R package. CONCLUSIONS: Linkage analysis using penetrance parameters estimated by the MLEP package enables feasible localization of a disease locus. This is shown through a simulation study and by demonstrating how the package is used to explore maximum likelihood estimates. Although the input dataset tends to bias the likelihood estimates, the method yields accurate results superior to the analysis using intuitive penetrance values for disease with low allele frequencies. MLEP is part of the Comprehensive R Archive Network and is freely available at http://cran.r-project.org/web/packages/MLEP/index.html. 相似文献
7.
Burbrink FT 《Molecular phylogenetics and evolution》2002,25(3):465-476
Most phylogeographic studies have used maximum likelihood or maximum parsimony to infer phylogeny and bootstrap analysis to evaluate support for trees. Recently, Bayesian methods using Marlov chain Monte Carlo to search tree space and simultaneously estimate tree support have become popular due to its fast search speed and ability to create a posterior distribution of parameters of interest. Here, I present a study that utilizes Bayesian methods to infer phylogenetic relationships of the cornsnake (Elaphe guttata) complex using cytochrome b sequences. Examination of the posterior probability distributions confirms the existence of three geographic lineages. Additionally, there is no support for the monophyly of the subspecies of E. guttata. Results suggest the three geographic lineages partially conform to the ranges of previously defined subspecies, although Shimodaira-Hasegawa tests suggest that subspecies-constrained trees produce significantly poorer likelihood estimates than the most likely trees reflecting the evolution of three geographic assemblages. Based on molecular support, these three geographic assemblages are recognized as species using evolutionary species criteria: E. guttata, Elaphe slowinskii, and Elaphe emoryi [phylogeographic, maximum likelihood, maximum parsimony, bootstrap, Bayesian, Markov chain Monte Carlo, cornsnake, Cytochrome b, geographic lineages, E. guttta, E. slowinskii, and E. emoryi]. 相似文献
8.
KS Lee RN Kim BH Yoon DS Kim SH Choi DW Kim SH Nam A Kim A Kang KH Park JE Jung SH Chae HS Park 《Bioinformation》2012,8(11):532-534
Recently, next generation sequencing (NGS) technologies have led to a revolutionary increase in sequencing speed and costefficacy. Consequently, a vast number of contigs from many recently sequenced bacterial genomes remain to be accurately mapped and annotated, requiring the development of more convenient bioinformatics programs. In this paper, we present a newly developed web-based bioinformatics program, Bacterial Genome Mapper, which is suitable for mapping and annotating contigs that have been assembled from bacterial genome sequence raw data. By constructing a multiple alignment map between target contig sequences and two reference bacterial genome sequences, this program also provides very useful comparative genomics analysis of draft bacterial genomes. AVAILABILITY: The database is available for free at http://mbgm.kribb.re.kr. 相似文献
9.
Hummingbirds are an important model system in avian biology, but to date the group has been the subject of remarkably few phylogenetic investigations. Here we present partitioned Bayesian and maximum likelihood phylogenetic analyses for 151 of approximately 330 species of hummingbirds and 12 outgroup taxa based on two protein-coding mitochondrial genes (ND2 and ND4), flanking tRNAs, and two nuclear introns (AK1 and BFib). We analyzed these data under several partitioning strategies ranging between unpartitioned and a maximum of nine partitions. In order to select a statistically justified partitioning strategy following partitioned Bayesian analysis, we considered four alternative criteria including Bayes factors, modified versions of the Akaike information criterion for small sample sizes (AIC(c)), Bayesian information criterion (BIC), and a decision-theoretic methodology (DT). Following partitioned maximum likelihood analyses, we selected a best-fitting strategy using hierarchical likelihood ratio tests (hLRTS), the conventional AICc, BIC, and DT, concluding that the most stringent criterion, the performance-based DT, was the most appropriate methodology for selecting amongst partitioning strategies. In the context of our well-resolved and well-supported phylogenetic estimate, we consider the historical biogeography of hummingbirds using ancestral state reconstructions of (1) primary geographic region of occurrence (i.e., South America, Central America, North America, Greater Antilles, Lesser Antilles), (2) Andean or non-Andean geographic distribution, and (3) minimum elevational occurrence. These analyses indicate that the basal hummingbird assemblages originated in the lowlands of South America, that most of the principle clades of hummingbirds (all but Mountain Gems and possibly Bees) originated on this continent, and that there have been many (at least 30) independent invasions of other primary landmasses, especially Central America. 相似文献
10.
Comparison of Bayesian and maximum likelihood bootstrap measures of phylogenetic reliability 总被引:16,自引:0,他引:16
Douady CJ Delsuc F Boucher Y Doolittle WF Douzery EJ 《Molecular biology and evolution》2003,20(2):248-254
Owing to the exponential growth of genome databases, phylogenetic trees are now widely used to test a variety of evolutionary hypotheses. Nevertheless, computation time burden limits the application of methods such as maximum likelihood nonparametric bootstrap to assess reliability of evolutionary trees. As an alternative, the much faster Bayesian inference of phylogeny, which expresses branch support as posterior probabilities, has been introduced. However, marked discrepancies exist between nonparametric bootstrap proportions and Bayesian posterior probabilities, leading to difficulties in the interpretation of sometimes strongly conflicting results. As an attempt to reconcile these two indices of node reliability, we apply the nonparametric bootstrap resampling procedure to the Bayesian approach. The correlation between posterior probabilities, bootstrap maximum likelihood percentages, and bootstrapped posterior probabilities was studied for eight highly diverse empirical data sets and were also investigated using experimental simulation. Our results show that the relation between posterior probabilities and bootstrapped maximum likelihood percentages is highly variable but that very strong correlations always exist when Bayesian node support is estimated on bootstrapped character matrices. Moreover, simulations corroborate empirical observations in suggesting that, being more conservative, the bootstrap approach might be less prone to strongly supporting a false phylogenetic hypothesis. Thus, apparent conflicts in topology recovered by the Bayesian approach were reduced after bootstrapping. Both posterior probabilities and bootstrap supports are of great interest to phylogeny as potential upper and lower bounds of node reliability, but they are surely not interchangeable and cannot be directly compared. 相似文献
11.
McCoy AJ 《Current opinion in structural biology》2002,12(5):670-673
Maximum likelihood methods are well known to macromolecular crystallographers as the methods of choice for isomorphous phasing and structure refinement. Recently, the use of maximum likelihood and Bayesian statistics has extended to the areas of molecular replacement and density modification, placing these methods on a stronger statistical foundation and making them more accurate and effective. 相似文献
12.
13.
Two molecular cytogenetics methods, PRINS (primed in situ DNA labeling) and C-PRINS (cycling PRINS), were optimized for the physical mapping of several types of DNA sequences on the
mitotic chromosomes of the narrow-leafed lupin (Lupinus angustifolius L.). The fragment of the FokI element from Vicia faba was localised by indirect PRINS reaction. Two other sequences, fragments of the coding sequences of L. luteus and of L. angustifolius, were localised by indirect C-PRINS. These techniques are faster and more sensitive than FISH, and they allowed the mapping
of short DNA fragments. The data obtained shows that both types of PRINS are valuable tools for chromosome identification
in lupin. 相似文献
14.
15.
Background
Estimates of divergence dates between species improve our understanding of processes ranging from nucleotide substitution to speciation. Such estimates are frequently based on molecular genetic differences between species; therefore, they rely on accurate estimates of the number of such differences (i.e. substitutions per site, measured as branch length on phylogenies). We used simulations to determine the effects of dataset size, branch length heterogeneity, branch depth, and analytical framework on branch length estimation across a range of branch lengths. We then reanalyzed an empirical dataset for plethodontid salamanders to determine how inaccurate branch length estimation can affect estimates of divergence dates. 相似文献16.
Here, we present a comprehensive analysis of solute transport systems encoded within the completely sequenced genomes of 18 prokaryotic organisms. These organisms include four Gram-positive bacteria, seven Gram-negative bacteria, two spirochetes, one cyanobacterium and four archaea. Membrane proteins are analyzed in terms of putative membrane topology, and the recognized transport systems are classified into 76 families, including four families of channel proteins, four families of primary carriers, 54 families of secondary carriers, six families of group translocators, and eight unclassified families. These families are analyzed in terms of the paralogous and orthologous relationships of their protein members, the substrate specificities of their constituent transporters and their distributions in each of the 18 organisms studied. The families vary from large superfamilies with hundreds of represented members, to small families with only one or a few members. The mode of transport generally correlates with the primary mechanism of energy generation, and the numbers of secondary transporters relative to primary transporters are roughly proportional to the total numbers of primary H(+) and Na(+) pumps in the cell. The phosphotransferase system is less prevalent in the analyzed bacteria than previously thought (only six of 14 bacteria transport sugars via this system) and is completely lacking in archaea and eukaryotes. Escherichia coli is shown to be exceptionally broad in its transport capabilities and therefore, at a membrane transport level, does not appear representative of the bacteria thus far sequenced. Archaea and spirochetes exhibit fewer proteins with multiple transmembrane segments and fewer net transporters than most bacteria. These results provide insight into the relevance of transport to the overall physiology of prokaryotes. 相似文献
17.
Several methods or tests and various software are currently being developed for analyzing data in population genetics and ecology, which often rely on computer-intensive algorithms. The potential user is thus confronted with the painful experience of freedom and, in particular, has to makea priori choices between different methods. Using examples drawn from population genetics, we explain some of these recently developed tools. 相似文献
18.
Hitte C Derrien T André C Ostrander EA Galibert F 《Bioinformatics (Oxford, England)》2004,20(18):3665-3667
SUMMARY: CRH_Server is an on line Comparative and Radiation Hybrid mapping Server dedicated to canine genomics. CRH_Server allows users to compute their own RH data using the current canine RH map, and allows comparative dog/human mapping analyses. Finally, it suggests multiple options for storage and queries of the dog RH database. AVAILABILITY: http://idefix.univ-rennes1.fr:8080/Dogs/rh-server.html. SUPPLEMENTARY INFORMATION: All information is available at http://idefix.univ-rennes1.fr:8080/Dogs/help_rh-server.html. 相似文献
19.
20.
All bovine chromosomes are now represented by a syntenic group, a linkage map and at least one in situ hybridization. Almost 1000 loci are mapped, about 300 of which are coding sequences useful for comparative mapping and evolutionary inference. Economically important loci are beginning to appear on bovine linkage maps and enhanced comparative maps are likely to be necessary to identify many of these genes by a comparative positional candidate gene approach. 相似文献