首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The increasing ability to extract and sequence DNA from noncontemporaneous tissue offers biologists the opportunity to analyse ancient DNA (aDNA) together with modern DNA (mDNA) to address the taxonomy of extinct species, evolutionary origins, historical phylogeography and biogeography. Perhaps more exciting are recent developments in coalescence-based Bayesian inference that offer the potential to use temporal information from aDNA and mDNA for the estimation of substitution rates and divergence dates as an alternative to fossil and geological calibration. This comes at a time of growing interest in the possibility of time dependency for molecular rate estimates. In this study, we provide a critical assessment of Bayesian Markov chain Monte Carlo (MCMC) analysis for the estimation of substitution rate using simulated samples of aDNA and mDNA. We conclude that the current models and priors employed in Bayesian MCMC analysis of heterochronous mtDNA are susceptible to an upward bias in the estimation of substitution rates because of model misspecification when the data come from populations with less than simple demographic histories, including sudden short-lived population bottlenecks or pronounced population structure. However, when model misspecification is only mild, then the 95% highest posterior density intervals provide adequate frequentist coverage of the true rates.  相似文献   

2.
3.
Knowledge of DNA evolution is central to our understanding of biological history, but how fast does DNA change? Previously, pedigree and ancient DNA studies--focusing on evolution in the short term--have yielded molecular rate estimates substantially faster than those based on deeper phylogenies. It has recently been suggested that short-term, elevated molecular rates decay exponentially over 1-2 Myr to long-term, phylogenetic rates, termed "time dependency of molecular rates." This transition has potential to confound molecular inferences of demographic parameters and dating of many important evolutionary events. Here, we employ a novel approach--geologically dated changes in river drainages and isolation of fish populations--to document rates of mitochondrial DNA change over a range of temporal scales. This method utilizes precise spatiotemporal disruptions of linear freshwater systems and hence avoids many of the limitations associated with typical DNA calibration methods involving fossil data or island formation. Studies of freshwater-limited fishes across the South Island of New Zealand have revealed that genetic relationships reflect past, rather than present, drainage connections. Here, we use this link between drainage geology and genetics to calibrate rates of molecular evolution across nine events ranging in age from 0.007 Myr (Holocene) to 5.0 Myr (Pliocene). Molecular rates of change in galaxiid fishes from calibration points younger than 200 kyr were faster than those based on older calibration points. This study provides conclusive evidence of time dependency in molecular rates as it is based on a robust calibration system that was applied to closely related taxa, and analyzed using a consistent and rigorous methodology. The time dependency observed here appears short-lived relative to previous suggestions (1-2 Myr), which has bearing on the accuracy of molecular inferences drawn from processes operating within the Quaternary and mechanisms invoked to explain the decay of rates with time.  相似文献   

4.
We develop a new model for studying the molecular evolution of protein-coding DNA sequences. In contrast to existing models, we incorporate the potential for site-to-site heterogeneity of both synonymous and nonsynonymous substitution rates. We demonstrate that within-gene heterogeneity of synonymous substitution rates appears to be common. Using the new family of models, we investigate the utility of a variety of new statistical inference procedures, and we pay particular attention to issues surrounding the detection of sites undergoing positive selection. We discuss how failure to model synonymous rate variation in the model can lead to misidentification of sites as positively selected.  相似文献   

5.
Felsenstein's maximum-likelihood approach for inferring phylogeny from DNA sequences assumes that the rate of nucleotide substitution is constant over different nucleotide sites. This assumption is sometimes unrealistic, as has been revealed by analysis of real sequence data. In the present paper Felsenstein's method is extended to the case where substitution rates over sites are described by the gamma distribution. A numerical example is presented to show that the method fits the data better than do previous models.   相似文献   

6.
Abstract A number of DNA sequences were cloned from Streptomyces cattleya which hybridized to more than one chromosomal DNA sequence. These sequences were unrelated and have a minimum copy number of between 4 and 10. One of these sequences showed hybridization to multiple DNA fragments from a wide range of other Streptomyces .  相似文献   

7.
We introduce the Bayesian skyline plot, a new method for estimating past population dynamics through time from a sample of molecular sequences without dependence on a prespecified parametric model of demographic history. We describe a Markov chain Monte Carlo sampling procedure that efficiently samples a variant of the generalized skyline plot, given sequence data, and combines these plots to generate a posterior distribution of effective population size through time. We apply the Bayesian skyline plot to simulated data sets and show that it correctly reconstructs demographic history under canonical scenarios. Finally, we compare the Bayesian skyline plot model to previous coalescent approaches by analyzing two real data sets (hepatitis C virus in Egypt and mitochondrial DNA of Beringian bison) that have been previously investigated using alternative coalescent methods. In the bison analysis, we detect a severe but previously unrecognized bottleneck, estimated to have occurred 10,000 radiocarbon years ago, which coincides with both the earliest undisputed record of large numbers of humans in Alaska and the megafaunal extinctions in North America at the beginning of the Holocene.  相似文献   

8.
Microbial genome sequences provide us with the fossil records for inferring their origination and evolution. Assuming that current microbial genomes are the evolutionary results of ancient genomes or fragments and the neighboring genes in ancient genomes are more likely neighbors in current genomes, in this paper we proposed a paleontological algorithm and assembled the orthologous gene groups from 66 complete and current microbial genome sequences into a pseudo-ancient genome, which consists of continuous fragments of various sizes. We performed bootstrap resampling and correlation analyses and the results showed that the assembled ancient genome and fragments are statistically significant and the genes of the same fragment are inherently related and likely derived from common ancestors. This method provides a new computational tool for studying microbial genome structure and evolution.  相似文献   

9.
The tribe Bovini contains a number of commercially and culturally important species, such as cattle. Understanding their evolutionary time scale is important for distinguishing between post-glacial and domestication-associated population expansions, but estimates of bovine divergence times have been hindered by a lack of reliable calibration points. We present a Bayesian phylogenetic analysis of 481 mitochondrial D-loop sequences, including 228 radiocarbon-dated ancient DNA sequences, using a multi-demographic coalescent model. By employing the radiocarbon dates as internal calibrations, we co-estimate the bovine phylogeny and divergence times in a relaxed-clock framework. The analysis yields evidence for significant population expansions in both taurine and zebu cattle, European aurochs and yak clades. The divergence age estimates support domestication-associated expansion times (less than 12 kyr) for the major haplogroups of cattle. We compare the molecular and palaeontological estimates for the Bison-Bos divergence.  相似文献   

10.
Genetic sequence data typically exhibit variability in substitution rates across sites. In practice, there is often too little variation to fit a different rate for each site in the alignment, but the distribution of rates across sites may not be well modeled using simple parametric families. Mixtures of different distributions can capture more complex patterns of rate variation, but are often parameter-rich and difficult to fit. We present a simple hierarchical model in which a baseline rate distribution, such as a gamma distribution, is discretized into several categories, the quantiles of which are estimated using a discretized beta distribution. Although this approach involves adding only two extra parameters to a standard distribution, a wide range of rate distributions can be captured. Using simulated data, we demonstrate that a "beta-" model can reproduce the moments of the rate distribution more accurately than the distribution used to simulate the data, even when the baseline rate distribution is misspecified. Using hepatitis C virus and mammalian mitochondrial sequences, we show that a beta- model can fit as well or better than a model with multiple discrete rate categories, and compares favorably with a model which fits a separate rate category to each site. We also demonstrate this discretization scheme in the context of codon models specifically aimed at identifying individual sites undergoing adaptive or purifying evolution.  相似文献   

11.
We develop codon-based models for simultaneously inferring the mutational effects of CpG and CpNpG methylation in coding regions. In a data set of 369 tomato genes, we show that there is very little effect of CpNpG methylation but a strong effect of CpG methylation affecting almost all genes. We further show that the CpNpG and CpG effects are largely uncorrelated. Our results suggest different roles of CpG and CpNpG methylation, with CpNpG methylation possibly playing a specialized role in defense against transposons and RNA viruses.  相似文献   

12.
Evolutionary trees from DNA sequences: A maximum likelihood approach   总被引:129,自引:0,他引:129  
Summary The application of maximum likelihood techniques to the estimation of evolutionary trees from nucleic acid sequence data is discussed. A computationally feasible method for finding such maximum likelihood estimates is developed, and a computer program is available. This method has advantages over the traditional parsimony algorithms, which can give misleading results if rates of evolution differ in different lineages. It also allows the testing of hypotheses about the constancy of evolutionary rates by likelihood ratio tests, and gives rough indication of the error of the estimate of the tree.By acceptance of this article, the publisher and/or recipient acknowledges the U.S. government's right to retain a nonexclusive, royalty-free licence in and to any copyright covering this paperThis report was prepared as an account of work sponsored by the United States Government. Neither the United States nor the United States Department of Energy, nor any of their employees, nor any of their contractors, subcontractors, or their employees, makes any warranty, express or implied, or assumes any legal liability or responsibility for the accuracy, completeness or usefulness of any information, apparatus, product or process disclosed, or represents that its use would not infringe privately-owned rights  相似文献   

13.
The patterns of genetic variation within and among individuals and populations can be used to make inferences about the evolutionary forces that generated those patterns. Numerous population genetic approaches have been developed in order to infer evolutionary history. Here, we present the “Two-Two (TT)” and the “Two-Two-outgroup (TTo)” methods; two closely related approaches for estimating divergence time based in coalescent theory. They rely on sequence data from two haploid genomes (or a single diploid individual) from each of two populations. Under a simple population-divergence model, we derive the probabilities of the possible sample configurations. These probabilities form a set of equations that can be solved to obtain estimates of the model parameters, including population split times, directly from the sequence data. This transparent and computationally efficient approach to infer population divergence time makes it possible to estimate time scaled in generations (assuming a mutation rate), and not as a compound parameter of genetic drift. Using simulations under a range of demographic scenarios, we show that the method is relatively robust to migration and that the TTo method can alleviate biases that can appear from drastic ancestral population size changes. We illustrate the utility of the approaches with some examples, including estimating split times for pairs of human populations as well as providing further evidence for the complex relationship among Neandertals and Denisovans and their ancestors.  相似文献   

14.
Time‐resolved fluorescence as well as steady‐state absorption and fluorescence were detected in order to study the interactions between tetramethylrhodamine (TAMRA) and DNA when TAMRA was covalently labeled on single‐ and double‐stranded oligonucleotides. Fluorescence intensity quenching and lifetime changes were characterized and correlated with different DNA sequences. The results demonstrated that the photoinduced electron transfer interaction between guanosine residues and TAMRA introduced a short lifetime fluorescence component when guanosine residues were at the TAMRA‐attached terminal of the DNA sequences. The discrepancy of two‐state and three‐state models in previous studies was due to the DNA sequence selection and sensitivity of techniques used to detect the short lifetime component. The results will help the design of fluorescence‐based experiments related to a dye labeled probe. Copyright © 2012 John Wiley & Sons, Ltd.  相似文献   

15.
Phylogeny of the Betulaceae is assessed on the basis of rbcL, ITS, and morphological data. Based upon 26 rbcL sequences representing most “higher” hamamelid families, the Betulaceae are monophyletic, with Casuarinaceae as its sister group, regardless of whether the outgroup is Cunoniaceae, Cercidiphyllaceae, Hamamelidaceae, or Nothofagus. Within the Betulaceae, two sister clades are evident, corresponding to the subfamilies Betuloideae and Coryloideae. However, with only 13 phylogenetically informative sites, the rbcL sequences provide limited intra-subfamilial resolution. Internal transcribed spacer (ITS) sequences provided 96 phylogenetically informative sites from 491 aligned sites resulting in a single most parsimonious tree of 374 steps (consistency index = 0.791) with two major lineages corresponding to the two traditional subfamilies: Betuloideae (Alnus, Betula) and Coryloideae (Corylus, Ostryopsis, Carpinus, Ostrya). This arrangement is mostly consistent with those from rbcL and morphology and is greatly reinforced by analyses with the three data sets combined. In the Coryloideae, the Ostryopsis–Carpinus–Ostrya clade is well supported, with Corylus as its sister group. The sister-group relationship between Ostryopsis and the Carpinus–Ostrya clade is well supported by ITS, rbcL, and morphological data. Phylogenetic relationships among the extant genera deduced by these analyses are compatible with inferences from ecological evolution and the extensive fossil record.  相似文献   

16.
In order to elucidate some features of the topological organization of DNA within the plant nucleus, DNA fragments involved in the attachment of the DNA loops to the nuclear matrix in maize were studied. The matrix-associated DNA from dry embryo and meristematic cells after extensive digestion with DNase I and high salt treatment was about 2% of the total DNA, sized within the range of 50 and 250 bp. This DNA was found to be enriched in repetitive DNA sequences, both for nuclei from dry embryo and meristematic cells. The loop size of the DNA in cells of Zea mays appeared to be between 5 and 25 kbp.Abbreviations EDTA Diamino-ethanetetraacetic acid - EtBr Ethidium bromide - LIS Lithium diiodosalicylate - PMSF Phenylmethylsulfonyl fluoride - SDS Sodium dodecyl sulfate  相似文献   

17.
Phylogenetic relationships among major clades of anuran amphibians were studied using partial sequences of three nuclear protein coding genes, Rag-1, Rag-2, and rhodopsin in 26 frog species from 18 families. The concatenated nuclear data set comprised 2,616 nucleotides and was complemented by sequences of the mitochondrial 12S and 16S rRNA genes for analyses of evolutionary rates. Separate and combined analyses of the nuclear markers supported the monophyly of modern frogs (Neobatrachia), whereas they did not provide support for the monophyly of archaic frog lineages (Archaeobatrachia), contrary to previous studies based on mitochondrial data. The Neobatrachia contain two well supported clades that correspond to the subfamilies Ranoidea (Hyperoliidae, Mantellidae, Microhylidae, Ranidae, and Rhacophoridae) and Hyloidea (Bufonidae, Hylidae, Leptodactylidae, and Pseudidae). Two other families (Heleophrynidae and Sooglossidae) occupied basal positions and probably represent ancient relicts within the Neobatrachia, which had been less clearly indicated by previous mitochondrial analyses. Branch lengths of archaeobatrachians were consistently shorter in all separate analyses, and nonparametric rate smoothing indicated accelerated substitution rates in neobatrachians. However, relative rate tests confirmed this tendency only for mitochondrial genes. In contrast, nuclear gene sequences from our study and from an additional GenBank survey showed no clear phylogenetic trends in terms of differences in rates of molecular evolution. Maximum likelihood trees based on Rag-1 and using only one neobatrachian and one archaeobatrachian sequence, respectively, even had longer archaeobatrachian branches averaged over all pairwise comparisons. More data are necessary to understand the significance of a possibly general assignation of short branches to basal and species-poor taxa by tree-reconstruction algorithms.  相似文献   

18.
Molecular phylogenetic research on Selaginellaceae has focused on the plastid gene rbcL, which in this family has unusually high substitution rates. Here we develop a molecular data set from the nuclear 26S ribosomal DNA gene with the aim of evaluating and extending the results of previous phylogenetic research. The 26S rDNA and the rbcL regions were sequenced for a sample of 23 species, which represent the main elements of species diversity in the family. The data were analysed independently and in combination using both maximum parsimony and Bayesian inference. Although several between genome differences were found, the general pattern of relationships uncovered by all analyses was very similar. Results corroborate the previous study supporting new groupings not previously recognised on morphological grounds. Substitution rates in the 26S rDNA were also found to be high (26% informative) for the region analysed, but lower than for rbcL (37% informative). These data indicate that high substitution rates might be widespread in all three genomes (i.e., plastid, mitochondrion, and nucleus).  相似文献   

19.
秦丹  徐存拴 《遗传》2013,35(11):1253-1264
非编码DNA序列是指基因组中不编码蛋白质的DNA序列。这些序列可以结合调节因子、转录为功能性RNA、单独或协同地调节生理活动和病理过程。文章围绕基因表达调控作用, 总结了近几年非编码DNA序列的研究成果, 对其结构、功能和可能的作用机制进行了初步阐述, 介绍了目前鉴定非编码DNA序列中功能元件的计算方法和实验技术, 并对非编码DNA未来的研究进行了展望。  相似文献   

20.
为了研究核苷酸变异,通过DNA序列的同源率,建立了DNA序列进化的动力学方程,进而得到了一种新的物种间进化距离dy(选择进化距离).由于核苷酸替代模型有很多,选用其中的4种模型,计算出其相应的选择进化距离dy,该进化距离包含了4种模型下的p距离、替代率为常数的距离d和替代率服从Г分布的Г距离dG.进一步根据动力学方程的特点,将模型转化为一元线性回归问题,用最小二乘法求得选择模型中的动力学参数b和各核苷酸位点每年的平均替代速率r.以16个物种的线粒体基因序列为例,说明这种新的进化距离并通过构建不同进化距离下的基因进化树来对各进化距离进行比较.结果表明:选择进化距离dy是一种有效的构建进化距离的方法.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号