首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 375 毫秒
1.
The phylogenetic inference of ancestral protein sequences is a powerful technique for the study of molecular evolution, but any conclusions drawn from such studies are only as good as the accuracy of the reconstruction method. Every inference method leads to errors in the ancestral protein sequence, resulting in potentially misleading estimates of the ancestral protein's properties. To assess the accuracy of ancestral protein reconstruction methods, we performed computational population evolution simulations featuring near-neutral evolution under purifying selection, speciation, and divergence using an off-lattice protein model where fitness depends on the ability to be stable in a specified target structure. We were thus able to compare the thermodynamic properties of the true ancestral sequences with the properties of “ancestral sequences” inferred by maximum parsimony, maximum likelihood, and Bayesian methods. Surprisingly, we found that methods such as maximum parsimony and maximum likelihood that reconstruct a “best guess” amino acid at each position overestimate thermostability, while a Bayesian method that sometimes chooses less-probable residues from the posterior probability distribution does not. Maximum likelihood and maximum parsimony apparently tend to eliminate variants at a position that are slightly detrimental to structural stability simply because such detrimental variants are less frequent. Other properties of ancestral proteins might be similarly overestimated. This suggests that ancestral reconstruction studies require greater care to come to credible conclusions regarding functional evolution. Inferred functional patterns that mimic reconstruction bias should be reevaluated.  相似文献   

2.
The first comprehensive cladistic analysis of Miridae, the plant bugs, is presented based on analysis of 3935 base pairs of mitochondrial (16S, COI) and nuclear (18S, 28SD3) DNA for 91 taxa in seven subfamilies. Data were analysed using maximum likelihood (ML), parsimony and Bayesian inference (BI) phylogenetic frameworks. The phylogenetic results are compared with previous hypotheses of higher relationships in the family using alternative hypothesis tests. A Bayesian relaxed molecular clock is used to examine divergence times, and ancestral feeding habits are reconstructed using parsimony and a Bayesian approach. Clades recovered in all analyses are as follows: Cimicomorpha, Miroidea and Miridae; Bryocorinae: Bryocorini; Stenodemini; Mirinae; Deraeocorinae (Clevinemini + Deraeocorini); Cylapinae; Isometopinae; Bryocorinae: Dicyphini; Orthotylini; Phylinae (Phylini + Pilophorini), and Phylinae as sister group to all the remaining mirid taxa. These results are largely congruent with former hypotheses based on morphological data with respect to the monophyly of various subfamilies and tribes; however, our results indicate that the subfamily Bryocorinae is not monophyletic, as the two tribes, Dicypini and Bryocorini, were separated in the phylogenetic results. Divergence time estimates indicate that the radiation of the Miridae began in the Permian; most genus‐level radiations within subfamilies began in the late Cretaceous, probably in response to the angiosperm radiation. Ancestral feeding state reconstructions based on Bayesian and parsimony inference were largely congruent and both reconstructed phytophagy as the ancestral state of the Miridae. Furthermore, the feeding habits of the common ancestors of Mirinae + Deraeocorinae, Bryocorinae + Cylapinae + Isometopinae + Orthotylinae, and the remaining taxa excluding Phylinae, were inferred as phytophagous. Therefore, at least three shifts from phytophagy or polyphagy to predation occurred within the Miridae. Additionally, based on the mirid host‐plant records, we discovered several trends, such as a strong relationship between host‐plant ranges and a facultative feeding habit. © The Willi Hennig Society 2011.  相似文献   

3.
Akashi H  Goel P  John A 《PloS one》2007,2(10):e1065
Reliable inference of ancestral sequences can be critical to identifying both patterns and causes of molecular evolution. Robustness of ancestral inference is often assumed among closely related species, but tests of this assumption have been limited. Here, we examine the performance of inference methods for data simulated under scenarios of codon bias evolution within the Drosophila melanogaster subgroup. Genome sequence data for multiple, closely related species within this subgroup make it an important system for studying molecular evolutionary genetics. The effects of asymmetric and lineage-specific substitution rates (i.e., varying levels of codon usage bias and departures from equilibrium) on the reliability of ancestral codon usage was investigated. Maximum parsimony inference, which has been widely employed in analyses of Drosophila codon bias evolution, was compared to an approach that attempts to account for uncertainty in ancestral inference by weighting ancestral reconstructions by their posterior probabilities. The latter approach employs maximum likelihood estimation of rate and base composition parameters. For equilibrium and most non-equilibrium scenarios that were investigated, the probabilistic method appears to generate reliable ancestral codon bias inferences for molecular evolutionary studies within the D. melanogaster subgroup. These reconstructions are more reliable than parsimony inference, especially when codon usage is strongly skewed. However, inference biases are considerable for both methods under particular departures from stationarity (i.e., when adaptive evolution is prevalent). Reliability of inference can be sensitive to branch lengths, asymmetry in substitution rates, and the locations and nature of lineage-specific processes within a gene tree. Inference reliability, even among closely related species, can be strongly affected by (potentially unknown) patterns of molecular evolution in lineages ancestral to those of interest.  相似文献   

4.
We propose two approximate methods (one based on parsimony and one on pairwise sequence comparison) for estimating the pattern of nucleotide substitution and a parsimony-based method for estimating the gamma parameter for variable substitution rates among sites. The matrix of substitution rates that represents the substitution pattern can be recovered through its relationship with the observable matrix of site pattern frequences in pairwise sequence comparisons. In the parsimony approach, the ancestral sequences reconstructed by the parsimony algorithm were used, and the two sequences compared are those at the ends of a branch in the phylogenetic tree. The method for estimating the gamma parameter was based on a reinterpretation of the numbers of changes at sites inferred by parsimony. Three data sets were analyzed to examine the utility of the approximate methods compared with the more reliable likelihood methods. The new methods for estimating the substitution pattern were found to produce estimates quite similar to those obtained from the likelihood analyses. The new method for estimating the gamma parameter was effective in reducing the bias in conventional parsimony estimates, although it also overestimated the parameter. The approximate methods are computationally very fast and appear useful for analyzing large data sets, for which use of the likelihood method requires excessive computation.   相似文献   

5.
A major assumption of many molecular phylogenetic methods is the homogeneity of nucleotide frequencies among taxa, which refers to the equality of the nucleotide frequency bias among species. Changes in nucleotide frequency among different lineages in a data set are thought to lead to erroneous phylogenetic inference because unrelated clades may appear similar because of evolutionarily unrelated similarities in nucleotide frequencies. We tested the effects of the heterogeneity of nucleotide frequency bias on phylogenetic inference, along with the interaction between this heterogeneity and stratified taxon sampling, by means of computer simulations using evolutionary parameters derived from genomic databases. We found that the phylogenetic trees inferred from data sets simulated under realistic, observed levels of heterogeneity for mammalian genes were reconstructed with accuracy comparable to those simulated with homogeneous nucleotide frequencies; the results hold for Neighbor-Joining, minimum evolution, maximum parsimony, and maximum-likelihood methods. The LogDet distance method, specifically designed to deal with heterogeneous nucleotide frequencies, does not perform better than distance methods that assume substitution pattern homogeneity among sequences. In these specific simulation conditions, we did not find a significant interaction between phylogenetic accuracy and substitution pattern heterogeneity among lineages, even when the taxon sampling is increased.  相似文献   

6.
This paper describes the inferential method, an approach for reconstructing protein and nucleotide sequences of ancestral species, starting from known, homologous, contemporary sequences. The method requires knowledge of the topology of the phylogenetic tree, whose nodes are the species to whom the reconstructed sequences belong.The method has been tested by computer simulation of speciation and nucleotide substitutions, starting from a single ancestral sequence, and by subsequent reconstruction of nodal sequences. Results have shown that reconstructions obtained by the inferential method are affected by limited error frequencies, which (1) are proportional to the squares of nucleotide substitution rates and of internodal distances, and (2) are little influenced by non-uniformity of transformation rates of nucleotides.Furthermore, good agreement of the results has been obtained by comparing protein-sequence reconstructions carried out with the inferential method with those obtained using the maximum parsimony method in two different cases: e.g., a reconstruction of simulated sequences and a reconstruction of mammalian ribonuclease sequences.Abbreviations used MP maximum parsimony method - ML maximum likelihood method - IM inferential method - MY millions of years - N-tree natural-like phylogenetic tree - E-tree equibranched phylogenetic tree - EA percentage number of erroneous amino acids in a reconstructed sequence - EC percentage number of erroneous codons in a reconstructed sequence - t n time interval between a P- and its - F-sequence nucleotides and amino acids are indicated by their I.U.B. codes (N.C.-I.U.B., 1985) Correspondence to: A. Di Donato  相似文献   

7.
Protein-coding genes in eukaryotes are interrupted by introns, but intron densities widely differ between eukaryotic lineages. Vertebrates, some invertebrates and green plants have intron-rich genes, with 6-7 introns per kilobase of coding sequence, whereas most of the other eukaryotes have intron-poor genes. We reconstructed the history of intron gain and loss using a probabilistic Markov model (Markov Chain Monte Carlo, MCMC) on 245 orthologous genes from 99 genomes representing the three of the five supergroups of eukaryotes for which multiple genome sequences are available. Intron-rich ancestors are confidently reconstructed for each major group, with 53 to 74% of the human intron density inferred with 95% confidence for the Last Eukaryotic Common Ancestor (LECA). The results of the MCMC reconstruction are compared with the reconstructions obtained using Maximum Likelihood (ML) and Dollo parsimony methods. An excellent agreement between the MCMC and ML inferences is demonstrated whereas Dollo parsimony introduces a noticeable bias in the estimations, typically yielding lower ancestral intron densities than MCMC and ML. Evolution of eukaryotic genes was dominated by intron loss, with substantial gain only at the bases of several major branches including plants and animals. The highest intron density, 120 to 130% of the human value, is inferred for the last common ancestor of animals. The reconstruction shows that the entire line of descent from LECA to mammals was intron-rich, a state conducive to the evolution of alternative splicing.  相似文献   

8.
The Bryaceae are a large cosmopolitan moss family including genera of significant morphological and taxonomic complexity. Phylogenetic relationships within the Bryaceae were reconstructed based on DNA sequence data from all three genomic compartments. In addition, maximum parsimony and Bayesian inference were employed to reconstruct ancestral character states of 38 morphological plus four habitat characters and eight insertion/deletion events. The recovered phylogenetic patterns are generally in accord with previous phylogenies based on chloroplast DNA sequence data and three major clades are identified. The first clade comprises Bryum bornholmense, B. rubens, B. caespiticium, and Plagiobryum. This corroborates the hypothesis suggested by previous studies that several Bryum species are more closely related to Plagiobryum than to the core Bryum species. The second clade includes Acidodontium, Anomobryum, and Haplodontium, while the third clade contains the core Bryum species plus Imbribryum. Within the latter clade, B. subapiculatum and B. tenuisetum form the sister clade to Imbribryum. Reconstructions of ancestral character states under maximum parsimony and Bayesian inference suggest fourteen morphological synapomorphies for the ingroup and synapomorphies are detected for most clades within the ingroup. Maximum parsimony and Bayesian reconstructions of ancestral character states are mostly congruent although Bayesian inference shows that the posterior probability of ancestral character states may decrease dramatically when node support is taken into account. Bayesian inference also indicates that reconstructions may be ambiguous at internal nodes for highly polymorphic characters.  相似文献   

9.
Phylogenetic analysis of large datasets using complex nucleotide substitution models under a maximum likelihood framework can be computationally infeasible, especially when attempting to infer confidence values by way of nonparametric bootstrapping. Recent developments in phylogenetics suggest the computational burden can be reduced by using Bayesian methods of phylogenetic inference. However, few empirical phylogenetic studies exist that explore the efficiency of Bayesian analysis of large datasets. To this end, we conducted an extensive phylogenetic analysis of the wide-ranging and geographically variable Eastern Fence Lizard (Sceloporus undulatus). Maximum parsimony, maximum likelihood, and Bayesian phylogenetic analyses were performed on a combined mitochondrial DNA dataset (12S and 16S rRNA, ND1 protein-coding gene, and associated tRNA; 3,688 bp total) for 56 populations of S. undulatus (78 total terminals including other S. undulatus group species and outgroups). Maximum parsimony analysis resulted in numerous equally parsimonious trees (82,646 from equally weighted parsimony and 335 from weighted parsimony). The majority rule consensus tree derived from the Bayesian analysis was topologically identical to the single best phylogeny inferred from the maximum likelihood analysis, but required approximately 80% less computational time. The mtDNA data provide strong support for the monophyly of the S. undulatus group and the paraphyly of "S. undulatus" with respect to S. belli, S. cautus, and S. woodi. Parallel evolution of ecomorphs within "S. undulatus" has masked the actual number of species within this group. This evidence, along with convincing patterns of phylogeographic differentiation suggests "S. undulatus" represents at least four lineages that should be recognized as evolutionary species.  相似文献   

10.
Due to morphological reduction and absence of amplifiable plastid genes, the identification of photosynthetic relatives of heterotrophic plants is problematic. Although nuclear and mitochondrial gene sequences may offer a welcome alternative source of phylogenetic markers, the presence of rate heterogeneity in these genes may introduce bias/systematic error in phylogenetic analyses. We examine the phylogenetic position of Thismiaceae based on nuclear 18S rDNA and mitochondrial atpA DNA sequence data, as well as using parsimony, likelihood and Bayesian inference methods. Significant differences in evolutionary rates of these genes between closely related taxa lead to conflicting results: while parsimony analyses of 18S rDNA and combined data strongly support the monophyly of Thismiaceae, Bayesian inference, with and without a relaxed molecular clock, as well as the Swofford–Olsen–Waddell–Hillis (SOWH) test confidently reject this hypothesis. We show that rate heterogeneity in our data leads to long-branch attraction artifacts in parsimony analysis. However, using model-based inference methods the question of whether Thismiaceae are monophyletic remains elusive. On the one hand maximum likelihood nonparametric bootstrapping and parametric hypothesis tests fail to support a paraphyletic Thismiaceae, on the other hand Bayesian inference methods (both without and with a relaxed clock) significantly reject a monophyletic Thismiaceae. These results show that an adequate sampling, the use of rate homogeneous data, and the application of different inference methods are important factors for developing phylogenetic hypotheses of myco-heterotrophic plants. © The Willi Hennig Society 2009.  相似文献   

11.
Ixobrychus cinnamomeus is a member of the large wading bird family, known as Ardeidae. In the present study, we determined the complete mitochondrial genome of I. cinnamomeus for use in future phylogenetic analysis. This circular mitochondrial genome is 17,180 bp in length and composed of 13 protein-coding genes, 22 tRNA genes, two rRNA genes and one putative control region. Three conserved domains and a minisatellite of 17 nucleotides with 22 tandem repeats were detected at the end of the control region. Phylogenetic relationships were reconstructed using the nucleotide and corresponding amino acid datasets of 12 concatenated protein-coding genes from the mitochondrial genome. Using maximum likelihood, maximum parsimony and Bayesian inference methods, the monophyly of Ciconiidae, Ardeidae and Threskiornithidae were confirmed; however, the monophyly of traditional Ciconiiformes and Pelecaniformes failed to be recovered. Although further studies are recommended to clarify relationships among and within the orders of Ciconiiformes, Pelecaniformes, Suliformes and Phaethontiformes, our results provide preliminary exploratory results that can be useful in the current understanding of avian phylogenetics.  相似文献   

12.
田天  袁缓  陈斌 《昆虫学报》1950,63(8):1016-1027
【目的】明确肉食亚目(Adephaga)水生类群线粒体基因组的基本特征,并基于线粒体基因组序列分析肉食亚目水生类群的系统发育关系。【方法】基于Illumina HiSeq X Ten测序技术测定了圆鞘隐盾豉甲Dineutus mellyi和齿缘龙虱Eretes sticticus的线粒体全基因组序列,对其进行了基因注释,并对其tRNA基因二级结构进行了预测分析。加上已公布的鞘翅目(Coleoptera)肉食亚目水生类群17个种的线粒体基因组序列,对该类群共19个种线粒体的蛋白质编码基因(protein-coding genes, PCGs)开展了比较基因组学分析,包括AT含量、密码子偏好性、选择压力等。基于13个PCGs的氨基酸序列和核苷酸序列,利用最大似然法(ML)和贝叶斯法(BI)分别构建鞘翅目肉食亚目水生类群的系统发育关系,并通过FcLM分析进一步评估伪龙虱科(Noteridae)和瀑甲科(Meruidae)的系统发育位置。【结果】圆鞘隐盾豉甲和齿缘龙虱的线粒体基因组全长分别为16 123 bp(GenBank登录号: MN781126)和16 196 bp(GenBank登录号: MN781132),都包含13个PCGs、22个tRNA基因、2个rRNA基因和1个D-loop区(控制区)。19个肉食亚目水生类群线粒体基因组PCGs的碱基组成都呈现A+T偏好性,在密码子使用上也都偏向于使用富含A+T的密码子;在进化过程中13个PCGs的进化模式相同,都受到纯化选择。基于线粒体基因组13个PCGs的氨基酸序列的肉食亚目水生类群的系统发育关系为(豉甲科Gyrinidae+(沼梭甲科Haliplidae+((壁甲科Aspidytidae+(两栖甲科Amphizoidae+龙虱科Dytiscidae))+(水甲科Hygrobiidae+(瀑甲科Meruidae+伪龙虱科Noteridae)))))。【结论】研究结果表明,豉甲科是肉食亚目水生类群的基部类群,接下来是沼梭甲科和龙虱总科;伪龙虱科和瀑甲科互为姐妹群,并一起作为龙虱总科内部的一个分支;两栖甲科与龙虱科具有更近的亲缘关系。  相似文献   

13.
Z. Yang  S. Kumar    M. Nei 《Genetics》1995,141(4):1641-1650
A statistical method was developed for reconstructing the nucleotide or amino acid sequences of extinct ancestors, given the phylogeny and sequences of the extant species. A model of nucleotide or amino acid substitution was employed to analyze data of the present-day sequences, and maximum likelihood estimates of parameters such as branch lengths were used to compare the posterior probabilities of assignments of character states (nucleotides or amino acids) to interior nodes of the tree; the assignment having the highest probability was the best reconstruction at the site. The lysozyme c sequences of six mammals were analyzed by using the likelihood and parsimony methods. The new likelihood-based method was found to be superior to the parsimony method. The probability that the amino acids for all interior nodes at a site reconstructed by the new method are correct was calculated to be 0.91, 0.86, and 0.73 for all, variable, and parsimony-informative sites, respectively, whereas the corresponding probabilities for the parsimony method were 0.84, 0.76, and 0.51, respectively. The probability that an amino acid in an ancestral sequence is correctly reconstructed by the likelihood analysis ranged from 91.3 to 98.7% for the four ancestral sequences.  相似文献   

14.
The mitochondrial 16S ribosomal RNA (rRNA) gene sequences from 93 cyprinid fishes were examined to reconstruct the phylogenetic relationships within the diverse and economically important subfamily Cyprininae. Within the subfamily a biased nucleotide composition (A>T, C>G) was observed in the loop regions of the gene, and in stem regions apparent selective pressures of base pairing showed a bias in favor of G over C and T over A. The bias may be associated with transition-transversion bias. Rates of nucleotide substitution were lower in stems than in loops. Analysis of compensatory substitutions across these taxa demonstrates 68% covariation in the gene and a logical weighting factor to account for dependence in mutations for phylogenetic inference should be 0.66. Comparisons of varied stem-loop weighting schemes indicate that the down-weightings for stem regions could improve the phylogenetic analysis and the degree of non-independence of stem substitutions was not as important as expected. Bayesian inference under four models of nucleotide substitution indicated that likelihood-based phylogenetic analyses were more effective in improving the phylogenetic performance than was weighted parsimony analysis. In Bayesian analyses, the resolution of phylogenies under the 16-state models for paired regions, incorporating GTR + G + I models for unpaired regions was better than those under other models. The subfamily Cyprininae was resolved as a monophyletic group, as well as tribe Labein and several genera. However, the monophyly of the currently recognized tribes, such as Schizothoracin, Barbin, Cyprinion + Onychostoma lineages, and some genera was rejected. Furthermore, comparisons of the parsimony and Bayesian analyses and results of variable length bootstrap analysis indicates that the mitochondrial 16S rRNA gene should contain important character variation to recover well-supported phylogeny of cyprinid taxa whose divergences occurred within the recent 8 MY, but could not provide resolution power for deep phylogenies spanning 10-19 MYA.  相似文献   

15.
Palaeobiogeographic reconstructions are underpinned by phylogenies, divergence times and ancestral area reconstructions, which together yield ancestral area chronograms that provide a basis for proposing and testing hypotheses of dispersal and vicariance. Methods for area coding include multi-state coding with a single character, binary coding with multiple characters and string coding. Ancestral reconstruction methods are divided into parsimony versus Bayesian/likelihood approaches. We compared nine methods for reconstructing ancestral areas for placental mammals. Ambiguous reconstructions were a problem for all methods. Important differences resulted from coding areas based on the geographical ranges of extant species versus the geographical provenance of the oldest fossil for each lineage. Africa and South America were reconstructed as the ancestral areas for Afrotheria and Xenarthra, respectively. Most methods reconstructed Eurasia as the ancestral area for Boreoeutheria, Euarchontoglires and Laurasiatheria. The coincidence of molecular dates for the separation of Afrotheria and Xenarthra at approximately 100 Ma with the plate tectonic sundering of Africa and South America hints at the importance of vicariance in the early history of Placentalia. Dispersal has also been important including the origins of Madagascar's endemic mammal fauna. Further studies will benefit from increased taxon sampling and the application of new ancestral area reconstruction methods.  相似文献   

16.
通过对类人猿亚目中部分种类的孕激素受体基因进行分析,重建类人猿亚目的 系统发育关系.扩增并测定了来源于14个属的类人猿亚目物种的孕激素受体编码区序列,并基于这一序列数据,分别采用邻接法、最大简约法和最大似然法重建了系统发育关系.除了阔鼻下目,3种方法构建的系统发生树的拓扑结构类似且各节点支持率高.重建的人猿超科和猴超科内部亲缘关系支持多数人所认可的分类系统.本研究为黑猩猩和人的姐妹群关系提供了证据,提示黑猩猩比大猩猩或其他猿猴更接近人类.阔鼻下目中蜘蛛猴科、卷尾猴科和僧面猴科三者之间的系统发育关系在本研究中未得到很好辨析.  相似文献   

17.
An enigmatic acrochaetioid alga was collected from Niangziguan spring in Shanxi Province, northern China. Morphological data indicated that this alga reproduces exclusively asexually by monosporangia and its morphological characteristics suggested that it might be referred to Audouinella heterospora. To ascertain its phylogenetic position, phylogenetic trees were reconstructed using partial sequences of the plastid‐encoded gene (rbcL) and the nuclear‐encoded gene (SSU rDNA) applying Bayesian inference (BI), maximum parsimony (MP) and maximum likelihood (ML). However, phylogenetic reconstructions showed that this acrochaetioid alga does not belong in a clade with the genus Audouinella, but forms a clade with Thorea hispida (Thoreales). Based on this analysis it is concluded that A. heterospora represents the ‘chantransia’ stage of T. hispida.  相似文献   

18.
Lagenophora (Astereae, Asteraceae) has 14 species in New Zealand, Australia, Asia, southern South America, Gough Island and Tristan da Cunha. Phylogenetic relationships in Lagenophora were inferred using nuclear and plastid DNA regions. Reconstruction of spatio‐temporal evolution was estimated using parsimony, Bayesian inference and likelihood methods, a Bayesian relaxed molecular clock and ancestral area and habitat reconstructions. Our results support a narrow taxonomic concept of Lagenophora including only a core group of species with one clade diversifying in New Zealand and another in South America. The split between the New Zealand and South American Lagenophora dates from 11.2 Mya [6.1–17.4 95% highest posterior density (HPD)]. The inferred ancestral habitats were openings in beech forest and subalpine tussockland. The biogeographical analyses infer a complex ancestral area for Lagenophora involving New Zealand and southern South America. Thus, the estimated divergence times and biogeographical reconstructions provide circumstantial evidence that Antarctica may have served as a corridor for migration until the expansion of the continental ice during the late Cenozoic. The extant distribution of Lagenophora reflects a complex history that could also have involved direct long‐distance dispersal across southern oceans. © 2014 The Linnean Society of London, Botanical Journal of the Linnean Society, 2015, 177 , 78–95.  相似文献   

19.
Understanding the proximate and ultimate causes underlying the evolution of nucleotide composition in mammalian genomes is of fundamental interest to the study of molecular evolution. Comparative genomics studies have revealed that many more substitutions occur from G and C nucleotides to A and T nucleotides than the reverse, suggesting that mammalian genomes are not at equilibrium for base composition. Analysis of human polymorphism data suggests that mutations that increase GC-content tend to be at much higher frequencies than those that decrease or preserve GC-content when the ancestral allele is inferred via parsimony using the chimpanzee genome. These observations have been interpreted as evidence for a fixation bias in favor of G and C alleles due to either positive natural selection or biased gene conversion. Here, we test the robustness of this interpretation to violations of the parsimony assumption using a data set of 21,488 noncoding single nucleotide polymorphisms (SNPs) discovered by the National Institute of Environmental Health Sciences (NIEHS) SNPs project via direct resequencing of n = 95 individuals. Applying standard nonparametric and parametric population genetic approaches, we replicate the signatures of a fixation bias in favor of G and C alleles when the ancestral base is assumed to be the base found in the chimpanzee outgroup. However, upon taking into account the probability of misidentifying the ancestral state of each SNP using a context-dependent mutation model, the corrected distribution of SNP frequencies for GC-content increasing SNPs are nearly indistinguishable from the patterns observed for other types of mutations, suggesting that the signature of fixation bias is a spurious artifact of the parsimony assumption.  相似文献   

20.
Mitochondrial DNA sequences can be used to estimate phylogenetic relationships among animal taxa and for molecular phylogenetic evolution analysis. With the development of sequencing technology, more and more mitochondrial sequences have been made available in public databases, including whole mitochondrial DNA sequences. These data have been used for phylogenetic analysis of animal species, and for studies of evolutionary processes. We made phylogenetic analyses of 19 species of Cervidae, with Bos taurus as the outgroup. We used neighbor joining, maximum likelihood, maximum parsimony, and Bayesian inference methods on whole mitochondrial genome sequences. The consensus phylogenetic trees supported monophyly of the family Cervidae; it was divided into two subfamilies, Plesiometacarpalia and Telemetacarpalia, and four tribes, Cervinae, Muntiacinae, Hydropotinae, and Odocoileinae. The divergence times in these families were estimated by phylogenetic analysis using the Bayesian method with a relaxed molecular clock method; the results were consistent with those of previous studies. We concluded that the evolutionary structure of the family Cervidae can be reconstructed by phylogenetic analysis based on whole mitochondrial genomes; this method could be used broadly in phylogenetic evolutionary analysis of animal taxa.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号