首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 421 毫秒
1.
A model of nucleotide substitution that allows the transition/transversion rate bias to vary across sites was constructed. We examined the fit of this model using likelihood-ratio tests by analyzing 13 protein coding genes and 1 pseudogene. Likelihood-ratio testing indicated that a model that allows variation in the transition/transversion rate bias across sites provided a significant improvement in fit for most protein coding genes but not for the pseudogene. When the analysis was repeated with parameters estimated separately for first, second, and third codon positions, strong heterogeneity was uncovered for the first and second codon positions; the variation in the transition/transversion rate was generally weaker at the third codon position. The transition rate bias and branch lengths are underestimated when variation in the transition/transversion rate was not accommodated, suggesting that it may be important to accommodate variation in the pattern of nucleotide substitution for accurate estimation of evolutionary parameters. Received: 4 November 1997 / Accepted: 19 May 1998  相似文献   

2.
We present a further application of the stochastic model previously described (Lanave et al., 1984, 1985) for measuring the nucleotide substitution rate in the mammalian evolution of the mitochondrial DNA (mtDNA). The applicability of this method depends on the validity of "stationarity conditions" (equal nucleotide frequencies at first, second and third silent codon positions in homologous protein coding genes). In the comparison of homologous sequences satisfying the stationarity condition at the silent sites, only the four codon families (quartets) for which both transitions and transversions are silent at the third position are considered here. This has allowed us to estimate the transition and transversion rates for any pair of species. We have analyzed the third silent codon position of the triplet rat-mouse-cow, of a series of slightly divergent primates and of two Drosophila species. In terms of two external dating input we have then determined the phylogenetic trees for rat, mouse, and cow as well as for a number of primates including man. The phylogenetic tree that we have derived for the triplet rat, mouse and cow agrees with that we had previously determined by analyzing the first, second and third silent codon positions (in both duets and quartets) of mt genes (Lanave et al., 1985). For primates our method leads to the following branching order from the oldest to the most recent: Gibbon, Orangutan, Gorilla, Chimpanzee and Man. In absolute time, fixing the distance Chimpanzee-Man as 5 million years (Myr) we estimate the dating of the divergence nodes as: Gorilla 7 Myr; Orangutan 16 Myr; Gibbon 20 Myr. In all cases analyzed, the transition rate has been found to be substantially higher than the transversion rate. Moreover we have found that the transition/transversion ratio is different in the various lineages. We suggest that this fact is probably related to the nucleotide frequencies at the third silent codon position.  相似文献   

3.
Bielawski JP  Dunn KA  Yang Z 《Genetics》2000,156(3):1299-1308
Rates and patterns of synonymous and nonsynonymous substitutions have important implications for the origin and maintenance of mammalian isochores and the effectiveness of selection at synonymous sites. Previous studies of mammalian nuclear genes largely employed approximate methods to estimate rates of nonsynonymous and synonymous substitutions. Because these methods did not account for major features of DNA sequence evolution such as transition/transversion rate bias and unequal codon usage, they might not have produced reliable results. To evaluate the impact of the estimation method, we analyzed a sample of 82 nuclear genes from the mammalian orders Artiodactyla, Primates, and Rodentia using both approximate and maximum-likelihood methods. Maximum-likelihood analysis indicated that synonymous substitution rates were positively correlated with GC content at the third codon positions, but independent of nonsynonymous substitution rates. Approximate methods, however, indicated that synonymous substitution rates were independent of GC content at the third codon positions, but were positively correlated with nonsynonymous rates. Failure to properly account for transition/transversion rate bias and unequal codon usage appears to have caused substantial biases in approximate estimates of substitution rates.  相似文献   

4.
Phylogenetic relationships were inferred using nucleotide sequences of the chloroplast gene matK for members of Cornales, a well-supported monophyletic group comprising Cornaceae and close relatives. The shortest trees resulting from this analysis were highly concordant with those based on previous phylogenetic analysis of rbcL sequences. Analysis of a combined matK and rbcL sequence data set (a total of 2652 bp [base pairs]) provided greater resolution of relationships and higher internal support for clades compared to the individual data sets. Four major clades (most inclusive monophyletic groups) of Cornales are indicated by both sets of genes: (1) Cornus-Alangium, (2) nyssoids (Nyssa-Davidia-Camptotheca)- mastixioids (Mastixia, Diplopanax), (3) Curtisia, and (4) Hydrangeaceae-Loasaceae. The combined evidence indicates that clades 2 and 3 are sisters, with clade 4 sister to the remainder of Cornales. These relationships are also supported by other lines of evidence, including synapomorphies in fruit and pollen morphology and gynoecial vasculature. Comparisons of matK and rbcL sequences based on one of the most parsimonious rbcL-matK trees indicate that matK has a much higher A-T content (66.9% in matK vs. 55.8% in rbcL) and a lower transition:transversion ratio (1.23 in matK vs. 2.21 in rbcL). The total number of nucleotide substitutions per site for matK is 2.1 times that of rbcL in Cornales. These findings are similar to recent comparisons of matK and rbcL in other dicots. Variable sites of matK are almost evenly distributed among the three codon positions (1.0:1.0:1.3), whereas variable sites of rbcL are mostly at the third position (1.8:1.0 :7.5). Among- lineages rates of nucleotide substitutions in rbcL are basically homogeneous throughout Cornales, but are more heterogeneous in matK.  相似文献   

5.
Summary Selective constraints on DNA sequence change were incorporated into a model of DNA divergence by restricting substitutions to a subset of nucleotide positions. A simple model showed that both mutation rate and the fraction of nucleotide positions free to vary are strong determinants of DNA divergence over time.When divergence between two species approaches the fraction of positions free to vary, standard methods that correct for multiple mutations yield severe underestimates of the number of substitutions per site. A modified method appropriate for use with DNA sequence, restriction site, or thermal renaturation data is derived taking this fraction into account. The model also showed that the ratio of divergence in two gene classes (e.g., nuclear and mitochondrial) may vary widely over time even if the ratio of mutation rates remains constant.DNA sequence divergence data are used increasingly to detect differences in rates of molecular evolution. Often, variation in divergence rate is assumed to represent variation in mutation rate. The present model suggests that differing divergence rates among comparisons (either among gene classes or taxa) should be interpreted cautiously. Differences in the fraction of nucleotide positions free to vary can serve as an important alternative hypothesis to explain differences in DNA divergence rates.  相似文献   

6.
The hepatitis B virus (HBV) has a circular DNA genome of about 3,200 base pairs. Economical use of the genome with overlapping reading frames may have led to severe constraints on nucleotide substitutions along the genome and to highly variable rates of substitution among nucleotide sites. Nucleotide sequences from 13 complete HBV genomes were compared to examine such variability of substitution rates among sites and to examine the phylogenetic relationships among the HBV variants. The maximum likelihood method was employed to fit models of DNA sequence evolution that can account for the complexity of the pattern of nucleotide substitution. Comparison of the models suggests that the rates of substitution are different in different genes and codon positions; for example, the third codon position changes at a rate over ten times higher than the second position. Furthermore, substantial variation of substitution rates was detected even after the effects of genes and codon positions were corrected; that is, rates are different at different sites of the same gene or at the same codon position. Such rates after the correction were also found to be positively correlated at adjacent sites, which indicated the existence of conserved and variable domains in the proteins encoded by the viral genome. A multiparameter model validates the earlier finding that the variation in nucleotide conservation is not random around the HBV genome. The test for the existence of a molecular clock suggests that substitution rates are more or less constant among lineages. The phylogenetic relationships among the viral variants were examined. Although the data do not seem to contain sufficient information to resolve the details of the phylogeny, it appears quite certain that the serotypes of the viral variants do not reflect their genetic relatedness. Correspondence to: Z. Yang  相似文献   

7.
S. Kumar 《Genetics》1996,143(1):537-548
Maximum likelihood methods were used to study the differences in substitution rates among the four nucleotides and among different nucleotide sites in mitochondrial protein-coding genes of vertebrates. In the 1st+2nd codon position data, the frequency of nucleotide G is negatively correlated with evolutionary rates of genes, substitution rates vary substantially among sites, and the transition/transversion rate bias (R) is two to five times larger than that expected at random. Generally, largest transition biases and greatest differences in substitution rates among sites are found in the highly conserved genes. The 3rd positions in placental mammal genes exhibit strong nucleotide composition biases and the transitional rates exceed transversional rates by one to two orders of magnitude. Tamura-Nei and Hasegawa-Kishino-Yano models with gamma distributed variable rates among sites (gamma parameter, α) adequately describe the nucleotide substitution process in 1st+2nd position data. In these data, ignoring differences in substitution rates among sites leads to largest biases while estimating substitution rates. Kimura's two-parameter model with variable-rates among sites performs satisfactorily in likelihood estimation of R, α, and overall amount of evolution for 1st+2nd position data. It can also be used to estimate pairwise distances with appropriate values of α for a majority of genes.  相似文献   

8.
王乐  柴建华 《遗传学报》1996,23(5):329-337
DNA分子进化中,对核苷酸替换的选择可呈选择中性或选择倾向性。为研究载脂蛋白基因进化过程中对核苷酸变化的选择方式,本文建立了基因的密码子空间分析方法。密码子空间是由密码子3个位置上核苷酸出现机率所组成的矩阵。对该空间中核苷酸分布的非随机性度量可以反映进化过程中核苷酸替换的选择方式。应用该法,我们发现载脂蛋白基因密码子空间第一及第三位的核苷酸分布呈高度非随机性。进一步研究表明:这种核苷酸的非随机分布可能与腺苷酸、胸苷酸对密码子位置的非中性选择有关。此外,还研究了同义密码子的选择使用与分支种系发生的关系。结果显示:载脂蛋白分子演化中存在着同义密码子使用的分子进化钟。这些研究提示密码子空间中核苷酸替换的非随机选择可能是载脂蛋白基因进化的一种特征。  相似文献   

9.
X Liu  H Liu  W Guo  K Yu 《Gene》2012,509(1):136-141
Codon models are now widely used to draw evolutionary inferences from alignments of homologous sequence data. Incorporating physicochemical properties of amino acids into codon models, two novel codon substitution models describing the evolution of protein-coding DNA sequences are presented based on the similarity scores of amino acids. To describe substitutions between codons a continue-time Markov process is used. Transition/transversion rate bias and nonsynonymous codon usage bias are allowed in the models. In our implementation, the parameters are estimated by maximum-likelihood (ML) method as in previous studies. Furthermore, instantaneous mutations involving more than one nucleotide position of a codon are considered in the second model. Then the two suggested models are applied to five real data sets. The analytic results indicate that the new codon models considering physicochemical properties of amino acids can provide a better fit to the data comparing with existing codon models, and then produce more reliable estimates of certain biologically important measures than existing methods.  相似文献   

10.
The phylogeny of Greya Busck (Lepidoptera: Prodoxidae) was inferred from nucleotide sequence variation across a 765-bp region in the cytochrome oxidase I and II genes of the mitochondrial genome. Most parsimonious relationships of 25 haplotypes from 16 Greya species and two outgroup genera (Tetragma and Prodoxus) showed substantial congruence with the species relationships indicated by morphological variation. Differences between mitochondrial and morphological trees were found primarily in the positions of two species, G. variabilis and G. pectinifera, and in the branching order of the three major species groups in the genus. Conflicts between the data sets were examined by comparing levels of homoplasy in characters supporting alternative hypotheses. The phylogeny of Greya species suggests that host-plant association at the family level and larval feeding mode are conservative characters. Transition/transversion ratios estimated by reconstruction of nucleotide substitutions on the phylogeny had a range of 2.0-9.3, when different subsets of the phylogeny were used. The decline of this ratio with the increase in maximum sequence divergence among taxa indicates that transitions are masked by transversions along deeper internodes or long branches of the phylogeny. Among transitions, substitutions of A-->G and T-->C outnumbered their reciprocal substitutions by 2-6 times, presumably because of the approximately 4:1 (77%) A+T-bias in nucleotide base composition. Of all transversions, 73%-80% were A<-->T substitutions, 85% of which occurred at third positions of codons; these estimates did not decrease with an increase in maximum sequence divergence of taxa included in the analysis. The high frequency of A<-->T substitutions is either a reflection or an explanation of the 92% A+T bias at third codon positions.   相似文献   

11.
In analyzing the silent nucleotide substitutions in some mammalian mitochondrial mRNA coding genes, we had found that the frequency of each of the four nucleotides in rat, mouse, and cow, but not in humans, is the same in the silent third codon position (Lanave C, Preparata G, Saccone C, Serio G (1984) J Mol Evol 20:86-93). Because our findings for these three species were compatible with a stationary Markov process for the evolution of nucleotide sequences, we applied such a model to calculate the effective evolutionary silent substitution rate (vs) and the divergence times among the species. In this paper we have analyzed the first and second codon positions in the same mammalian mitochondrial genes. We found that in the first and second codon positions the human mitochondrial genes satisfy the stationarity conditions. This has allowed us to use the stochastic model mentioned above to calculate the divergence times among mouse, rat, cow, and human. Furthermore, we have analyzed the silent substitution rate in one nuclear gene for these four mammals. We found that in this gene the effective silent substitution rate is about 3 times lower than in mitochondrial genes, and that humans are in this case stationary with respect to the other three mammals in the third codon position as well. Application of our Markov model to this latter gene yields divergence times consistent with our previous determinations.  相似文献   

12.
13.
Summary Based on the rates of synonymous substitution in 42 protein-codin gene pairs from rat and human, a correlation is shown to exist between the frequency of the nucleotides in all positions of the codon and the synonymous substitution rate. The correlation coefficients were positive for A and T and negative for C and G. This means that AT-rich genes accumulate more synonymous substitutions than GC-rich genes. Biased patterns of mutation could not account for this phenomenon. Thus, the variation in synonymous substitution rates and the resulting unequal codon usage must be the consequence of selection against A and T in synonymous positions. Most of the varition in rates of synonymous substitution can be explained by the nucleotide composition in synonymous positions. Codon-anticodon interactions, dinucleotide frequencies, and contextual factors influence neither the rates of synonymous substitution nor codon usage. Interestingly, the nucleotide in the second position of codons (always a nonsynonymous position) was found to affect the rate of synonymous substitution. This finding links the rate of nonsynonymous substitution with the synonymous rate. Consequently, highly conservative proteins are expected to be encoded by genes that evolve slowly in terms of synonymous substitutions, and are consequently highly biased in their codon usage.  相似文献   

14.
Summary In analyzing the silent nucleotide substitutions in some mammalian mitochondrial mRNA coding genes, we had found that the frequency of each of the four nucleotides in rat, mouse, and cow, but not in humans, is the same in the silent third codon position (Lanave C, Preparata G, Saccone C, Serio G (1984) J Mol Evol 20:86-93). Because our findings for these three species were compatible with a stationary Markov process for the evolution of nucleotide sequences, we applied such a model to calculate the effective evolutionary silent substitution rate (vs) and the divergence times among the species. In this paper we have analyzed the first and second codon positions in the same mammalian mitochondrial genes. We found that in the first and second codon positions the human mitochondrial genes satisfy the stationarity conditions. This has allowed us to use the stochastic model mentioned above to calculate the divergence times among mouse, rat, cow, and human. Furthermore, we have analyzed the silent substitution rate in one nuclear gene for these four mammals. We found that in this gene the effective silent substitution rate is about 3 times lower than in mitochondrial genes, and that humans are in this case stationary with respect to the other three mammals in the third codon position as well. Application of our Markov model to this latter gene yields divergence times consistent with our previous determinations.  相似文献   

15.
We completed phylogenetic analysis of the major non-coding region of the mitochondrial DNA (mtDNA) from 159 animals of eight Euro-American and six East Asian domesticated pig breeds and 164 Japanese and five European wild boars. A total of 62 mtDNA haplotypes were detected. Alignment of these regions revealed nucleotide variations (including gaps) at 73 positions, including 58 sites with transition nucleotide substitutions, and two transversion substitutions. Phylogenetic analysis of the sequences could not organize domestic pig breeds into discrete clusters. In addition, many of the haplotypes found in members of diverged clustering groups were found primarily in Euro-American pig breeds, indicating extensive introgression of Asian domestic pigs into European breeds. Furthermore, phylogenetic analysis allocated the DNA sequences of non-coding regions into two different groups, and the deepest branchpoint of this porcine phylogeny corresponded to 86 000-136 000 years before present. This time of divergence would predate the historical period when the pig is thought to have been domesticated from the wild boar.  相似文献   

16.
We examine the complete nucleotide sequences of the mitochondrial cytochrome oxidase II gene of 13 species of insects, representing 10 orders. The genes range from 673 to 690 bp in length, encoding 226 to 229 amino acids. Several insertion or deletion events, each involving one or two codons, can be observed. The 3' end of the gene is extremely variable in both length and sequence, making alignment of the ends unreliable. Using the first 639 nucleotide positions, for which unambiguous alignments could be obtained, we examine the neighbor-joining trees based on nucleotide divergences and based on conserved subsets of that data, including transversion and amino acid and second codon position divergences. Each of these subsets produces different trees, none of which can be easily reconciled with trees constructed using morphology and the fossil record. Bootstrap analysis using second codon positions strongly supports affinities between the order Blatteria (cockroaches) and the order Isoptera (termites) and between a wasp and the published honeybee sequence (Order Hymenoptera). The divergence of insect orders is very ancient and may have occurred too rapidly for easy resolution using mitochondrial protein sequences. Unambiguous resolution of insect orders will probably require analysis of many additional taxa, using the COII gene and other conserved sequences.  相似文献   

17.
A model of evolutionary base substitutions that can incorporate different substitutional rates between the four bases and that takes into account unequal composition of bases in DNA sequences is proposed. Using this model, we derived formulae that enable us to estimate the evolutionary distances in terms of the number of nucleotide substitutions through comparative studies of nucleotide sequences. In order to check the validity of various formulae, Monte Carlo experiments were performed. These formulae were applied to analyze data on DNA sequences from diverse organisms. Particular attention was paid to problems concerning a globin pseudogene in the mouse and the time of its origin through duplication. We obtained a result suggesting that the evolutionary rates of substitution in the first and second codon positions of the pseudogene were roughly 10 times faster than those in the normal globin genes; whereas, the rate in the third position remained almost unchanged. Application of our formulae to histone genes H2B and H3 of the sea urchin showed that, in each of these genes, the rate in the third codon position is tremendously higher than that in the second position. All of these observations can easily and consistently be interpreted by the neutral theory of molecular evolution.  相似文献   

18.
A codon-based model of nucleotide substitution for protein-coding DNA sequences   总被引:34,自引:23,他引:11  
A codon-based model for the evolution of protein-coding DNA sequences is presented for use in phylogenetic estimation. A Markov process is used to describe substitutions between codons. Transition/transversion rate bias and codon usage bias are allowed in the model, and selective restraints at the protein level are accommodated using physicochemical distances between the amino acids coded for by the codons. Analyses of two data sets suggest that the new codon-based model can provide a better fit to data than can nucleotide-based models and can produce more reliable estimates of certain biologically important measures such as the transition/transversion rate ratio and the synonymous/nonsynonymous substitution rate ratio.   相似文献   

19.
Variation in chloroplastrbcL sequences was studied in representative species of four different lineages: the tribeRubieae (Rubiaceae), and the generaDrosera (Droseraceae),Nothofagus (Nothofagaceae) andIlex (Aquifoliaceae). Each lineage has its particular non-overlapping set ofrbcL polymorphic sites, indicating that common unconstrainedrbcL sites are not shared. Large differences in the rate and pattern of nucleotide substitution are observed among the four lineages. The genusIlex has the lowest rate of substitution, the lowest transition/transversion ratio, the lowest synonymous/replacement ratio and the lowest number of substitutions at the third codon position. An apparent relationship of these measures to the age of the lineages is observed. The A + T content and codon use among the four lineages are very similar and, apparently, cannot account for the observed differences in patterns of nucleotide substitution. However, the A + T content of the two bases immediately flanking the polymorphic sites is higher inIlex than in the other lineages. This could be correlated with the transversion/transition bias observed inIlex. The particularly low synonymous/replacement ratio found inIlex could also be explained by the small population sizes of species in this genus.  相似文献   

20.
The Escherichia coli mutator alleles, mutL and mutS, produced transversion as well as transition base-pair substitutions with the trpA reversion system. Transversions, however, were generally mutator-induced at a lower level than transitions and the specific type of transversion and its nucleotide position appeared to strongly affect its level of enhancement. These results are interpreted to mean that mutL- and mutS-dependent mismatch correction is generally more effective at correcting transition mispairings than transversion mispairings. Correction of transversion mispairings is probably dependent upon site of occurrence and type of mismatch.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号