首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
We examined 103 nucleotide sequences of the HIV-1 env gene, sampled from 35 countries and tested: I) the random (neutral) distribution of the number of nucleotide changes; II) the proportion of bases at molecular equilibrium; III) the neutral expected homogeneity of the distribution of new fixated bases; IV) the hypothesis of the neighbor influence on the mutation rates in a site. The expected random number of fixations per site was estimated by Bose-Einstein statistics, and the expected frequencies of bases by matrices of mutation-fixation rates. The homogeneity of new fixations was analyzed using χ2 and trinomial tests for homogeneity. Fixations of the central base in trinucleotides were used to test the neighbor influence on base substitutions. Neither the number of fixations nor the frequencies of bases fitted the expected neutral distribution. There was a highly significant heterogeneity in the distribution of new fixations, and several sites showed more transversions than transitions, showing that each nucleotide site has its own pattern of change. These three independent results make the neutral theory, the nearly neutral and the neighbor influence hypotheses untenable and indicate that evolution of env is rather highly selective.  相似文献   

2.
Analysis for the homogeneity of the distribution of the second base of dinucleotides in relation to the first, whose bases are separated by 0, 1, 2,... 21 nucleotide sites, was performed with the VIH-1 genome (cDNA), the Drosophila mtDNA, the Drosophila Torso gene and the human p-globin gene. These four DNA segments showed highly significant heterogeneities of base distributions that cannot be accounted for by neutral or nearly neutral evolution or by the "neighbor influence" of nucleotides on mutation rates. High correlations are found in the bases of dinucleotides separated by 0, 1 and more number of sites. A periodicity of three consecutive significance values (measured by the x29) was found only in Drosophila mtDNA. This periodicity may be due to an unknown structure or organization of mtDNA. This non-random distribution of the two bases of dinucleotides widespread throughout these DNA segments is rather compatible with panselective evolution and generalized internucleotide co-adaptation.  相似文献   

3.
Neutral evolution results from random recurrent mutation and genetic drift. A small part of random evolution, that which is related to protein or DNA polymorphisms, is the subject of the Neutral Theory of Evolution. One of the foundations of this theory is the demonstration that the mutation rate (m) is equal to the substitution rate. Since both rates are independent of population size, they are independent of drift, which is dependent upon population size. Neutralists have erroneously equated the substitution rate with the fixation rate, despite the fact that they are antithetical conceptions. The neutralists then applied the random walk stochastic model to justify alleles or bases that were fixated or eliminated. In this model, once the allele or base frequencies reach the monomorphic states (values of 1.0 or 0.0), the absorbing barriers, they can no longer return to the polymorphic state. This operates in a pure mathematical model. If recurrent mutation occurs (as in biotic real systems) fixation and elimination are impossible. A population of bacteria in which m = 10(-8) base mutation (or substitution)/site/generation and the reproduction rate is 1000 cell cycle/year should replace all its genome bases in approximately 100,000 years. The expected situation for all sites is polymorphism for the four bases rather than monomorphism at 1.0 or 0.0 frequencies. If fixation and elimination of a base for more than 500,000 years are impossible, then most of the neutral theory is untenable. A new complete neutral model, which allows for recurrent substitutions, is proposed here based on recurrent mutation or substitution and drift alone. The model fits a binomial or Poisson distribution and not a geometric one, as does neutral theory.  相似文献   

4.
Retroviruses favor target-DNA (tDNA) distortion and particular bases at sites of integration, but the mechanism underlying HIV-1 selectivity is unknown. Crystal structures revealed a network of prototype foamy virus (PFV) integrase residues that distort tDNA: Ala188 and Arg329 interact with tDNA bases, while Arg362 contacts the phosphodiester backbone. HIV-1 integrase residues Ser119, Arg231, and Lys258 were identified here as analogs of PFV integrase residues Ala188, Arg329 and Arg362, respectively. Thirteen integrase mutations were analyzed for effects on integrase activity in vitro and during virus infection, yielding a total of 1610 unique HIV-1 integration sites. Purine (R)/pyrimidine (Y) dinucleotide sequence analysis revealed HIV-1 prefers the tDNA signature (0)RYXRY(4), which accordingly favors overlapping flexible dinucleotides at the center of the integration site. Consistent with roles for Arg231 and Lys258 in sequence specific and non-specific binding, respectively, the R231E mutation altered integration site nucleotide preferences while K258E had no effect. S119A and S119T integrase mutations significantly altered base preferences at positions −3 and 7 from the site of viral DNA joining. The S119A preference moreover mimicked wild-type PFV selectivity at these positions. We conclude that HIV-1 IN residue Ser119 and PFV IN residue Ala188 contact analogous tDNA bases to effect virus integration.  相似文献   

5.
Sequence saturation mutagenesis (SeSaM) is a conceptually novel and practically simple method that truly randomizes a target sequence at every single nucleotide position. A SeSaM experiment can be accomplished within 2–3 days and comprises four steps: generating a pool of DNA fragments with random length, ‘tailing’ the DNA fragments with universal base using terminal transferase at 3′-termini, elongating DNA fragments in a PCR to the full-length genes using a single-stranded template and replacing the universal bases by standard nucleotides. Random mutations are created at universal sites due to the promiscuous base-pairing property of universal bases. Using enhanced green fluorescence protein as the model system and deoxyinosine as the universal base, we proved by sequencing 100 genes the concept of the SeSaM method and achieved a random distribution of mutations with the mutational bias expected for deoxyinosine.  相似文献   

6.
Phylogenetic analyses frequently rely on models of sequence evolution that detail nucleotide substitution rates, nucleotide frequencies, and site-to-site rate heterogeneity. These models can influence hypothesis testing and can affect the accuracy of phylogenetic inferences. Maximum likelihood methods of simultaneously constructing phylogenetic tree topologies and estimating model parameters are computationally intensive, and are not feasible for sample sizes of 25 or greater using personal computers. Techniques that initially construct a tree topology and then use this non-maximized topology to estimate ML substitution rates, however, can quickly arrive at a model of sequence evolution. The accuracy of this two-step estimation technique was tested using simulated data sets with known model parameters. The results showed that for a star-like topology, as is often seen in human immunodeficiency virus type 1 (HIV-1) subtype B sequences, a random starting topology could produce nucleotide substitution rates that were not statistically different than the true rates. Samples were isolated from 100 HIV-1 subtype B infected individuals from the United States and a 620 nt region of the env gene was sequenced for each sample. The sequence data were used to obtain a substitution model of sequence evolution specific for HIV-1 subtype B env by estimating nucleotide substitution rates and the site-to-site heterogeneity in 100 individuals from the United States. The method of estimating the model should provide users of large data sets with a way to quickly compute a model of sequence evolution, while the nucleotide substitution model we identified should prove useful in the phylogenetic analysis of HIV-1 subtype B env sequences. Received: 4 October 2000 / Accepted: 1 March 2001  相似文献   

7.
The genetic variability and evolution of the satellite RNA (satRNA) of cucumber mosaic virus (CMV) was analyzed. Twenty-five CMV-satRNAs compared clustered into three main groups, and no correlation was found between genetic proximity and other characteristics (pathogenicity, geographical origin) of the satRNAs. Values for the number of nucleotide substitutions per site between any two satRNAs suggest that divergence is checked by functional constraints. The analysis of mutations relative to an ancestral sequence, and the number of substitutions per site at first, second and third positions of codons in putative open reading frames, show that the variation of CMV-satRNAs does not follow a pattern typical of coding sequences, and indicates that preservation of the sequence of encoded products is not a constraint to evolution. On the other hand, when the observed variation was analyzed relative to a secondary structure model proposed for CMV-satRNAs, several lines of evidence indicated that the maintenance of the secondary structure is a constraint to evolution: the number of substitutions per site, the number of point insertions and deletions and the number of base substitutions that would disrupt base-pairing were significantly higher for unpaired than for base-paired positions. Also, compensatory mutations at base-paired positions occurred more frequently than expected from random. The results suggest that CMV-satRNAs are non-coding, functional RNAs whose biology would be determined by their direct interaction with components of the host and/or the helper virus.  相似文献   

8.
Apolipoprotein B mRNA-editing, enzyme-catalytic, polypeptide-like 3G (i.e., APOBEC3G or A3G) is an evolutionarily conserved cytosine deaminase that potently restricts human immunodeficiency virus type 1 (HIV-1), retrotransposons and other viruses. A3G has a nucleotide target site specificity for cytosine dinucleotides, though only certain cytosine dinucleotides are ‘hotspots’ for cytosine deamination, and others experience little or no editing by A3G. The factors that define these critical A3G hotspots are not fully understood. To investigate how A3G hotspots are defined, we used an in vitro fluorescence resonance energy transfer-based oligonucleotide assay to probe the site specificity of A3G. Our findings strongly suggest that the target single-stranded DNA (ssDNA) secondary structure as well as the bases directly 3′ and 5′ of the cytosine dinucleotide are critically important A3G recognition. For instance, A3G cannot readily deaminate a cytosine dinucleotide in ssDNA stem structures or in nucleotide base loops composed of three bases. Single-stranded nucleotide loops up to seven bases in length were poor targets for A3G activity unless cytosine residues flanked the cytosine dinucleotide. Furthermore, we observed that A3G favors adenines, cytosines and thymines flanking the cytosine dinucleotide target in unstructured regions of ssDNA. Low cytosine deaminase activity was detected when guanines flanked the cytosine dinucleotide. Taken together, our findings provide the first demonstration that A3G cytosine deamination hotspots are defined by both the sequence context of the cytosine dinucleotide target as well as the ssDNA secondary structure. This knowledge can be used to better trace the origins of mutations to A3G activity, and illuminate its impact on processes such as HIV-1 genetic variation.  相似文献   

9.
The evolution of human immunodeficiency virus (HIV) type 1 nef quasispecies in a patient clonally infected with a contaminated batch of blood clotting factor IX was monitored. nef sequences were derived at 11, 25, and 41 months postinfection from infected peripheral blood mononuclear cells after molecular cloning of PCR-amplified proviral DNA. The phylogenetic relationships among a total of 41 informative sequences were established by split decomposition analysis and used as a basis to establish a substitution matrix and to score synonymous (s) and nonsynonymous (ns) substitutions. The number of observed in-phase stop codons within the nef sequences was comparable to that expected on a random basis. Similarly, the numbers of observed s and ns substitutions did not differ significantly from expected values. No codon position was preferentially mutated. The maximum sequence divergence increased in a linear manner, with approximately 4.4 nucleotide and approximately 3.2 amino acid changes per year. It appears that stochastic processes strongly influence short-term HIV nef quasispecies evolution in vivo.  相似文献   

10.
Zika virus (ZIKV) is a mosquito‐transmitted flavivirus, linked to microcephaly and fetal death in humans. Here, we investigate whether host‐mediated RNA editing of adenosines (ADAR) plays a role in the molecular evolution of ZIKV. Using complete coding sequences for the ZIKV polyprotein, we show that potential ADAR substitutions are underrepresented at the ADAR‐resistant GA dinucleotides of both the positive and negative strands, that these changes are spatially and temporally clustered (as expected of ADAR editing) for three branches of the viral phylogeny, and that ADAR mutagenesis can be linked to its codon usage. Furthermore, resistant GA dinucleotides are enriched on the positive (but not negative) strand, indicating that the former is under stronger purifying selection than the latter. ADAR editing also affects the evolution of the rhabdovirus sigma. Our study now documents that host ADAR editing is a mutation and evolutionary force of positive‐ as well as negative‐strand RNA viruses.  相似文献   

11.
Protein evolution has occurred by successive fixation of individual mutations. The probability of fixation depends on the fitness of the mutation, and the arising variant can be deleterious, neutral, or beneficial. Despite its relevance, only few studies have estimated the distribution of fitness effects caused by random single mutations on protein function. The human immunodeficiency virus type 1 (HIV-1) protease was chosen as a model protein to quantify protein's tolerability to random single mutations. After determining the enzymatic activity of 107 single random mutants, we found that 86% of single mutations were deleterious for the enzyme catalytic efficiency and 54% lethal. Only 2% of the mutations significantly increased the catalytic efficiency of the enzyme. These data demonstrate the vulnerability of HIV-1 protease to single random mutations. When a second random mutagenesis library was constructed from an HIV-1 protease carrying a highly deleterious single mutation (D30N), a higher proportion of mutations with neutral or beneficial effect were found, 26% and 9%, respectively. Importantly, antagonist epistasis was observed between deleterious mutations. In particular, the mutation N88D, lethal for the wild-type protease, restored the wild-type catalytic efficiency when combined with the highly deleterious mutation D30N. The low tolerability to single random substitutions shown here for the wild-type HIV-1 protease contrasts with its in vivo ability to generate an adaptive variation. Thus, the antagonist epistasis between deleterious or lethal mutations may be responsible for increasing the protein mutational robustness and evolvability.  相似文献   

12.
Mammalian gene evolution: Nucleotide sequence divergence between mouse and rat   总被引:16,自引:0,他引:16  
As a paradigm of mammalian gene evolution, the nature and extent of DNA sequence divergence between homologous protein-coding genes from mouse and rat have been investigated. The data set examined includes 363 genes totalling 411 kilobases, making this by far the largest comparison conducted between a single pair of species. Mouse and rat genes are on average 93.4% identical in nucleotide sequence and 93.9% identical in amino acid sequence. Individual genes vary substantially in the extent of nonsynonymous nucleotide substitution, as expected from protein evolution studies; here the variation is characterized. The extent of synonymous (or silent) substitution also varies considerably among genes, though the coefficient of variation is about four times smaller than for nonsynonymous substitutions. A small number of genes mapped to the X-chromosome have a slower rate of molecular evolution than average, as predicted if molecular evolution is male-driven. Base composition at silent sites varies from 33% to 95% G + C in different genes; mouse and rat homologues differ on average by only 1.7% in silent-site G + C, but it is shown that this is not necessarily due to any selective constraint on their base composition. Synonymous substitution rates and silent site base composition appear to be related (genes at intermediate G + C have on average higher rates), but the relationship is not as strong as in our earlier analyses. Rates of synonymous and nonsynonymous substitution are correlated, apparently because of an excess of substitutions involving adjacent pairs of nucleotides. Several factors suggest that synonymous codon usage in rodent genes is not subject to selection.  相似文献   

13.
Summary Computer simulation of protein evolution is based on a simple model consisting of random fixation of allowed codons (RFAC). Random replacement of single nucleotides occurs in a DNA sequence. If this results in any of the synonomous codons for allowed amino acids the mutation is fixed, if not, there is no change in the DNA and the cycle is repeated. Multiple fixations at the same nucleotide site, back mutations, degenerate fixations and coincidental identity of amino acids all occur. RFAC simulation begins with a single DNA sequence and follows a phylogeny based on the fossil record. The rate of fixation at the level of DNA is constant. The model upon which RFAC simulation is based is the same as the neutral theory of molecular evolution. The simulation is therefore a test of this theory. The results of simulated and real evolution are compared for fibrinopeptides A in mammals and cytochromes C and hemoglobin and chains in vertebrates. In each case the allowed variation at each site has been set equal to that observed, twice that observed and all protein amino acids. Rates of fixation vary from 2.4 × 10–10 to 10–5 accepted nucleotide fixations per codon per year. There is some, although never excellent, agreement between real and simulated evolution, the better fits are obtained in the cases of fibrinopeptides A and cytochromes C. The major source of discrepancy between real evolution and simulation is irregularities in the rates of real evolution. RFAC simulation is compared with the random evolutionary hit (REH) model, augmented maximum parsimony and the accepted point mutations (PAM) approach.  相似文献   

14.
We describe a mathematical model and Monte Carlo (MC) simulation of viral evolution during acute infection. We consider both synchronous and asynchronous processes of viral infection of new target cells. The model enables an assessment of the expected sequence diversity in new HIV-1 infections originating from a single transmitted viral strain, estimation of the most recent common ancestor (MRCA) of the transmitted viral lineage, and estimation of the time to coalesce back to the MRCA. We also calculate the probability of the MRCA being the transmitted virus or an evolved variant. Excluding insertions and deletions, we assume HIV-1 evolves by base substitution without selection pressure during the earliest phase of HIV-1 infection prior to the immune response. Unlike phylogenetic methods that follow a lineage backwards to coalescence, we compare the observed data to a model of the diversification of a viral population forward in time. To illustrate the application of these methods, we provide detailed comparisons of the model and simulations results to 306 envelope sequences obtained from eight newly infected subjects at a single time point. The data from patients were in good agreement with model predictions, and hence compatible with a single-strain infection evolving under no selection pressure. The diversity of the samples from the other two patients was too great to be explained by the model, suggesting multiple HIV-1-strains were transmitted. The model can also be applied to longitudinal patient data to estimate within-host viral evolutionary parameters.  相似文献   

15.
Asymmetrical distribution of CpG in an 'average' mammalian gene.   总被引:24,自引:7,他引:17       下载免费PDF全文
The frequency and distribution of the rare dinucleotide CpG was examined in 15 mammalian genes. CpG is highly methylated at cytosine in mammalian DNA (1,2) and 5-methylcytosine (5mC) is thought to undergo a transition mutation via deamination to produce thymine (3). This would result in the accumulation of TpG and CpA and depletion of CpG during evolution (4). Consistent with this hypothesis, the gene sample of 26,541 dinucleotides contained CpG at 40% the frequency expected by base composition and the CpG transition products, TpG+CpA, were significantly elevated at 124% of expected random frequency. However, because CpG occurs at only 25% of expected random frequency in the genome, the sampled genes were considerably enriched in this dinucleotide. CpGs were asymmetrically distributed in sequences flanking the genes. 5'-flanking sequences were enriched in CpG at 135% of the frequency expected assuming a symmetrical distribution of all the CpGs in the sampled genes (p less than 0.01), while 3'-flanking regions were depleted in CpG at 40% of expected values (p less than 0.0001). This asymmetry may reflect the role of 5-methylcytosine in gene expression. In contrast the frequencies of GpC and GpT+ ApC did not differ significantly from that predicted by base composition and these dinucleotides were not asymmetrically distributed.  相似文献   

16.
M Hasegawa  T A Yano 《Origins of life》1975,6(1-2):219-227
The entropy of the amino acid sequences coded by DNA is considered as a measure of diversity of variety of proteins, and is taken as a measure of evolution. The DNA or m-RNA sequence is considered as a stationary second-order Markov chain composed of four kinds of bases. Because of the biased nature of the genetic code table, increase of entropy of amino acid sequences is possible with biased nucleotide sequence. Thus the biased DNA base composition and the extreme rarity of the base doublet CpG of higher organisms are explained. It is expected that the amino acid composition was highly biased at the days of the origin of the genetic code table, and the more frequent amino acids have tended to get rarer, and the rarer ones more frequent. This tendency is observed in the evolution of hemoglobin, cytochrome C, fibrinopeptide, immunoglobulin and lysozyme, and protein as a whole.  相似文献   

17.
We found a consistent 3-site periodicity of the X29 values for the heterogeneity of the distribution of the second base in relation to the first base of dinucleotides separated by 0 (contiguous), 1, 2, 3 ... 17 (K) nucleotide sites in Drosophila mtDNA. Triplets of X29 values were found where the first was over 300 and the second and third ranged between 37 and 114 (previous studies). In this study, the periodicity was significant until separation of 2011K, and a structure of deviations from randomness among dinucleotides was found. The most deviant dinucleotides were G-G, G-C and C-G for the first, second and third element of the triplet, respectively. In these three cases there were more dinucleotides observed than expected. This inter-bases correlation and periodicity may be related to the tertiary structure of circular DNA, like that of prokaryotes and mitochondria, to protect and preserve it. The mtDNA with 19.517 bp was divided into four equal segments of 4.879 bp. The fourth sub-segment presented a very low proportion of G and C, the internucleotide interaction was weaker in this sub-segment and no periodicity was found. The maintenance of this mtDNA structure and organization for millions of generations, in spite of a high recurrent mutation rate, does not support the notion of neutralism or near neutralism. The high level of internucleotide interaction and periodicity indicate that every nucleotide is co-adapted with the residual genome.  相似文献   

18.
Summary The maintenance of a proper distribution of charged amino acid residues might be expected to be an important factor in protein evolution. We therefore compared the inferred changes in charge during the evolution of 43 protein families with the changes expected on the basis of random base substitutions. It was found that certain proteins, like the eye lens crystallins and most histones, display an extreme avoidance of changes in charge. Other proteins, like phospholipase A2 and ferredoxin, apparently have sustained more charged replacements than expected, suggesting a positive selection for changes in charge. Depending on function and structure of a protein, charged residues apparently can be important targets for selective forces in protein evolution. It appears that actual biased codon usage tends to decrease the proportion of charged amino acid replacements. The influence of nonrandomness of mutations is more equivocal. Genes that use the mitochondrial instead of the universal code lower the probability that charge changes will occur in the encoded proteins.  相似文献   

19.
20.
Replication of HIV-1 under selective pressure frequently results in the evolution of virus variants that replicate more efficiently under the applied conditions. For example, in patients on antiretroviral therapy, such evolution can result in variants that are resistant to the HIV-1 inhibitors, thus frustrating the therapy. On the other hand, virus evolution can help us to understand the molecular mechanisms that underlie HIV-1 replication. For example, evolution of a defective virus mutant can result in variants that overcome the introduced defect by restoration of the original sequence or by the introduction of additional mutations in the viral genome. Analysis of the evolution pathway can reveal the requirements of the element under study and help to understand its function. Analysis of the escape routes may generate new insight in the viral life cycle and result in the identification of unexpected biological mechanisms. We have developed in vitro HIV-1 evolution into a systematic research tool that allows the study of different aspects of the viral replication cycle. We will briefly review this method of forced virus evolution and provide several examples that illustrate the power of this approach.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号