首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Parsch J  Braverman JM  Stephan W 《Genetics》2000,154(2):909-921
A novel method of RNA secondary structure prediction based on a comparison of nucleotide sequences is described. This method correctly predicts nearly all evolutionarily conserved secondary structures of five different RNAs: tRNA, 5S rRNA, bacterial ribonuclease P (RNase P) RNA, eukaryotic small subunit rRNA, and the 3' untranslated region (UTR) of the Drosophila bicoid (bcd) mRNA. Furthermore, covariations occurring in the helices of these conserved RNA structures are analyzed. Two physical parameters are found to be important determinants of the evolution of compensatory mutations: the length of a helix and the distance between base-pairing nucleotides. For the helices of bcd 3' UTR mRNA and RNase P RNA, a positive correlation between the rate of compensatory evolution and helix length is found. The analysis of Drosophila bcd 3' UTR mRNA further revealed that the rate of compensatory evolution decreases with the physical distance between base-pairing residues. This result is in qualitative agreement with Kimura's model of compensatory fitness interactions, which assumes that mutations occurring in RNA helices are individually deleterious but become neutral in appropriate combinations.  相似文献   

2.
Baines JF  Parsch J  Stephan W 《Genetics》2004,166(1):237-242
Recent advances in experimental analyses of the evolution of RNA secondary structures suggest a more complex scenario than that typically considered by Kimura's classical model of compensatory evolution. In this study, we examine one such case in more detail. Previous experimental analysis of long-range compensatory interactions between the two ends of Drosophila Adh mRNA failed to fit the classical model of compensatory evolution. To further investigate and verify long-range pairing in Drosophila Adh with respect to models of compensatory evolution and its potential functional role, we introduced site-directed mutations in the Drosophila melanogaster Adh gene. We explore two alternative hypotheses for why previous analysis of long-range compensatory interactions failed to fit the classical model. Specifically, we investigate whether the disruption of a conserved short-range pairing within Adh exon 2 has an effect on Adh expression or if there is a dual functional role of a conserved sequence in the 3'-UTR in both long-range pairing and the negative regulation of Adh expression. We find that a classical result was not observed due to the pleiotropic effect of changing a nucleotide involved in both long-range base pairing and the negative regulation of gene expression.  相似文献   

3.
RNA secondary structure and compensatory evolution   总被引:6,自引:0,他引:6  
The classic concept of epistatic fitness interactions between genes has been extended to study interactions within gene regions, especially between nucleotides that are important in maintaining pre-mRNA/mRNA secondary structures. It is shown that the majority of linkage disequilibria found within the Drosophila Adh gene are likely to be caused by epistatic selection operating on RNA secondary structures. A recently proposed method of RNA secondary structure prediction based on DNA sequence comparisons is reviewed and applied to several types of RNAs, including tRNA, rRNA, and mRNA. The patterns of covariation in these RNAs are analyzed based on Kimura's compensatory evolution model. The results suggest that this model describes the substitution process in the pairing regions (helices) of RNA secondary structures well when the helices are evolutionarily conserved and thermodynamically stable, but fails in some other cases. Epistatic selection maintaining pre-mRNA/mRNA secondary structures is compared to weak selective forces that determine features such as base composition and synonymous codon usage. The relationships among these forces and their relative strengths are addressed. Finally, our mutagenesis experiments using the Drosophila Adh locus are reviewed. These experiments analyze long-range compensatory interactions between the 5' and 3' ends of Adh mRNA, the different constraints on secondary structures in introns and exons, and the possible role of secondary structures in RNA splicing.  相似文献   

4.
The alcohol dehydrogenase (Adh) region of Drosophila pseudoobscura, which includes the two genes Adh and Adh-Dup, was used to examine the pattern and organization of linkage disequilibrium among pairs of segregating nucleotide sites. A collection of 99 strains from the geographic range of D. pseudoobscura were nucleotide-sequenced with polymerase chain reaction-mediated techniques. All pairs of the 359 polymorphic sites in the 3.5-kb Adh region were tested for significant linkage disequilibrium with Fisher's exact test. Of the 74,278 pairwise comparisons of segregating sites, 127 were in significant linkage disequilibrium at the 5% level. The distribution of five linkage disequilibrium estimators D(ij), D(2), r(ij), r(2) and Dij were compared to theoretical distributions. The observed distributions of D(ij), D(2), r(ij) and r(2) were consistent with the theoretical distribution given an infinite sites model. The observed distribution of Dij differed from the theoretical distribution because of an excess of values at -1 and 1. No spatial pattern was observed in the linkage disequilibrium pattern in the Adh region except for two clusters of sites nonrandomly associated in the adult intron and intron 2 of Adh. The magnitude of linkage disequilibrium decreases significantly as nucleotide distance increases, or a distance effect. Adh-Dup had a larger estimate of the recombination parameter, 4Nc, than Adh, where N is the effective population size and c is the recombination rate. A comparison of the mutation and recombination parameters shows that 7-17 recombination events occur for each mutation event. The heterogeneous estimates of the recombination parameter and the inverse relationship between linkage disequilibrium and nucleotide distance are no longer significant when the two clusters of Adh intron sites are excluded from analyses. The most likely explanation for the two clusters of linkage disequilibria is epistatic selection between sites in the cluster to maintain pre-mRNA secondary structure.  相似文献   

5.
6.
In RNA fitness landscapes with interconnected networks of neutral mutations, neutral precursor mutations can play an important role in facilitating the accessibility of epistatic adaptive mutant combinations. I use an exhaustively surveyed fitness landscape model based on short sequence RNA genotypes (and their secondary structure phenotypes) to calculate the minimum rate at which mutants initially appearing as neutral are incorporated into an adaptive evolutionary walk. I show first, that incorporating neutral mutations significantly increases the number of point mutations in a given evolutionary walk when compared to estimates from previous adaptive walk models. Second, that incorporating neutral mutants into such a walk significantly increases the final fitness encountered on that walk - indeed evolutionary walks including neutral steps often reach the global optimum in this model. Third, and perhaps most importantly, evolutionary paths of this kind are often extremely winding in their nature and have the potential to undergo multiple mutations at a given sequence position within a single walk; the potential of these winding paths to mislead phylogenetic reconstruction is briefly considered.  相似文献   

7.

Background  

The secondary structure of an RNA must be known before the relationship between its structure and function can be determined. One way to predict the secondary structure of an RNA is to identify covarying residues that maintain the pairings (Watson-Crick, Wobble and non-canonical pairings). This "comparative approach" consists of identifying mutations from homologous sequence alignments. The sequences must covary enough for compensatory mutations to be revealed, but comparison is difficult if they are too different. Thus the choice of homologous sequences is critical. While many possible combinations of homologous sequences may be used for prediction, only a few will give good structure predictions. This can be due to poor quality alignment in stems or to the variability of certain sequences. This problem of sequence selection is currently unsolved.  相似文献   

8.
A distance constrained secondary structural model of the ≈10 kb RNA genome of the HIV-1 has been predicted but higher-order structures, involving long distance interactions, are currently unknown. We present the first global RNA secondary structure model for the HIV-1 genome, which integrates both comparative structure analysis and information from experimental data in a full-length prediction without distance constraints. Besides recovering known structural elements, we predict several novel structural elements that are conserved in HIV-1 evolution. Our results also indicate that the structure of the HIV-1 genome is highly variable in most regions, with a limited number of stable and conserved RNA secondary structures. Most interesting, a set of long distance interactions form a core organizing structure (COS) that organize the genome into three major structural domains. Despite overlapping protein-coding regions the COS is supported by a particular high frequency of compensatory base changes, suggesting functional importance for this element. This new structural element potentially organizes the whole genome into three major domains protruding from a conserved core structure with potential roles in replication and evolution for the virus.  相似文献   

9.
Bacterial ribonuclease P (RNase P), an endonuclease involved in tRNA maturation, is a ribonucleoprotein containing a catalytic RNA. The secondary structure of this ribozyme is well established, but comparatively little is understood about its 3-D structure. In this analysis, orientation and distance constraints between elements within the Escherichia coli RNase P RNA-pre-tRNA complex were determined by intra- and intermolecular crosslinking experiments. A molecular mechanics-based RNA structure refinement protocol was used to incorporate the distance constraints indicated by crosslinking, along with the known secondary structure of RNase P RNA and the tertiary structure of tRNA, into molecular models. Seven different structures that satisfy the constraints equally well were generated and compared by superposition to estimate helix positions and orientations. Manual refinement within the range of conformations indicated by the molecular mechanics analysis was used to derive a model of RNase P RNA with bound substrate pre-tRNA that is consistent with the crosslinking results and the available phylogenetic comparisons.  相似文献   

10.
Messenger RNA sequences often have to preserve functional secondary structure elements in addition to coding for proteins. We present a statistical analysis of retroviral mRNA which supports the hypothesis that the natural genetic code is adapted to such complementary coding. These sequences are still able to explore efficiently the space of possible proteins by point mutations. This is borne out by the observation that, in stem regions of retroviral mRNA foldings, silent mutations on one strand are preferentially accompanied by conservative mutations on the other. Distances between amino acids based on physicochemical properties are used to quantify the conservation of protein function under the constraint of maintained RNA secondary structure. We find that preservation of RNA secondary structure by compensatory mutations is evolutionary compatible with the efficient search for new variants on the protein level. Received: 4 June 1999 / Accepted: 12 October 1999  相似文献   

11.
A Bar-Shira  A Panet    A Honigman 《Journal of virology》1991,65(10):5165-5173
Sequence analysis of the human T-cell leukemia virus type I (HTLV-I) long terminal repeat (LTR) does not reveal a polyadenylation consensus sequence, AAUAAA, close to the polyadenylation site at the 3' end of the viral RNA. Using site-directed mutagenesis, we demonstrated that two cis-acting signals are required for efficient RNA processing in HTLV-I LTR: (i) a remote AAUAAA hexamer at a distance of 276 nucleotides upstream of the polyadenylation site, and (ii) the 20-nucleotide GU-rich sequence immediately downstream from the poly(A) site. It has been postulated that the folding of RNA into a secondary structure juxtaposes the AAUAAA sequence, in a noncontiguous manner, to within 14 nucleotides of the polyadenylation site. To test this hypothesis, we introduced deletions and point mutations within the U3 and R regions of the LTR. RNA 3'-end processing occurred efficiently at the authentic HTLV-I poly(A) site after deletion of the sequences predicted to form the secondary structure. Thus, the genetic analysis supports the hypothesis that folding of the HTLV-I RNA in the U3 and R regions juxtaposes the AAUAAA sequence and the poly(A) site to the correct functional distance. This unique arrangement of RNA-processing signals is also found in the related retroviruses HTLV-II and bovine leukemia virus.  相似文献   

12.

Background  

RNAMute is an interactive Java application that calculates the secondary structure of all single point mutations, given an RNA sequence, and organizes them into categories according to their similarity with respect to the wild type predicted structure. The secondary structure predictions are performed using the Vienna RNA package. Several alternatives are used for the categorization of single point mutations: Vienna's RNAdistance based on dot-bracket representation, as well as tree edit distance and second eigenvalue of the Laplacian matrix based on Shapiro's coarse grain tree graph representation.  相似文献   

13.
Freeling M 《Genetics》1976,83(4):701-717
The ability to stain mature pollen grains for the presence of alcohol dehydrogenase (ADH) activity permits the quantitation of ADH( +) gametophytes at frequencies below 10(-6). This resolution allows reversion and genetic fine structure analyses. The rationale of pollen analysis follows Nelson's prototype studies with waxy. As with the waxy gene, revertant frequencies for seven Adh1-deficient ( Adh1(-)) alleles appear to be in excess of microbially derived expectations. Each of the seven Adh1(-) alleles were derived from one of three naturally occurring isoalleles. Based on Schwartz's protein level characterizations of the mutants' products, it was anticipated that the seven Adh1(-) alleles should recombine to yield ADH(+) cistrons in certain pairwise combinations. This expectation was not met. The parental "wild-type" isoalleles from which the mutants were derived appear to be structurally divergent. The discussion interprets these data in view of understanding naturally occurring cistronic variation.  相似文献   

14.
A statistical reference for RNA secondary structures with minimum free energies is computed by folding large ensembles of random RNA sequences. Four nucleotide alphabets are used: two binary alphabets, AU and GC, the biophysical AUGC and the synthetic GCXK alphabet. RNA secondary structures are made of structural elements, such as stacks, loops, joints, and free ends. Statistical properties of these elements are computed for small RNA molecules of chain lengths up to 100. The results of RNA structure statistics depend strongly on the particular alphabet chosen. The statistical reference is compared with the data derived from natural RNA molecules with similar base frequencies. Secondary structures are represented as trees. Tree editing provides a quantitative measure for the distance dt, between two structures. We compute a structure density surface as the conditional probability of two structures having distance t given that their sequences have distance h. This surface indicates that the vast majority of possible minimum free energy secondary structures occur within a fairly small neighborhood of any typical (random) sequence. Correlation lengths for secondary structures in their tree representations are computed from probability densities. They are appropriate measures for the complexity of the sequence-structure relation. The correlation length also provides a quantitative estimate for the mean sensitivity of structures to point mutations. © 1993 John Wiley & Sons, Inc.  相似文献   

15.
N-Ethyl-N-nitrosourea (ENU) was used to induce mutations in the Drosophila melanogaster, alcohol dehydrogenase (Adh) gene. Flies were treated with ENU and mated to homozygous intragenic Adh null mutants; Adh null mutations were selected by exposure of the F1 generation to 1-penten-3-ol. Fourteen Adh null mutations were recovered which included 11 from spermatozoa, 2 from oocytes and 1 from a premeiotic spermatocyte. 2 mutations from spermatozoa and 1 of the mutations from oocytes were multilocus deficiencies which included the Adh locus as determined by complementation tests. The remaining 11 intragenic Adh null mutations were sequenced using the Sanger dideoxy method. One Adh null mutation induced in an oocyte was an AT to TA transversion and the mutation induced in a premeiotic spermatocyte was a GC to AT transition, both of which resulted in a single amino acid substitution. The 11 null mutations induced in spermatozoa were a data set in which both the dose of ENU and the treated germ-cell stage were held constant; therefore, only these 11 mutations were used to calculate the mutation frequency and compare the mutations at the Adh locus with those recovered in other studies. The dose of ENU induced a sex-linked recessive lethal frequency approximately 300 times that of the spontaneous frequency; therefore, these mutations were assumed to have been induced by ENU. 2 of the 11 mutations induced in spermatozoa were multilocus deficiencies and 9 were intragenic mutations. 7 of the 9 intragenic mutations were GC to AT transitions which resulted in 5 single amino acid substitutions, 1 premature translation termination codon, and 1 splice site mutation.(ABSTRACT TRUNCATED AT 400 WORDS)  相似文献   

16.
The Rate of Compensatory Evolution   总被引:8,自引:1,他引:7       下载免费PDF全文
W. Stephan 《Genetics》1996,144(1):419-426
A two-locus model is presented to analyze the evolution of compensatory mutations occurring in stems of RNA secondary structures. Single mutations are assumed to be deleterious but harmless (neutral) in appropriate combinations. In proceeding under mutation pressure, natural selection and genetic drift from one fitness peak to another one, a population must therefore pass through a valley of intermediate deleterious states of individual fitness. The expected time for this transition is calculated using diffusion theory. The rate of compensatory evolution, k(c), is then defined as the inverse of the expected transition time. When selection against deleterious single mutations is strong, k(c) depends on the recombination fraction r between the two loci. Recombination generally reduces the rate of compensatory evolution because it breaks up favorable combinations of double mutants. For complete linkage, k(c) is given by the rate at which favorable combinations of double mutants are produced by compensatory mutation. For r>0, k(c) decreases exponentially with r. In contrast, k(c) becomes independent of r for weak selection. We discuss the dynamics of evolutionary substitutions of compensatory mutants in relation to WRIGHT's shifting balance theory of evolution and use our results to analyze the substitution process in helices of mRNA secondary structures.  相似文献   

17.
The frequency of X-ray-induced (null-enzyme) mutations at the alcohol dehydrogenase locus in Drosophila melanogaster was measured. The rate of recovery of chromosomes that fail to direct the synthesis of a functional Adh protein is 3 x 10(-8) per R for chromosomes that do not include large chromosome rearrangements. However, this analysis excludes a larger number of chromosomes that are "null-enzyme mutations" because thye are deleted for the region of the Adh locus. The dose of X-rays required to induce a frequency of non-deletion null-enzyme mutants equal to the spontaneous frequency is about 73 rad calculated from the data reported in this communication.  相似文献   

18.
In vitro selection can generate functional sequence variants of an RNA structural motif that are useful for comparative analysis. The technique is particularly valuable in cases where natural variation is unavailable or non-existent. We report the extension of this approach to a new extreme--the identification of a 112 nt ribozyme secondary structure imbedded within a 186 nt RNA. A pool of 10(14) variants of an RNA ligase ribozyme was generated using combinatorial chemical synthesis coupled with combinatorial enzymatic ligation such that 172 of the 186 relevant positions were partially mutagenized. Active variants of this pool were enriched using an in vitro selection scheme that retains the sequence variability at positions very close to the ligation junction. Ligases isolated after four rounds of selection catalyzed self-ligation up to 700 times faster than the starting sequence. Comparative analysis of the isolates indicated that when complexed with substrate RNAs the ligase forms a nested, double pseudo-knot secondary structure with seven stems and several important joining segments. Comparative analysis also suggested the identity of mutations that account for the increased activity of the selected ligase variants; designed constructs incorporating combinations of these changes were more active than any of the individual ligase isolates.  相似文献   

19.
The evolution and adaptation of molecular populations is constrained by the diversity accessible through mutational processes. RNA is a paradigmatic example of biopolymer where genotype (sequence) and phenotype (approximated by the secondary structure fold) are identified in a single molecule. The extreme redundancy of the genotype-phenotype map leads to large ensembles of RNA sequences that fold into the same secondary structure and can be connected through single-point mutations. These ensembles define neutral networks of phenotypes in sequence space. Here we analyze the topological properties of neutral networks formed by 12-nucleotides RNA sequences, obtained through the exhaustive folding of sequence space. A total of 4(12) sequences fragments into 645 subnetworks that correspond to 57 different secondary structures. The topological analysis reveals that each subnetwork is far from being random: it has a degree distribution with a well-defined average and a small dispersion, a high clustering coefficient, and an average shortest path between nodes close to its minimum possible value, i.e. the Hamming distance between sequences. RNA neutral networks are assortative due to the correlation in the composition of neighboring sequences, a feature that together with the symmetries inherent to the folding process explains the existence of communities. Several topological relationships can be analytically derived attending to structural restrictions and generic properties of the folding process. The average degree of these phenotypic networks grows logarithmically with their size, such that abundant phenotypes have the additional advantage of being more robust to mutations. This property prevents fragmentation of neutral networks and thus enhances the navigability of sequence space. In summary, RNA neutral networks show unique topological properties, unknown to other networks previously described.  相似文献   

20.
Parameters influencing the efficiency of expression of the human immune interferon (IFN-gamma) gene in E. coli were studied by comparing a series of eight in vitro-derived gene variants. These contained all possible combinations of silent mutations in the first three codons of the mature IFN-gamma polypeptide coding sequence. Expression levels varied up to 50-fold among the different constructions. Comparison of messenger RNA secondary structure models for each variant suggested that the presence of stem-loop structures blocking the translation initiation signals could drastically decrease the efficiency of IFN-gamma synthesis. With variants displaying no stable mRNA secondary structure in the region, a C----U transition at position +11 after the AUG resulted in a 5-fold increase in expression indicating that RNA primary structure also plays an important role in expression. In addition we demonstrate that, in this system, a spacing of 8 nucleotides between the Shine-Dalgarno region and AUG was optimal for gene expression and that the steady-state production level of IFN-gamma rose exponentially with increasing rate of synthesis.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号