首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
We have determined the DNA sequence of intron 1 and flanking exons in the mitochondrial apocytochrome b gene of the Neurospora laboratory strain 74A and the natural isolate North Africa. In contrast to a previous report, we find that this intron contains an open reading frame (ORF) of 951 bases in frame with the upstream exon. The putative intron-encoded protein resembles those of other intron ORFs with respect to length, calculated isoelectric point, and proportion of basic, acidic, polar, and non-polar amino acids; however, no amino acid sequences resembling the "decapeptides" characteristic of maturase-like ORFs were found. Coupled with the previous finding that this intron is capable of self-splicing in vitro in the absence of proteins, the observations discussed here raise the possibility that other introns with long, in-frame ORFs may also be capable of RNA-catalyzed splicing in vitro.  相似文献   

2.
Can Codon Usage Bias Explain Intron Phase Distributions and Exon Symmetry?   总被引:1,自引:0,他引:1  
More introns exist between codons (phase 0) than between the first and the second bases (phase 1) or between the second and the third base (phase 2) within the codon. Many explanations have been suggested for this excess of phase 0. It has, for example, been argued to reflect an ancient utility for introns in separating exons that code for separate protein modules. There may, however, be a simple, alternative explanation. Introns typically require, for correct splicing, particular nucleotides immediately 5 in exons (typically a G) and immediately 3 in the following exon (also often a G). Introns therefore tend to be found between particular nucleotide pairs (e.g., G|G pairs) in the coding sequence. If, owing to bias in usage of different codons, these pairs are especially common at phase 0, then intron phase biases may have a trivial explanation. Here we take codon usage frequencies for a variety of eukaryotes and use these to generate random sequences. We then ask about the phase of putative intron insertion sites. Importantly, in all simulated data sets intron phase distribution is biased in favor of phase 0. In many cases the bias is of the magnitude observed in real data and can be attributed to codon usage bias. It is also known that exons may carry either the same phase (symmetric) or different phases (asymmetric) at the opposite ends. We simulated a distribution of different types of exons using frequencies of introns observed in real genes assuming random combination of intron phases at the opposite sides of exons. Surprisingly the simulated pattern was quite similar to that observed. In the simulants we typically observe a prevalence of symmetric exons carrying phase 0 at both ends, which is common for eukaryotic genes. However, at least in some species, the extent of the bias in favor of symmetric (0,0) exons is not as great in simulants as in real genes. These results emphasize the need to construct a biologically relevant null model of successful intron insertion.Reviewing Editor: Dr. Manyuan Long  相似文献   

3.
Mobile group I introns sometimes contain an open reading frame (ORF) possibly encoding a site-specific DNA endonuclease. However, previous phylogenetic studies have not clearly deduced the evolutionary roles of the group I intron ORFs. In this paper, we examined the phylogeny of group IA2 introns inserted in the position identical to that of the chloroplast-encoded rbcL coding region (rbcL-462 introns) and their ORFs from 13 strains of five genera (Volvox, Pleodorina, Volvulina, Astrephomene, and Gonium) of the colonial Volvocales (Chlorophyceae) and a related unicellular green alga, Vitreochlamys. The rbcL-462 introns contained an intact or degenerate ORF of various sizes except for the Gonium multicoccum rbcL-462 intron. Partial amino acid sequences of some rbcL-462 intron ORFs exhibited possible homology to the endo/excinuclease amino acid terminal domain. The distribution of the rbcL-462 introns is sporadic in the phylogenetic trees of the colonial Volvocales based on the five chloroplast exon sequences (6021 bp). Phylogenetic analyses of the conserved intron sequences resolved that the G. multicoccum rbcL-462 intron had a phylogenetic position separate from those of other colonial volvocalean rbcL-462 introns, indicating the recent horizontal transmission of the intron in the G. multicoccum lineage. However, the combined data set from conserved intron sequences and ORFs from most of the rbcL-462 introns resolved robust phylogenetic relationships of the introns that were consistent with those of the host organisms. Therefore, most of the extant rbcL-462 introns may have been vertically inherited from the common ancestor of their host organisms, whereas such introns may have been lost in other lineages during evolution of the colonial Volvocales. In addition, apparently higher synonymous substitutions than nonsynonymous substitutions in the rbcL-462 intron ORFs indicated that the ORFs might evolve under functional constraint, which could result in homing of the rbcL-462 intron in cases of spontaneous intron loss. On the other hand, the presence of intact to largely degenerate ORFs of the rbcL-462 introns within the three isolates of Gonium viridistellatum and the rare occurrence of the ORF-lacking rbcL-462 intron suggested that the ORFs might degenerate to result in the spontaneous intron loss during a very short evolutionary time following the loss of the ORF function. Thus, the sporadic distribution of the rbcL-462 introns within the colonial Volvocales can be largely explained by an equilibrium between maintenance of the introns by the intron ORF and spontaneous loss of introns when the introns do not have a functional ORF.  相似文献   

4.
5.
The translation of human triosephosphate isomerase (TPI) mRNA normally terminates at codon 249 within exon 7, the final exon. Frameshift and nonsense mutations of the type that cause translation to terminate prematurely at or upstream of codon 189 within exon 6 reduce the level of nuclear TPI mRNA to 20 to 30% of normal by a mechanism that is not a function of the distance of the nonsense codon from either the translation initiation or termination codon. In contrast, frameshift and nonsense mutations of another type that cause translation to terminate prematurely at or downstream of codon 208, also within exon 6, have no effect on the level of nuclear TPI mRNA. In this work, quantitations of RNA that derived from TPI alleles in which nonsense codons had been generated between codons 189 and 208 revealed that the boundary between the two types of nonsense codons resides between codons 192 and 195. The analysis of TPI gene insertions and deletions indicated that the positional feature differentiating the two types of nonsense codons is the distance of the nonsense codon upstream of intron 6. For example, the movement of intron 6 to a position downstream of its normal location resulted in a concomitant downstream movement of the boundary between the two types of nonsense codons. The analysis of intron 6 mutations indicated that the intron 6 effect is stipulated by the 88 nucleotides residing between the 5' and 3' splice sites. Since the deletion of intron 6 resulted in only partial abrogation of the nonsense codon-mediated reduction in the level of TPI mRNA, other sequences within TPI pre-mRNA must function in the effect. One of these sequences may be intron 2, since the deletion of intron 2 also resulted in partial abrogation of the effect. In experiments that switched introns 2 and 6, the replacement of intron 6 with intron 2 was of no consequence to the effect of a nonsense codon within either exon 1 or exon 6. In contrast, the replacement of intron 2 with intron 6 was inconsequential to the effect of a nonsense codon in exon 6 but resulted in partial abrogation of a nonsense codon in exon 1.  相似文献   

6.
The overlapping ND4L and ND5 genes of Neurospora crassa mitochondria are interrupted by one and two intervening sequences, respectively, of about 1,490, 1,408 and 1,135 bp in length. All three intervening sequences are class I introns and as such have the potential to fold into the conserved secondary structure that has been proposed for the majority of fungal mitochondrial introns. They contain long open reading frames (ORFs; from 306 to 425 codons long) that are continuous and in frame with the upstream exon sequences. These ORFs contain the conserved decapeptide-encoding sequences that are characteristic of the ORFs present in most class I introns. Extensive homology exists among the ORFs encoded by the ND4L intron, ND5 intron 1, and the second intron of the N. crassa oli2 gene. Also, internal repeats of about 130 amino acid residues are present twice in each of these three ORFs, suggesting that a duplication event may have occurred in the formation of these ORFs. The ND4L intron shares extensive homology (at the levels of both primary and proposed secondary structures) with the self-splicing intervening sequence present in the Tetrahymena nuclear rRNA gene. This homology includes but is not limited to the core secondary structure, as peripheral structural elements are also conserved in the two introns.  相似文献   

7.
We report the nucleotide sequence of the chloroplast psbA gene encoding the 32 kilodalton protein of photosystem II from Chlamydomonas moewusii. Like its land plant homologues, this green algal protein consists of 353 amino acids. The C. moewusii psbA gene is composed of three exons containing 252, 11 and 90 codons and of two group I introns containing 2363 and 1807 nucleotides. Each of the introns features an internal open reading frame (ORF) that potentially encodes a basic protein of more than 300 residues. The primary sequences of the putative intron-encoded proteins are unrelated and none of them shares conserved elements with any of the proteins predicted from the group I intron sequences published so far. The first C. moewusii intron is inserted at the same position as the fourth intron of the psbA gene from Chlamydomonas reinhardtii; the second intron lies at a novel site downstream of this position. On the basis of their RNA secondary structures, the C. moewusii introns 1 and 2 can be assigned to subgroups IA and IB, respectively. However, intron 1 is not typical of subgroup IA introns, its most unusual feature being the location of the ORF in the "loop L5" region. To our knowledge, this is the first time that an ORF is located in this region of the group I intron structure.  相似文献   

8.
9.
Statistics of the exon-intron structure and splicing sites of several diverse eukaryotes was studied. The yeast exon-intron structures have a number of unique features. A yeast gene usually have at most one intron. The branch site is strongly conserved, whereas the polypirimidine tract is short. Long yeast introns tend to have stronger acceptor sites. In other species the branch site is less conserved and often cannot be determined. In non-yeast samples there is an almost universal correlation between lengths of neighboring exons (all samples excluding protists) and correlation between lengths of neighboring introns (human, drosophila, protists). On the average first introns are longer, and anomalously long introns are usually first introns in a gene. There is a universal preference for exons and exon pairs with the (total) length divisible by 3. Introns positioned between codons are preferred, whereas those positioned between the first and second positions in codon are avoided. The choice of A or G at the third position of intron (the donor splice sites generally prefer purines at this position) is correlated with the overall GC-composition of the gene. In all samples dinucleotide AG is avoided in the region preceding the acceptor site.  相似文献   

10.
B Wissinger  W Schuster  A Brennicke 《Cell》1991,65(3):473-482
The complete NADH dehydrogenase subunit 1 (nad1) ORF in Oenothera mitochondria is encoded by five exons. These exons are located in three distant locations of the mitochondrial genome. One genomic region encodes exon a, the second encodes exons b and c, and the third specifies exons d and e. Cis-splicing group II introns separate exons b and c and d and e, while trans-splicing reactions are required to link exons a and b and c and d. The two parts of the group II intron sequences involved in these trans-splicing events can be aligned in domain IV. Exon sequences and the maturase-related ORF in intron d/e are edited by numerous C to U alterations in the mRNA. Two RNA editing events in the trans-splicing intron a/b improve conservation of the secondary structure in the stem of domain VI. RNA editing in intron sequences may thus be required for the trans-splicing reaction.  相似文献   

11.
12.
13.
14.
The trnK intron of plants encodes the matK open reading frame (ORF), which has been used extensively as a phylogenetic marker for classification of plants. Here we examined the evolution of the trnK intron itself as a model for group II intron evolution in plants. Representative trnK intron sequences were compiled from species spanning algae to angiosperms, and four introns were newly sequenced. Phylogenetic analyses showed that the matK ORFs belong to the ML (mitochondrial-like) subclass of group II intron ORFs, indicating that they were derived from a mobile group II intron of the class. RNA structures of the introns were folded and analyzed, which revealed progressive RNA structural deviations and degenerations throughout plant evolution. The data support a model in which plant organellar group II introns were derived from bacterial-like introns that had "standard" RNA structures and were competent for self-splicing and mobility and that subsequently the ribozyme structures degenerated to ultimately become dependent upon host-splicing factors. We propose that the patterns of RNA structure evolution seen for the trnK intron will apply to the other group II introns in plants.  相似文献   

15.
16.
拟南芥和线虫基因序列及剪切位点的理论预测   总被引:5,自引:1,他引:5  
将拟南芥(A.thaliana)和线虫(C.elegans)基因组按外显子、内含子及基因间序列区分为3类。分别选取64、40、20种三联体的概率作为信号参数构建离散源,根据离散增量预测序列所属类型。结果表明:拟南芥各条染色体标准集总预测成功率达到82.19%,检验集为87.95%;线虫各条染色体标准集总预测成功率达到79.67%,检验集达到81,93%。另外,将两种基因序列中的外显子分别划分成3类,用外显子剪切位点、翻译起始和结束位点附近的三联体的3个位点作为3条子链,以各条子链的12个参数构建离散源,用离散增量对3种序列类型进行预测,预测成功率都达80%以上。  相似文献   

17.
A genomic DNA fragment (gCORE-1), encoding a portion of the cartilage proteoglycan core protein, has been isolated from a phage library using cDNA as a probe. The genomic insert is about 17 kilobase pairs; two BamHI fragments of the insert (1.3 and 4.8 kilobase pairs) contain most of the hybridizable sequences found in the cDNA. Sequence analysis of these fragments shows that they contain a total of five exons that encompass 216 amino acid residues, all of which are identical to those of the corresponding cDNA sequence. Three of the exons, which are adjacent to one another, are very similar to the corresponding exons in the gene of a rat hepatic lectin as well as to an exon in the gene of human pulmonary surfactant-associated protein. There is a strong degree of conservation of amino acid sequences encoded in the three genes, although there is no similarity between their introns. The sizes of the five exons in gCORE-1, except for one (which is indeterminate because only a partial cDNA sequence is available), are less than 184 base pairs, whereas the sizes of the introns range from 218 to greater than 2629 base pairs. Four of the introns interrupt an exon codon at either their donor or acceptor sites, between the first and second nucleotides. Only one intron does not split a codon. Intron and exon boundary sites are in agreement with known consensus sequences for introns. The dispersed distribution and relatively small size of the exons, if representative of the entire gene, suggest that the complete gene which codes for the core protein may be quite sizable.  相似文献   

18.
Computer analyses of the entire GenBank database were conducted to examine correlation between splicing sites and codon positions in reading frames. Intron insertion patterns (i.e., splicing site locations with respect to codon positions) have been analyzed for all of the 74 codons of all the eukaryote taxonomic groups: primates, rodents mammals, vertebrates, invertebrates, and plants. We found that reading frames are interrupted by an intron at a codon boundary (as opposed to the middle of a codon) significantly more often than expected. This observation is consistent with the exon shuffling hypothesis, because exons that end at codon boundaries can be concatenated without causing a frame shift and thus are evolutionarily advantageous. On the other hand, when introns interrupt at the middles of codons, they exist in between the first and second bases much more frequently than between the second and third bases, despite the fact that boundaries between the first and second bases of codons are generally far more important than those between the second and third bases. The reason for this is not clear and yet to be explained. We also show that the length of an exon is a multiple of 3 more frequently than expected. Furthermore, the total length of two consecutive exons is also more frequently a multiple of 3. All the observations above are consistent with results recently published by Long, Rosenberg, and Gilbert (1995).   相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号