首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
3.
Most research concerning the evolution of introns has largely considered introns within coding sequences (CDSs), without regard for introns located within untranslated regions (UTRs) of genes. Here, we directly determined intron size, abundance, and distribution in UTRs of genes using full-length cDNA libraries and complete genome sequences for four species, Arabidopsis thaliana, Drosophila melanogaster, human, and mouse. Overall intron occupancy (introns/exon kbp) is lower in 5' UTRs than CDSs, but intron density (intron occupancy in regions containing introns) tends to be higher in 5' UTRs than in CDSs. Introns in 5' UTRs are roughly twice as large as introns in CDSs, and there is a sharp drop in intron size at the 5' UTR-CDS boundary. We propose a mechanistic explanation for the existence of selection for larger intron size in 5' UTRs, and outline several implications of this hypothesis. We found introns to be randomly distributed within 5' UTRs, so long as a minimum required exon size was assumed. Introns in 3' UTRs were much less abundant than in 5' UTRs. Though this was expected for human and mouse that have intron-dependent nonsense-mediated decay (NMD) pathways that discourage the presence of introns within the 3' UTR, it was also true for A. thaliana and D. melanogaster, which may lack intron-dependent NMD. Our findings have several implications for theories of intron evolution and genome evolution in general.  相似文献   

4.
Dinoflagellate genomes present unique challenges including large size, modified DNA bases, lack of nucleosomes, and condensed chromosomes. EST sequencing has shown that many genes are found as many slightly different variants implying that many copies are present in the genome. As a preliminary survey of the genome our goal was to obtain genomic sequences for 47 genes from the dinoflagellate Amphidinium carterae. A PCR approach was used to avoid problems with large insert libraries. One primer set was oriented inward to amplify the genomic complement of the cDNA and a second primer set would amplify outward between tandem repeats of the same gene. Each gene was also tested for a spliced leader using cDNA as template. Almost all (14/15) of the highly expressed genes (i.e. those with high representation in the cDNA pool) were shown to be in tandem arrays with short intergenic spacers, and most were trans-spliced. Only two moderately expressed genes were found in tandem arrays. A polyadenylation signal was found in genomic copies containing the sequence AAAAG/C at the exact polyadenylation site and was conserved between species. Four genes were found to have a high intron density (>5 introns) while most either lacked introns, or had only one to three. Actin was selected for deeper sequencing of both genomic and cDNA copies. Two clusters of actin copies were found, separated from each other by many non-coding features such as intron size and sequence. One intron-rich gene was selected for genomic walking using inverse PCR, and was not shown to be in a tandem repeat. The first glimpse of dinoflagellate genome indicates two general categories of genes in dinoflagellates, a highly expressed tandem repeat class and an intron rich less expressed class. This combination of features appears to be unique among eukaryotes.  相似文献   

5.
6.
The nucleotide sequence of Korean ginseng (Panax schinseng Nees) chloroplast genome has been completed (AY582139). The circular double-stranded DNA, which consists of 156,318 bp, contains a pair of inverted repeat regions (IRa and IRb) with 26,071 bp each, which are separated by small and large single copy regions of 86,106 bp and 18,070 bp, respectively. The inverted repeat region is further extended into a large single copy region which includes the 5' parts of the rpsl9 gene. Four short inversions associated with short palindromic sequences that form stem-loop structures were also observed in the chloroplast genome of P. schinseng compared to that of Nicotiana tabacum. The genome content and the relative positions of 114 genes (75 peptide-encoding genes, 30 tRNA genes, 4 rRNA genes, and 5 conserved open reading frames [ycfs]), however, are identical with the chloroplast DNA of N. tabacum. Sixteen genes contain one intron while two genes have two introns. Of these introns, only one (trnL-UAA) belongs to the self-splicing group I; all remaining introns have the characteristics of six domains belonging to group II. Eighteen simple sequence repeats have been identified from the chloroplast genome of Korean ginseng. Several of these SSR loci show infra-specific variations. A detailed comparison of 17 known completed chloroplast genomes from the vascular plants allowed the identification of evolutionary modes of coding segments and intron sequences, as well as the evaluation of the phylogenetic utilities of chloroplast genes. Furthermore, through the detailed comparisons of several chloroplast genomes, evolutionary hotspots predominated by the inversion end points, indel mutation events, and high frequencies of base substitutions were identified. Large-sized indels were often associated with direct repeats at the end of the sequences facilitating intra-molecular recombination.  相似文献   

7.
U12-dependent introns are found in small numbers in most eukaryotic genomes, but their scarcity makes accurate characterisation of their properties challenging. A computational search for U12-dependent introns was performed using the draft version of the human genome sequence. Human expressed sequences confirmed 404 U12-dependent introns within the human genome, a 6-fold increase over the total number of non-redundant U12-dependent introns previously identified in all genomes. Although most of these introns had AT-AC or GT-AG terminal dinucleotides, small numbers of introns with a surprising diversity of termini were found, suggesting that many of the non-canonical introns found in the human genome may be variants of U12-dependent introns and, thus, spliced by the minor spliceosome. Comparisons with U2-dependent introns revealed that the U12-dependent intron set lacks the ‘short intron’ peak characteristic of U2-dependent introns. Analysis of this U12-dependent intron set confirmed reports of a biased distribution of U12-dependent introns in the genome and allowed the identification of several alternative splicing events as well as a surprising number of apparent splicing errors. This new larger reference set of U12-dependent introns will serve as a resource for future studies of both the properties and evolution of the U12 spliceosome.  相似文献   

8.
Group I and group II introns are different catalytic self-splicing and mobile RNA elements that contribute to genome dynamics. In this study, we have analyzed their distribution and evolution in 29 sequenced genomes from the Bacillus cereus group of bacteria. Introns were of different structural classes and evolutionary origins, and a large number of nearly identical elements are shared between multiple strains of different sources, suggesting recent lateral transfers and/or that introns are under a strong selection pressure. Altogether, 73 group I introns were identified, inserted in essential genes from the chromosome or newly described prophages, including the first elements found within phages in bacterial plasmids. Notably, bacteriophages are an important source for spreading group I introns between strains. Furthermore, 77 group II introns were found within a diverse set of chromosomal and plasmidic genes. Unusual findings include elements located within conserved DNA metabolism and repair genes and one intron inserted within a novel retroelement. Group II introns are mainly disseminated via plasmids and can subsequently invade the host genome, in particular by coupling mobility with host cell replication. This study reveals a very high diversity and variability of mobile introns in B. cereus group strains.  相似文献   

9.
A part of eukaryotic tRNA genes harbor an intron at one nucleotide 3' to the anticodon, so that removal of the intron is an essential processing step for tRNA maturation. While some tRNA introns have important roles in modification of certain nucleotides, essentiality of the tRNA intron in eukaryotes has not been tested extensively. This is partly because most of the eukaryotic genomes have multiple genes encoding an isoacceptor tRNA. Here, we examined whether the intron of tRNA-Trp(CCA) genes, six copies of which are scattered on the genome of yeast, Saccharomyces cerevisiae, is essential for growth or translation of the yeast in vivo. We devised a procedure to remove all of the tRNA introns from the yeast genome iteratively with marker cassettes containing both positive and negative markers. Using this procedure, we removed all the introns from the six tRNA-Trp(CCA) genes, and found that the intronless strain grew normally and expressed tRNA-Trp(CCA) in an amount similar to that of the wild-type genes. Neither incorporation of (35)S-labeled amino acids into a TCA-insoluble fraction nor the major protein pattern on SDS-PAGE/2D gel were affected by complete removal of the intron, while expression levels of some proteins were marginally affected. Therefore, the tRNA-Trp(CCA) intron is dispensable for growth and bulk translation of the yeast. This raises the possibility that some mechanism other than selective pressure from translational efficiency maintains the tRNA intron on the yeast genome.  相似文献   

10.
We identified the single gene for nuclear lamin in the genome draft of the sea anemone Nematostella vectensis, a member of the cnidaria, a very old metazoan phylum. The gene consists of 10 exons and 9 introns. Strikingly all 9 intron positions are conserved in the human lamin B genes, which have only 1 (lamin B1) or 2 (lamin B2) additional introns. Using the information on neighboring genes we propose that the human lamin B1 gene on chromosome 5 is the true homolog of the Nematostella lamin gene, while the lamin B2 gene on chromosome 19 arose during vertebrate evolution. In marked contrast to this conservation of gene structure are the results in the rapidly evolving genomes of Drosophila and Caenorhabditis elegans. Here the lamin genes have much fewer introns and these occur often at novel positions. In the single nematode lamin gene and the Drosophila lamin Dmo gene no intron position coincides with an intron in the sea anemone lamin gene.  相似文献   

11.
CpG islands in vertebrate genomes   总被引:120,自引:0,他引:120  
  相似文献   

12.
13.
Ⅱ组内含子(group Ⅱ intron)存在于原生生物、真菌、藻类、植物细胞器以及细菌和古细菌基因组中.在体内,Ⅱ组内含子可通过两步连续的转酯反应从前体RNA中自剪接,并连接两 侧外显子.许多Ⅱ组内含子的剪接反应是由蛋白质辅助完成的,这种蛋白质有的是由内含子编码,有的是由宿主基因编码.Ⅱ组内含子能够有效地归巢进入无内含子的等位基因,也能 够以低频率逆转座进入非等位基因.转座过程依赖内含子RNA和内含子编码的蛋白质(内切核酸酶活性和逆转录酶活性).本论文在总结Ⅱ组内含子最新研究成果的基础上,分析Ⅱ组内含子可能的起源和进化途径  相似文献   

14.
15.
16.

Background

Insertions of spliceosomal introns are very rare events during evolution of vertebrates and the mechanisms governing creation of novel intron(s) remain obscure. Largely, gene structures of melanocortin (MC) receptors are characterized by intron-less architecture. However, recently a few exceptions have been reported in some fishes. This warrants a systematic survey of MC receptors for understanding intron insertion events during vertebrate evolution.

Methodology/Principal Findings

We have compiled an extended list of MC receptors from different vertebrate genomes with variations in fishes. Notably, the closely linked MC2Rs and MC5Rs from a group of ray-finned fishes have three and one intron insertion(s), respectively, with conserved positions and intron phase. In both genes, one novel insertion was in the highly conserved DRY motif at the end of helix TM3. Further, the proto-splice site MAG↑R is maintained at intron insertion sites in these two genes. However, the orthologs of these receptors from zebrafish and tetrapods are intron-less, suggesting these introns are simultaneously created in selected fishes. Surprisingly, these novel introns are traceable only in four fish genomes. We found that these fish genomes are severely compacted after the separation from zebrafish. Furthermore, we also report novel intron insertions in P2Y receptors and in CHRM3. Finally, we report ultrasmall introns in MC2R genes from selected fishes.

Conclusions/Significance

The current repository of MC receptors illustrates that fishes have no MC3R ortholog. MC2R, MC5R, P2Y receptors and CHRM3 have novel intron insertions only in ray-finned fishes that underwent genome compaction. These receptors share one intron at an identical position suggestive of being inserted contemporaneously. In addition to repetitive elements, genome compaction is now believed to be a new hallmark that promotes intron insertions, as it requires rapid DNA breakage and subsequent repair processes to gain back normal functionality.  相似文献   

17.
18.
We examined the gene structure of a set of 2563 Arabidopsis thaliana paralogous pairs that were duplicated simultaneously 20-60 MYA by tetraploidy. Out of a total of 23,164 introns in these genes, we found that 10,004 pairs have been conserved and 578 introns have been inserted or deleted in the time since the duplication event. This intron insertion/deletion rate of 2.7 x 10(-3) to 9.1 x 10(-4) per site per million years is high in comparison to previous studies. At least 56 introns were gained and 39 lost based on parsimony analysis of the phylogenetic distribution of these introns. We found weak evidence that genes undergoing intron gain and loss are biased with respect to gene ontology terms. Gene pairs that experienced at least 2 intron insertions or deletions show evidence of enrichment for membrane location and transport and transporter activity function. We do not find any relationship of intron flux to expression level or G + C content of the gene. Detection of a bias in the location of intron gains and losses within a gene depends on the method of measurement: an intragene method indicates that events (specifically intron losses) are biased toward the 3' end of the gene. Despite the relatively recent acquisition of these introns, we found only one case where we could identify the mechanism of intron origin--the TOUCH3 gene has experienced 2 tandem, partial, internal gene duplications that duplicated a preexisting intron and also created a novel, alternatively spliced intron that makes use of a duplicated pair of cryptic splice sites.  相似文献   

19.
Irimia M  Roy SW 《PLoS genetics》2008,4(8):e1000148
The presence of spliceosomal introns in eukaryotes raises a range of questions about genomic evolution. Along with the fundamental mysteries of introns' initial proliferation and persistence, the evolutionary forces acting on intron sequences remain largely mysterious. Intron number varies across species from a few introns per genome to several introns per gene, and the elements of intron sequences directly implicated in splicing vary from degenerate to strict consensus motifs. We report a 50-species comparative genomic study of intron sequences across most eukaryotic groups. We find two broad and striking patterns. First, we find that some highly intron-poor lineages have undergone evolutionary convergence to strong 3' consensus intron structures. This finding holds for both branch point sequence and distance between the branch point and the 3' splice site. Interestingly, this difference appears to exist within the genomes of green alga of the genus Ostreococcus, which exhibit highly constrained intron sequences through most of the intron-poor genome, but not in one much more intron-dense genomic region. Second, we find evidence that ancestral genomes contained highly variable branch point sequences, similar to more complex modern intron-rich eukaryotic lineages. In addition, ancestral structures are likely to have included polyT tails similar to those in metazoans and plants, which we found in a variety of protist lineages. Intriguingly, intron structure evolution appears to be quite different across lineages experiencing different types of genome reduction: whereas lineages with very few introns tend towards highly regular intronic sequences, lineages with very short introns tend towards highly degenerate sequences. Together, these results attest to the complex nature of ancestral eukaryotic splicing, the qualitatively different evolutionary forces acting on intron structures across modern lineages, and the impressive evolutionary malleability of eukaryotic gene structures.  相似文献   

20.
Mitochondria are descendants of the endosymbiotic α-proteobacterium most likely engulfed by the ancestral eukaryotic cells, and the proto-mitochondrial genome should have been severely streamlined in terms of both genome size and gene repertoire. In addition, mitochondrial (mt) sequence data indicated that frequent intron gain/loss events contributed to shaping the modern mt genome organizations, resulting in the homologous introns being shared between two distantly related mt genomes. Unfortunately, the bulk of mt sequence data currently available are of phylogenetically restricted lineages, i.e., metazoans, fungi, and land plants, and are insufficient to elucidate the entire picture of intron evolution in mt genomes. In this work, we sequenced a 12 kbp-fragment of the mt genome of the katablepharid Leucocryptos marina. Among nine protein-coding genes included in the mt genome fragment, the genes encoding cytochrome b and cytochrome c oxidase subunit I (cob and cox1) were interrupted by group I introns. We further identified that the cob and cox1 introns host open reading frames for homing endonucleases (HEs) belonging to distantly related superfamilies. Phylogenetic analyses recovered an affinity between the HE in the Leucocryptos cob intron and two green algal HEs, and that between the HE in the Leucocryptos cox1 intron and a fungal HE, suggesting that the Leucocryptos cob and cox1 introns possess distinct evolutionary origins. Although the current intron (and intronic HE) data are insufficient to infer how the homologous introns were distributed to distantly related mt genomes, the results presented here successfully expanded the evolutionary dynamism of group I introns in mt genomes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号