首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 187 毫秒
1.
2.
Intron-exon structures of eukaryotic model organisms.   总被引:28,自引:1,他引:27       下载免费PDF全文
To investigate the distribution of intron-exon structures of eukaryotic genes, we have constructed a general exon database comprising all available intron-containing genes and exon databases from 10 eukaryotic model organisms: Homo sapiens, Mus musculus, Gallus gallus, Rattus norvegicus, Arabidopsis thaliana, Zea mays, Schizosaccharomyces pombe, Aspergillus, Caenorhabditis elegans and Drosophila. We purged redundant genes to avoid the possible bias brought about by redundancy in the databases. After discarding those questionable introns that do not contain correct splice sites, the final database contained 17 102 introns, 21 019 exons and 2903 independent or quasi-independent genes. On average, a eukaryotic gene contains 3.7 introns per kb protein coding region. The exon distribution peaks around 30-40 residues and most introns are 40-125 nt long. The variable intron-exon structures of the 10 model organisms reveal two interesting statistical phenomena, which cast light on some previous speculations. (i) Genome size seems to be correlated with total intron length per gene. For example, invertebrate introns are smaller than those of human genes, while yeast introns are shorter than invertebrate introns. However, this correlation is weak, suggesting that other factors besides genome size may also affect intron size. (ii) Introns smaller than 50 nt are significantly less frequent than longer introns, possibly resulting from a minimum intron size requirement for intron splicing.  相似文献   

3.
Introns in gene evolution   总被引:23,自引:0,他引:23  
Fedorova L  Fedorov A 《Genetica》2003,118(2-3):123-131
  相似文献   

4.
Longer first introns are a general property of eukaryotic gene structure   总被引:1,自引:0,他引:1  
Bradnam KR  Korf I 《PloS one》2008,3(8):e3093
  相似文献   

5.
The complete nucleotide sequence and exon/intron structure of the rat embryonic skeletal muscle myosin heavy chain (MHC) gene has been determined. This gene comprises 24 X 10(3) bases of DNA and is split into 41 exons. The exons encode a 6035 nucleotide (nt) long mRNA consisting of 90 nt of 5' untranslated, 5820 nt of protein coding and 125 nt of 3' untranslated sequence. The rat embryonic MHC polypeptide is encoded by exons 3 to 41 and contains 1939 amino acid residues with a calculated Mr of 223,900. Its amino acid sequence displays the structural features typical for all sarcomeric MHCs, i.e. an amino-terminal "globular" head region and a carboxy-terminal alpha-helical rod portion that shows the characteristics of a coiled coil with a superimposed 28-residue repeat pattern interrupted at only four positions by "skip" residues. The complex structure of the rat embryonic MHC gene and the conservation of intron locations in this and other MHC genes are indicative of a highly split ancestral sarcomeric MHC gene. Introns in the rat embryonic gene interrupt the coding sequence at the boundaries separating the proteolytic subfragments of the head, but not at the head/rod junction or between the 28-residue repeats present within the rod. Therefore, there is little evidence for exon shuffling and intron-dependent evolution by gene duplication as a mechanism for the generation of the ancestral MHC gene. Rather, intron insertion into a previously non-split ancestral MHC rod gene consisting of multiple tandemly arranged 28-residue-encoding repeats, or convergent evolution of an originally non-repetitive ancestral MHC rod gene must account for the observed structure of the rod-encoding portion of present-day MHC genes.  相似文献   

6.
Intron definition in splicing of small Drosophila introns.   总被引:4,自引:1,他引:3       下载免费PDF全文
Approximately half of the introns in Drosophila melanogaster are too small to function in a vertebrate and often lack the pyrimidine tract associated with vertebrate 3' splice sites. Here, we report the splicing and spliceosome assembly properties of two such introns: one with a pyrimidine-poor 3' splice site and one with a pyrimidine-rich 3' splice site. The pyrimidine-poor intron was absolutely dependent on its small size for in vivo and in vitro splicing and assembly. As such, it had properties reminiscent of those of yeast introns. The pyrimidine-rich intron had properties intermediate between those of yeasts and vertebrates. This 3' splice site directed assembly of ATP-dependent complexes when present as either an intron or exon and supported low levels of in vivo splicing of a moderate-length intron. We propose that splice sites can be recognized as pairs across either exons or introns, depending on which distance is shorter, and that a pyrimidine-rich region upstream of the 3' splice site facilitates the exon mode.  相似文献   

7.
8.
Intron loss and gain in Drosophila   总被引:1,自引:0,他引:1  
Although introns were first discovered almost 30 years ago, their evolutionary origin remains elusive. In this work, we used multispecies whole-genome alignments to map Drosophila melanogaster introns onto 10 other fully sequenced Drosophila genomes. We were able to find 1,944 sites where an intron was missing in one or more species. We show that for most (>80%) of these cases, there is no leftover intronic sequence or any missing exonic sequence, indicating exact intron loss or gain events. We used parsimony to classify these differences as 1,754 intron loss events and 213 gain events. We show that lost and gained introns are significantly shorter than average and flanked by longer than average exons. They also display quite distinct phase distributions and show greater than average similarity between the 5' splice site and its 3' partner splice site. Introns that have been lost in one or more species evolve faster than other introns, occur in slowly evolving genes, and are found adjacent to each other more often than would be expected for independent single losses. Our results support the cDNA recombination mechanism of intron loss, suggest that selective pressures affect site-specific loss rates, and show conclusively that intron gain has occurred within the Drosophila lineage, solidifying the "introns-middle" hypothesis and providing some hints about the gain mechanism.  相似文献   

9.
D Jenne  K K Stanley 《Biochemistry》1987,26(21):6735-6742
The S-protein/vitronectin gene was isolated from a human genomic DNA library, and its sequence of about 5.3 kilobases including the adjacent 5' and 3' flanking regions was established. Alignment of the genomic DNA nucleotide sequence and the cDNA sequence indicated that the gene consisted of eight exons and seven introns. The intron positions in the S-protein gene and their phase type were compared to those in the hemopexin gene which shares amino acid sequence homologies with transin and the S-protein. Three introns have been found at equivalent positions; two other introns are very close to these positions and are interpreted as cases of intron sliding. Introns 3-7 occur at a conserved glycine residue within repeating peptide segments, whereas introns 1 and 2 are at the boundaries of the Somatomedin B domain of S-protein. The analysis of the exon structure in relation to repeating peptide motifs within the S-protein strongly suggests that it contains only seven repeats, one less than the hemopexin molecule. A very similar repeat pattern like that in hemopexin is shown to be present also in two other related proteins, transin and interstitial collagenase. An evolutionary model for the generation of the repeat pattern in the S-protein and the other members of this novel "pexin" gene family is proposed, and the sequence modifications for some of the repeats during divergent evolution are discussed in relation to known unique functional properties of hemopexin and S-protein.  相似文献   

10.
The exon-intron structure of human, insect (Drosophila sp.), and dicot plant (Arabidopsis thaliana) genes was considered. In each genome there exists a characteristic intron length. Anomalously long introns was usually the first introns in genes. In each sample there are correlations between the lengths of neighboring exons and between exon lengths and closeness to the consensus of the sites at exon boundaries. Exons and exon pairs containing an integer number of triplets are preferred. These results are relevant to the study of splicing mechanism and evolution of introns, as well as construction of gene recognition algorithms.  相似文献   

11.
12.
13.
14.
15.
S M Quirk  D Bell-Pedersen  M Belfort 《Cell》1989,56(3):455-465
Intron mobility in the T-even phages has been demonstrated. Efficient nonreciprocal conversion of intron minus (In-) alleles to intron plus (In+) occurred for the td and sunY genes, but not for nrdB. Conversion to In+ was absolutely dependent on expression of the respective intron open reading frame (ORF). Introns were inserted at their cognate sites in an intronless phage genome via an RNA-independent, DNA-based, duplicative recombination event that was stimulated by exon homology. The td intron ORF product directs the endonucleolytic cleavage of DNA, targeting the site of intron integration. A 21 nucleotide deletion of the integration site abolished high frequency intron inheritance. These experiments provide a novel example of gene conversion in prokaryotes, while suggesting a molecular rationale for the inconsistent distribution of introns within highly conserved exon contexts of the T-even phage genomes.  相似文献   

16.
17.
Introns are flanked by a partially conserved coding sequence that forms the immediate exon junction sequence following intron removal from pre-mRNA. Phylogenetic evidence indicates that these sequences have been targeted by numerous intron insertions during evolution, but little is known about this process. Here, we test the prediction that exon junction sequences were functional splice sites that existed in the coding sequence of genes prior to the insertion of introns. To do this, we experimentally identified nine cryptic splice sites within the coding sequence of actin genes from humans, Arabidopsis, and Physarum by inactivating their normal intron splice sites. We found that seven of these cryptic splice sites correspond exactly to the positions of exon junctions in actin genes from other species. Because actin genes are highly conserved, we could conclude that at least seven actin introns are flanked by cryptic splice sites, and from the phylogenetic evidence, we could also conclude that actin introns were inserted into these cryptic splice sites during evolution. Furthermore, our results indicate that these insertion events were dependent upon the splicing machinery. Because most introns are flanked by similar sequences, our results are likely to be of general relevance.  相似文献   

18.
As part of the exploratory sequencing program Génolevures, visual scrutinisation and bioinformatic tools were used to detect spliceosomal introns in seven hemiascomycetous yeast species. A total of 153 putative novel introns were identified. Introns are rare in yeast nuclear genes (<5% have an intron), mainly located at the 5′ end of ORFs, and not highly conserved in sequence. They all share a clear non-random vocabulary: conserved splice sites and conserved nucleotide contexts around splice sites. Homologues of metazoan snRNAs and putative homologues of SR splicing factors were identified, confirming that the spliceosomal machinery is highly conserved in eukaryotes. Several introns’ features were tested as possible markers for phylogenetic analysis. We found that intron sizes vary widely within each genome, and according to the phylogenetic position of the yeast species. The evolutionary origin of spliceosomal introns was examined by analysing the degree of conservation of intron positions in homologous yeast genes. Most introns appeared to exist in the last common ancestor of present day yeast species, and then to have been differentially lost during speciation. However, in some cases, it is difficult to exclude a possible sliding event affecting a pre-existing intron or a gain of a novel intron. Taken together, our results indicate that the origin of spliceosomal introns is complex within a given genome, and that present day introns may have resulted from a dynamic flux between intron conservation, intron loss and intron gain during the evolution of hemiascomycetous yeasts.  相似文献   

19.
Introns within introns (twintrons) are known only from the Euglena chloroplast genome. Twintrons are group II or III introns, into which another group II or III intron has been transposed. In this paper we describe a non-Euglena twintron structure within a plastid-encoded chaperone gene (cpn60) of the cryptomonad alga Pyrenomonas salina. In addition, the evolutionary relationships between members of the Cpn60 protein family are determined. Our findings permit the inclusion of cryptomonad plastomes in phylogenetic studies of intron evolution and present further evidence for the origin of modern plastids from a cyanobacterial ancestor.  相似文献   

20.
We have sequenced 14 introns from the ciliate Tetrahymena thermophila and include these in an analysis of the 27 intron sequences available from seven T. thermophila protein-encoding genes. Consensus 5' and 3' splice junctions were determined and found to resemble the junctions of other nuclear pre-mRNA introns. Unique features are noted and discussed. Overall the introns have a mean A + T content of 85% (21% higher than neighbouring exons) with smaller introns tending towards a higher A + T content. Approximately half of the introns are less than 100 bp. Introns from other organisms (approximately 30 of each) were also examined. The introns of Dictyostelium discoideum, Caenorhabditis elegans and Drosophila melanogaster, like those of T. thermophila, have a much higher mean A + T content than their neighbouring exons (greater than 20%). Introns from plants, Neurospora crassa and Schizosaccharomyces pombe also have a significantly higher A + T content (10%-20%). Since a high A + T content is required for intron splicing in plants (58), the elevated A + T content in the introns of these other organisms may also be functionally significant. The introns of yeast (Saccharomyces cerevisiae) and mammals (humans) appear to lack this trait and thus in some aspects may be atypical. The polypyrimidine tract, so distinctive of vertebrate introns, is not a trait of the introns in the non-vertebrate organisms examined in this study.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号