首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A database called eukaryotic intron database (EID) was developed based on the data from GenBank.Studies on the statistical characteristics of EID show that there were 103,848 genes,478,484 introns,and 582,332 exons,with an average of 4.61 introns and 5.61 exons per gene.Introns of 40-120 nt in length were abundant in the database.Results of the statistical analysis on the data from nine model species showed that in eukaryotes,higher species do not necessarily have more introns or exons in a gene than lower species.Furthermore,characteristics of EID,such as intron phase,distribution of different splice sites,and the relationship between genome size and intron proportion or intron density,have been studied.  相似文献   

2.
A database called eukaryotic intron database (EID) was developed based on the data from GenBank. Studies on the statistical characteristics of EID show that there were 103, 848 genes, 478,484 introns, and 582,332 exons, with an average of 4.61 introns and 5.61 exons per gene. Introns of 40–120 nt in length were abundant in the database. Results of the statistical analysis on the data from nine model species showed that in eukaryotes, higher species do not necessarily have more introns or exons in a gene than lower species. Furthermore, characteristics of EID, such as intron phase, distribution of different splice sites, and the relationship between genome size and intron proportion or intron density, have been studied. __________ Translated from Acta Scientiarum Naturalium Universitatis Sunyatseni, 2005, 44(6): 79–82 [译自: 中山大学学报, 2005, 44(6): 79–82]  相似文献   

3.
Advances in the Exon-Intron Database (EID)   总被引:3,自引:0,他引:3  
  相似文献   

4.
Alternative splicing contributes to both gene regulation and protein diversity. To discover broad relationships between regulation of alternative splicing and sequence conservation, we applied a systems approach, using oligonucleotide microarrays designed to capture splicing information across the mouse genome. In a set of 22 adult tissues, we observe differential expression of RNA containing at least two alternative splice junctions for about 40% of the 6,216 alternative events we could detect. Statistical comparisons identify 171 cassette exons whose inclusion or skipping is different in brain relative to other tissues and another 28 exons whose splicing is different in muscle. A subset of these exons is associated with unusual blocks of intron sequence whose conservation in vertebrates rivals that of protein-coding exons. By focusing on sets of exons with similar regulatory patterns, we have identified new sequence motifs implicated in brain and muscle splicing regulation. Of note is a motif that is strikingly similar to the branchpoint consensus but is located downstream of the 5′ splice site of exons included in muscle. Analysis of three paralogous membrane-associated guanylate kinase genes reveals that each contains a paralogous tissue-regulated exon with a similar tissue inclusion pattern. While the intron sequences flanking these exons remain highly conserved among mammalian orthologs, the paralogous flanking intron sequences have diverged considerably, suggesting unusually complex evolution of the regulation of alternative splicing in multigene families.  相似文献   

5.
We have determined the complete sequence of the mitochondrial gene coding for cytochrome b in Saccharomyces douglasii. The gene is 6310 base-pairs long and is interrupted by four introns. The first one (1311 base-pairs) belongs to the group ID of secondary structure, contains a fragment open reading frame with a characteristic GIY ... YIG motif, is absent from Saccharomyces cerevisiae and is inserted in the same site in which introns 1 and 2 are inserted in Neurospora crassa and Podospora anserina, respectively. The next three S. douglasii introns are homologous to the first three introns of S. cerevisiae, are inserted at the same positions and display various degrees of similarity ranging from an almost complete identity (intron 2 and 4) to a moderate one (intron 3). We have compared secondary structures of intron RNAs, and nucleotide and amino acid sequences of cytochrome b exons and intron open reading frames in the two Saccharomyces species. The rules that govern fixation of mutations in exon and intron open reading frames are different: the relative proportion of mutations occurring in synonymous codons is low in some introns and high in exons. The overall frequency of mutations in cytochrome b exons is much smaller than in nuclear genes of yeasts, contrary to what has been found in vertebrates, where mitochondrial mutations are more frequent. The divergence of the cytochrome b gene is modular: various parts of the gene have changed with a different mode and tempo of evolution.  相似文献   

6.
Intron-exon structures of eukaryotic model organisms.   总被引:27,自引:1,他引:27       下载免费PDF全文
To investigate the distribution of intron-exon structures of eukaryotic genes, we have constructed a general exon database comprising all available intron-containing genes and exon databases from 10 eukaryotic model organisms: Homo sapiens, Mus musculus, Gallus gallus, Rattus norvegicus, Arabidopsis thaliana, Zea mays, Schizosaccharomyces pombe, Aspergillus, Caenorhabditis elegans and Drosophila. We purged redundant genes to avoid the possible bias brought about by redundancy in the databases. After discarding those questionable introns that do not contain correct splice sites, the final database contained 17 102 introns, 21 019 exons and 2903 independent or quasi-independent genes. On average, a eukaryotic gene contains 3.7 introns per kb protein coding region. The exon distribution peaks around 30-40 residues and most introns are 40-125 nt long. The variable intron-exon structures of the 10 model organisms reveal two interesting statistical phenomena, which cast light on some previous speculations. (i) Genome size seems to be correlated with total intron length per gene. For example, invertebrate introns are smaller than those of human genes, while yeast introns are shorter than invertebrate introns. However, this correlation is weak, suggesting that other factors besides genome size may also affect intron size. (ii) Introns smaller than 50 nt are significantly less frequent than longer introns, possibly resulting from a minimum intron size requirement for intron splicing.  相似文献   

7.
Human red and green visual pigment genes are X-linked duplicate genes. To study their evolutionary history, introns 2 and 4 (1,987 and 1,552 bp, respectively) of human red and green pigment genes were sequenced. Surprisingly, we found that intron 4 sequences of these two genes are identical and that the intron 2 sequences differ by only 0.3%. The low divergences are unexpected because the duplication event producing the two genes is believed to have occurred before the separation of the human and Old World monkey (OWM) lineages. Indeed, the divergences in the two introns are significantly lower than both the synonymous divergence (3.2% +/- 1.1%) and the nonsynonymous divergence (2.0% +/- 0.5%) in the coding sequences (exons 1-6). A comparison of partial sequences of exons 4 and 5 of human and OWM red and green pigment genes supports the hypothesis that the gene duplication occurred before the human-OWM split. In conclusion, the high similarities in the two intron sequences might be due to very recent gene conversion, probably during evolution of the human lineage.   相似文献   

8.
9.
10.
From a genomic library of Xenopus laevis, two genes coding for different preprocaeruleins have been isolated and sequenced. These correspond to the type I and type III precursors analyzed previously at the cDNA level [Richter, K., Egger, R. and Kreil, G. (1986) J. Biol. Chem. 261, 3676-3680]. The type III gene comprises eight exons; the type I apparently contains eight exons as well, of which six have been sequenced. The genetic information for the dekapeptide caerulein is present on small exons of 45 base pairs. The two genes are highly homologous in their 5'-flanking region, the exon/intron boundaries, and long stretches of intron sequences. A possible scheme for the evolution of this small family of genes through exon and gene duplications is presented. In the type I gene, in place of one of the caerulein exons, a potential exon with conserved splice sites was discovered. If expressed in some frog cells, this exon would code for a new peptide 60% homologous to caerulein.  相似文献   

11.
12.
We have cloned the thyroglobulin (Tg) gene of normal goats and goitrous goats which have a Tg synthesis defect. At the 5'-end of the gene, we studied cosmid clones covering a region from 20 kilobases (kb) upstream from the Tg gene to 42 kb into it. Electron microscopy and restriction mapping show that this part of the gene contains 20 exons of 90-1190 bp, in total 4.9 kb of exonic information (56% of the mRNA) split by 19 introns of 150-9100 bp. The exons comprise 12% of the 5' sequences cloned. At the 3'-end, 55 kb were cloned, containing 10 kb of the gene which comprises only 3 exons of 550 bp in total. Sequence analysis of the 3'-end of the normal and abnormal Tg genes has revealed one transition mutation 3' to the reading frame in a stem-loop structure region of the last exon near the poly(A) addition site. Analysis of the promoter site and the first 5 exons has revealed only one difference between the normal and goitrous Tg genes: a Ser----Leu transition in exon 5. We also found an insertion in the fifth intron of the abnormal gene.  相似文献   

13.
ADAM is a recently discovered gene family that encodes proteins with a disintegrin and metalloproteinase. ADAMTS-1 is a gene encoding a new member protein of the ADAM family with the thrombospondin (TSP) type I motif, the expression of which is associated with inflammatory processes. In the present study, we have characterized the exon/intron organization of the mouse ADAMTS-1 gene. The ADAMTS-1 gene is composed of nine exons, all of which are present within the 9.2-kb genomic region. Among the nine exons, exons 1, 5, and 6 encode a proprotein domain, a disintegrin-like domain, and a TSP type I motif, respectively, of the ADAMTS-1 protein, suggesting that there is a correlation between exon/intron organization and functional domains. In addition, the exon/ intron organization of the ADAMTS-1 gene is very different from that of the metalloproteinase-like/disintegrin-like/cysteine-rich protein gene (MDC) (ADAM11), suggesting that the genomic structure of ADAM family genes is not necessarily conserved. Furthermore, fluorescencein situhybridization revealed that the ADAMTS-1 gene is located in region C3–C5 of chromosome 16, to which none of the previously identified ADAM genes have been mapped.  相似文献   

14.
15.
Concerted and divergent evolution within the rat gamma-crystallin gene family   总被引:11,自引:0,他引:11  
The nucleotide sequences of six rat gamma-crystallin genes have been determined. All genes have the same mosaic structure: the first exons contain a relatively short (25 to 44 base-pair) 5' non-coding region and the first nine base-pairs of the coding sequence, the second exons encode protein motifs I and II, while protein motifs III and IV are encoded by the third exons. The third exons also contain a 60 to 67-base-pair long 3' non-coding region. In the gamma 1-2 gene, the splice acceptor site of the third exon has been shifted three base-pairs upstream. Hence, the protein product of this gene is one amino acid residue longer. The first introns, though varying in length from 85 to 100 base-pairs, are conserved in sequence. The second introns vary considerably in length (0.9 X 10(3) to 1.9 X 10(3) base-pairs) and sequence. The second exons of the genes show concerted evolution and have undergone multiple gene conversions. In contrast, the third exons show divergent evolution. From the sequences of the third exons, an evolutionary tree of the gene family was constructed. This tree suggests that three of the present genes derive directly from the genes that originated from a tandem duplication of a two-gene cluster. Two duplications of the last gene of the four-gene cluster then yielded the other three genes. Region a' of the third exon, encoding protein motif III, is variable, while the region encoding protein motif IV (b') is constant. We postulate that this variability in region a' is due to a period of radiation after each gene duplication. A comparison of the rat sequences with those of orthologous sequences from other species shows that the variation in region a' is now preserved. Hence, it might specify the specific functional property of each gamma-crystallin protein within the lens.  相似文献   

16.
The sequences encoding the 5'-ends of three chicken fast-white myosin heavy chain (MHC) genes have been determined. When compared with the sequences of two other MHC genes it is apparent that both the exon and intron positions are conserved. All exon sequences are highly conserved; there is absolute amino acid conservation in the second and third exons. In addition, while the first and third introns diverge among the genes, the second intron is highly conserved between the five. This intron contains a 24-bp sequence that is repeated twice in one of the introns and once in the other four. Analyses indicate that this sequence, which is partially homologous to 7SL RNA, appears to be largely restricted to the MHC gene family. Analysis of the 5'-flanking sequences show that while small homologies are present between some of the genes, they have extensively diverged in this region.  相似文献   

17.
H H Lin  D K Ann 《Genomics》1991,10(1):102-113
  相似文献   

18.
The mitochondrial cob-box gene coding for apocytochrome b in yeast has five introns and six exons or two introns and three exons depending on the wild-type strain considered. Some intron mutations in this gene affect not only its expression but also that of another mitochondrial gene: oxi3. To understand better the function of introns in gene expression, we have constructed a series of new strains that differ only by the presence or absence of one of the five wild-type introns in the cytochrome b gene, the rest of the mitochondrial and nuclear genome remaining unchanged. All constructions result from in vivo recombination events between rho- donor and rho+ recipient mtDNA. The following genes have been constructed: [see text]. Interestingly, all the genes lead to the synthesis of cytochrome b, while only the genes having the intron bI4 allow the expression of oxi3. A nuclear gene, when mutated, can compensate for the absence of the intron bI4.  相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号