首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Genes of Arabidopsis thaliana, Orysa sativa, Caenorhabditis elegans, Homo sapiens have been studied by computer analysis. The average intron and exon lengths in genes of these organisms decreases with increase of intron number in genes. The length of introns and exons in A. thaliana and O. sativa genes is change with increase of intron number in genes by high coefficient of correlation. Linear dependence between the sum of exon lengths and intron number in genes increased proportionally to number of gene introns. The average length of introns and genes of human depend on density of genes in DNA.  相似文献   

2.
The exon-intron structure of human, insect (Drosophila sp.), and dicot plant (Arabidopsis thaliana) genes was considered. In each genome there exists a characteristic intron length. Anomalously long introns was usually the first introns in genes. In each sample there are correlations between the lengths of neighboring exons and between exon lengths and closeness to the consensus of the sites at exon boundaries. Exons and exon pairs containing an integer number of triplets are preferred. These results are relevant to the study of splicing mechanism and evolution of introns, as well as construction of gene recognition algorithms.  相似文献   

3.
Szalai C  Toth S  Falus A 《Gene》2000,243(1-2):161-166
The exon-intron organization and sequences of the exon-intron boundaries of the human gp130 transmembrane receptor gene have been determined using genomic DNAs as samples. The gp130 gene comprises 17 exons and 16 introns. The positions of the exon-intron boundaries show good correlation to the functional/homology regions of gp130. Exons 3-17 code for the gp130 protein, and each subdomain of the receptor is encoded by a set of exons. The coding potential of exons and the intron phasing of the human gp130 gene conform to the patterns observed previously for other cytokine receptor genes. This supports the notions that the gp130 gene evolved from the same ancestral gene that gave rise to other members of the cytokine receptor family.  相似文献   

4.
Statistics of the exon-intron structure and splicing sites of several diverse eukaryotes was studied. The yeast exon-intron structures have a number of unique features. A yeast gene usually have at most one intron. The branch site is strongly conserved, whereas the polypirimidine tract is short. Long yeast introns tend to have stronger acceptor sites. In other species the branch site is less conserved and often cannot be determined. In non-yeast samples there is an almost universal correlation between lengths of neighboring exons (all samples excluding protists) and correlation between lengths of neighboring introns (human, drosophila, protists). On the average first introns are longer, and anomalously long introns are usually first introns in a gene. There is a universal preference for exons and exon pairs with the (total) length divisible by 3. Introns positioned between codons are preferred, whereas those positioned between the first and second positions in codon are avoided. The choice of A or G at the third position of intron (the donor splice sites generally prefer purines at this position) is correlated with the overall GC-composition of the gene. In all samples dinucleotide AG is avoided in the region preceding the acceptor site.  相似文献   

5.
6.
7.
The human erythrocyte alpha-spectrin gene which spans 80 kbp has been cloned from human genomic DNA as overlapping lambda recombinants. The exon-intron junctions were identified and the exons mapped. The gene is encoded by 52 exons whose sizes range from 684 bp to the smallest of 18 bp. The donor and acceptor splice site sequences match the splice site consensus sequences, with the exception of one splice site where a donor sequence begins with -GC. The size and location of exons do not correlate with the 106-amino-acid repeat, except in three locations where the surrounding codons are conserved as well. The lack of correspondence between exons and 106-amino-acid repeat is interpreted to reflect the appearance of a spectrin-like gene from a minigene early in the evolution of eukaryotes. Since current evidence indicates that introns were present in genes before the divergence of prokaryotes and eukaryotes, it is possible that the original distribution of introns within the minigene has been lost by the random deletion of introns from the spectrin gene.  相似文献   

8.
Structure and expression of the gene coding for the human serpin hLS2   总被引:3,自引:0,他引:3  
We have analyzed genomic clones encoding human leuserpin 2 (hLS2). The gene covers about 14.5 kilobases and consists of 5 exons and 4 introns. The genes coding for hLS2, alpha 1-antitrypsin, alpha 1-antichymotrypsin, and rat angiotensinogen share an equivalent exon-intron structure and therefore constitute a distinct subgroup within the serpin gene family, which otherwise displays a highly variable exon-intron pattern. With the exception of a segment in the second exon, the sequence similarity of the genes coding for hLS2 and alpha 1-antitrypsin extends to all exons including one encoding the 5'-untranslated sequences. The implications of these findings with respect to the genesis of the amino-terminal heterogeneity in the serpin family are discussed.  相似文献   

9.
The complete sequence of 12,851 nucleotides of the mouse lactate dehydrogenase-A (LDH-A) gene has been determined. It includes eight exons, seven introns, promoter and regulatory regions. The B1 repetitive elements present in intron III and VI are oriented in opposite orientation, and they share 72% sequence homology. The exon-intron organization of mouse LDH-A gene is compared with the organizations of other dehydrogenase genes, and the molecular evolution of the nicotinamide adenine dinucleotide binding domains is discussed.  相似文献   

10.
Rab11a, Rab11b, and Rab25 in mammals are thought to comprise a subfamily of Rab proteins, although Rab25 has two amino acid differences in its effector domain. We have isolated and characterized the genomic sequences of murine Rab11a and Rab25 and compared them with those of previously characterized mammalian Rab genes. The Rab11a gene spans 29 kb and Rab25 spans 9 kb. The genes have TATA-less promoters, but contain GC-rich areas in their upstream 5' regions. Both genes have 5 exons, with the introns containing characteristic repeats. Rab11a has an unusually long 8. 5-kb fourth intron. The Rab11a and Rab25 genes are localized to chromosomes 9C and 3E3/F1, respectively. The overall organization of the Rab11a, Rab11b, and Rab25 genes is similar, with homologous exon-intron boundaries, and differs markedly from those of Rab3A and Rab1A. These results confirm that Rab11A, Rab11b, and Rab25 represent a closely related gene family.  相似文献   

11.
L M Sabatini  Y Z He  E A Azen 《Gene》1990,89(2):245-251
Human statherin (STT) is a low-Mr (43 amino acids) acidic phosphoprotein secreted mainly by salivary glands. It acts as an inhibitor of precipitation of Ca.phosphate salts in the oral cavity. DNA (12.2 kb) was isolated from human genomic phage lambda libraries as a series of overlapping clones, and the nucleotide sequence of the STT-encoding gene (STT) was determined. The transcribed region spans 6.5 kb and contains six exons and five introns. Upstream DNA (1.6 kb) was also sequenced and a number of possible regulatory elements were identified. The exon-intron boundaries of the STT gene roughly coincide with the protein-coding regions of the mRNA and with the functional domains of STT. This pattern of organization has been seen in a variety of eukaryotic genes and is consistent with the domain theory of gene evolution.  相似文献   

12.
A 13 kb rat Cu/ZnSOD genomic clone has been purified from a rat liver genomic library and completely characterized by restriction mapping, detailed sequencing and Southern blot analysis. This gene spans approximately 6 kb and contains five exons and four introns. Comparison of rat, mouse, and human Cu/ZnSOD genes reveals a high conservation in genomic organization and exon-intron junctions, including an unusual 5'GC donor sequence at the first intron. The gene contains a TATA box as well as an inverted CCAAT box, a feature common to both the mouse and human genes. Furthermore, several repeats were identified in the 5' promoter region of this gene, and these regulatory elements are also strikingly conserved in these three species.  相似文献   

13.
The organization of two closely clustered genes, Fer1HCH and Fer2LCH, encoding the heavy-chain homolog (HCH) and the light-chain homolog (LCH) subunits of Drosophila melanogaster ferritin are reported here. The 5019-bp sequence of the cluster was assembled from genomic fragments obtained by polymerase chain reaction (PCR) amplification of genomic DNA and from sequences obtained from the Berkeley Drosophila Genome Project (BDGP) (http://www.fruitfly.org). These genes, located at position 99F1, have different exon-intron structures (Fer1HCH has three introns and Fer2LCH has two introns) and are divergently transcribed. Computer analysis of the possibly shared promoter regions revealed the presence of putative metal regulatory elements (MREs), a finding consistent with the upregulation of these genes by iron, and putative NF-kappaB-like binding sites. The structure of two other invertebrate ferritin genes, from the nematode Caenorhabditis elegans (located on chromosomes I and V), was also analyzed. Both nematode genes have two introns, lack iron-responsive elements (IREs), and encode ferritin subunits similar to vertebrate H chains. These findings, along with comparisons of ferritin genes from invertebrates, vertebrates, and plants, suggest that the specialization of ferritin H and L type chains, the complex exon-intron organization of plant and vertebrate genes, and the use of the IRE/iron regulatory protein (IRP) mechanism for regulation of ferritin synthesis are recent evolutionary acquisitions.  相似文献   

14.
We have isolated and sequenced the gene encoding the human embryonic/atrial myosin alkali light chain isoform (MLC-1emb/A). The gene is split into seven exons by six introns; the last exon, as in all MLC isoform genes sequenced to date, is completely 3' untranslated sequence. Comparison of the MLC-1emb/A isoform gene with the other MLC-1 genes showed that the exon-intron arrangement of the human MLC-1emb/A isoform gene is analogous to that of the other MLC-1 type isoform genes. We have also mapped the human MLC-1emb/A isoform gene to the long arm of chromosome 17; the corresponding mouse gene has been mapped to chromosome 11. This gene, together with a number of others such as the collagen(I) alpha 1, galactokinase, and thymidine kinase genes, is part of the largest syntenic group between mouse and man.  相似文献   

15.
During evolution segments of homeothermic genomes underwent a GC content increase. Our analyses reveal that two exon-intron architectures have evolved from an ancestral state of low GC content exons flanked by short introns with a lower GC content. One group underwent a GC content elevation that abolished the differential exon-intron GC content, with introns remaining short. The other group retained the overall low GC content as well as the differential exon-intron GC content, and is associated with longer introns. We show that differential exon-intron GC content regulates exon inclusion level in this group, in which disease-associated mutations often lead to exon skipping. This group's exons also display higher nucleosome occupancy compared to flanking introns and exons of the other group, thus "marking" them for spliceosomal recognition. Collectively, our results reveal that differential exon-intron GC content is a previously unidentified determinant of exon selection and argue that the two GC content architectures reflect the two mechanisms by which splicing signals are recognized: exon definition and intron definition.  相似文献   

16.
Correlation of gene and protein structure of rat and human lipocortin I   总被引:5,自引:0,他引:5  
Lipocortins (annexins) are a family of calcium-dependent phospholipid-binding proteins with phospholipase A2 inhibitory activity. The characteristic primary structure of members of this family consists of a core structure of four or eight repeated domains, which have been implicated in calcium-dependent phospholipid binding. In two lipocortins (I and II) a short amino-terminal sequence distinct from the core structure has potential regulatory functions which are dependent on its phosphorylation state. We have isolated the rat and the human lipocortin I genes and found that they both consist of 13 exons with a striking conservation of their exon-intron structure and their promoter and amino acid sequences. Both lipocortin I genes are at least 19 kbp in length with exons ranging from 57 to 123 bp interrupted by introns as large as 5 kbp. Each of the four repeat units of lipocortin I are encoded by two consecutive exons while individual exons code for the highly conserved putative calcium-binding domains. The promoter sequences in the rat and in human genes are highly conserved and contain nucleotide sequences characterized as enhancer sequences in other genes. The structure of the lipocortin I gene lends support to the hypothesis that the lipocortin genes arose by a duplication of a single domain.  相似文献   

17.
Human DNA topoisomerase II is essential for chromosome segregation and is the target for several clinically important anticancer agents. It is expressed as genetically distinct alpha and beta isoforms encoded by the TOP2alpha and TOP2beta genes that map to chromosomes 17q21-22 and 3p24, respectively. The genes display different patterns of cell cycle- and tissue-specific expression, with the alpha isoform markedly upregulated in proliferating cells. In addition to the fundamental role of TOP2alpha and TOP2beta genes in cell growth and development, altered expression and rearrangement of both genes are implicated in anticancer drug resistance. Here, we report the complete structure of the human topoisomerase IIalpha gene, which consists of 35 exons spanning 27.5 kb. Sequence data for the exon-intron boundaries were determined and examined in the context of topoisomerase IIalpha protein structure comprising three functional domains associated with energy transduction, DNA breakage-reunion activity and nuclear localization. The organization of the 3' half of human TOP2beta, including sequence specifying the C-terminal nuclear localization domain, was also elucidated. Of the 15 introns identified in this 20 kb region of TOP2beta, the first nine and the last intron align in identical positions and display the same phases as introns in TOP2alpha. Though their extreme 3' ends differ, the striking conservation suggests the two genes diverged recently in evolutionary terms consistent with a gene duplication event. Access to TOP2alpha and TOP2beta gene structures should aid studies of mutations and gene rearrangements associated with anticancer drug resistance.  相似文献   

18.
Regions with a quasiperiodical location of exon--intron sites have been found in the loci of genes of I and VII type collagens (with a total length of exons more than 15% of the entire size of the locus). The periods observed are similar to periods typical for the nucleosomal level of the organization of chromatin. It was shown that the sites consisting of successively arranged exons and introns form groups involving two to five such regions of the same length. The groups encoding the fibrillar regions of the gene product contain more than 50% of exons. The regions are on the average 165 nt long, which is close to the minimal nucleosomal repeat length observed in some regions of the eukaryotic genome. In the nonfibrillar region of the gene of VII type collagen, groups of several exon-intron pairs with an average length of 227 nt were identified. The change in the length of exon-introns sites on going from the nonfibrillar to the fibrillar moiety occurs in a jump, which is clearly seen on a periodogram of the locus.  相似文献   

19.
We have cloned a new species of rat Gi alpha (Gi3 alpha) cDNA and genomic DNAs for three distinct human Gi alpha proteins (Gi1 alpha, Gi2 alpha, and Gi3 alpha). Gi3 alpha cDNA codes for a protein of 354 amino acids (Mr 40,522) whose sequence is closely related but distinct from that of the previously isolated rat Gi alpha (Gi2 alpha). By screening the human genomic libraries with the two rat Gi alpha cDNAs as probes, clones encoding human Gi1 alpha, Gi2 alpha, and Gi3 alpha were isolated. The human Gi2 alpha and Gi3 alpha genes are composed of eight coding exons and seven introns and possess a completely identical exon-intron organization. Southern blot analysis indicates that a single copy of each Gi alpha gene is present per haploid human genome.  相似文献   

20.
We conducted a multi-genome analysis correlating protein domain organization with the exon-intron structure of genes in nine eukaryotic genomes. We observed a significant correlation between the borders of exons and domains on a genomic scale for both invertebrates and vertebrates. In addition, we found that the more complex organisms displayed consistently stronger exon-domain correlation, with substantially more significant correlations detected in vertebrates compared with invertebrates. Our observations concur with the principles of exon shuffling theory, including the prediction of predominantly symmetric phase of introns flanking the borders of correlating exons. These results suggest that extensive exon shuffling events during evolution significantly contributed to the shaping of eukaryotic proteomes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号