首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Concerted and divergent evolution within the rat gamma-crystallin gene family   总被引:11,自引:0,他引:11  
The nucleotide sequences of six rat gamma-crystallin genes have been determined. All genes have the same mosaic structure: the first exons contain a relatively short (25 to 44 base-pair) 5' non-coding region and the first nine base-pairs of the coding sequence, the second exons encode protein motifs I and II, while protein motifs III and IV are encoded by the third exons. The third exons also contain a 60 to 67-base-pair long 3' non-coding region. In the gamma 1-2 gene, the splice acceptor site of the third exon has been shifted three base-pairs upstream. Hence, the protein product of this gene is one amino acid residue longer. The first introns, though varying in length from 85 to 100 base-pairs, are conserved in sequence. The second introns vary considerably in length (0.9 X 10(3) to 1.9 X 10(3) base-pairs) and sequence. The second exons of the genes show concerted evolution and have undergone multiple gene conversions. In contrast, the third exons show divergent evolution. From the sequences of the third exons, an evolutionary tree of the gene family was constructed. This tree suggests that three of the present genes derive directly from the genes that originated from a tandem duplication of a two-gene cluster. Two duplications of the last gene of the four-gene cluster then yielded the other three genes. Region a' of the third exon, encoding protein motif III, is variable, while the region encoding protein motif IV (b') is constant. We postulate that this variability in region a' is due to a period of radiation after each gene duplication. A comparison of the rat sequences with those of orthologous sequences from other species shows that the variation in region a' is now preserved. Hence, it might specify the specific functional property of each gamma-crystallin protein within the lens.  相似文献   

2.
Structure and evolution of the bovine prothrombin gene   总被引:6,自引:0,他引:6  
The cloned bovine prothrombin gene has been characterized by partial DNA sequence analysis, including the 5' and 3' flanking sequences and all the intron-exon junctions. The gene is approximately 15.4 x 10(3) base-pairs in length and comprises 14 exons interrupted by 13 introns. The exons coding for the prepro-leader peptide and the gamma-carboxyglutamic acid-containing region are similar in organization to the corresponding exons in the factor IX and protein C genes. This region has probably evolved as a result of recent gene duplication and exon shuffling events. The exons coding for the kringles and the serine protease region of the prothrombin gene are different in organization from the homologous regions in other genes, suggesting that introns have been inserted into these regions after the initial gene duplication events.  相似文献   

3.
Nucleotide sequence of the gene for human factor IX (antihemophilic factor B)   总被引:97,自引:0,他引:97  
Two different human genomic DNA libraries were screened for the gene for blood coagulation factor IX by employing a cDNA for the human protein as a hybridization probe. Five overlapping lambda phages were identified that contained the gene for factor IX. The complete DNA sequence of about 38 kilobases for the gene and the adjacent 5' and 3' flanking regions was established by the dideoxy chain termination and chemical degradation methods. The gene contained about 33.5 kilobases of DNA, including seven introns and eight exons within the coding and 3' noncoding regions of the gene. The eight exons code for a prepro leader sequence and 415 amino acids that make up the mature protein circulating in plasma. The intervening sequences range in size from 188 to 9473 nucleotides and contain four Alu repetitive sequences, including one in intron A and three in intron F. A fifth Alu repetitive sequence was found immediately flanking the 3' end of the gene. A 50 base pair insert in intron A was found in a clone from one of the genomic libraries but was absent in clones from the other library. Intron A as well as the 3' noncoding region of the gene also contained alternating purine-pyrimidine sequences that provide potential left-handed helical DNA or Z-DNA structures for the gene. KpnI repetitive sequences were identified in intron D and the region flanking the 5' end of the gene. The 5' flanking region also contained a 1.9-kb HindIII subfamily repeat. The seven introns in the gene for factor IX were located in essentially the same position as the seven introns in the gene for human protein C, while the first three were found in positions identical with those in the gene for human prothrombin.  相似文献   

4.
5.
Structure and sequence of the human gene for tyrosine aminotransferase (TAT) was determined by analysis of cDNA and genomic clones. The gene extends over 10.9 kbl and consists of 12 exons giving rise to a 2,754 nucleotide long mRNA (excluding the poly(A)tail). The human TAT gene is predicted to code for a 454 amino acid protein of molecular weight 50,399 dalton. The overall sequence identity within the coding region of the human and the previously characterized rat TAT genes is 87% at the nucleotide and 92% at the protein level. A minor human TAT mRNA results from the use of an alternative polyadenylation signal in the 3' exon which is present but not used at the corresponding position in the rat TAT gene. The non-coding region of the 3' exon contains a complete Alu element which is absent in the rat TAT gene but present in apes and old world monkeys. Two functional glucocorticoid response elements (GREs) reside 2.5 kb upstream of the rat TAT gene. The DNA sequence of the corresponding region of the human TAT gene shows the distal GRE mutated and the proximal GRE replaced by Alu elements.  相似文献   

6.
Human blood-coagulation factor X (hBCFX) is a serine protease zymogen which participates in the middle phase of blood coagulation. Recently, we and others have reported the cDNA sequences. At present, partial hBCFX gene structure is available. In this paper, we report the isolation of two genomic clones, the X-emb lambda phage clone encoding exons 1 and 2 of the hBCFX gene, and the X-cos cosmid clone encoding exons 2-8. The restriction map of the hBCFX region spans 55 kb. The gene itself was found to be 27 kb long. The sequence of the 5' region of the hBCFX gene has been determined and reveals an ATTTG pentanucleotide, which also occurs in a similar location in the genes encoding factors VII, IX, protein CC and C prothrombin, suggesting that this motif might be of importance in the regulation of these genes.  相似文献   

7.
The eye lens contains a structural protein, alpha crystallin, composed of two homologous primary gene products alpha A2 and alpha B2. In certain rodents, still another alpha crystallin polypeptide, alpha AIns, occurs, which is identical to alpha A2 except that it contains an insertion peptide between residues 63 and 64. In this paper we describe the complete alpha A crystallin gene that has been cloned from DNA isolated from Syrian golden hamster. Evidence is provided that the alpha A gene is present as a single copy in the hamster genome. The detailed organization of the gene has been established by means of DNA sequence analysis and S1 nuclease mapping, revealing that the gene consists of four exons. The first exon contains the information for the 68 base-pair long 5' non-coding region as well as the coding information for the first 63 amino acids. The second exon encodes the 23 amino acid insertion sequence, the third exon codes for amino acid 87 to 127 of the alpha AIns chain, whereas the last exon encodes the C-terminal 69 amino acids and contains the information for the 523 base-pair long 3' non-coding region. The second exon is bordered by a 3' splice junction (A X G/G X C), which deviates from the consensus for donor splice sites (A X G/G X T). This deviation is found in both hamster and mouse. An internal duplication was detected in the first exon by using a DIAGON-generated matrix for comparison. By means of similar DIAGON-generated matrices it was confirmed that the amino acids coded for by the third and fourth exons are homologous to the small heat-shock proteins of Drosophila, Caenorhabditis and soyabean. The implications of the differential splicing and the evolutionary aspects of the detected homologies are discussed.  相似文献   

8.
9.
10.
11.
The complete nucleotide sequence and exon/intron structure of the rat embryonic skeletal muscle myosin heavy chain (MHC) gene has been determined. This gene comprises 24 X 10(3) bases of DNA and is split into 41 exons. The exons encode a 6035 nucleotide (nt) long mRNA consisting of 90 nt of 5' untranslated, 5820 nt of protein coding and 125 nt of 3' untranslated sequence. The rat embryonic MHC polypeptide is encoded by exons 3 to 41 and contains 1939 amino acid residues with a calculated Mr of 223,900. Its amino acid sequence displays the structural features typical for all sarcomeric MHCs, i.e. an amino-terminal "globular" head region and a carboxy-terminal alpha-helical rod portion that shows the characteristics of a coiled coil with a superimposed 28-residue repeat pattern interrupted at only four positions by "skip" residues. The complex structure of the rat embryonic MHC gene and the conservation of intron locations in this and other MHC genes are indicative of a highly split ancestral sarcomeric MHC gene. Introns in the rat embryonic gene interrupt the coding sequence at the boundaries separating the proteolytic subfragments of the head, but not at the head/rod junction or between the 28-residue repeats present within the rod. Therefore, there is little evidence for exon shuffling and intron-dependent evolution by gene duplication as a mechanism for the generation of the ancestral MHC gene. Rather, intron insertion into a previously non-split ancestral MHC rod gene consisting of multiple tandemly arranged 28-residue-encoding repeats, or convergent evolution of an originally non-repetitive ancestral MHC rod gene must account for the observed structure of the rod-encoding portion of present-day MHC genes.  相似文献   

12.
13.
14.
Most eukaryotic cells encode principally a 2.5-kilobase (kb) transforming growth factor (TGF)-beta 1 mRNA. However, we have found two major TGF-beta 1 RNA species, 3.5 and 2.5 kb long, in porcine tissues. The 3.5-kb species has a longer 3'-untranslated sequence generated by the selection of an alternate polyadenylation site. There is a 117-nucleotide sequence within this unique 3' region, which is similar to the PRE-1 repetitive sequence of unknown function, reported earlier in the porcine genome. We have also cloned and characterized an alternately spliced mRNA species specific for the TGF-beta 1 gene, in which exons IV and V of the corresponding human TGF-beta 1 gene are deleted. The nucleotide sequence of this cDNA clone predicts a putative precursor protein of 256 amino acids; the N-terminal 211 amino acids of this putative protein are identical to the TGF-beta 1 precursor protein (exons I, II, and III of the human TGF-beta 1 gene), but the C-terminal 45 amino acids are distinct, due to a frameshift in the translation of exons VI and VII. In addition we provide data for the existence of other mRNA species generated in a tissue-specific manner either by alternate splicing or by heterogeneous 5' leader sequences.  相似文献   

15.
The complete nucleotide sequence of the rat aldolase A isozyme gene, including the 5' and 3' flanking sequences, was determined. The gene comprises ten exons, spans 4827 base-pairs and occurs in a single copy per haploid rat genome. The genomic DNA sequence was compared with those of three species of rat aldolase A mRNA (mRNAs I, II and III) that have been found to differ from each other only in the 5' non-coding region and to be expressed tissue-specifically. It revealed that the first exon (exon M1) encodes the 5' non-coding sequence of mRNA I, while the second exon (exon AH1) encodes those of mRNAs II and III and the following eight exons (exons 2 to 9) are shared commonly by all the mRNA species. These results allowed us to conclude that mRNA I and mRNAs II, III were generated from a single aldolase A gene by alternative usage of exon M1 or exon AH1 in addition to exons 2 to 9. S1 nuclease mapping of the 5' ends of their precursor RNAs suggested that these three mRNA species were transcribed from three different initiation sites on the single gene.  相似文献   

16.
M J Smith 《DNA sequence》1992,2(4):235-240
The gene encoding a C. elegans homologue of the mammalian reticuloplasmin, calreticulin, was cloned and sequenced and the amino-acid sequence of its product deduced. The coding region of the gene comprises three exons separated by introns of 95 and 55 nucleotides, followed by either 158 or 279 bases of 3' non-coding sequence before putative polyadenylation signals. The precursor protein of 395 residues includes an N-terminal signal sequence of 13 residues. The C-terminus has the ER retention signal HDEL preceded by a polyacidic zone similar to known mammalian calreticulins. The sequence shows a 61% identity with mouse calreticulin, increasing to 82% in the proline-rich region of the molecule. Comparison of the C. elegans sequence with the calreticulin-related antigen RAL-1 of Oncocerca volvulus shows 73% identity, excluding the calreticulin C-terminal region. The sequence of this region differs markedly from RAL-1 where the parasite protein has a polybasic stretch and no ER retention signal. The C. elegans gene described here and designated crt-1 was mapped to a region towards the left-hand end of Chromosome V on the physical map of the genome. Southern blotting of genomic DNA indicates that in C. elegans the calreticulin homologue exists in only one form as the product of a single gene.  相似文献   

17.
18.
19.
A Carrier  M D Devignes  M F Rosier  C Auffray 《Gene》1992,116(2):173-179
An NGF cDNA containing the 5' exons of the nerve growth factor (NGF) messenger was obtained from chicken heart mRNA using the anchored polymerase chain reaction technique. Alignment of the chicken with the corresponding murine and human sequences reveals interspecies similarities. A sequence corresponding to an exon found only in the NGF messenger, which is abundant in the submaxillary gland of the male mouse, is present in the chicken NGF cDNA. The first non-coding exons of the NGF gene are much less conserved between chicken and mouse or human than the region of the last exon encoding the mature protein. After the cloning of the chicken NGF gene from a cosmid library, the chicken NGF exons have been located within 20 kb of DNA. The chicken NGF gene is therefore shorter than its murine counterpart which spans more than 43 kb. Furthermore, the organization of the chicken and murine NGF genes markedly differs in their 5' portion.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号