首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Nucleotide sequence of the gene for the b subunit of human factor XIII   总被引:9,自引:0,他引:9  
R E Bottenus  A Ichinose  E W Davie 《Biochemistry》1990,29(51):11195-11209
Factor XIII (Mr 320,000) is a blood coagulation factor that stabilizes and strengthens the fibrin clot. It circulates in blood as a tetramer composed of two a subunits (Mr 75,000 each) and two b subunits (Mr 80,000 each). The b subunit consists of 641 amino acids and includes 10 tandem repeats of 60 amino acids known as GP-I structures, short consensus repeats (SCR), or sushi domains. In the present study, the human gene for the b subunit has been isolated from three different genomic libraries prepared in lambda phage. Fifteen independent phage with inserts coding for the entire gene were isolated and characterized by restriction mapping, Southern blotting, and DNA sequencing. The gene was found to be 28 kilobases in length and consisted of 12 exons (I-XII) separated by 11 intervening sequences. The leader sequence was encoded by exon I, while the carbonyl-terminal region of the protein was encoded by exon XII. Exons II-XI each coded for a single sushi domain, suggesting that the gene evolved through exon shuffling and duplication. The 12 exons in the gene ranged in size from 64 to 222 base pairs, while the introns ranged in size from 87 to 9970 nucleotides and made up 92% of the gene. The introns contained four Alu repetitive sequences, one each in introns A, E, I, and J. A fifth Alu repeat was present in the flanking 3' end of the gene. Two partial KpnI repeats were also found in the introns, including one in intron I and one in intron J. The KpnI repeat in intron J was 89% homologous to a sequence of approximately 2200 nucleotides flanking the gene coding for human beta globin and approximately 3800 nucleotides from the L1 insertion present in the gene for human factor VIII. Intron H also contained an "O" family repeat, while two potential regions for Z-DNA were identified within introns G and J. One nucleotide change was found in the coding region of the gene when its sequence was compared to that of the cDNA. This difference, however, did not result in a change in the amino acid sequence of the protein.  相似文献   

2.
Structure of the murine complement factor H gene   总被引:3,自引:0,他引:3  
Factor H is a regulatory protein of the alternative pathway of complement activation comprised of 20 tandem repeating units of 60 amino acids each. A factor H cDNA clone was used to identify 17 genomic clones from a cosmid library. Four clones were selected for analysis of intron/exon junctions and 5' and 3' regions of the gene and for mapping of the exons. The factor H gene was found to be comprised of 22 exons. Each repeating unit is encoded by one exon, except the second repeat, which is coded by two exons; the leader sequence is encoded by a separate exon. The exons range in size from 77 to 210 base pairs (bp) and average 178 bp. They span a region of approximately 100 kilobases (kb) on chromosome 1. The leader sequence exon is 26 kb upstream of the first repeat exon, representing the largest intron. The other introns range in size from 86 bp to 12.9 kb, and the average intron size is 4.7 kb. Analysis of the genomic organization of the factor H gene has provided insight into the protein structure and will enable the construction of deletion mutants for functional studies.  相似文献   

3.
4.
W. Li  R. K. Herman    J. E. Shaw 《Genetics》1992,132(3):675-689
Mutations in the unc-33 gene of the nematode Caenorhabditis elegans lead to severely uncoordinated movement, abnormalities in the guidance and outgrowth of the axons of many neurons, and a superabundance of microtubules in neuronal processes. We have cloned unc-33 by tagging the gene with the transposable element Tc4. Three unc-33 messages, which are transcribed from a genomic region of at least 10 kb, were identified and characterized. The three messages have common 3' ends and identical reading frames. The largest (3.8-kb) message consists of the 22-nucleotide trans-spliced leader SL1 and 10 exons (I-X); the intermediate-size (3.3-kb) message begins with SL1 spliced to the 5' end of exon V and includes exons V-X; and the smallest (2.8-kb) message begins within exon VII and also includes exons VIII-X. A gamma-ray-induced deletion mutation situated within exon VIII reduces the sizes of all three messages by 0.5 kb. The three putative polypeptides encoded by the three messages overlap in C-terminal sequence but differ by the positions at which their N termini begin; none has significant similarity to any other known protein. A Tc4 insertion in exon VII leads to alterations in splicing that result in three approximately wild-type-size messages: the Tc4 sequence and 28 additional nucleotides are spliced out of the two larger messages; the Tc4 sequence is trans-spliced off the smallest message such that SL1 is added 13 nucleotides upstream of the normal 5' end of the smallest message.  相似文献   

5.
6.
7.
Concerted and divergent evolution within the rat gamma-crystallin gene family   总被引:11,自引:0,他引:11  
The nucleotide sequences of six rat gamma-crystallin genes have been determined. All genes have the same mosaic structure: the first exons contain a relatively short (25 to 44 base-pair) 5' non-coding region and the first nine base-pairs of the coding sequence, the second exons encode protein motifs I and II, while protein motifs III and IV are encoded by the third exons. The third exons also contain a 60 to 67-base-pair long 3' non-coding region. In the gamma 1-2 gene, the splice acceptor site of the third exon has been shifted three base-pairs upstream. Hence, the protein product of this gene is one amino acid residue longer. The first introns, though varying in length from 85 to 100 base-pairs, are conserved in sequence. The second introns vary considerably in length (0.9 X 10(3) to 1.9 X 10(3) base-pairs) and sequence. The second exons of the genes show concerted evolution and have undergone multiple gene conversions. In contrast, the third exons show divergent evolution. From the sequences of the third exons, an evolutionary tree of the gene family was constructed. This tree suggests that three of the present genes derive directly from the genes that originated from a tandem duplication of a two-gene cluster. Two duplications of the last gene of the four-gene cluster then yielded the other three genes. Region a' of the third exon, encoding protein motif III, is variable, while the region encoding protein motif IV (b') is constant. We postulate that this variability in region a' is due to a period of radiation after each gene duplication. A comparison of the rat sequences with those of orthologous sequences from other species shows that the variation in region a' is now preserved. Hence, it might specify the specific functional property of each gamma-crystallin protein within the lens.  相似文献   

8.
The rat alpha- and bovine alpha s1-casein genes have been isolated and their 5' sequences determined. The rat alpha-, beta-, gamma- and bovine alpha s1-casein genes contain similar 5' exon arrangements in which the 5' noncoding, signal peptide and casein kinase phosphorylation sequences are each encoded by separate exons. These findings support the hypothesis that during evolution, the family of casein genes arose by a process involving exon recruitment followed by intragenic and intergenic duplication of a primordial gene. Several highly conserved regions in the first 200 base pairs of the 5' flanking DNA have been identified. Additional sequence homology extending up to 550 base pairs upstream of the CAP site has been found between the rat alpha- and bovine alpha s1-casein sequences. Unexpectedly, the 5' flanking promoter regions are conserved to a greater extent than both the entire mature coding and intron regions of these genes. These conserved 5' flanking sequences may contain potential cis regulatory elements which are responsible for the coordinate expression of the functionally-related casein genes during mammary gland development.  相似文献   

9.
10.
Structure of the gene for human coagulation factor V.   总被引:22,自引:0,他引:22  
L D Cripe  K D Moore  W H Kane 《Biochemistry》1992,31(15):3777-3785
Activated factor V (Va) serves as an essential protein cofactor for the conversion of prothrombin to thrombin by factor Xa. Analysis of the factor V cDNA indicates that the protein contains several types of internal repeats with the following domain structure: A1-A2-B-A3-C1-C2. In this report we describe the isolation and characterization of genomic DNA coding for human factor V. The factor V gene contains 25 exons which range in size from 72 to 2820 bp. The structure of the gene for factor V is similar to the previously characterized gene for factor VIII. Based on the aligned amino acid sequences of the two proteins, 21 of the 24 intron-exon boundaries in the factor V gene occur at the same location as in the factor VIII gene. In both genes, the junctions of the A1-A2 and A2-A3 domains are each encoded by a single exon. In contrast, the boundaries between domains A3-C1 and C1-C2 occur at intron-exon boundaries, which is consistent with evolution through domain duplication and exon shuffling. The connecting region or B domain of factor V is encoded by a single large exon of 2820 bp. The corresponding exon of the factor VIII gene contains 3106 bp. The 5' and 3' ends of both of these exons encode sequences homologous to the carboxyl-terminal end of domain A2 and the amino-terminal end of domain A3 in ceruloplasmin. There is otherwise no homology between the B domain exons.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

11.
12.
13.
A complementary DNA clone for bovine osteonectin was used to isolate the osteonectin gene from two libraries of bovine genomic DNA fragments. Two overlapping clones were obtained whose relationship was determined by restriction mapping and sequence analysis. The two clones contain the entire osteonectin coding region spanning approximately 11 kilobases of genomic DNA. The coding region of the gene was determined, by electron microscopy and DNA sequencing, to reside in nine exons. In addition, there is at least one 5' exon interrupted by an intron in the 5'-nontranslated sequence of the gene. Excluding this 5' exon and the 3'-terminal exon, the exons are small and approximately uniform in size, averaging 130 +/- 17 base pairs. Three of the exons at the 5' end of the gene were sequenced and appear to encode discrete protein domains. For example, the putative exon 2 contains the coding region for the leader peptide of the molecule. The amino-terminal protein sequence was determined for osteonectin extracted from human, rabbit, and chicken bone and compared with those for bovine, mouse, and pig osteonectin. These data suggest that osteonectin is highly conserved between species, interspecies changes being seen primarily at the amino terminus of the protein and specifically in the region encoded by putative exon 3 in the bovine gene.  相似文献   

14.
The complete nucleotide sequence and exon/intron structure of the rat embryonic skeletal muscle myosin heavy chain (MHC) gene has been determined. This gene comprises 24 X 10(3) bases of DNA and is split into 41 exons. The exons encode a 6035 nucleotide (nt) long mRNA consisting of 90 nt of 5' untranslated, 5820 nt of protein coding and 125 nt of 3' untranslated sequence. The rat embryonic MHC polypeptide is encoded by exons 3 to 41 and contains 1939 amino acid residues with a calculated Mr of 223,900. Its amino acid sequence displays the structural features typical for all sarcomeric MHCs, i.e. an amino-terminal "globular" head region and a carboxy-terminal alpha-helical rod portion that shows the characteristics of a coiled coil with a superimposed 28-residue repeat pattern interrupted at only four positions by "skip" residues. The complex structure of the rat embryonic MHC gene and the conservation of intron locations in this and other MHC genes are indicative of a highly split ancestral sarcomeric MHC gene. Introns in the rat embryonic gene interrupt the coding sequence at the boundaries separating the proteolytic subfragments of the head, but not at the head/rod junction or between the 28-residue repeats present within the rod. Therefore, there is little evidence for exon shuffling and intron-dependent evolution by gene duplication as a mechanism for the generation of the ancestral MHC gene. Rather, intron insertion into a previously non-split ancestral MHC rod gene consisting of multiple tandemly arranged 28-residue-encoding repeats, or convergent evolution of an originally non-repetitive ancestral MHC rod gene must account for the observed structure of the rod-encoding portion of present-day MHC genes.  相似文献   

15.
16.
17.
18.
Ten genomic DNA clones encoding the human leukocyte common Ag (LCA, CD45) gene were isolated by screening human genomic DNA libraries with LCA cDNA probes. One genomic DNA clone contains the promoter region and the first two exons, as determined by primer extension analyses and S1 nuclease protection studies as well as nucleotide sequence determination. The first exon does not encode a peptide, while the second exon contains the initiation ATG codon and encodes the signal peptide. The other nine genomic DNA clones, which are separated from the first genomic clone by an unknown distance, are connected and span a total of 73 kb. The nine connected genomic clones encode a total of 31 exons. The 33 exons encoded by these 10 genomic clones account for the entire cDNA sequences including the 5' and 3' untranslated sequences. Exon 3 and exons 7 through 15 encode the extracellular domain sequences that are common to all LCA isoforms. Differential usage of exons 4, 5, and 6, generates at least five distinct LCA isoforms. Exon 16 encodes the transmembrane peptide. The cytoplasmic region of the leukocyte common antigens is composed of two homologous domains. Exons 17 through 24 encode the first domain, and exons 25 through 32 encode the second domain. The comparison of these exons indicated that the homologous domains were generated by duplication of several exons. The most 3' exon (exon 33) encodes the carboxy terminus of the LCA molecules and includes the entire 3' untranslated sequence.  相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号