首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 27 毫秒
1.
A recombinant phage, SpC3, containing a 17 kb genomic DNA insert representing approximately 60% of the 3' portion of the sheep collagen alpha 2 gene, was evaluated by electron microscopic R loop analysis. A minimum of 17 intervening sequences (introns) and 18 alpha 2 coding sequences (exons) were mapped. With the exception of the 850 base pair exon located at the extreme 3' end of the insert, all exons contained 250 base pairs or less. The total length of all the exons in SpC3 was 3,014 base pairs. The length distribution of the 17 introns ranged from 300 to 1600 base pairs; together, all of the introns comprised 14,070 base pairs of SpC3 DNA. Thus, the DNA region required for coding the interspersed 3 kb of alpha 2 collagen genetic information was 5.6 fold longer than the corresponding alpha 2 mRNA coding sequences.  相似文献   

2.
3.
Nucleotide sequence of the gene for the b subunit of human factor XIII   总被引:9,自引:0,他引:9  
R E Bottenus  A Ichinose  E W Davie 《Biochemistry》1990,29(51):11195-11209
Factor XIII (Mr 320,000) is a blood coagulation factor that stabilizes and strengthens the fibrin clot. It circulates in blood as a tetramer composed of two a subunits (Mr 75,000 each) and two b subunits (Mr 80,000 each). The b subunit consists of 641 amino acids and includes 10 tandem repeats of 60 amino acids known as GP-I structures, short consensus repeats (SCR), or sushi domains. In the present study, the human gene for the b subunit has been isolated from three different genomic libraries prepared in lambda phage. Fifteen independent phage with inserts coding for the entire gene were isolated and characterized by restriction mapping, Southern blotting, and DNA sequencing. The gene was found to be 28 kilobases in length and consisted of 12 exons (I-XII) separated by 11 intervening sequences. The leader sequence was encoded by exon I, while the carbonyl-terminal region of the protein was encoded by exon XII. Exons II-XI each coded for a single sushi domain, suggesting that the gene evolved through exon shuffling and duplication. The 12 exons in the gene ranged in size from 64 to 222 base pairs, while the introns ranged in size from 87 to 9970 nucleotides and made up 92% of the gene. The introns contained four Alu repetitive sequences, one each in introns A, E, I, and J. A fifth Alu repeat was present in the flanking 3' end of the gene. Two partial KpnI repeats were also found in the introns, including one in intron I and one in intron J. The KpnI repeat in intron J was 89% homologous to a sequence of approximately 2200 nucleotides flanking the gene coding for human beta globin and approximately 3800 nucleotides from the L1 insertion present in the gene for human factor VIII. Intron H also contained an "O" family repeat, while two potential regions for Z-DNA were identified within introns G and J. One nucleotide change was found in the coding region of the gene when its sequence was compared to that of the cDNA. This difference, however, did not result in a change in the amino acid sequence of the protein.  相似文献   

4.
The bovine prothrombin gene was characterized by Southern blot analysis of bovine genomic DNA using bovine prothrombin cDNA fragments as hybridization probes. These analyses suggested that the bovine genome contains a single prothrombin gene that is at least 10 kilobase pairs (kbp) in size. To characterize the gene more thoroughly, two bovine genomic phage libraries were screened by using prothrombin cDNAs as hybridization probes. Heteroduplex analysis of the cloned genomic DNA and cDNA showed that the prothrombin gene is 14.9 kbp in size and contains at least 14 exons interrupted by 13 introns. The exons vary in size from 28 to 317 base pairs (bp), while the introns vary in size from less than 100 to 6940 bp. Regions of self-complementarity were observed within some of the introns, suggesting the presence of inverted repeat sequences. The bovine prothrombin gene shows similarities in structure to both the human prothrombin gene and the human factor IX gene.  相似文献   

5.
Nucleotide sequence of the gene for human factor IX (antihemophilic factor B)   总被引:97,自引:0,他引:97  
Two different human genomic DNA libraries were screened for the gene for blood coagulation factor IX by employing a cDNA for the human protein as a hybridization probe. Five overlapping lambda phages were identified that contained the gene for factor IX. The complete DNA sequence of about 38 kilobases for the gene and the adjacent 5' and 3' flanking regions was established by the dideoxy chain termination and chemical degradation methods. The gene contained about 33.5 kilobases of DNA, including seven introns and eight exons within the coding and 3' noncoding regions of the gene. The eight exons code for a prepro leader sequence and 415 amino acids that make up the mature protein circulating in plasma. The intervening sequences range in size from 188 to 9473 nucleotides and contain four Alu repetitive sequences, including one in intron A and three in intron F. A fifth Alu repetitive sequence was found immediately flanking the 3' end of the gene. A 50 base pair insert in intron A was found in a clone from one of the genomic libraries but was absent in clones from the other library. Intron A as well as the 3' noncoding region of the gene also contained alternating purine-pyrimidine sequences that provide potential left-handed helical DNA or Z-DNA structures for the gene. KpnI repetitive sequences were identified in intron D and the region flanking the 5' end of the gene. The 5' flanking region also contained a 1.9-kb HindIII subfamily repeat. The seven introns in the gene for factor IX were located in essentially the same position as the seven introns in the gene for human protein C, while the first three were found in positions identical with those in the gene for human prothrombin.  相似文献   

6.
We have determined the nucleotide sequence of 4508 base pairs of human genomic DNA which contain the human serine esterase gene from cytotoxic T lymphocytes (SECT) (equivalent to the 1-3E cDNA clone) and include 879 bp of 5' flanking DNA and 393 bp of 3' flanking DNA. The gene consists of five exons of 88, 148, 136, 261, and 257 nucleotides separated by four introns of 1043, 455, 205, and 643 nucleotides. The location of introns with respect to protein coding sequences in the SECT gene is identical to that of the human cathepsin G and murine granzyme B genes. Comparison of SECT gene exonic sequences to murine granzyme B-F cDNA sequences indicates similarities of 75 and 72% for granzymes B and C and 61, 59, and 61% for granzymes D, E, and F, respectively. The 5' flanking sequence of the SECT gene showed similarity only to the 5' flanking sequence of the murine granzyme B gene, indicating that these genes are homologous. Comparison of the SECT gene sequence to the human cathepsin G sequence indicated no similarity in the 5' flanking DNA although the exonic sequences show 64% sequence similarity overall and 45% sequence similarity in the respective 3' untranslated regions. These similarities suggest that the SECT and cathepsin G genes are members of the same family of serine protease genes. Evidence from high and low stringency Southern transfer analysis of human genomic DNA indicates the presence of another gene of at least 85% sequence similarity to the SECT gene.  相似文献   

7.
8.
Structure and evolution of the bovine prothrombin gene   总被引:6,自引:0,他引:6  
The cloned bovine prothrombin gene has been characterized by partial DNA sequence analysis, including the 5' and 3' flanking sequences and all the intron-exon junctions. The gene is approximately 15.4 x 10(3) base-pairs in length and comprises 14 exons interrupted by 13 introns. The exons coding for the prepro-leader peptide and the gamma-carboxyglutamic acid-containing region are similar in organization to the corresponding exons in the factor IX and protein C genes. This region has probably evolved as a result of recent gene duplication and exon shuffling events. The exons coding for the kringles and the serine protease region of the prothrombin gene are different in organization from the homologous regions in other genes, suggesting that introns have been inserted into these regions after the initial gene duplication events.  相似文献   

9.
A genomic DNA fragment (gCORE-1), encoding a portion of the cartilage proteoglycan core protein, has been isolated from a phage library using cDNA as a probe. The genomic insert is about 17 kilobase pairs; two BamHI fragments of the insert (1.3 and 4.8 kilobase pairs) contain most of the hybridizable sequences found in the cDNA. Sequence analysis of these fragments shows that they contain a total of five exons that encompass 216 amino acid residues, all of which are identical to those of the corresponding cDNA sequence. Three of the exons, which are adjacent to one another, are very similar to the corresponding exons in the gene of a rat hepatic lectin as well as to an exon in the gene of human pulmonary surfactant-associated protein. There is a strong degree of conservation of amino acid sequences encoded in the three genes, although there is no similarity between their introns. The sizes of the five exons in gCORE-1, except for one (which is indeterminate because only a partial cDNA sequence is available), are less than 184 base pairs, whereas the sizes of the introns range from 218 to greater than 2629 base pairs. Four of the introns interrupt an exon codon at either their donor or acceptor sites, between the first and second nucleotides. Only one intron does not split a codon. Intron and exon boundary sites are in agreement with known consensus sequences for introns. The dispersed distribution and relatively small size of the exons, if representative of the entire gene, suggest that the complete gene which codes for the core protein may be quite sizable.  相似文献   

10.
The human tissue plasminogen activator gene   总被引:28,自引:0,他引:28  
  相似文献   

11.
12.
13.
Structure of the human gene for the proliferating cell nuclear antigen   总被引:35,自引:0,他引:35  
The proliferating cell nuclear antigen (PCNA, cyclin) was originally defined as a nuclear protein whose appearance correlated with the proliferative state of the cell. It is now known to be a co-factor of DNA polymerase delta and to be necessary for DNA synthesis and cell cycle progression. cDNA clones of human PCNA have been isolated and, using one of these cDNA, we have now obtained from a lambda phage library a clone containing the entire human PCNA gene and flanking sequences. The human PCNA gene is a unique copy gene and has 6 exons. It spans, from the cap site to the poly(A) signal 4961 base pairs. We have identified, in the 5'-flanking sequence, a region with promoter activity, a well as other structural elements common to other promoters. An interesting feature of the PCNA gene is the presence of extensive sequence similarities among introns and between introns and exons.  相似文献   

14.
S Han  L A Stuart  S J Degen 《Biochemistry》1991,30(40):9768-9780
A human genomic DNA library was screened by using conditions of reduced stringency with a bovine cDNA probe coding for the kringle domains in prothrombin in order to isolate the human prothrombin gene. Twelve positives were identified, three of which coded for prothrombin (Degen & Davie, 1987). Phage L5 was characterized in more detail because of its strong hybridization to the cDNA probe and its unique restriction map compared to the gene coding for human prothrombin. The gene in L5 was sequenced and found to code for a kringle-containing protein. A human liver cDNA library was screened by using a genomic probe from the gene in L5. cDNAs were isolated that contained sequence identical with regions in the gene in L5. Comparison of the cDNA with the gene indicated that the gene in L5 was composed of 18 exons separated by 17 intervening sequences and is 4690 bp in length. Exons ranged in size from 36 to 242 bp in length while intervening sequences ranged from 77 to 697 bp in length. The putative protein encoded by the gene in L5 contains four kringle domains followed by a serine protease-like domain. This domain structure is identical with that found in hepatocyte growth factor (HGF), although the two proteins are only about 50% identical. On the basis of the similarity of the protein encoded by L5 and HGF, we propose that the putative L5 protein be tentatively called HGF-like protein until a function is identified. The DNA sequence of the gene and cDNA and its translated amino acid sequence were compared against GenBank and NBRF databases. Sequences homologous to DNF15S1 and DNF15S2, human DNF15S2 lung mRNA, and rat acyl-peptide hydrolase were identified in exon 17 to the 3' end of the characterized sequence for the gene. From our results, it is apparent that the gene coding for human HGF-like protein is located at the DNF15S2 locus on human chromosome 3 (3p21). The gene for acyl-peptide hydrolase is 444 bp downstream of the gene coding for HGF-like protein, but on the complementary strand. The DNF15S2 locus has been proposed to code for one or more tumor suppressor genes since this locus is deleted in DNA from small cell lung carcinoma, other lung cancers, renal cell carcinoma, and von Hippel-Lindau syndrome.  相似文献   

15.
16.
17.
Ubiquitin coding sequences were isolated from a human genomic library and two cDNA libraries. One human ubiquitin gene consists of 2055 nucleotides and codes for a polyprotein consisting of 685 amino acid residues. The polyprotein contains nine direct repeats of the ubiquitin amino acid sequence and the last ubiquitin sequence is extended with an additional valyl residue at the C-terminal end. No spacer sequences separate the ubiquitin repeats and the coding regions are not interrupted by intervening sequences. This particular gene is transcribed since cDNAs corresponding to the genomic sequence have been isolated. At least two more types of ubiquitin genes are encoded in the human genome, one coding for an ubiquitin monomer while another presumably codes for three or four direct repeats of the ubiquitin sequence. Human DNA contains many copies of the ubiquitin sequence. Ubiquitin is therefore encoded in the human genome as a multigene family.  相似文献   

18.
19.
The histidine tRNA genes of yeast   总被引:9,自引:0,他引:9  
Yeast has at least seven nuclear histidine tRNA genes although there is a single tRNAHis. We have sequenced three of the histidine tRNA genes. The genes have identical coding sequences and the DNA anti-codon sequence GTG corresponds to the GUG anti-codon in tRNAHis. None of the three yeast histidine tRNA genes has an intervening sequence. Two of the three genes contain repeated DNA elements in the region adjacent to the 5' end of the histidine tRNA gene. One of the elements, sigma, is 18 base pairs (bp) from the 5' end of each of these genes, sigma elements are highly conserved and flanked by 5-bp repeats. The other element, delta, is at variable distances from the tRNA gene; one is 439 bp from a histidine tRNA gene and the other is 52 bp from a histidine tRNA gene. These solo delta elements are quite divergent when compared with delta s associated with transposon yeast elements and are not flanked by 5-bp repeats.  相似文献   

20.
The human alpha-fetoprotein gene spans 19,489 base pairs from the putative "Cap" site to the polyadenylation site. It is composed of 15 exons separated by 14 introns, which are symmetrically placed within the three domains of alpha-fetoprotein. In the 5' region, a putative TATAAA box is at position -21, and a variant sequence, CCAAC, of the common CAT box is at -65. Enhancer core sequences GTGGTTTAAAG are found in introns 3 and 4, and several copies of glucocorticoid response sequences AGATACAGTA are found on the template strand of the gene. There are six polymorphic sites within 4690 base pairs of contiguous DNA derived from two allelic alpha-fetoprotein genes. This amounts to a measured polymorphic frequency of 0.13%, or 6.4 X 10(-4)/site, which is about 5-10 times lower than values estimated from studies on polymorphic restriction sites in other regions of the human genome. There are four types of repetitive sequence elements in the introns and flanking regions of the human alpha-fetoprotein gene. At least one of these is apparently a novel structure (designated Xba) and is found as a pair of direct repeats, with one copy in intron 7 and the other in intron 8. It is conceivable that within the last 2 million years the copy in intron 8 gave rise to the repeat in intron 7. Their present location on both sides of exon 8 gives these sequences a potential for disrupting the functional integrity of the gene in the event of an unequal crossover between them. There are three Alu elements, one of which is in intron 4; the others are located in the 3' flanking region. A solitary Kpn repeat is found in intron 3. The Xba and Kpn repeats were only detected by complete sequencing of the introns. Neither X, Xba, nor Kpn elements are present in the related human albumin gene, whereas Alu's are present in different positions. From phylogenetic evidence, it appears that Alu elements were inserted into the alpha-fetoprotein gene at some time postdating the mammalian radiation 85 million years ago.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号