首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
The nucleotide sequence of 6225 base pairs (bp) of Euglena gracilis chloroplast DNA including the complete DNA sequence of the chloroplast-encoded ribulose-1,5-bisphosphate carboxylase large subunit gene along with the flanking DNA sequences is presented. The gene is greater than 5.5 kilobase pairs in length and is organized as 10 exons coding for 475 amino acids, separated by 9 introns. The exons range in size from 45 to 438 bp, while the introns range in size from 382 to 568 bp. The introns have highly conserved boundary sequences with the consensus, 5'-N GTGTGGATTT...(intron)...TTAATTTTAT N-3'. The introns are 82-85 mol% AT, with a pronounced T greater than A greater than G greater than C base bias in the RNA-like strand. They do not appear to encode any polypeptides. In addition, the introns have a conserved sequence 30-50 bp from their 3'-ends with the consensus, 5'-TACAGTTTGAAAATGA-3'. The 5'-TACA sequence bears some homology to the 5'-end of the TACTAACA sequence found in a similar location in yeast nuclear mRNA introns. The conserved sequences of the Euglena rbcL introns may be indicative of a splicing mechanism similar to that of eucaryotic nuclear mRNA introns and group II mitochondrial introns.  相似文献   

2.
The complete exon size and distribution pattern in the gene for the alpha 1 chain of human type IV collagen was determined. Clones covering 145 kilobases (kb) of genomic DNA including 100 kb of the gene itself as well as 25 kb upstream and 20 kb downstream of the gene sequences, respectively, were isolated from lambda phage and cosmid libraries. The overall gene structure was determined by endonuclease restriction mapping and R-loop analyses and all exon sizes by nucleotide sequencing. The characterized clones contained all the coding sequences except for exon 2 whose sequence was determined after its amplification by the polymerase chain reaction. There were four gaps in the intron sequences; the exact size of the gene is unknown. The entire gene is at least 100 kb in size and contains 52 exons whose size distribution is completely different from that of the genes for fibrillar collagens. In the -Gly-X-Y- coding region there are three exons of 99, 90, and 45 base pairs (bp) each and two exons of 27, 36, 42, 51, 54, 63, and 84 bp each. The rest of the exons have sizes between 71 and 192 bp in the collagenous region. About one-half of the -Gly-X-Y- repeat coding exons start with the second base for the codon of glycine, whereas the other half starts (with two exceptions) with a complete glycine codon. The distribution of split versus unsplit codons is uneven in that the first 19 exons of the gene start with a complete codon. The gene contains repetitive sequences in several regions. A 185-nucleotide segment containing 40 copies of CCT flanked by poly(C) and poly(T) sequences was shown to be located adjacent to an exon. The gene has previously been shown to be located head-to-head to the alpha 2(IV) collagen gene at the distal end of the long arm of chromosome 13, such that the first exons of the two genes are separated by as little as 42 bp (P?schl, E., Pollner, R., and Kühn, K. (1988) EMBOJ. 7,2687-2695; Soininen, R., Huotari, M., Hostikka, S. L., Prockop, D. J., and Tryggvason, K. (1988) J. Biol. Chem. 263, 17217-17220). The results demonstrate that the human alpha 1(IV) collagen gene has a structure distinctly different from the genes for fibrillar collagens and also that it is considerably larger than any collagen gene characterized to date.  相似文献   

3.
4.
A lambda gt11 cDNA library containing DNA inserts prepared from human liver mRNA has been screened with an antibody to human alpha 2-thiol proteinase inhibitor that was isolated from fresh plasma. Eighteen positive clones were isolated from one million phage, and each was plaque purified. The cDNA insert of one of these phage was sequenced and shown to code for alpha 2-thiol proteinase inhibitor as identified by a partial amino acid sequence of the light chain of alpha 2-thiol proteinase inhibitor. This cDNA insert contained 1529 base pairs coding for the complete alpha 2-thiol proteinase inhibitor. It included 45 base pairs of 5' noncoding sequence, 1281 base pairs that code for pre alpha 2-thiol proteinase inhibitor, a stop codon, 160 base pairs of 3' noncoding sequence, and 40 base pairs of poly(A) tail. The noncoding sequence on the 3' end contained a potential recognition site (AATAAA) for processing and polyadenylation of precursor messenger RNA. The amino acid sequence of alpha 2-thiol proteinase inhibitor deduced from the cDNA showed a striking similarity (overall homology at 74%) to that of bovine low molecular weight (LMW) kininogen, including two internally repeated sequences and a nonapeptide sequence of bradykinin. These data clearly indicated that alpha 2-thiol proteinase inhibitor and LMW kininogen are identical. This was further supported by immunological cross-reactivity between alpha 2-thiol proteinase inhibitor and LMW kininogen.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

5.
Nucleotide sequence of the gene for human prothrombin   总被引:23,自引:0,他引:23  
S J Degen  E W Davie 《Biochemistry》1987,26(19):6165-6177
A human genomic DNA library was screened for the gene coding for human prothrombin with a cDNA coding for the human protein. Eighty-one positive lambda phage were identified, and three were chosen for further characterization. These three phage hybridized with 5' and/or 3' probes prepared from the prothrombin cDNA. The complete DNA sequence of 21 kilobases of the human prothrombin gene was determined and included a 4.9-kilobase region that was previously sequenced. The gene for human prothrombin contains 14 exons separated by 13 intervening sequences. The exons range in size from 25 to 315 base pairs, while the introns range from 84 to 9447 base pairs. Ninety percent of the gene is composed of intervening sequence. All the intron splice junctions are consistent with sequences found in other eukaryotic genes, except for the presence of GC rather than GT on the 5' end of intervening sequence L. Thirty copies of Alu repetitive DNA and two copies of partial KpnI repeats were identified in clusters within several of the intervening sequences, and these repeats represent 40% of the DNA sequence of the gene. The size, distribution, and sequence homology of the introns within the gene were then compared to those of the genes for the other vitamin K dependent proteins and several other serine proteases.  相似文献   

6.
A genomic DNA fragment (gCORE-1), encoding a portion of the cartilage proteoglycan core protein, has been isolated from a phage library using cDNA as a probe. The genomic insert is about 17 kilobase pairs; two BamHI fragments of the insert (1.3 and 4.8 kilobase pairs) contain most of the hybridizable sequences found in the cDNA. Sequence analysis of these fragments shows that they contain a total of five exons that encompass 216 amino acid residues, all of which are identical to those of the corresponding cDNA sequence. Three of the exons, which are adjacent to one another, are very similar to the corresponding exons in the gene of a rat hepatic lectin as well as to an exon in the gene of human pulmonary surfactant-associated protein. There is a strong degree of conservation of amino acid sequences encoded in the three genes, although there is no similarity between their introns. The sizes of the five exons in gCORE-1, except for one (which is indeterminate because only a partial cDNA sequence is available), are less than 184 base pairs, whereas the sizes of the introns range from 218 to greater than 2629 base pairs. Four of the introns interrupt an exon codon at either their donor or acceptor sites, between the first and second nucleotides. Only one intron does not split a codon. Intron and exon boundary sites are in agreement with known consensus sequences for introns. The dispersed distribution and relatively small size of the exons, if representative of the entire gene, suggest that the complete gene which codes for the core protein may be quite sizable.  相似文献   

7.
A bovine cDNA library constructed from fetal cartilage RNA was screened with a pro alpha 1(II) collagen specific chicken cDNA. A recombinant clone (Bc 7), with an insert of 1 kb, was identified and shown to contain sequences exhibiting 85% homology with the chicken pro alpha 1(II) collagen C-propeptide. Interspecies comparison strongly suggested that one potential glycosylation site present in the avian C-propeptide is not utilized, since this site is absent in the bovine chain. In addition, two overlapping genomic clones (Pal 3 and Pal 4) were isolated and partially characterized. These clones span 23 kb of DNA and contain approximately 17 kb of the pro alpha 1(II) calf gene. Sequencing of exon 1 has determined the length of the 3' untranslated region and the exact location of the polyadenylation attachment site.  相似文献   

8.
The complete primary structure of HLA-Bw58   总被引:12,自引:0,他引:12  
Serological studies indicate that HLA-B17 molecules are unusually cross-reactive with products of the HLA-A locus. In particular, a mouse monoclonal antibody MA2.1 defines an epitope that is shared by HLA-A2 and the two subtypes (Bw57 and Bw58) of B17. To investigate these relationships at the structural level, we have isolated a gene coding for Bw58 from the WT49 B cell line. The gene was transfected into mouse L cells and its protein product was characterized with a panel of monoclonal anti-HLA antibodies. The nucleotide sequence of 3520 base pairs of DNA encompassing the seven exons coding for Bw58 and associated introns was determined. The deduced protein sequences for Bw58 and eight other HLA-A,B,C molecules were compared. In the first polymorphic domain (alpha 1), Bw58 is unusual in that it is as homologous to HLA-A locus products as to HLA-B locus products. In the second polymorphic domain (alpha 2), Bw58 has greater homology to B locus products. In the alpha 1 domain of Bw58, small segments of amino acid and nucleotide sequence homology with A2 (residues 62-65) and with Aw24 (residues 75-83) are found in the major region of polymorphic diversity (residues 62-83). These similarities provide structural correlates for the serological relationships between Bw58 and A locus molecules, with residues 62-65 possibly being involved in the MA2.1 epitope. From comparisons of four HLA-A and four HLA-B sequences, there is a difference in the patterns of variation for A and B locus molecules. For B locus molecules there is greater variation in the alpha 1 domain than in the alpha 2 domain. For A locus molecules, variation in the two domains is similar and like that for B locus alpha 2 domains. In comparison to other HLA-A,B,C genes, novel inverted repeat sequences were found in the nucleotide sequence of HLA-Bw58. These sequences flank the putative RNA splicing sites at the 3' end of the exons encoding the alpha 2 and alpha 3 protein domains.  相似文献   

9.
10.
Structure of two in tandem human 17 beta-hydroxysteroid dehydrogenase genes   总被引:4,自引:0,他引:4  
Two human 17 beta-hydroxysteroid dehydrogenase (17 beta-HSD) genes (h17 beta-HSDI and h17 beta-HSDII) included in tandem within an approximately 13 kilobase pair fragment were isolated from a genomic lambda EMBL3 DNA library using cDNA encoding human 17 beta-HSD (hpE2DH216) as probe. We have determined the complete exon and intron sequences of the two genes as well as their 5' and 3'-flanking regions. Human 17 beta-HSDII contains six exons and five short introns for a total length of 3250 base pairs. The exon sequence of h17 beta-HSDII is identical to the previously reported hpE2DH216 cDNA while the overlapping nucleotide sequences of the corresponding exons and introns of h17 beta-HSDI and h17 beta-HSDII show 89% homology. In addition, we have used the hpE2DH216 cDNA to demonstrate the widespread expression of 17 beta-HSD mRNAs in steroidogenic and peripheral target tissues. These new findings provide the basis for a better understanding of the molecular mechanisms involved in 17 beta-HSD deficiency and peripheral sex steroid metabolism.  相似文献   

11.
12.
Structure of the human type I DNA topoisomerase gene   总被引:7,自引:0,他引:7  
We describe the molecular organization of the human gene coding for type I DNA topoisomerase. The coding sequence is split into 21 exons distributed over at least 85 kilobase pairs (kb) of human genomic DNA. The sizes of the 20 introns vary widely between 0.2 and at least 30 kb and all contain the sequence elements known to be required for pre-mRNA splicing. Several of the intron sequences separate exons encoding parts of the enzyme that are highly conserved between human and yeast suggesting that at least some of the exons may code for individual, structurally, or functionally important domains of the enzyme. We also describe the promoter sequence of the human topoisomerase I gene and show that it is composed of distinct functional elements.  相似文献   

13.
Nucleotide sequence of the gene for human factor IX (antihemophilic factor B)   总被引:97,自引:0,他引:97  
Two different human genomic DNA libraries were screened for the gene for blood coagulation factor IX by employing a cDNA for the human protein as a hybridization probe. Five overlapping lambda phages were identified that contained the gene for factor IX. The complete DNA sequence of about 38 kilobases for the gene and the adjacent 5' and 3' flanking regions was established by the dideoxy chain termination and chemical degradation methods. The gene contained about 33.5 kilobases of DNA, including seven introns and eight exons within the coding and 3' noncoding regions of the gene. The eight exons code for a prepro leader sequence and 415 amino acids that make up the mature protein circulating in plasma. The intervening sequences range in size from 188 to 9473 nucleotides and contain four Alu repetitive sequences, including one in intron A and three in intron F. A fifth Alu repetitive sequence was found immediately flanking the 3' end of the gene. A 50 base pair insert in intron A was found in a clone from one of the genomic libraries but was absent in clones from the other library. Intron A as well as the 3' noncoding region of the gene also contained alternating purine-pyrimidine sequences that provide potential left-handed helical DNA or Z-DNA structures for the gene. KpnI repetitive sequences were identified in intron D and the region flanking the 5' end of the gene. The 5' flanking region also contained a 1.9-kb HindIII subfamily repeat. The seven introns in the gene for factor IX were located in essentially the same position as the seven introns in the gene for human protein C, while the first three were found in positions identical with those in the gene for human prothrombin.  相似文献   

14.
15.
16.
Syrian hamster DDT-1 cells are derived from smooth muscle of the ductus deferens. DDT-1 cell growth is increased by the addition of testosterone (T). Acidic fibroblast growth factor (aFGF) or basic fibroblast growth factor (bFGF) also known as heparin binding growth factor I and II (HBGF-I and HBGF-II) can replace T in the stimulation of growth in these cells. This phenomenon is correlated with testosterone's ability to elevate aFGF/HBGF-I mRNA. The increase steady-state levels of aFGF/HBGF-I mRNA were documented by northern blots and by in situ hybridization. Using a 520 bp human aFGF/HBGF-I cDNA probe, a genomic clone with a 38 kb DNA insert was isolated from a cosmid library. By restriction enzyme analysis and southern hybridization, it was determined that there are three coding exons. DNA sequence analysis showed all of the coding region and 3' noncoding sequences were on this clone. A 5' noncoding exon not in the 38 kb insert is indicated, based on the cDNA sequences and genomic sequences of aFGF/HBGF-I's from hamster DDT-1 cells and several other species. The cDNA for hamster aFGF/HBGF-I was isolated from a DDT-1 lambda gt11 library and sequenced. Comparison of the coding region of aFGF/HBGF-I from four species shows a greater than 90% conservation of amino acid sequence.  相似文献   

17.
18.
Structure of the human ornithine transcarbamylase gene   总被引:21,自引:0,他引:21  
Complementary and genomic DNA clones corresponding to the human ornithine transcarbamylase (OTC) [EC 2.1.3.3]mRNA have been isolated and analyzed. The OTC gene is about 73 kilobase pairs (kb) long and contains 10 exons interrupted by 9 introns of highly variable sizes. The smallest intron is 80 base pairs and the largest, 21.7 kb. The 5'- and 3'-flanking regions, entire exons and all the exon/intron boundaries were sequenced. The nucleotide and deduced amino acid sequences of isolated OTC cDNAs as well as the corresponding regions of the genomic DNA were compared with those of human OTC cDNA (Horwich, A.L., Fenton, W.A., Williams, K.R., Kalousek, F., Kraus, J.P., Doolittle, R.F., Koningsberg, W., & Rosenberg, L.E. (1984) Science 224, 1068-1074). We found 20 nucleotide substitutions among these sequences, of which 6 related to amino acid changes. The nature of these nucleotide substitutions is discussed.  相似文献   

19.
20.
Organization of the human protein S genes   总被引:6,自引:0,他引:6  
Human genomic clones that span the entire protein S expressed gene (PS alpha) and the 3' two-thirds of the protein S pseudogene (PS beta) have been isolated and characterized. The PS alpha gene is greater than 80 kilobases in length and contains 14 introns and 15 exons, as well as 6 repetitive "Alu" sequences. Exons I and XV contain 112 and 1139 bp 5' and 3' noncoding segments in addition to the amino and carboxyl termini, respectively. Exons I-VIII encode protein segments that are homologous to the vitamin K dependent clotting proteins and are bounded by introns whose position and type are identical with other members of this protein family. Exons IX-XV encode protein segments homologous to sex hormone binding globulin (SHBG) and are bounded by introns of identical type and position as in the SHBG gene. Genomic clones for the PS beta gene cover a distance of greater than 55 kilobases and contain segments corresponding to amino acids 46-635 of the mature protein and the 1.1-kb 3' noncoding region of the cDNA. The presence of multiple base changes in the coding portions of this gene, resulting in termination codons and frame shifts, suggests that it is a pseudogene. Comparison of DNA sequences for the two genes reveals 97% identity for coding and 3' noncoding, and 95.4% for intronic regions, suggesting divergence of the two genes is a relatively recent event.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号