首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
Five independent clones containing the natural chicken ovomucoid gene have been isolated from a chicken gene library. One of these clones, CL21, contains the complete ovomucoid gene and includes more than 3 kb of DNA sequences flanking both termini of the gene. Restriction endonuclease mapping, electron microscopy and direct DNA sequencing analyses of this clone have revealed that the ovomucoid gene is 5.6 kb long and codes for a messenger RNA of 821 nucleotides. The structural gene sequence coding Ifor the mature messenger RNA is split into at least eight segments by a minimum of seven intervening sequences of various sizes. The shortest structural gene segment is only 20 nucleotides long. All seven intervening sequences are located within the peptide coding region of the gene, and the sequences at the 5' and 3' untranslated regions of the mRNA are not interrupted by intervening sequences. The DNA sequences of the regions flanking the 5' and 3' termini of the gene have been determined. Thirty nucleotides before the start of the messenger RNA coding sequence is the heptanucleotide TATATAT, which is also present in a similar location relative to the chicken ovalbumin gene and other unique sequence eucaryotic genes. This sequence resembles that of the Pribnow box in procaryotic genes where a promoter function has been implicated. Seven nucleotides past the 3' end of the gene is the tetranucleotide TTGT, a sequence found to be present at identical locations as either TTTT or TTGT in other eucaryotic genes that have been sequenced. These conserved DNA sequences flanking eucaryotic genes may serve some regulator function in the expression of these genes.  相似文献   

3.
Sequence of the cDNA and gene for angiogenin, a human angiogenesis factor   总被引:29,自引:0,他引:29  
Human cDNAs coding for angiogenin, a human tumor derived angiogenesis factor, were isolated from a cDNA library prepared from human liver poly(A) mRNA employing a synthetic oligonucleotide as a hybridization probe. The largest cDNA insert (697 base pairs) contained a short 5'-noncoding sequence followed by a sequence coding for a signal peptide of 24 (or 22) amino acids, 369 nucleotides coding for the mature protein of 123 amino acids, a stop codon, a 3'-noncoding sequence of 175 nucleotides, and a poly(A) tail. The gene coding for human angiogenin was then isolated from a genomic lambda Charon 4A bacteriophage library employing the cDNA as a probe. The nucleotide sequence of the gene and the adjacent 5'- and 3'-flanking regions (4688 base pairs) was then determined. The coding and 3'-noncoding regions of the gene for human angiogenin were found to be free of introns, and the DNA sequence for the gene agreed well with that of the cDNA. The gene contained a potential TATA box in the 5' end in addition to two Alu repetitive sequences immediately flanking the 5' and 3' ends of the gene. The third Alu sequence was also found about 500 nucleotides downstream from the Alu sequence at the 3' end of the gene. The amino acid sequence of human angiogenin as predicted from the gene sequence was in complete agreement with that determined by amino acid sequence analysis. It is about 35% homologous with human pancreatic ribonuclease, and the amino acid residues that are essential for the activity of ribonuclease are also conserved in angiogenin. This provocative finding is thought to have important physiological implications.  相似文献   

4.
Three cloned apolipoprotein A-II genes were isolated from a human genomic cosmid library constructed in our laboratory. An approximately 3-kilobase HindIII insert containing the entire gene was analyzed by RNA:DNA hybridization and electron microscopy. The apo-A-II gene was found to consist of 4 exons and 3 intervening sequences (IVS), and the lengths of each exon and IVS were estimated by direct observation of the hybrids. The entire approximately 3-kilobase HindIII insert was sequenced. The 5' end of the gene was determined by primer extension. The DNA sequence confirms the presence of 4 exons and 3 IVS: exon 1, 34 nucleotides; exon 2, 76 nucleotides; exon 3, 133 nucleotides; exon 4, 230 nucleotides; IVS-I, 169 nucleotides; IVS-II, 299 nucleotides; and IVS-III, 396 nucleotides. A "TATA box" is located at position -29 from the CAP site. A "CAT box" is present at position -78. A "TG" element consisting of (TG)19 is identified at the 3' end of IVS-III. Furthermore, an enhancer core sequence, CTTTCCA, is identified at position -355 in the 5' flanking sequence. At positions -497 to -471 upstream from the CAP site is a stretch of 27 nucleotides that show high homology to stretches of 5' flanking sequences in the apo-C-II, apo-A-I, apo-E, and apo-C-III genes. An Alu dimer sequence is located approximately 300 nucleotides from the 3' end of the gene. Within this Alu sequence, we have identified a polymorphic MspI site. Restriction fragment length polymorphism involving this site has been previously shown to correlate with apo-A-II levels and high density lipoprotein structure. Analysis of conformation by Chou-Fasman analysis and by the helical hydrophobic moment of Eisenberg et al. (Eisenberg, D., Weiss, R. M., and Tergwillager, T. C. (1982). Nature (Lond.) 299, 371-374) indicates that in all of the 5 apolipoproteins characterized at the nucleotide level to date, i.e. apo-C-II, apo-A-II, apo-E, apo-A-I, and apo-C-III, the 2 IVS within the peptide coding regions of the gene tend to occur at regions corresponding to the surface of the polypeptide chain and divide the protein into distinct functional domains.  相似文献   

5.
The gene coding for the common alpha subunit of the bovine pituitary glycoprotein hormones was isolated from a bovine genomic library. The gene spans roughly 16.5 kbp, contains three intervening sequences, and codes for a message of approximately 730 nucleotides. The complete coding region of the gene was sequenced as well as 315 nucleotides of 5' flanking sequence and the entire intron C. Only a single base difference was found when the sequence of the gene was compared with that of the cDNA. Genomic blotting experiments suggest the presence of a single alpha subunit gene. Comparison of the bovine and human alpha subunit genes indicated that the high level of homology observed in the coding regions has been maintained throughout the 5' and 3' untranslated regions, and at least 90 nucleotides of the 5'flanking regions. Additionally, there is an 18 base pair sequence present in both the 5' flanking and 5' untranslated regions of the gene that is homologous to a region of the chick ovalbumin gene. This ovalbumin sequence has been suggested as a binding site for the progesterone receptor-complex.  相似文献   

6.
RNA 3 of alfalfa mosaic virus (AlMV) contains information for two genes: near the 5' end an active gene coding for a 35 Kd protein and, near the 3' end, a silent gene coding for viral coat protein. We have determined a sequence of 318 nucleotides which contains the potential initiation codon for the 35 Kd protein at 258 nucleotides from the 5' end. This long leader sequence can form initiation complexes containing three 80 S ribosomes. A shorter species of RNA, corresponding to a molecule of RNA 3 lacking the cap and the first 154 nucleotides (RNA 3') has been isolated. The remaining leader sequence of 104 nucleotides in RNA 3' forms a single 80 S initiation complex with wheat germ ribosomes. The location of the regions of the leader sequence of RNA 3 involved in initiation complex formation with 80 S ribosomes is reported.  相似文献   

7.
Human spermidine synthase: cloning and primary structure   总被引:1,自引:0,他引:1  
Using a synthetic deoxyoligonucleotide mixture constructed for a tryptic peptide of the bovine enzyme as a probe, cDNA coding for the full-length subunit of spermidine synthase was isolated from a human decidual cDNA library constructed on phage lambda gt11. After subcloning into the Eco RI site of pBR322 and propagation, both strands of the insert were sequenced using a shotgun strategy. Starting from the first start codon, which was immediately preceded by a GC-rich region including four overlapping CCGCC consensus sequences, an open reading frame for a 302-amino-acid polypeptide was resolved. This peptide had an Mr of 33,827, started with methionine, and ended with serine. The identity of the isolated cDNA was confirmed by comparison of the deduced amino acid sequence with resolved sequences of the tryptic peptides of bovine spermidine synthase. The coding strand of the cDNA revealed no special regulatory or ribosome-binding signals within 82 nucleotides preceding the start codon and no polyadenylation signal within 247 nucleotides following the stop codon. The coding region, containing a 13-nucleotide repeat close to the 5' end, was longer than, and very different from, that of the bacterial counterpart. This region seems to be of retroviral origin and shows marked homology with sequences found in a variety of human, mammalian, avian, and viral genes and mRNAs. By computer analysis, the first 200 nucleotides of the 5' end of the coding strand appear able to form a very stable secondary structure with a free energy change of -157.6 kcal/mole.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

8.
Nucleotide sequence of the yeast regulatory gene GAL80   总被引:20,自引:1,他引:19       下载免费PDF全文
The GAL80 gene in Saccharomyces cerevisiae encodes a negative regulatory protein for the set of inducible genes involving metabolism of galactose and melibiose. We have determined the nucleotide sequence of GAL80 and its flanking regions and assigned the 5' end of its mRNA to the sequence. The deduced coding sequence for GAL80 protein contains 1305 nucleotides and the calculated molecular weight of the peptide chain is 48309. The 5' end of the GAL80 mRNA maps about 67 nucleotides upstream from the translation initiating ATG. We have also determined the nucleotide sequence of uninducible alleles GAL80S-0, GAL80S-1 and GAL80S-2, and found single base substitution in each of these mutant genes which would lead to alteration of amino acid in GAL80 protein.  相似文献   

9.
A lambda gt 11 library containing cDNA inserts prepared from human liver mRNA has been screened with an affinity-purified antibody to human histidine-rich glycoprotein (HRG) and then with a restriction fragment isolated from the 5' end of the largest cDNA insert obtained by antibody screening. A number of positive clones were identified and shown to code for HRG by DNA sequence analysis. A total of 2067 nucleotides were determined by sequencing 3 overlapping cDNA clones, which included 121 nucleotides of 5'-noncoding sequence, 54 nucleotides coding for a leader sequence of 18 amino acids, 1521 nucleotides coding for the mature protein of 507 amino acids, a stop codon of TAA, and 352 nucleotides of 3'-noncoding sequence followed by a poly(A) tail of 16 nucleotides. The length of the noncoding sequence of the 3' end differed in several clones, but each contained a polyadenylylation or processing sequence of AATAAA followed by a poly(A) tail. More than half of the amino acid sequence of HRG consisted of five different types of internal repeats. Within the last 3 internal repeats (type V), there were 12 tandem repetitions of a 5 amino acid segment with a consensus sequence of Gly-His-His-Pro-His. This repeated portion, referred to as a "histidine-rich region", contained 53% histidine and showed a high degree of similarity to a histidine-rich region of high molecular weight kininogen.  相似文献   

10.
Structure and expression of the rat apolipoprotein E gene   总被引:2,自引:0,他引:2  
  相似文献   

11.
The nucleotide sequence for an unusual, cloned human adenosine deaminase cDNA has been determined. Contained within a sequence of 1535 nucleotides is a coding sequence of 1089 nucleotides that encodes a protein of 40,762 daltons. The coding sequence is interrupted by a non-coding region containing 76 nucleotides. Both the 3' and 5' ends of this region have consensus sequences generally associated with splice sites. The 3' untranslated sequence contained 308 nucleotides, including a polyadenylation signal sequence 20 nucleotides from the end. The cloned cDNA appears to correspond to a nuclear mRNA precursor which contains a small intron.  相似文献   

12.
Plasmid clones containing cDNA coding for the B-chain of human Clq were isolated from a liver cDNA library. The longest cDNA insert isolated contained all the coding sequence for amino acid residues B1 to B226 plus a 3' non-translated region of 264 nucleotides that extended into the poly(A) tail, thus accounting for 950 nucleotides of the mRNA. The B-chain mRNA was estimated by Northern-blot analysis to be 1.46 kb (kilobases) long, which indicated that approx. 500 bases were not accounted for in the cDNA clone. A cosmid clone containing the C1q-B chain gene was isolated from a human genomic DNA library. The precise 5' limit of gene was not established, but from the data available it appears that the gene is approx. 2.6 kb long. The coding sequence for residues B1 to B226 in the gene is interrupted by one intron, of 1.1 kb, which is located within the codon coding for glycine at position B36. This glycine residue is located in the middle of the triple-helical regions found in C1q at exactly the position where there is an unusual structural feature, i.e. a bend in each of the helical regions brought about by the interruption of the Gly-Xaa-Yaa repeating triplet sequences in the A- and C-chains and the presence of an 'extra' triplet in the B-chain. Nucleotide sequencing of the 5' end of the gene indicates the presence of a predominantly hydrophobic stretch of 29 amino acids, immediately before residue B1, which could serve as a signal peptide.  相似文献   

13.
The cDNAs corresponding to the 5' ends of the mRNAs coding for the envelope protein precursor (gPr92env) of the B77 strain and the transforming protein (pp60src) of the Prague B strain of Rous sarcoma virus were cloned into pBR322, and the nucleotide sequences surrounding the splice junctions were determined. Both mRNAs are products of single splicing events from a common donor splice site at nucleotide 398 from the 5' end of the RNA to acceptor splice sites at nucleotides 5078 and 7054 for the env and src mRNAs, respectively. These results confirm and extend previous conclusions based on peptide mapping and single-strand nuclease mapping. Compared with the sequence of the Prague C genome RNA, the B77 strain contains a 6-nucleotide deletion in the sequence corresponding to the hydrophobic portion of the signal peptide of the envelope protein precursor.  相似文献   

14.
The coat protein (CP) of Papaya ringspot virus (PRSV) was analyzed for presentation of the antigenic peptide of animal virus, Canine parvovirus (CPV), in Escherichia coli (E. coli). The 45 nucleotides fragment coding for the 15-aa peptide epitope of the CPV-VP2 protein was either inserted into the PRSV-cp gene at the 5', 3' ends, both 5' and 3' ends or substituted into the 3' end of the PRSV cp gene. Each of the chimeric PRSV cp genes was cloned into the pRSET B vector under the control of the T7 promoter and transformed into E. coli. The recombinant coat proteins expressed from different chimeric PRSV-cp genes were purified and intraperitoneally injected into mice. All of the recombinant coat proteins showed strong immunogenicity and stimulate mice immune response. The recombinant coat proteins containing the CPV epitope insertion at the C terminus and at both N and C termini elicited ten times higher specific antisera in immunized mice compared with the other two recombinant coat proteins which contain the CPV epitope insertion at the N terminus and substitution at the C terminus.  相似文献   

15.
16.
Complementary DNAs (cDNA's) specific for various regions of the Moloney murine sarcoma virus (MSV) 124 RNA genome were prepared by cross-hybridization techniques. A cDNA specific for the first 1,000 nucleotides adjacent to the RNA 3' end (cDNA 3') was prepared and shown to also be complementary to the 3'-terminal 1,000 nucleotides of a related Moloney murine leukemia virus (MLV) genome. A cDNA complementary to the "MSV-specific" portion of the MSV 124 genome was prepared. This cDNA was shown not to anneal to Moloney MLV RNA and to anneal to a portion of the viral RNA of about 1,500 to 1,800 nucleotides in length, located 1,000 nucleotides from the 3' end of MSV RNA. A cDNA common to the genome of MSV and MLV was also obtained and shown to anneal to the 5'-terminal two-thirds, as well as to the 3'-terminal 1,000 nucleotides, of the MSV RNA genome. This cDNA also annealed to the RNA from MLV and mainly to the 5'-terminal half of the MLV genome. It is concluded that the 6-kilobase Moloney MSV 124 RNA genome has a sequence arrangement that includes (i) a 3' portion of about 1,000 nucleotides, which is also present at the 3' terminus of MLV; (ii) an MSV-specific region, not shared with MLV, which extends between 1,000 and 2,500 nucleotides from the 3' terminus; and (iii) a second "common" region, again shared with MLV, which extends from 2,500 nucleotides to the 5' terminus. This second common region appears to be located in the 5' half of the 10-kilobase MLV genome as well. Experiments in which a large excess of cold MLV cDNA was annealed to (3)H-labeled polyadenylic acid-containing fragments of MSV RNA gave results consistent with this arrangement of the MSV genome.  相似文献   

17.
18.
19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号