首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
In the plasmid pUC8ksgA7, the coding region of the ksgA gene is preceded by the lac promoter (Plac) and a small open reading frame (ORF). This ORF of 15 codons is composed of nucleotides derived from the lacZ gene, a multiple cloning site and the ksgA gene itself. The reading frame begins with the ATG initiation codon of lacZ and ends a few nucleotides beyond the ATG start codon of ksgA. The ksgA gene is not preceded by a Shine-Dalgarno (SD) signal. Cells transformed with pUC8ksgA7 produce active methylase, the product of the ksgA gene. Introduction of an in-phase TAA stop codon in the small ORF abolishes methylase production in transformed cells. On the plasmid pUC8ksgA5, which contains the entire ksgA region, the promoter of the ksgA gene was found to reside in a 380 base pair Bgl1-Pvu2 restriction fragment, partly overlapping the ksgA gene, by two independent methods. Cloning of this fragment in front of the galK gene in plasmid pKO1 stimulates galactokinase activity in transformants and its insertion into the expression vector pKL203 makes beta-galactosidase synthesis independent of the presence of Plac. The sequence of the Bgl1-Pvu2 fragment was determined and a putative promoter sequence identified. An SD signal could not be distinguished at a proper distance upstream from the ksgA start codon. Instead, an ORF of 13 codons starting with ATG in tandem with an SD signal and ending 4 codons ahead of the ksgA gene was identified. This suggests that translation of the ORF is required for expression of the ksgA gene.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

3.
In a lacZ expression vector (pMC1403Plac), all 64 codons were introduced immediately 3' from the AUG initiation codon. The expression of the second codon variants was measured by immunoprecipitation of the plasmid-coded fusion proteins. A 15-fold difference in expression was found among the codon variants. No distinct correlation could be made with the level of tRNA corresponding to the codons and large differences were observed between synonymous codons that use the same tRNA. Therefore the effect of the second codon is likely to be due to the influence of its composing nucleotides, presumably on the structure of the ribosomal binding site. An analysis of the known sequences of a large number of Escherichia coli genes shows that the use of codons in the second position deviates strongly from the overall codon usage in E. coli. It is proposed that codon selection at the second position is not based on requirements of the gene product (a protein) but is determined by factors governing gene regulation at the initiation step of translation.  相似文献   

4.
5.
6.
7.
The similarity of two nucleotide sequences is often expressed in terms of evolutionary distance, a measure of the amount of change needed to transform one sequence into the other. Given two sequences with a small distance between them, can their similarity be explained by their base composition alone? The nucleotide order of these sequences contributes to their similarity if the distance is much smaller than their average permutation distance, which is obtained by calculating the distances for many random permutations of these sequences. To determine whether their similarity can be explained by their dinucleotide and codon usage, random sequences must be chosen from the set of permuted sequences that preserve dinucleotide and codon usage. The problem of choosing random dinucleotide and codon-preserving permutations can be expressed in the language of graph theory as the problem of generating random Eulerian walks on a directed multigraph. An efficient algorithm for generating such walks is described. This algorithm can be used to choose random sequence permutations that preserve (1) dinucleotide usage, (2) dinucleotide and trinucleotide usage, or (3) dinucleotide and codon usage. For example, the similarity of two 60-nucleotide DNA segments from the human beta-1 interferon gene (nucleotides 196-255 and 499-558) is not just the result of their nonrandom dinucleotide and codon usage.   相似文献   

8.
The role of the translational terminator and initiator signals arrangement for two adjacent genes in polycistronic mRNA has been studied. Semisynthetic beta-galactosidase gene (lacZ) of E. coli and fragment of phage M13 DNA (with promoter PVIII, gene IX, and part of gene VIII) were used for constructing of the IX-VIII-lacZ artificial polycistronic operon. Cloning of the constructs into pBR322 vector resulted in a number of pLZ381N plasmids differing by the mutual arrangement of gene VIII translation terminator codon and SD site and initiator codon (SD-ATG-region) of lacZ gene. The mutual arrangement of gene VIII terminator codon and SDlacZ-ATG region has been altered by means of deletions and insertions that have not affected lacZ translation initiation signals. The beta-galactosidase (beta-Gal) synthesis in E. coli harbouring different types of pLZ381N plasmids has been found to depend on type of cistron coupling (gene VIII and lacZ). The overlapping of terminator and initiator codons (ATGA) for genes VIII and lacZ (type I of polycistrons) provide approximately equal translational level for both cistrons. On the other side, levels of beta-Gal synthesis in case of polycistrons type II (gene VIII stop-codon position at the beginning of SDlacZ or 10 nucleotides upstream) were 20-30 times as high as for type I. Differences in beta-Gal levels have also been found for variants of VIII-lacZ coupling in types IV and III polycistrons (the SDlacZ-ATG region in 27-50 nucleotides downstream from the proximal cistron VIII stop-codon, which, in turn, is 41 nucleotides upstream this terminator). These data cannot be explained on the basis of possible secondary structure including the SDlacZ-ATG region and other parts of polycistronic mRNA. In all these cases similarly stable stem-loop structures have been found. Therefore, the arrangement of the translation termination and initiation signals for two adjacent genes in essential for distal gene translation efficiency. One can imagine that ribosome or its 30S subpartical, stalling on the proximal gene terminator codon, affects the distal gene translation initiation.  相似文献   

9.
10.
The enzyme NAD(P)H:flavin oxidoreductase (flavin reductase) catalyzes the reduction of soluble flavins by reduced pyridine nucleotides. In Escherichia coli it is part of a multienzyme system that reduces the Fe(III) center of ribonucleotide reductase to Fe(II) and thereby sets the stage for the generation by dioxygen of a free tyrosyl radical required for enzyme activity. Similar enzymes are known in other organisms and may more generally be involved in iron metabolism. We have now isolated the gene for the E. coli flavin reductase from a lambda gt11 library. After DNA sequencing we found an open reading frame coding for a polypeptide of 233 amino acids, with a molecular weight of 26,212 and with an N-terminal segment identical to that determined by direct Edman degradation. The coding sequence is preceded by a weak ribosome binding site centered 8 nucleotides from the start codon and by a promoterlike sequence centered at a distance of 83 nucleotides. In a Kohara library the gene hybridized to position 3680 on the physical map of E. coli. A bacterial strain that overproduced the enzyme approximately 100-fold was constructed. The translated amino acid sequence contained a potential pyridine nucleotide-binding site and showed 25% identity with the C-terminal part of one subunit (protein C) of methane monooxygenase from methanotropic bacteria that reduces the iron center of a second subunit (protein A) of the oxygenase by pyridine nucleotides.  相似文献   

11.
12.
S S Fojo  S W Law  H B Brewer 《FEBS letters》1987,213(1):221-226
The complete nucleic acid sequence of human preproapolipoprotein (apo) C-II has been determined from 2 apoC-II clones isolated from 2 different human genomic DNA libraries. The cloned fragments were approx. 14 and 18 kb long, and sequence analysis established that the apoC-II gene consists of 3338 nucleotides containing 3 intervening sequences of 2391, 167, and 298 bases. The first intron is located within the 5'-untranslated region of apoC-II and contains 4 Alu type sequences. The second intron interrupts the codon specifying amino acid - 11 of the apoC-II signal peptide. The last intron, which contains a 38 bp sequence which is repeated 6 times, interrupts the codon specifying for amino acid +44 of the mature apolipoprotein.  相似文献   

13.
14.
The nucleotide sequence running from the genetic left end of bacteriophage T7 DNA to within the coding sequence of gene 4 is given, except for the internal coding sequence for the gene 1 protein, which has been determined elsewhere. The sequence presented contains nucleotides 1 to 3342 and 5654 to 12,100 of the approximately 40,000 base-pairs of T7 DNA. This sequence includes: the three strong early promoters and the termination site for Escherichia coli RNA polymerase: eight promoter sites for T7 RNA polymerase; six RNAase III cleavage sites; the primary origin of replication of T7 DNA; the complete coding sequences for 13 previously known T7 proteins, including the anti-restriction protein, protein kinase, DNA ligase, the gene 2 inhibitor of E. coli RNA polymerase, single-strand DNA binding protein, the gene 3 endonuclease, and lysozyme (which is actually an N-acetylmuramyl-l-alanine amidase); the complete coding sequences for eight potential new T7-coded proteins; and two apparently independent initiation sites that produce overlapping polypeptide chains of gene 4 primase. More than 86% of the first 12,100 base-pairs of T7 DNA appear to be devoted to specifying amino acid sequences for T7 proteins, and the arrangement of coding sequences and other genetic elements is very efficient. There is little overlap between coding sequences for different proteins, but junctions between adjacent coding sequences are typically close, the termination codon for one protein often overlapping the initiation codon for the next. For almost half of the potential T7 proteins, the sequence in the messenger RNA that can interact with 16 S ribosomal RNA in initiation of protein synthesis is part of the coding sequence for the preceding protein. The longest non-coding region, about 900 base-pairs, is at the left end of the DNA. The right half of this region contains the strong early promoters for E. coli RNA polymerase and the first RNAase III cleavage site. The left end contains the terminal repetition (nucleotides 1 to 160), followed by a striking array of repeated sequences (nucleotides 175 to 340) that might have some role in packaging the DNA into phage particles, and an A · T-rich region (nucleotides 356 to 492) that contains a promoter for T7 RNA polymerase, and which might function as a replication origin.  相似文献   

15.
16.
17.
The globulin storage protein genes of cotton are found to exist as gene tandems that contain a gene from each of the 2 globulin subfamilies separated by a spacer region of about 2700 or 3400 base pairs. Three different tandems have been identified by restriction endonuclease mapping of genomic DNA. A cDNA that is different from the genes of the tandems in map sites and/or in nucleotide sequence indicates that a fourth tandem probably exists in the cotton genome. Since the species of cotton used here (Gossypium hirsutum) is an amphidiploid, it is likely that two of the tandems are contributed from each genome.Considerable divergence in nucleotide sequence (18%) and in derived amino acid sequence (28%) is found when the 2 genes of a sequenced tandem are compared. The sequence of the cDNA closely resembles one of the genes in the tandem showing only a 4% divergence in nucleotides and a 4.2% divergence in amino acids. Thus the 2 genes of each tandem represent a relatively ancient gene duplication that has given rise to the two globulin subfamilies of cotton. Only one subfamily has a glycosylation site and the glycosylation of its derived proteins gives rise to the 2 molecular weight sets of globulins seen on gel electrophoresis.Other basic features of these genes and their derived proteins are presented.  相似文献   

18.
The nucleotide sequence of a 9937 base-pair portion of human chromosome 9, which contains two complete leukocyte interferon genes (LeIF-L and J), the complete intergenic region, and part of a third related possible pseudogene (LeIF-M), has been determined. The coding regions of the L and J genes are separated by 4363 nucleotides. The coding regions for the putative L and J interferons are 96% homologous and are each surrounded by about 3500 nucleotides of flanking sequences, which are also highly homologous. The L and J genes and their respective flanking sequences comprise a 4000 nucleotide leukocyte interferon gene repeat unit; the L gene repeat unit contains two major insertions not present in the J gene repeat unit. The J gene repeat unit is flanked by sequence features reminiscent of those found surrounding transposable elements. Both the L and J gene repeat units are embedded within sequences that are highly repeated in the human genome. Structural features identified within this portion of chromosome 9 may have been important for the generation of this interferon gene cluster.  相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号