首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The yeast nuclear gene PET111 is required specifically for translation of the mitochondrion-coded mRNA for cytochrome c oxidase subunit II. We have determined the nucleotide sequence of a 3-kilobase segment of DNA that carries PET111. The sequence contains a single long open reading frame that predicts a basic protein of 718 amino acids. The PET111 gene product is a mitochondrial protein, since a hybrid protein which includes the amino-terminal 154 amino acids of PET111 fused to beta-galactosidase is specifically associated with mitochondria. PET111 is translated from a 2.9-kilobase mRNA which, interestingly, has an extended 5'-leader sequence containing four short open reading frames upstream of the long open reading frame. These open reading frames exhibit an interesting pattern of overlap with each other and with the PET111 reading frame.  相似文献   

2.
The pheST operon codes for the two subunits of phenylalanyl-tRNA synthetase and it expression is controlled by attenuation in a way similar to many amino acid biosynthetic operons. The nucleotide sequence of the control regions of the operon indicates the presence of several open reading frames besides that of the leader peptide. One of these open reading frames, called the alternative leader peptide, starts at about the same place as the leader peptide and ends after the terminator of the attenuator. Another open reading frame, called the terminator peptide, starts after the terminator and covers about half the distance to pheS, the first structural gene of the operon. The present report shows that, in fact, the only open reading frame to be translated efficiently is the leader peptide itself. The alternative leader peptide and the terminator peptide are both translated at a negligible rate.  相似文献   

3.
A Spirochaeta aurantia DNA fragment containing the trpE gene and flanking chromosomal DNA was cloned, and the sequence of the trpE structural gene plus 870 bp upstream and 1,257 bp downstream of trpE was determined. The S. aurantia trpE gene codes for a polypeptide of 482 amino acid residues with a predicted molecular weight of 53,629 that showed sequence similarity to TrpE proteins from other organisms. The S. aurantia TrpE polypeptide is not more closely related to the other published spirochete TrpE sequence (that of Leptospira biflexa) than to TrpE polypeptides of other bacteria. Two additional complete open reading frames and one partial open reading frame were identified in the sequenced DNA. One of the complete open reading frames and the partial open reading frame are upstream of trpE and are encoded on the DNA strand opposite that containing trpE. The other open reading frame is downstream of trpE and on the same DNA strand as trpE. On the basis of the results of a protein sequence data base search, it appears that trpE is the only tryptophan biosynthesis gene in the sequenced DNA. This is in contrast to L. biflexa, in which trpE is separated from trpG by only 64 bp.  相似文献   

4.
We have determined the complete nucleotide sequence of an infectious cloned genome of ground squirrel hepatitis virus (GSHV), a nonpathogenic member of the hepadnavirus group. The genome is 3,311 base pairs long and contains the major open reading frames described for the related human and woodchuck hepatitis B viruses (HBV and WHV, respectively). These reading frames include genes for the major structural proteins (the surface and core antigens), unassigned open reading frames (A and B), the longer of which is presumed to encode the viral DNA polymerase, and an open reading frame preceding and continuous with the surface antigen gene. The arrangement of these open reading frames is similar to that encountered in the genomes of HBV and WHV: all of the reading frames are encoded on the same strand, they are positioned in the same fashion with respect to each other, and a large portion (at least 51%) of the genome can be translated in two reading frames. Comparisons of the predicted translational products of the three mammalian hepadnaviruses reveal 78% amino acid homology between the proteins of GSHV and WHV and 43% homology between those of GSHV and HBV. In addition, a perfect direct repeat of 10 to 11 base pairs, separated by ca. 46 to 223 base pairs, is present in the three mammalian viruses and in duck hepatitis B virus; the position of the repeats near the 5' termini of the two strands of virion DNA suggests a role in viral replication.  相似文献   

5.
The genome structure of a densovirus from a silkworm was determined by sequencing more than 85% of the complete genome DNA. This is the first report of the genome organization of an insect parvovirus deduced from the DNA sequence. In the viral genome, two large open reading frames designated 1 and 2 and one smaller open reading frame designated 3 were identified. The first two open reading frames shared the same strand, while the third was found in the complementary sequence. Computer analysis suggested that open reading frame 2 may encode all four structural proteins. The genome organization and a part of the nucleotide sequence were conserved among the insect densovirus, rodent parvoviruses, and a human dependovirus. These viruses may have diverged from a common ancestor.  相似文献   

6.
The bacterial DNA sequence in GenBank database were divided into coding and noncoding regions and examined for the base-trimer distribution in every triplet frame on the sense and antisense strands. The results revealed that for the noncoding region, both strands have very similar base-trimer distributions and have no frame specificity; that is, DNA is symmetric in the noncoding region. For the coding region, on the other hand, the symmetry is broken only in the triplet framework, and we found a special triplet-frame-specific symmetry which appears when the two complementary strands of the coding region are read from their 5 ends. In addition, the following frame specificity was also observed in the distribution of stop codons on the antisense strand of the coding region. When the antisense sequences of the open reading frames (ORFs) in the database are read in the three reading frames, the same reading frame as the corresponding ORF contains a significantly larger amount of long open frames without stop codons (i.e., nonstop frames [NSFs]) than expected, while the number of NSFs in the other two reading frames is similar to that of the expected one. That is, NSFs as well as ORFs are maintained in a frame-specific manner, and in this sense, DNA becomes symmetrical even in the coding region. These two kinds of frame-specific symmetries indicate that only an ORF and its complementary triplets are specifically recognized and maintained in DNA. We suppose that the antisense strands as well as the sense strands in the coding region may be transcribed, thereby producing various kinds of proteins corresponding to NSFs, though their amount may not be large. The presence of these proteins should have some benefits for living organisms, and therefore we propose that these proteins are upcoming enzymes having novel functions.Correspondence to: I. Urabe  相似文献   

7.
8.
Mutations in fii or tolA of the fii-tolA-tolB gene cluster at 17 min on the Escherichia coli map render cells tolerant to high concentrations of the E colicins and do not allow the DNA of infecting single-stranded filamentous bacteriophages to enter the bacterial cytoplasm. The nucleotide sequence of a 1,854-base-pair DNA fragment carrying the fii region was determined. This sequence predicts three open reading frames sequentially coding for proteins of 134, 230, and 142 amino acids, followed by the potential start of the tolA gene. Oligonucleotide mutagenesis of each open reading frame and maxicell analysis demonstrated that all open reading frames are expressed in vivo. Sequence analysis of mutant fii genes identified the 230-amino acid protein as the fii gene product. Chromosomal insertion mutations were constructed in each of the two remaining open reading frames. The phenotype resulting from an insertion of the chloramphenicol gene into the gene coding for the 142-amino acid protein is identical to that of mutations in fii and tolA. This gene is located between fii and tolA, and we propose the designation of tolQRA for this cluster in which tolQ is the former fii gene and tolR is the new open reading frame. The protein products of this gene cluster play an important role in the transport of large molecules such as the E colicins and filamentous phage DNA into the bacterium.  相似文献   

9.
A system for the computer analysis of nucleic acid and protein sequences ("Helix") is described. Format of the DNA sequences is EMBL--compatible and may be easily commented with the help of convenient menus. "Helix" has also following possibilities: an effective alignment of gele reading data and formation of the final sequence; simple making of recombined molecules "in calcular"; calculations of nucleotide and dinucleotide distribution along the sequence; looking for coding frames; calculations percentage of codons and amino acids in coding frames; searching for direct and inverted repeats; sequences alignment; protein secondary structure prediction; restriction mapping; DNA--protein translation. "Helix" also contain programs for RNA-structure prediction, looking for homologies throughover the EMAL bank, choosing optimal sequence for probes and searching promoters. All the programs are written at FORTRAN-77 and automatically translated into FORTRAN-4. "Helix" require only 64 kbite.  相似文献   

10.
11.
12.
The complete nucleotide sequence of the circular double-stranded DNA of the genital human papillomavirus type 6b (HPV6b) comprising 7902 bp was determined and compared with the DNA sequences of human papillomavirus type 1a (HPV1a) and bovine papillomavirus type 1 (BPV1). All major open reading frames are located on one DNA strand only. Their arrangement reveals that the genomic organization of HPV6b is similar to that of HPV1a and BPV1. The putative early region includes two large open reading frames E1 and E2 with marked amino acid sequence homologies to HPV1a and BPV1 which are flanked by several smaller frames. The internal part of E2 completely overlaps with another open reading frame E4. The putative late region contains two large open reading frames L1 and L2. The L1 amino acid sequences are highly conserved among analyzed papillomavirus types. By sequence comparison, potential promoter, splicing and polyadenylation signals can be localized in HPV6b DNA suggesting possible mechanisms of genital papillomavirus gene expression.  相似文献   

13.
S M Thomas  R A Lamb  R G Paterson 《Cell》1988,54(6):891-902
The "P" gene of the paramyxovirus SV5 encodes two known proteins, P (Mr approximately equal to 44,000) and V (Mr approximately equal to 24,000). The complete nucleotide sequence of the "P" gene has been obtained and is found to contain two open reading frames, neither of which is large enough to encode the P protein. We have shown that the P and V proteins are translated from two mRNAs that differ by the presence of two nontemplated G residues in the P mRNA. These two additional nucleotides convert the two open reading frames to one of 392 amino acids. The P and V proteins are amino coterminal and have 164 amino acids in common. The unique C terminus of V consists of a cysteine-rich region that resembles a cysteine-rich metal binding domain. An open reading frame that contains this cysteine-rich region exists in all other paramyxovirus "P" gene sequences examined, which suggests that it may have important biological significance.  相似文献   

14.
Genetic information stored in DNA sequences is translated into protein by linking a triplet nucleotide sequence and an amino acid. Because the frames of the triplets can be configured in three ways, a total of six polypeptides, each with a different sequence, can be produced from a single double-stranded DNA molecule. We recently developed the MolCraft system [reviewed in K. Shiba, J. Mol. Catal. B 18 (2004) xxx], which enables us to make combinatorial polymers of three peptides translated from one strand of a double-stranded DNA molecule. To explore all the information that a single double-stranded DNA molecule encodes, we have now developed a new system, La-MolCraft, in which all six reading frames encoded by both strands are combinatorially polymerized using loop-mediated isothermal amplification of DNA (LAMP) [Nucl. Acids Res. 28 (2000) E63].  相似文献   

15.
An 11-kbp DNA element of unknown function interrupts the nifD gene in vegetative cells of Anabaena sp. strain PCC 7120. In developing heterocysts the nifD element excises from the chromosome via site-specific recombination between short repeat sequences that flank the element. The nucleotide sequence of the nifH-proximal half of the element was determined to elucidate the genetic potential of the element. Four open reading frames with the same relative orientation as the nifD element-encoded xisA gene were identified in the sequenced region. Each of the open reading frames was preceded by a reasonable ribosome-binding site and had biased codon utilization preferences consistent with low levels of expression. Open reading frame 3 was highly homologous with three cytochrome P-450 omega-hydroxylase proteins and showed regional homology to functionally significant domains common to the cytochrome P-450 superfamily. The sequence encoding open reading frame 2 was the most highly conserved portion of the sequenced region based on heterologous hybridization experiments with three genera of heterocystous cyanobacteria.  相似文献   

16.
17.
We cloned and sequenced 8.3 kb of Bacillus subtilis DNA corresponding to the flaA locus involved in flagellar biosynthesis, motility, and chemotaxis. The DNA sequence revealed the presence of 10 complete and 2 incomplete open reading frames. Comparison of the deduced amino acid sequences to data banks showed similarities of nine of the deduced products to a number of proteins of Escherichia coli and Salmonella typhimurium for which a role in flagellar functioning has been directly demonstrated. In particular, the sequence data suggest that the flaA operon codes for the M-ring protein, components of the motor switch, and the distal part of the basal-body rod. The gene order is remarkably similar to that described for region III of the enterobacterial flagellar regulon. One of the open reading frames was translated into a protein with 48% amino acid identity to S. typhimurium FliI and 29% identity to the beta subunit of E. coli ATP synthase.  相似文献   

18.
Isolation and nucleotide sequence determination of fusaric acid-detoxification genes are described in this paper. For screening the genes, bacteria collected from soil were positively selected in a selective medium containing fusaric acid. The capability of fusaric acid-resistant isolates to detoxify the toxin was assayed by examining the survival of tomato callus cells in culture filtrates prepared from the bacterial culture, in the presence of fusaric acid. The isolate (HY-1) showing the highest detoxification was selected and identified as Klebsiella oxytoca. Chromosomal DNA of this isolate was digested with Bam HI and shotgun-cloned to fusaric acid-sensitive E. coli. The DNA fragment carrying fusaric acid-detoxification genes was further shortened by enzyme digestion and the open reading frames in the fragment were analyzed by determining total nucleotide sequences of the fragment. Finally, three open reading frames were shown to be essential for expressing the detoxification of fusaric acid. These frames possessed a single promoter sequence at the upstream region of the first open reading frame. Northern blot analysis showed that these genes were polycistronically transcribed to express the fusaric acid detoxification, strongly supporting th results of DNA sequence analysis.  相似文献   

19.
扫描模型和遗漏扫描模型是真核生物mRNA翻译起始的两种主要机制,但其仍存在某些例外情况,如对具有多顺反子结构的mRNA,选择性翻译起始的发生机制目前仍不清楚.本研究基于GFP蛋白开放表达框(ORF)构建了一系列重组表达载体,用以转录在移码翻译顺序及同一翻译顺序下,AUG起始密码子处于不同序列背景,以及间隔不同距离的多顺反子结构mRNA.通过转染人Bel 7402细胞系,研究了这些多顺反子结构mRNA的翻译起始模式.结果表明,在移码翻译顺序下,多顺反子mRNA可翻译出对应的不同蛋白质,而在同一翻译顺序下,GFP蛋白表达框中的多个AUG密码子,仅有首位起始密码子可发挥作用,提示核糖体在从首位起始密码子开始翻译的同时,可能会有部分核糖体继续向下扫描并识别下游的起始密码子,而这种选择性的翻译起始效率,主要取决于密码子所处的序列背景及间隔距离等因素.  相似文献   

20.
In viruses an increased coding ability is provided by overlapping genes, in which two alternative open reading frames (ORFs) may be translated to yield two distinct proteins. The identification of signature sequences in overlapping genes is a topic of particular interest, since additional out-of-frame coding regions can be nested within known genes. In this work, a novel feature peculiar to overlapping coding regions is presented. It was detected by analysis of a sample set of 21 virus genomic sequences and consisted in the repeated occurrence of a cluster of basic amino acid residues, encoded by a frame, combined to a stretch of acidic residues, encoded by the corresponding overlapping frame. A computer scan of an additional set of virus sequences demonstrated that this feature is common to several other known overlapping ORFs and led to prediction of a novel overlapping gene in hepatitis G virus (HGV). The occurrence of a bifunctional coding region in HGV was also supported by its extremely lower rate of synonymous nucleotide substitutions compared to that observed in the other gene regions of the HGV genome. Analysis of the amino acid sequence that was deduced from the putative overlapping gene revealed a high content of basic residues and the presence of a nuclear targeting signal; these characteristics suggest that a core-like protein may be expressed by this novel ORF. Received: 21 July 1999 / Accepted: 26 October 1999  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号