首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The complete nucleotide sequence of the circular double-stranded DNA of the genital human papillomavirus type 6b (HPV6b) comprising 7902 bp was determined and compared with the DNA sequences of human papillomavirus type 1a (HPV1a) and bovine papillomavirus type 1 (BPV1). All major open reading frames are located on one DNA strand only. Their arrangement reveals that the genomic organization of HPV6b is similar to that of HPV1a and BPV1. The putative early region includes two large open reading frames E1 and E2 with marked amino acid sequence homologies to HPV1a and BPV1 which are flanked by several smaller frames. The internal part of E2 completely overlaps with another open reading frame E4. The putative late region contains two large open reading frames L1 and L2. The L1 amino acid sequences are highly conserved among analyzed papillomavirus types. By sequence comparison, potential promoter, splicing and polyadenylation signals can be localized in HPV6b DNA suggesting possible mechanisms of genital papillomavirus gene expression.  相似文献   

2.
Genes encoding the core proteins of adenovirus type 2   总被引:7,自引:0,他引:7  
The nucleotide sequence of the HindIII-D fragment of adenovirus type 2 has been determined. The sequence, which is located between coordinates 41.8 and 51.0, covers most of the L2 cotermination family. It includes three major open translational reading frames encoding the carboxyl-terminal part of the penton base as well as the major core polypeptides V and VII. An additional minor open translational reading frame encoding a highly basic polypeptide was detected in the sequence. The L2 region has a very compact organization with very short distances between the different genes, although no overlapping coding sequences were found. The predicted amino acid sequences of core proteins V and VII reveal that they are highly basic proteins and polypeptide VII resembles the arginine-rich H4 histones in its amino acid composition, but no striking similarities are apparent at the amino acid sequence level. The candidate polypeptide encoded by the newly discovered translational reading frame contains 29% basic residues and includes a hypothetical recognition sequence for the adenovirus-encoded endopeptidase. In conjunction with previously published sequences and those reported in accompanying papers (Akusj?rvi, G., Alestr?m, P., Pettersson, M., Lager, M., J?rnvall, H., and Pettersson, U. (1984) J. Biol. Chem. 259, 13976-13979; Roberts, R. J., O'Neill, K. E., and Yen, C. E. (1984) J. Biol. Chem. 259, 13965-13975) a complete sequence can now be reconstructed for the 35,937-base pairs adenovirus type 2 genome.  相似文献   

3.
A Kato  I Sato  T Ihara  S Ueda  A Ishihama  K Hirai 《Gene》1989,84(2):399-405
The genomes of two avian herpesviruses, Marek's disease virus type 1 (MDV1) and herpesvirus of turkey (HVT), share close homology only within certain DNA regions. One such homologous region of HVT DNA was cloned and sequenced. Two open reading frames (ORFs) were found in the long unique region, ORF1 encoding the glycoprotein A (gA), and ORF2 encoding a still unidentified protein. These two HVT-ORFs are located at almost the same positions as the homologous MDV1-ORFs. The nucleotide sequence homologies between HVT and MDV1 were 73% and 68% for ORF1 and ORF2, respectively. Both the 5'- and 3'-noncoding regions, however, are less conserved. The third letter within every codon of ORF1 and ORF2 showed a mismatch of greater than 50% between the two viruses. The amino acid (aa) sequence homologies between the corresponding putative viral proteins are 83% and 80% for ORF1 (gA) and ORF2, respectively. More than 90% homology was observed in the C-terminal region of ORF1 (gA). Furthermore, the deduced aa sequences for both of the ORFs in these two viruses showed considerable homology to two adjoining genes in herpes simplex virus type 1, the glycoprotein C and UL45 genes.  相似文献   

4.
A 2830-bp segment of the mitochondrial genome of the fungus Aspergillus nidulans was sequenced and shown to contain two unidentified reading frames (URFs). These reading frames are 352 and 488 codons in length, and would specify unmodified proteins of mol. wts. 39,000 and 54,000, respectively. The derived amino acid sequences indicate that these genes are equivalent to the human mitochondrial URFs 1 and 4, with 39% amino acid homology for URF1 and 26% for URF4. Both URFs were shown by secondary structure predictions to code for predominantly beta-sheeted proteins with strong structural conservation between the fungal and human homologues. Counterparts of mammalian URFs have not previously been identified in non-mammalian genomes, and the discovery that A. nidulans possesses reading frames so closely homologous with URF1 and URF4 shows that these genes are of general functional importance in the mitochondria of diverse species.  相似文献   

5.
6.
A 6.3 kb fragment of E.coli RFL57 DNA coding for the type IV restriction-modification system Eco57I was cloned and expressed in E.coli RR1. A 5775 bp region of the cloned fragment was sequenced which contains three open reading frames (ORF). The methylase gene is 1623 bp long, corresponding to a protein of 543 amino acids (62 kDa); the endonuclease gene is 2991 bp in length (997 amino acids, 117 kDa). The two genes are transcribed convergently from different strands with their 3'-ends separated by 69 bp. The third short open reading frame (186 bp, 62 amino acids) has been identified, that precedes and overlaps by 7 nucleotides the ORF encoding the methylase. Comparison of the deduced Eco57I endonuclease and methylase amino acid sequences revealed three regions of significant similarity. Two of them resemble the conserved sequence motifs characteristic of the DNA[adenine-N6] methylases. The third one shares similarity with corresponding regions of the PaeR7I, TaqI, CviBIII, PstI, BamHI and HincII methylases. Homologs of this sequence are also found within the sequences of the PaeR7I, PstI and BamHI restriction endonucleases. This is the first example of a family of cognate restriction endonucleases and methylases sharing homologous regions. Analysis of the structural relationship suggests that the type IV enzymes represent an intermediate in the evolutionary pathway between the type III and type II enzymes.  相似文献   

7.
J A Engler  M S Hoppe  M P van Bree 《Gene》1983,21(1-2):145-159
The nucleotide sequence of a cloned DNA segment encoding the early region 2b from the group B human adenovirus Ad7 has been determined. When compared to Ad2, a group C adenovirus, these sequences were found to be approx. 80% homologous within the l-strand gene-coding regions. Most changes are transitions or transversions, although several deletions/insertions also occur within the N-terminal domain of one of the coding regions. The substantial nucleotide homology results in a high degree of amino acid conservation in the predicted polypeptides encoded by the early region 2b genes. Two major open reading frames, corresponding to the Mr 87000 and Mr 140000 polypeptides of Ad2, are found in the l strand of Ad7 between genome coordinates 28.5 to 23.1 and 13.8, respectively. The r strand of the DNA in this region encodes the three leader segments joined to the 5' end of the most late viral mRNAs, and also encodes the i-leader segment found between the second and third leaders on some mRNAs. The positions of the donor and acceptor splice sites of the three leaders are conserved and can be identified by homology to Ad2. Only two of the unidentified open reading frames (URF) in Ad2 (Gingeras et al., J. Biol. Chem., in press) can be found in Ad7. URF1, encoding an Mr 13500 polypeptide at genome coordinate 17, is predominantly conserved in nucleotide and amino acid sequence, but contains one half as many arginine amino acids as does URF1 of Ad2. URF2, encoding an Mr 13600 protein which lies within the i-leader region, is not well conserved in either nucleotide or amino acid sequence.  相似文献   

8.
A long L1 repetitive sequence (3.6 kilobase pairs) was found in the third intron of the human thymidylate synthase gene. This L1 family sequence is unique in that it possesses the longest open reading frame (1.7 kilobase pairs) of all L1 family members identified in sequences associated with specific genes that have been cloned thus far. Furthermore, the amino acid sequence deduced from the open reading frame of the L1 sequence was found to be highly homologous (90%) to that encoded by a known human teratocarcinoma L1 RNA species, and to contain several blocks of sequences homologous to ones in RNA-dependent DNA polymerases of various origins.  相似文献   

9.
M Ono  H Toh  T Miyata    T Awaya 《Journal of virology》1985,55(2):387-394
We determined the complete nucleotide sequence of the intracisternal A-particle gene, IAP-H18, cloned from the normal Syrian hamster liver DNA. IAP-H18 was 7,951 base pairs in length with two identical long terminal repeats of 376 base pairs at both ends. On the coding strand, imperfect open reading frames corresponding to gag and pol of the retrovirus genome were observed, whereas many stop codons were present in the region corresponding to env. The putative H18 gag gene (809 amino acids) had a sequence homologous to the N-terminal half of the mouse mammary tumor virus gag gene and locally to the Rous sarcoma virus gag gene. The putative H18 pol gene (900 residues) was homologous to the Rous sarcoma virus pol gene almost throughout the entire region. Two conserved regions among the retrovirus pol genes have been reported. One presumably corresponds to the DNA polymerase and the RNase H domain, and the other corresponds to the DNA endonuclease domain of the multifunctional protein pol. By the comparison of the deduced amino acid sequences of the putative endonuclease domain of six representative oncovirus genomes, a phylogenetic tree of the oncovirus genomes was constructed, and the intracisternal A-particle (type A) genome was found to be more closely related to the mouse mammary tumor virus (type B) and squirrel monkey retrovirus (type D) genomes.  相似文献   

10.
11.
The shufflon is a DNA region that undergoes complex rearrangement mediated by the product of a putative site-specific recombinase gene, rci. The DNA sequences of the shufflon region and the rci gene of IncI2 plasmid R721 were determined. The R721 shufflon consists of three invertible DNA segments that are homologous to the shufflon segments found in IncI1 plasmid R64. Structural analysis of open reading frames indicated that the R721 shufflon possibly functions as a biological switch for selecting one of the six pilV genes in which the N-terminal region is constant and the C-terminal region is variable. The R721 rci gene was shown to encode a basic protein of 374 amino acid residues.  相似文献   

12.
A chitinase gene of Bacillus circulans WL-12 was cloned into Escherichia coli by transforming HB101 cells with a recombinant plasmid composed of chromosomal DNA fragments prepared from B. circulans WL-12 and the plasmid vector pKK223-3. DNA sequencing analysis revealed that the region necessary for the normal expression of chitinase activity contained one open reading frame of 2097 base pairs which codes for the precursor of chitinase A1. The precursor of chitinase A1 contained a long signal sequence of 41 amino acids with an extremely long N-terminal hydrophilic segment of 15 amino acids. Cloned chitinase produced in E. coli had at its N terminus an additional 8 amino acids that were not found in B. circulans mature chitinase A1. The N-terminal two-thirds of the deduced amino acid sequence of chitinase A1 showed a 33% amino acid match to chitinase A of Serratia marcescens. This region of chitinase A1 is immediately followed by tandemly repeating 95-amino acid segments that are 70% homologous to each other. Statistical analysis revealed that these repeating segments are homologous to the type III homology units of fibronectin, a multifunctional extracellular matrix and plasma protein of higher eukaryotes. This observation indicates that type III homology units originated prior to the emergence of eukaryotes and may be distributed in a wide range of organisms.  相似文献   

13.
We have molecularly cloned and sequenced a portion of the simian foamy virus type 1 (SFV-1); open reading frames representing the endonuclease domain of the polymerase (pol) and the envelope (env) genes were identified by comparison with the human foamy virus (HFV). Unlike the HFV genomic organization, the SFV-1 pol gene overlaps the env gene; thus, the open reading frames reported for HFV between pol and env is not present in SFV-1. Comparisons of predicted amino acid sequences of HFV and SFV-1 reveal that the endonuclease domains of the pol genes are about 84% related. The region predicted to encode the SFV-1 extracellular env domain is 569 codons; SFV-1 and HFV have 64% amino acid similarity in this env domain. The predicted hydrophobic transmembrane env proteins of both HFV and SFV-1 show about 73% similarity. A total of 16 potential glycosylation sites are found in SFV-1 env, and 15 are found in HFV; 11 are shared. SFV-1 has 25 cysteine residues, and HFV has 23 residues; all 23 cysteine residues of HFV are conserved in SFV-1. This sequence analysis reveals that the human and simian foamy viruses are highly related.  相似文献   

14.
We determined the nucleotide sequence of a 3.5-kb region of the bovine herpesvirus 1 (BHV-1) genome which contained the complete BHV-1 homologs of the herpes simplex virus type 1 (HSV-1) UL26 and UL26.5 genes. In HSV-1, the UL26 and UL26.5 open reading frames encode scaffold proteins upon which viral capsids are assembled. The UL26-encoded protein is also a proteinase and specifically cleaves both itself and the UL26.5-encoded protein. The overall BHV-1-encoded amino acid sequence showed only 41% identity to the HSV-1 sequences and was most divergent in the regions defined to be involved in the scaffolding function. We substituted the proteins encoded by the BHV-1 homologs of the UL26 and UL26.5 open reading frames, expressed in baculovirus, for the corresponding HSV-1 proteins in an in vitro HSV-1 capsid assembly system. The proteins expressed from the BHV-1 UL26 and UL26.5 homologs facilitated the formation of hybrid type B capsids indistinguishable from those formed entirely with HSV-1-encoded proteins.  相似文献   

15.
The DNA sequence of approximately 80% of the transcribed region of the kinetoplast maxicircle DNA of Leishmania tarentolae was obtained, and structural genes were localized by comparison of the translated amino acid sequences with those of known mitochondrial genes from other organisms. By this method, the genes for cytochrome oxidase subunits I, II, and III, cytochrome b, and human mitochondrial unidentified reading frames 4 and 5 were identified. By comparing the amino acid sequences of the putative L. tarentolae genes with those of known genes, we conclude that TGA codes for tryptophan, as in most other mitochondrial systems. This is the only apparent change from the universal genetic code. The six identified structural genes show various degrees of divergence from the homologous genes in other species, with cytochrome oxidase subunit I being the most conserved and cytochrome oxidase subunit III being the least conserved. A comparison of the cytochrome b genes from L. tarentolae and Trypanosoma brucei showed that the ratio of transversions to transitions is 1:1, suggesting that these species diverged from each other more than 80 X 10(6) years ago. Several as yet unidentified open reading frames were also present in the maxicircle sequence. These data confirm that maxicircle DNA has a coding potential which typifies other mitochondrial systems.  相似文献   

16.
Approximately 10,000 nucleotides were sequenced in the oriC region of the Bacillus subtilis chromosome. The first replicating DNA strands are hybridized with a SalI-EcoRI fragment (nucleotide #1206-2954) in one direction (left to right) and an EcoRI-PstI fragment (#2949-4233) in the other. Seven open reading frames (ORF) accompanied with Shine-Dalgarno (SD) sequences were identified. ORF638 and ORF821 were identified as gyrB and gyrA genes respectively based on genetic evidences and amino acid sequence data. Comparison of amino acid sequences revealed that ORF44, ORF446, ORF378 and ORF323 are homologous with rpmH, dnaA, dnaN and recF of Escherichia coli, respectively. Thus, the organization of the ORFs from ORF44 to ORF638 resembles the organization of genes in the rpmH-gyrB region of the E. coli chromosome. Two non-coding regions characteristic for oriC signals were found near the site of initiation of the first replicating DNA. They are composed of repeating sequences whose consensus sequence TTAT(C/A)CACA is identical to that of 4 repeating sequences in the oriC of E. coli.  相似文献   

17.
18.
A 3.5-kb HindIII DNA fragment containing the secY gene of Bacillus subtilis has been cloned into plasmid pUC13 using the Escherichia coli secY gene as a probe. The complete nucleotide sequence of the cloned DNA indicated that it contained five open reading frames, and their order in the region, given by the gene product, was suggested to be L30-L15-SecY-Adk-Map by their similarity to the products of the E. coli genes. The region was similar to a part of the spc operon of the E. coli chromosome, although the genes for Adk and Map were not included. The gene product of the B. subtilis secY homologue was composed of 423 amino acids and its molecular weight was calculated to be 46,300. The distribution of hydrophobic amino acids in the gene product suggested that the protein is a membrane integrated protein with ten transmembrane segments. The total deduced amino acid sequence of the B. subtilis SecY homologue shows 41.3% homology with that of E. coli SecY, but remarkably higher homologous regions (more than 80% identity) are present in the four cytoplasmic domains.  相似文献   

19.
Identification of functional open reading frames in chloroplast genomes   总被引:7,自引:0,他引:7  
K H Wolfe  P M Sharp 《Gene》1988,66(2):215-222
We have used a rapid computer dot-matrix comparison method to identify all DNA regions which have been evolutionarily conserved between the completely sequenced chloroplast genomes of tobacco and a liverwort. Analysis of these regions reveals 74 homologous open reading frames (ORFs) which have been conserved as to length and amino acid sequence; these ORFs also have an excess of nucleotide substitutions at silent sites of codons. Since the nonfunctional parts of these genomes have become saturated with mutations and show no sequence similarity whatsoever, the homologous ORFs are almost certainly functional. A further four pairs of ORFs show homology limited to only a short part of their putative gene products. Amino acid sequence identities range between 50 and 99%; some chloroplast proteins are seen to be among the most slowly evolving of all known proteins. A search of the nucleotide and amino acid sequence databanks has revealed several previously unidentified genes in chloroplast sequences from other species, but no new homologies to prokaryotic genes.  相似文献   

20.
Abstract The P39 antigen is a specific, highly conserved, and immunogenic protein of Lyne disease spirochetes, Borrelia burgdorferi sensu lato. The nucleotide sequence of the gene encoding this protein was determined and found to be the first of two tandemly arranged open reading frames located on the spirochete's chromosome. These two open reading frames were designated bmpA for the gene encoding P39 and bmpB for the gene encoding the putative protein ORF2 encoded by the second open reading frame. The nucleic acid sequence identity for the two open reading frames was 62% while their deduced amino acid sequences were 52% identical. Comparison to sequence data bases demonstrated that the deduced amino acid sequences of both P39 and ORF2 were homologous to TmpC, a putative outer or cytoplasmic membrane lipoprotein of the syphilis spirochete, Treponema pallidum .  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号