首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
A highly repetitive long interspersed sequence from rat DNA has been isolated and partly characterized. This sequence comprises at least a 1300 base-pair and a 2400 base-pair EcoRI fragment and probably additional elements. The 2400 base-pair segment has been analyzed in detail. It appears to be part of the chromosomal DNA in rat cells. The 2400 base-pair repeat is likely to be distributed over several regions in the rat genome. The 2400 base-pair segment has been cloned, mapped for restriction sites, and part of its nucleotide sequence has been determined. The 2400 base-pair sequence is a member of a typical highly repetitive long interspersed sequence with high copy number and restriction site polymorphism. There are sequence homologies to mouse and human DNA. A striking homology has been detected to the flanking sequences of a repetitive mouse DNA sequence that has been described to be located adjacent to one of the kappa-immunoglobulin variable genes. Elements in the 2400 base-pair rat repeat are transcribed in cells from most rat organs and from several continuous rat cell lines. This RNA from rat cell lines was found polyadenylated or not polyadenylated. The nucleotide sequence of parts of the 2400 base-pair DNA segment revealed open reading frames for polypeptide sequences. Such open reading frames have been detected in two different segments of the 2400 base-pair DNA repeat. Open reading frames exist in the two complementary strands in the same DNA segment. The hypothetical polypeptide whose sequence has been determined in toto has a length of 190 amino acid residues and is enriched in hydrophobic amino acids, reminiscent of the amino acid composition in membrane proteins. Hence, it is conceivable that the 2400 base-pair repeat sequence from rat DNA, at least in part, encodes messenger RNAs that might be translated into functional proteins.  相似文献   

2.
We have determined the complete nucleotide sequence of an infectious cloned genome of ground squirrel hepatitis virus (GSHV), a nonpathogenic member of the hepadnavirus group. The genome is 3,311 base pairs long and contains the major open reading frames described for the related human and woodchuck hepatitis B viruses (HBV and WHV, respectively). These reading frames include genes for the major structural proteins (the surface and core antigens), unassigned open reading frames (A and B), the longer of which is presumed to encode the viral DNA polymerase, and an open reading frame preceding and continuous with the surface antigen gene. The arrangement of these open reading frames is similar to that encountered in the genomes of HBV and WHV: all of the reading frames are encoded on the same strand, they are positioned in the same fashion with respect to each other, and a large portion (at least 51%) of the genome can be translated in two reading frames. Comparisons of the predicted translational products of the three mammalian hepadnaviruses reveal 78% amino acid homology between the proteins of GSHV and WHV and 43% homology between those of GSHV and HBV. In addition, a perfect direct repeat of 10 to 11 base pairs, separated by ca. 46 to 223 base pairs, is present in the three mammalian viruses and in duck hepatitis B virus; the position of the repeats near the 5' termini of the two strands of virion DNA suggests a role in viral replication.  相似文献   

3.
The 78 101 base pair long sequence of a cluster of 22-kDa alpha zein genes in the maize inbred BSSS53 was determined. Each zein gene is contained within a repeat unit that varies in length. If such a repeat, or amplicon, is aligned along the entire sequence, a 10.5-fold sequence amplification is delineated. Because of insertions and deletions in intergenic regions, many of the zein genes are spaced over different distances. Only three out of 10 zein-related sequences have an intact open reading frame, indicating an unusual large number of genes unable to contribute to the accumulation of normal-size 22-kDa zein proteins. It is proposed that the seven remaining zein-related sequences be considered gene reserves because of their potential to be restored by gene conversion. Intergenic insertions in the cluster range from 1098 to 14 896 base pairs. Although they are composed of transposable element sequences, they also contain additional open reading frames, two of them showing homology to rice cDNA sequences. The average amplicon is 4423 base pairs long, with the sequence surrounding each zein gene more than 90 % conserved. Coincidently, the size of the amplicon is equivalent to the average gene density (one gene within 4640 bp) in the Arabidopsis thaliana genome, one of the smallest in plants. Successive steps of amplification and insertion of DNA might explain to a certain degree how genome size variation has been generated in plants.  相似文献   

4.
The mitochondrial genome of the peronosporomycete water mold Saprolegnia ferax has been characterized as a 46 930 bp circle containing an 8618 bp large inverted repeat (LIR). Eighteen reading frames encode identified subunits of respiratory complexes I, III, IV and V; 16 encode polypeptides of small and large mitoribosome subunits; and one encodes a subunit of the sec-independent protein translocation pathway. Of four additional putative reading frames three are homologues of those found in the related Phytophthora infestans genome. Protein encoding loci in the tightly compacted genome typically are arranged in operon-like clusters including three abutting and two overlapping pairs of reading frames. Translational RNAs include the mitochondrial small and large subunit rRNAs and 25 tRNA species. No tRNAs are encoded to enable translation of any threonine or the arginine CGR codons. The LIR separates the molecule into 19 274 bp large and 10 420 bp small single copy regions, and it encodes intact duplicate copies of four reading frames encoding known proteins, both rRNAs, and five tRNAs. Partial 3' sequences of three additional reading frames are duplicated at single copy sequence junctions. Active recombination between LIR elements generates two distinctive gene orders and uses the duplicated 3' sequences to maintain intact copies of the partially duplicated loci.  相似文献   

5.
The complete nucleotide sequence of human papillomavirus type 1a (7811 nucleotides) has been established. The overall organization of the viral genome is different from that of other related papovaviruses (SV40, BKV, polyoma). Firstly, genetic information seems to be coded by one strand. Secondly, no significant homology is found with SV40 or polyoma coding sequence for either DNA or deducted protein sequences. The relatedness of human and bovine papillomaviruses is revealed by a conserved coding sequence in the two species. Two regions can be defined on the viral genome: the putative early region contains two large open reading frames of 1446 and 966 nucleotides, together with several split ones, and corresponds to the transforming part of the bovine papillomavirus type 1 genome, and the remaining sequences, which include two open reading frames likely to encode structural polypeptide(s). The DNA sequence is analysed and putative signals for regulation of gene expression, and homologies with the Alu family of human ubiquitous repeats and the SV40 72-bp repeat are outlines.  相似文献   

6.
The proteins expressed by insertion sequence IS911, a member of the widespread IS3 family of elements, have been analyzed. The results indicate that three major species are produced from two consecutive reading frames. A protein of Mr 11,500, ORFA, is synthesized from an upstream reading frame. A larger protein, ORFAB, uses the same initiation codon and is produced by a -1 programmed translational frameshift between orfA and a downstream frame, orfB, whose amino acid sequence shows significant homology with retroviral integrase proteins. The orfB frame is also expressed independently in two alternative forms: the first uses a rare AUU initiation codon in the orfB phase whereas the second appears to initiate in the orfA phase and is produced by a -1 frameshift mechanism similar to that used in ORFAB expression. A specific IS911 integration reaction using a minimal active junction composed of 51 base-pairs of the right inverted repeat and a flanking phase lambda sequence resembling a second end in inverted orientation has been developed to analyze the functions of these proteins by transcomplementation in vivo. The orfA and orfB frames are shown to be essential and production of ORFAB is shown to stimulate integration in this system, suggesting that this fusion protein is the IS911 transposase.  相似文献   

7.
8.
L Liu  E J Hansen 《Plasmid》1999,42(2):150-153
The complete nucleotide sequence of plasmid pLQ510 from Moraxella catarrhalis strain E22 has been determined. This plasmid contained 12,082 bp with 38% GC content. Five open reading frames that encoded predicted proteins with homology to plasmid-encoded proteins from other bacteria were identified. A putative origin of replication that contained an AT-rich region followed by four direct repeats and an inverted repeat was identified.  相似文献   

9.
It is known that 5'-untranslated sequences of eukaryotic mRNA often contain AUG triplets, which may serve as the sites of translation initiation. It is thought that these leader open reading frames can fulfil the regulatory functions and encode functionally active proteins. However, they have been incompletely characterized. The article deals with the context organization of leader reading frames of eukaryotic mRNAs. It has been shown that their characteristics correlate with the position relative to the protein-encoding sequence, which may be associated with the efficiency of initiation of translation.  相似文献   

10.
We report characterisation of three copies of a novel repeat sequence isolated from a Mycobacterium bovis genomic library. The repeat occurs within open reading frames, potentially encoding a conserved tandem array of a pentapeptide sequence with the consensus X-Gly-Asn-X-Gly. The tandem array is present up to five times in M. bovis and it is proposed that they may occur in a family of genes expressing functionally related proteins. We postulate that these proteins may play a role in binding of M. bovis to host cell receptors.  相似文献   

11.
12.
We here report the complete nucleotide sequence of the 47.9 kb mitochondrial (mt) genome from the obligate aerobic yeast Yarrowia lipolytica. It encodes, all on the same strand, seven subunits of NADH: ubiquinone oxidoreductase (ND1-6, ND4L), apocytochrome b (COB), three subunits of cytochrome oxidase (COX1, 2, 3), three subunits of ATP synthetase (ATP6, 8 and 9), small and large ribosomal RNAs and an incomplete set of tRNAs. The Y. lipolytica mt genome is very similar to the Hansenula wingei mt genome, as judged from blocks of conserved gene order and from sequence homology. The extra DNA in the Y. lipolytica mt genome consists of 17 group 1 introns and stretches of A+Trich sequence, interspersed with potentially transposable GC clusters. The usual mould mt genetic code is used. Interestingly, there is no tRNA able to read CGN (arginine) codons. CGN codons could not be found in exonic open reading frames, whereas they do occur in intronic open reading frames. However, several of the intronic open reading frames have accumulated mutations and must be regarded as pseudogenes. We propose that this may have been triggered by the presence of untranslatable CGN codons. This sequence is available under EMBL Accession No. AJ307410.  相似文献   

13.
We have determined the DNA sequence (46 kilobases) of the short unique region, the short repeat, and part of the long repeat of human cytomegalovirus strain AD169. Analysis of the sequence has revealed at least 38 possible regions that may code for protein. Many of these open reading frames show homology to each other, and five groups of homologous reading frames are identified. Half of the predicted translation products appear to be membrane proteins, and fall into two distinct classes; those that have potential signal and anchor sequences, and those that have seven potential membrane-spanning regions and appear to be integral membrane proteins. A number of the former class contain sites for N-linked glycosylation and may therefore be glycoproteins. None of the 38 open reading frames shows homology to other known herpesvirus proteins.  相似文献   

14.
The flanking regions and the end of the chloroplast ribosomal unit of Chlamydomonas reinhardii have been sequenced. The upstream region of the ribosomal unit contains three open reading frames coding for 111, 117 and 124 amino acids, respectively. The latter polypeptide is partially related to the ribosomal protein L16 of E. coli. Two of the open reading frames overlap each other and are oriented in opposite direction. The region between these open reading frames and the 5' end of the 16S rRNA gene contains numerous short direct and inverted repeats which can be folded into large stem-loop structures. Sequence elements that resemble prokaryotic promoters are found in the same region. Several of the repeated elements are distributed throughout the non-coding regions of the chloroplast inverted repeat. Sequence comparison between the 5S rRNA and its gene does not reveal any significant sequence heterogeneity between the chloroplast 5S rRNA genes.  相似文献   

15.
Abstract The complete nucleotide sequence of pNS1 (3879 bp), a tetracycline-resistance (TcR) plasmid drived from staphylococcal plasmid pTP5, has been determined and compared with that of the staphylococcal TcR plasmid pT181 [6]. The nucleotide sequences of the 2 plasmids are in agreement, except for 18 nucleotides, but these differences are significant in that they give rise to new open reading frames (ORFs). A short ORF-D is found in the copy control region, and the TcR region contains a single large ORF-A, that encodes the Tet protein (50 kDa). The upstream region of ORF-A contains 3 inverted repeat sequences, which can generate structures very similar in conformation of the structure of the control region of the inducible erythromycin-resistance gene of pE194.  相似文献   

16.
We analyzed 17 B-cell lineages cloned from two patients with infectious mononucleosis and found that different B-cell lineages exhibited notable variation in the length of the fused Epstein-Barr virus (EBV) terminal region on intracellular EBV episomes. EBV termini in different B-cell clones from the same person differed by as many as 15 to 20 reiterations of the ca. 500-base-pair terminal repeat sequence. In contrast, analysis of seven B-cell lineages cloned from a patient with a fatal, oligoclonal lymphoma revealed that three of the cell clones had the same-sized EBV terminal region. These three clones had previously been shown, by immunoglobulin gene analysis, to be metastatic daughter cells descended from a common progenitor. Similarity of the EBV terminal regions in the three daughter clones suggested that EBV infected the progenitor cell before proliferation and metastasis. Individual, EBV-infected cells from a single individual showed sufficient heterogeneity in their EBV termini to allow use of terminal fragment size as a clonal marker in studies addressing the contribution of EBV to the clonal pathogenesis of tumors with which this virus has been associated.  相似文献   

17.
Unique purine-rich mRNA sequences embedded in the coding sequences of a distinct group of gammaherpesvirus maintenance proteins underlie the ability of the latently infected cell to minimize immune recognition. The Epstein-Barr virus nuclear antigen, EBNA1, a well characterized lymphocryptovirus maintenance protein has been shown to inhibit in cis antigen presentation, due in part to a large internal repeat domain encoding glycine and alanine residues (GAr) encoded by a purine-rich mRNA sequence. Recent studies have suggested that it is the purine-rich mRNA sequence of this repeat region rather than the encoded GAr polypeptide that directly inhibits EBNA1 self-synthesis and contributes to immune evasion. To test this hypothesis, we generated a series of EBNA1 internal repeat frameshift constructs and assessed their effects on cis-translation and endogenous antigen presentation. Diverse peptide sequences resulting from alternative repeat reading frames did not alleviate the translational inhibition characteristic of EBNA1 self-synthesis or the ensuing reduced surface presentation of EBNA1-specific peptide-MHC class I complexes. Human cells expressing the EBNA1 frameshift variants were also poorly recognized by antigen-specific T-cells. Furthermore, a comparative analysis of the mRNA sequences of the corresponding repeat regions of different viral maintenance homologues highlights the high degree of identity between the nucleotide sequences despite very little homology in the encoded amino acid sequences. Based on these combined observations, we propose that the cis-translational inhibitory effect of the EBNA1 internal repeat sequence operates mechanistically at the nucleotide level, potentially through RNA secondary structural elements, and is unlikely to be mediated through the GAr polypeptide. The demonstration that the EBNA1 repeat mRNA sequence and not the encoded protein sequence underlies immune evasion in this class of virus suggests a novel approach to therapeutic development through the use of anti-sense strategies or small molecules targeting EBNA1 mRNA structure.  相似文献   

18.
A repetitive element (IS986), previously isolated from Mycobacterium tuberculosis and shown to detect multiple restriction fragment-length polymorphisms (RFLPs), has been sequenced. It consists of a potential insertion sequence of 1358bp, with 30-bp inverted repeat ends. IS986 has four potentially significant open reading frames (ORFs): ORFa1, ORFa2 and ORFb on one strand and ORFc on the complementary strand. The sequences of the potential translated products identify IS986 as a member of the IS3 family, with an apparent frameshift between ORFa1 and ORFa2. IS986 has potential as a highly specific probe for detection and typing of M. tuberculosis, as well as for transposon mutagenesis of mycobacteria. The sequence of IS986 is virtually identical to that of another recently described element, IS6110 (Thierry et al., 1990).  相似文献   

19.
20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号