首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
We have used highly methylation tolerant host strains to clone hyper- and hypo-methylated genomic elements from different regions of the same family of long interspersed repetitive elements from human DNA, specifically the 1.8 kilobase (kb) and 1.2kb KpnI fragments from members of the L1 family of transposable elements in which respectively some 18% and 2.7% of cytosines are methylated in vivo in human spleen DNA. The consensus of the DNA sequences of the ends of 13 clones from the hypomethylated region of human L1 agreed exactly with the consensus derived previously from clones made using conventional host strains. However the sequences of 18 of our clones from the 5' end of the hypermethylated region differed significantly from the sequences of clones made using conventional hosts (P less than 0.0001). The 5' region of the 1.8kb L1 region is a CpG island which, in human somatic tissue, appears to be maintained in a highly methylated state, including methylation at sites other than CpG dinucleotides. The consensus sequence of this region also has features suggestive of a previously unrecognized open reading frame.  相似文献   

3.
It has previously been shown that the L1 family in the mouse (L1Md) contains two alternative 5' ends called the A- and F-type sequences (1,2). We show here that the F-type element is a major class of murine L1 elements and report on the details of organization of the 5' motif of these F-type elements. Although the A- and F-type 5' sequences share no detectable sequence homology the organization of an F-type 5' end is strikingly similar to that of an A-type. That is, the F-type 5' sequences consist of a tandem array of a small number of 206 bp monomers while the A-type 5' motif consists of a tandem array of 208 bp monomers. All of the A-type elements characterized to date have a truncated monomer at the 5' end of the array. Many of the F-type elements are also terminated at the 5' end by a truncated copy but unlike the A-type elements some F-type elements terminate with a monomer which is within a few nucleotides of being complete. In addition the F-type consensus sequence, in contrast to the A-type sequence, shows homology (70%) to the body of the L1Md starting at the position where the monomer joins the rest of the L1 element.  相似文献   

4.
We have identified a new family of interspersed, moderately repetitive DNA elements, termed the RSg-1 family, in the genome of the rainbow trout. Two of the elements examined here are situated upstream of sequences which code for trout nuclear proteins; a protamine gene (p101) and the clustered histone H4 gene. Sequence comparison of various RSg-1 elements indicated a high degree of nucleotide sequence homology between different members of the family. These repetitive elements exhibit well defined 3' ends which contain poly(A) segments preceded by the consensus polyadenylation signal AATAAA. Sequences flanking the 3' end of the poly(A) tract also conform to a consensus sequence. A similar sequence is also found flanking the 5' terminus of the element in the protamine clone p101, and thus may represent a target-site duplication generated upon insertion of the element into the genome. These characteristics, together with the heterogeneous nature of the 5' ends of the elements, are reminiscent of processed pseudogenes and retroposons such as the mammalian L1 family of interspersed repetitive elements.  相似文献   

5.
We have deduced the sequence of a composite long interspersed repeated DNA in primates and herein describe its relationship to a complex repeat element (L1Heg) located in the interval linking the human epsilon- and G gamma-globin genes. The main element of L1Heg is 3' truncated and interrupted by the insertion of the 3' end of a second L1 element. Transposition of L1Heg into this intergenic locus generated a 62-bp duplication of flanking sequences. In contrast, insertion of the second repeat may have been mediated by homology between donor and target sequences. The main repeat represents a novel class of abundant elements whose sequences have diverged from other rodent and primate LINES approximately 1.3 kb downstream from the 5' terminus of L1Heg. Comparison of L1Heg with the sequences of two other related L1 members revealed a complex set of rearrangements confined within a region that resembles the long terminal repeats of other types of retroposons. The boundaries of conversion-like events were defined on the basis of the clustering of nucleotide sequence variants common to two or more nonallelic 3' L1H elements. Several of these events are apparently initiated or resolved within a common 150-bp region that coincides with the 3' terminus of a pan-mammalian open reading frame. This analysis showed that concerted genetic interactions and random drift both contribute appreciably to sequence variation within this set of L1H members.  相似文献   

6.
Tnr1 (235 bp long) is a transposable element in rice. Polymerase chain reactions (PCRs) done with a primer(s) that hybridizes to terminal inverted repeat sequences (TIRs) of Tnr1 detected new Tnr1 members with one or two insertions in rice genomes. Six identified insertion sequences (Tnr4, Tnr5, Tnr11, Tnr12, Tnr13 and RIRE9) did not have extensive homology to known transposable elements, rather they had structural features characteristic of transposable elements. Tnr4 (1767 bp long) had imperfect 64-bp TIRs and appeared to generate duplication of a 9-bp sequence at the target site. However, the TIR sequences were not homologous to those of known transposable elements, indicative that Tnr4 is a new transposable element. Tnr5 (209 bp long) had imperfect 46-bp TIRs and appeared to generate duplication of sequence TTA like that of some elements of the Tourist family. Tnr11 (811 bp long) had 73-bp TIRs with significant homology to those of Tnr1 and Stowaway and appeared to generate duplication of sequence TA, indicative that Tnr11 is a transposable element of the Tnr1/Stowaway family. Tnr12 (2426 bp long) carried perfect 9-bp TIRs, which began with 5'-CACTA- -3' from both ends and appeared to generate duplication of a 3-bp target sequence, indicative that Tnr12 is a transposable element of the En/Spm family. Tnr13 (347 bp long) had 31-bp TIRs and appeared to generate duplication of an 8-bp target sequence. Two sequences, one the transposon-like element Crackle, had partial homology in the Tnr13 ends. All five insertions appear to be defective elements derived from autonomous ones encoding the transposase gene. All had characteristic tandem repeat sequences which may be recognized by transposase. The sixth insertion sequence, named RIRE9 (3852 bp long), which begins with 5'-TG- -3' and ends with 5'- -CA-3', appeared to generate duplication of a 5-bp target sequence. These and other structural features indicate that this insertion is a solo LTR (long terminal repeat) of a retrotransposon. The transposable elements described above could be identified as insertions into Tnr1, which do not deleteriously affect the growth of rice cells.  相似文献   

7.
8.
9.
L1P_MA2 is a primate-specific subfamily of L1 retrotransposons. The consensus sequence of this element differs from the canonical L1 consensus by the presence of a 3800-bp region in 5' (L1M1_5). Part of this region has been proposed to be involved in a dystrophin mutation affecting the correct splicing of the gene and causing an X-linked dilated cardiomyopathy. In consideration of the potential involvement in splicing regulation of this element and also because of its atypical structure, we investigated its evolutionary history by analyzing the inter- and intraspecific divergence of L1P_MA2 sequences in various species of primates. The resulting phylogenetic trees show long terminal branches and short basal internodes, as expected for a rapid event of diversification that occurred in the past. The phylogenetic analysis and the intraspecific divergence estimates revealed a pattern of evolution for this element similar in all primates with the exception of lemurs, thus suggesting that the major wave of expansion of L1P_MA2 in primate genomes occurred after the divergence between Prosimiae and Anthropoidea. These results clearly indicate that a phylogenetic approach is more appropriate than methods based on sequence data from a single species, when investigating time and mode of evolution of retro-elements.  相似文献   

10.
11.
The complete nucleotide sequence of L1Md-A13, a 6372 base-pair (bp) member of the L1Md repetitive family isolated from a BALB/c mouse genomic DNA library, is reported. The nucleotide sequence of 4331 bp from the 5' end of L1Md-9, which is located in the beta-globin complex of the C57BL/10 mouse, is also reported. Parsimony analysis of these sequences plus two previously reported L1Md sequences allows the determination of an ancestral L1Md sequence. Analysis of the L1Md population indicates that this ancestral sequence is likely to represent a functional L1 sequence. This ancestral sequence confirms that the length (1137 bp and 3900 bp) and relationship (14 bp overlap) of the two large open reading frames previously reported are conserved features of the L1Md family. It also allows the determination of an ancestral amino acid sequence for these two open reading frames. Full-length L1Md elements have one of two sequences tandemly repeated at the 5' end. These two monomers are called A-type and F-type. Our data define the 5' end of A-type full-length L1Md elements. L1Md elements of the A-type have varying numbers of tandemly repeated 208 bp monomers, but each element ends about 78 bp from the 5' end of the terminal 208 bp monomer.  相似文献   

12.
13.
The 5' structure of mRNA transcribed from the dihydrofolate reductase-thymidylate synthase (DHFR-TS) gene of the protozoan parasite Leishmania major has been characterized. S1 nuclease mapping identifies a heterogenous 5' structure which is not affected by growth phase or developmental stage. The DNA sequence of the 5' region of the DHFR-TS gene does not reveal homology with other trypanosomatid genes, eukaryotic consensus genetic elements, or the mammalian DHFR promoter element. This latter finding is especially significant as we show that the 5' region of the E. coli DHFR gene exhibits homology to the mammalian DHFR promoter element, despite their greater evolutionary distance.  相似文献   

14.
Retrotransposons replicate via a complex mechanism which depends on, among other things, the presence of long terminal repeats (LTRs) and a tRNA binding site just 3' of the 5' LTR. The LINES 1 (L1) family of sequences, which similar to retrotransposons in many other properties, represents a new class of retroposon which does not possess LTRs. However, we show here that the repetitive 5' motif associated with murine L1 elements contains a tRNA-like sequence in a location analogous to the position of the retro-transposon tRNA binding site. Although the repetition of such a 5' motif has only been found associated with murine L1 elements, we have found an analogous tRNA-like sequence near the 5' ends of the L1 elements from each of the other analyzed species for which the L1 family has been characterized, that is rat (L1Rr), human (L1Hs), drosophila (I element) and trypanosome (INGI). The conservation of this tRNA-like sequence near the 5' terminus of L1 elements from such diverse species suggests that it plays a functional role in the life of the L1 class of retroposon.  相似文献   

15.
16.
17.
18.
A Kudo  F Melchers 《The EMBO journal》1987,6(8):2267-2272
The murine gene lambda 5 is selectively expressed in pre-B lymphocytes. Of the three exons encoding lambda 5, exons II and III show strong homologies to immunoglobulin lambda light (L) chain gene segments, i.e. to J lambda intron and exon, and C lambda exon sequences respectively. We have now found, 4.6 kb upstream of lambda 5, another gene composed of two exons which is selectively expressed in pre-B cell lines as a 0.85 kb mRNA potentially coding for a protein of 142 amino acids including a 19 amino acid-long signal peptide. The 5' sequences of this gene show homologies to sequences encoding the variable regions of kappa and lambda L chains and of heavy (H) chains. The deduced amino acid sequence contains the consensus cysteine residues as well as other consensus amino acids at positions which characterize immunoglobulin (Ig) domains. We call the second gene VpreB. The 3' end of VpreB encoding the 26 carboxyl terminal amino acids shows no homology to any known nucleotide sequence. The putative protein encoded by VpreB is a potential candidate for association with the putative protein encoded by lambda 5, and thereby a candidate for association with H chains in pre-B cells. Southern blot analysis of DNA from liver (germ line) and 70Z/3 pre-B cell lines reveals two genes which hybridize to the VpreB gene. We call VpreB1 the gene which is found 5' of lambda 5. The other gene, called VpreB2, which has not yet been located within the genome, shows 97% nucleotide sequence homology to VpreB1 in an area of 1 kb which covers the coding region of the gene.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

19.
20.
We report the isolation of the complete genes encoding nucleolin from rat and hamster. The DNA clones were obtained from partial genomic libraries by probing with a genomic DNA fragment containing the leader and promoter regions of the mouse nucleolin gene. We have determined the complete nucleotide sequence of the 5'-terminal region for the three rodent species. The sequenced regions extend over 1 kb downstream and upstream from the cap sites and include a conserved CpG island 1500 nucleotides (nt) long. The 5' end of the CpG island in each species has maintained a long alternating purine-pyrimidine sequence which could adopt a Z-DNA conformation. By sequence comparison, 42 blocks of homology are defined in the 5'-terminal region, of which 36 appear in the CpG island and contain numerous conserved CpG dinucleotides. Two blocks, 110 and 49 nt long, encompassing the cap sites and the region immediately upstream, respectively, present features characteristic of regulated genes: a possible TATA box (ATTA), two pyrimidine-rich nucleotide stretches and two inverted juxtaposed CCAAT-like boxes (GGTTGG). Furthermore, the adjacent upstream conserved region presents features characteristic of housekeeping genes: four G/C boxes, embedded in a high G + C-content sequence, among them one presenting a perfect consensus Sp 1-binding site (GCCCCGCCCC). Among unusual features, we report numerous large G + C-rich conserved sequences located in the first intron. One of these sequences contains two G/C boxes which border a sequence presenting a dyad symmetry (GCGCACGTGCTC). Our findings shed some light on the putative role of the CpG island. We show that CpG-rich sequence motifs are under strong selective pressure over the whole 5'-terminal region and are presumably involved in regulatory mechanisms.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号