首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
K Noma  H Ohtsubo  E Ohtsubo 《DNA research》2001,8(6):291-299
The Arabidopsis thaliana genome has about 250 copies of LINEs (here called ATLNs). Of these, some, called ATLN-Ls, have an extra sequence of about 2 kb in the region downstream of two consecutive open reading frames, orf1 and orf2. Interestingly, the extra sequences in these ATLN-L members have another open reading frame, designated as orf3. Each member is flanked by direct repeats of a target site sequence, showing that ATLN-L members with the three open reading frames have retrotransposed as a unit. The ATLN-L members are also distinct from other ATLN members: orf1 terminates with TAA (or TAG) and is located in the same frame as orf2, and the ATG initiation codon of orf2 is not present in the proximal region. A sequence that may form a pseudoknot structure in ATLN-L mRNA was present in the proximal region of orf2, therefore the TAA (or TAG) termination codon of orf1 is assumed to be suppressed to produce an Orf1-Orf2 transframe protein during the translation of the ATLN-L mRNA. The region between orf2 and orf3 is several hundred bp long, suggesting that orf3 expression is independent of orfl-orf2. The amino acid sequences of the proteins Orf1 and Orf3 are highly homologous in their N-terminal half regions that have a retroviral zinc-finger motif for RNA binding. Orf3, however, has a leucine-zipper motif in addition to the zinc-finger motif. The C-terminal regions of the Orf1 and Orf3 proteins have poor homology, but seem to have nuclear localization signals, suggesting that these proteins are involved in the transfer of ATLN-L mRNA to nuclei. A phylogenetic tree shows that Orf3 proteins form a branch distinct from the branches of the Orf1 proteins encoded by ATLN-L members. This indicates that an ancestor element of ATLN-Ls has incorporated the orf1 frame carried by another ATLN member into its distal region to orf1-orf2 during evolution.  相似文献   

3.
4.
5.
A computer-aided homology search of databases found that the nucleotide sequences flanking ATLN44, a non-LTR retrotransposon (LINE) from Arabidopsis thaliana, are repeated in the A. thaliana genome. These sequences are homologous to flanking sequences of 664 bp with terminal inverted repeat sequences of about 70 bp. The 664-bp sequence and most of the 14 homologues identified were flanked by direct repeat sequences of 9 bp. These findings indicate that the repeated sequence, named Tnat1, is a transposable element that duplicates a 9-bp sequence at the target site on transposition and that ATLN44 is inserted in one Tnat1 member. Interestingly, all of the Tnat1 members had tandem repeats comprised of several units of a 60-bp sequence, the number of repeats differing among Tnat1 members. Of the Tnat1 members identified, one was inserted into another sequence repeated in the A. thaliana genome: that sequence is about 770 bp long and has terminal inverted repeat sequences of about 110 bp. The sequence is flanked by direct repeats of a 9-bp sequence, indicating that it is another transposable element, named Tnat2, from A. thaliana. Moreover, Tnat2 members had a tandem repeat about 240 bp long. Tnat1 and Tnat2 with tandem repeats in their internal regions show no homology to each other or to any of the elements identified previously; therefore they appear to be novel transposable elements.  相似文献   

6.
7.
Long interspersed elements (LINEs) are transposable elements that proliferate within eukaryotic genomes, having a large impact on eukaryotic genome evolution. LINEs mobilize via a process called retrotransposition. Although the role of the LINE-encoded protein(s) in retrotransposition has been extensively investigated, the participation of host-encoded factors in retrotransposition remains unclear. To address this issue, we examined retrotransposition frequencies of two structurally different LINEs—zebrafish ZfL2-2 and human L1—in knockout chicken DT40 cell lines deficient in genes involved in the non-homologous end-joining (NHEJ) repair of DNA and in human HeLa cells treated with a drug that inhibits NHEJ. Deficiencies of NHEJ proteins decreased retrotransposition frequencies of both LINEs in these cells, suggesting that NHEJ is involved in LINE retrotransposition. More precise characterization of ZfL2-2 insertions in DT40 cells permitted us to consider the possibility of dual roles for NHEJ in LINE retrotransposition, namely to ensure efficient integration of LINEs and to restrict their full-length formation.  相似文献   

8.
9.
The preTa subfamily of long interspersed elements (LINEs) is characterized by a three base-pair "ACG" sequence in the 3' untranslated region, contains approximately 400 members in the human genome, and has low level of nucleotide divergence with an estimated average age of 2.34 million years old suggesting that expansion of the L1 preTa subfamily occurred just after the divergence of humans and African apes. We have identified 362 preTa L1 elements from the draft human genomic sequence, investigated the genomic characteristics of preTa L1 insertions, and screened individual elements across diverse human populations and various non-human primate species using polymerase chain reaction (PCR) assays to determine the phylogenetic origin and levels of human genomic diversity associated with the L1 elements. All of the preTa L1 elements analyzed by PCR were absent from the orthologous positions in non-human primate genomes with 33 (14%) of the L1 elements being polymorphic with respect to insertion presence or absence in the human genome. The newly identified L1 insertion polymorphisms will prove useful as identical by descent genetic markers for the study of human population genetics. We provide evidence that preTa L1 elements show an integration site preference for genomic regions with low GC content. Computational analysis of the preTa L1 elements revealed that 29% of the elements amenable to complete sequence analysis have apparently escaped 5' truncation and are essentially full-length (approximately 6kb). In all, 29 have two intact open reading frames and may be capable of retrotransposition.  相似文献   

10.
Some previously unidentified short interspersed repetitive elements (SINEs) and long interspersed repetitive element (LINEs) were isolated from various higher elasmobranchs (sharks, skates, and rays) and characterized. These SINEs, members of the HE1 SINE family, were tRNA-derived and were widespread in higher elasmobranches. The 3'-tail region of this SINE family was strongly conserved among elasmobranchs. The LINEs, members of the HER1 LINE family, encoded an amino acid sequence similar to that encoded by the chicken CR1 LINE family, and they contained a strongly conserved 3'-tail region in the 3' untranslated region. This tail region of the HER1 LINE family was almost identical to that of the HE1 SINE family. Thus, the HE1 SINE family and the HER1 LINE family provide a clear example of a pair of SINEs and LINEs that share the same tail region. Conservation of the secondary structures of the tail regions, as well as of the nucleotide sequences, between the HE1 SINE family and HER1 LINE family during evolution suggests that SINEs utilize the enzymatic machinery for retroposition of LINEs through the recognition of higher-order structures of the conserved 3'-tail region. A discussion is presented of the parasitism of SINEs on LINEs during the evolution of these retroposons.  相似文献   

11.
12.
13.
The Ta (transcribed, subset a) subfamily of L1 LINEs (long interspersed elements) is characterized by a 3-bp ACA sequence in the 3' untranslated region and contains approximately 520 members in the human genome. Here, we have extracted 468 Ta L1Hs (L1 human specific) elements from the draft human genomic sequence and screened individual elements using polymerase-chain-reaction (PCR) assays to determine their phylogenetic origin and levels of human genomic diversity. One hundred twenty-four of the elements amenable to complete sequence analysis were full length ( approximately 6 kb) and have apparently escaped any 5' truncation. Forty-four of these full-length elements have two intact open reading frames and may be capable of retrotransposition. Sequence analysis of the Ta L1 elements showed a low level of nucleotide divergence with an estimated age of 1.99 million years, suggesting that expansion of the L1 Ta subfamily occurred after the divergence of humans and African apes. A total of 262 Ta L1 elements were screened with PCR-based assays to determine their phylogenetic origin and the level of human genomic variation associated with each element. All of the Ta L1 elements analyzed by PCR were absent from the orthologous positions in nonhuman primate genomes, except for a single element (L1HS72) that was also present in the common (Pan troglodytes) and pygmy (P. paniscus) chimpanzee genomes. Sequence analysis revealed that this single exception is the product of a gene conversion event involving an older preexisting L1 element. One hundred fifteen (45%) of the Ta L1 elements were polymorphic with respect to insertion presence or absence and will serve as identical-by-descent markers for the study of human evolution.  相似文献   

14.
The 5' and 3' untranslated regions of eukaryotic mRNAs may play a crucial role in the regulation of gene expression controlling mRNA localization, stability and translational efficiency. For this reason we developed UTRdb, a specialized database of 5' and 3' untranslated sequences of eukaryotic mRNAs cleaned from redundancy. UTRdb entries are enriched with specialized information not present in the primary databases including the presence of nucleotide sequence patterns already demonstrated by experimental analysis to have some functional role. All these patterns have been collected in the UTRsite database so that it is possible to search any input sequence for the presence of annotated functional motifs. Furthermore, UTRdb entries have been annotated for the presence of repetitive elements. All internet resources implemented for retrieval and functional analysis of 5' and 3' untranslated regions of eukaryotic mRNAs are accessible at http://bigarea.area.ba.cnr.it:8000/EmbIT/UTRH ome/  相似文献   

15.
16.
17.
The genomic RNAs of flaviviruses such as dengue virus (DEN) have a 5' m7GpppN cap like those of cellular mRNAs but lack a 3' poly(A) tail. We have studied the contributions to translational expression of 5'- and 3'-terminal regions of the DEN serotype 2 genome by using luciferase reporter mRNAs transfected into Vero cells. DCLD RNA contained the entire DEN 5' and 3' untranslated regions (UTRs), as well as the first 36 codons of the capsid coding region fused to the luciferase reporter gene. Capped DCLD RNA was as efficiently translated in Vero cells as capped GLGpA RNA, a reporter with UTRs from the highly expressed alpha-globin mRNA and a 72-residue poly(A) tail. Analogous reporter RNAs with regulatory sequences from West Nile and Sindbis viruses were also strongly expressed. Although capped DCLD RNA was expressed much more efficiently than its uncapped form, uncapped DCLD RNA was translated 6 to 12 times more efficiently than uncapped RNAs with UTRs from globin mRNA. The 5' cap and DEN 3' UTR were the main sources of the translational efficiency of DCLD RNA, and they acted synergistically in enhancing translation. The DEN 3' UTR increased mRNA stability, although this effect was considerably weaker than the enhancement of translational efficiency. The DEN 3' UTR thus has translational regulatory properties similar to those of a poly(A) tail. Its translation-enhancing effect was observed for RNAs with globin or DEN 5' sequences, indicating no codependency between viral 5' and 3' sequences. Deletion studies showed that translational enhancement provided by the DEN 3' UTR is attributable to the cumulative contributions of several conserved elements, as well as a nonconserved domain adjacent to the stop codon. One of the conserved elements was the conserved sequence (CS) CS1 that is complementary to cCS1 present in the 5' end of the DEN polyprotein open reading frame. Complementarity between CS1 and cCS1 was not required for efficient translation.  相似文献   

18.
The complete nucleotide sequence of pRGO1, a cryptic plasmid from Propionibacterium acidipropionici E214, was determined. pRGO1 is 6, 868 bp long, and its G+C content is 65.0%. Frame analysis of the sequence revealed six open reading frames, which were designated Orf1 to Orf6. The deduced amino acid sequences of Orf1 and Orf2 showed extensive similarities to an initiator of plasmid replication, the Rep protein, of various plasmids of gram-positive bacteria. The amino acid sequence of the putative translation product of orf3 exhibited a high degree of similarity to the amino acid sequences of DNA invertase in several bacteria. For the putative translation products of orf4, orf5, and orf6, on the other hand, no homologous sequences were found. The function of these open reading frames was studied by deletion analysis. A shuttle vector, pPK705, was constructed for shuttling between Escherichia coli and a Propionibacterium strain containing orf1 (repA), orf2 (repB), orf5, and orf6 from pRGO1, pUC18, and the hygromycin B-resistant gene as a drug marker. Shuttle vector pPK705 successfully transformed Propionibacterium freudenreichii subsp. shermanii IFO12426 by electroporation at an efficiency of 8 x 10(6) CFU/microg of DNA under optimized conditions. Transformation of various species of propionibacteria with pPK705 was also performed at efficiencies of about 10(4) to 10(7) CFU/microg of DNA. The vector was stably maintained in strains of P. freudenreichii subsp. shermanii, P. freudenreichii, P. pentosaceum, and P. freudenreichii subsp. freudenreichii grown under nonselective conditions. Successful manipulation of a host-vector system in propionibacteria should facilitate genetic studies and lead to creation of genes that are useful industrially.  相似文献   

19.
20.
We have determined the nucleotide sequence of core histone genes and flanking regions from two of approximately 11 different genomic histone clusters of the nematode Caenorhabditis elegans. Four histone genes from one cluster (H3, H4, H2B, H2A) and two histone genes from another (H4 and H2A) were analyzed. The predicted amino acid sequences of the two H4 and H2A proteins from the two clusters are identical, whereas the nucleotide sequences of the genes have diverged 9% (H2A) and 12% (H4). Flanking sequences, which are mostly not similar, were compared to identify putative regulatory elements. A conserved sequence of 34 base-pairs is present 19 to 42 nucleotides 3' of the termination codon of all the genes. Within the conserved sequence is a 16-base dyad sequence homologous to the one typically found at the 3' end of histone genes from higher eukaryotes. The C. elegans core histone genes are organized as divergently transcribed pairs of H3-H4 and H2A-H2B and contain 5' conserved sequence elements in the shared spacer regions. One of the sequence elements, 5' CTCCNCCTNCCCACCNCANA 3', is located immediately upstream from the canonical TATA homology of each gene. Another sequence element, 5' CTGCGGGGACACATNT 3', is present in the spacer of each heterotypic pair. These two 5' conserved sequences are not present in the promoter region of histone genes from other organisms, where 5' conserved sequences are usually different for each histone class. They are also not found in non-histone genes of C. elegans. These putative regulatory sequences of C. elegans core histone genes are similar to the regulatory elements of both higher and lower eukaryotes. The coding regions of the genes and the 3' regulatory sequences are similar to those of higher eukaryotes, whereas the presence of common 5' sequence elements upstream from genes of different histone classes is similar to histone promoter elements in yeast.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号