首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
B Chénais  A Caruso  S Hiard  N Casse 《Gene》2012,509(1):7-15
Transposable elements (TEs) are present in roughly all genomes. These mobile DNA sequences are able to invade genomes and their impact on genome evolution is substantial. The mobility of TEs can induce the appearance of deleterious mutations, gene disruption and chromosome rearrangements, but transposition activity also has positive aspects and the mutational activities of TEs contribute to the genetic diversity of organisms. This short review aims to give a brief overview of the impact TEs may have on animal and plant genome structure and expression, and the relationship between TEs and the stress response of organisms, including insecticide resistance.  相似文献   

3.
Thomas J  Sorourian M  Ray D  Baker RJ  Pritham EJ 《Gene》2011,474(1-2):52-58
Transposable elements (TEs) have the unique ability to move and replicate within the genome and therefore engender dramatic changes to genome architecture. Among different types of TEs, rolling-circle transposons (Helitrons) are well known for their ability to capture and amplify host gene fragments. Bioinformatic analysis revealed that Helitrons constitute ~3% of the Myotis lucifugus, (little brown bat) genome, while no Helitrons were found in any of the other 44+ sequenced mammalian genomes. Recently horizontal transfer has been implicated for some of the M. lucifugus Helitrons, in part explaining this disparate distribution among mammals. The purpose of this work is to determine both the distribution of Helitrons among bats and to estimate the number of independent invasions. We employed a combination of in silico, PCR and hybridization based techniques to identify Helitrons from diverse bat species belonging to ten different families. This work reveals that Helitrons invaded the vesper bat lineage, at least once. Indeed, Helitrons were not identified in the sister taxa 'Miniopterus', which suggests that the amplification of Helibat occurred (30-36 mya) only in the vesper bat lineage. The estimated age of amplification of the Helibats and the rapid radiation of vesper bats are roughly coincidental and suggest that the invasion and amplification of these elements might have influenced their evolutionary trajectory potentially contributing to phenotypic and genotypic diversity.  相似文献   

4.
5.

Background

Enterococcus faecalis and Enterococcus faecium are typical enterococcal bacterial pathogens. Antibiotic resistance means that the identification of novel E. faecalis and E. faecium phages against antibiotic-resistant Enterococcus have an important impact on public health. In this study, the E. faecalis phage IME-EF4, E. faecium phage IME-EFm1, and both their hosts were antibiotic resistant. To characterize the genome termini of these two phages, a termini analysis theory was developed to provide a wealth of terminal sequence information directly, using only high-throughput sequencing (HTS) read frequency statistics.

Results

The complete genome sequences of phages IME-EF4 and IME-EFm1 were determined, and our termini analysis theory was used to determine the genome termini of these two phages. Results showed 9 bp 3′ protruding cohesive ends in both IME-EF4 and IME-EFm1 genomes by analyzing frequencies of HTS reads. For the positive strands of their genomes, the 9 nt 3′ protruding cohesive ends are 5′-TCATCACCG-3′ (IME-EF4) and 5′-GGGTCAGCG-3′ (IME-EFm1). Further experiments confirmed these results. These experiments included mega-primer polymerase chain reaction sequencing, terminal run-off sequencing, and adaptor ligation followed by run-off sequencing.

Conclusion

Using this termini analysis theory, the termini of two newly isolated antibiotic-resistant Enterococcus phages, IME-EF4 and IME-EFm1, were identified as the byproduct of HTS. Molecular biology experiments confirmed the identification. Because it does not require time-consuming wet lab termini analysis experiments, the termini analysis theory is a fast and easy means of identifying phage DNA genome termini using HTS read frequency statistics alone. It may aid understanding of phage DNA packaging.  相似文献   

6.
Tempel S  Nicolas J  El Amrani A  Couée I 《Gene》2007,403(1-2):18-28
Helitrons are a class of prolific transposable elements in the Arabidopsis thaliana genome. Although 37 families were identified after the recent discovery of Helitrons, no systematic classification is available because of the high variability of helitronic sequences. Since transposition proteins are assumed to interact with Helitron termini, a Helitron model was formalized based on terminus characterization in order to carry out an exhaustive analysis of all possible combinations of the pairs of termini present. This combinatorics approach resulted in the discovery of a number of new Helitron elements corresponding to termini associations from distinct previously-described Helitron families. The occurrence matrix of termini combinations yielded a structure that revealed clusters of Helitron families.  相似文献   

7.
Transposable elements (TEs) constitute a substantial fraction of the eukaryotic genome and, as a result, have a complex relationship with their host that is both adversarial and dependent. To minimize damage to cellular genes, TEs possess mechanisms that target integration to sequences of low importance. However, the retrotransposon Tf1 of Schizosaccharomyces pombe integrates with a surprising bias for promoter sequences of stress-response genes. The clustering of integration in specific promoters suggests that Tf1 possesses a targeting mechanism that is important for evolutionary adaptation to changes in environment. We report here that Sap1, an essential DNA-binding protein, plays an important role in Tf1 integration. A mutation in Sap1 resulted in a 10-fold drop in Tf1 transposition, and measures of transposon intermediates support the argument that the defect occurred in the process of integration. Published ChIP-Seq data on Sap1 binding combined with high-density maps of Tf1 integration that measure independent insertions at single-nucleotide positions show that 73.4% of all integration occurs at genomic sequences bound by Sap1. This represents high selectivity because Sap1 binds just 6.8% of the genome. A genome-wide analysis of promoter sequences revealed that Sap1 binding and amounts of integration correlate strongly. More important, an alignment of the DNA-binding motif of Sap1 revealed integration clustered on both sides of the motif and showed high levels specifically at positions +19 and −9. These data indicate that Sap1 contributes to the efficiency and position of Tf1 integration.  相似文献   

8.
9.

Background

Transposable elements (TEs) are DNA sequences that are able to move from their location in the genome by cutting or copying themselves to another locus. As such, they are increasingly recognized as impacting all aspects of genome function. With the dramatic reduction in cost of DNA sequencing, it is now possible to resequence whole genomes in order to systematically characterize novel TE mobilization in a particular individual. However, this task is made difficult by the inherently repetitive nature of TE sequences, which in some eukaryotes compose over half of the genome sequence. Currently, only a few software tools dedicated to the detection of TE mobilization using next-generation-sequencing are described in the literature. They often target specific TEs for which annotation is available, and are only able to identify families of closely related TEs, rather than individual elements.

Results

We present TE-Tracker, a general and accurate computational method for the de-novo detection of germ line TE mobilization from re-sequenced genomes, as well as the identification of both their source and destination sequences. We compare our method with the two classes of existing software: specialized TE-detection tools and generic structural variant (SV) detection tools. We show that TE-Tracker, while working independently of any prior annotation, bridges the gap between these two approaches in terms of detection power. Indeed, its positive predictive value (PPV) is comparable to that of dedicated TE software while its sensitivity is typical of a generic SV detection tool. TE-Tracker demonstrates the benefit of adopting an annotation-independent, de novo approach for the detection of TE mobilization events. We use TE-Tracker to provide a comprehensive view of transposition events induced by loss of DNA methylation in Arabidopsis. TE-Tracker is freely available at http://www.genoscope.cns.fr/TE-Tracker.

Conclusions

We show that TE-Tracker accurately detects both the source and destination of novel transposition events in re-sequenced genomes. Moreover, TE-Tracker is able to detect all potential donor sequences for a given insertion, and can identify the correct one among them. Furthermore, TE-Tracker produces significantly fewer false positives than common SV detection programs, thus greatly facilitating the detection and analysis of TE mobilization events.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-014-0377-z) contains supplementary material, which is available to authorized users.  相似文献   

10.
Throughout evolution, eukaryotic genomes have been invaded by transposable elements (TEs). Little is known about the factors leading to genomic proliferation of TEs, their preferred integration sites and the molecular mechanisms underlying their insertion. We analyzed hundreds of thousands nested TEs in the human genome, i.e. insertions of TEs into existing ones. We first discovered that most TEs insert within specific ‘hotspots’ along the targeted TE. In particular, retrotransposed Alu elements contain a non-canonical single nucleotide hotspot for insertion of other Alu sequences. We next devised a method for identification of integration sequence motifs of inserted TEs that are conserved within the targeted TEs. This method revealed novel sequences motifs characterizing insertions of various important TE families: Alu, hAT, ERV1 and MaLR. Finally, we performed a global assessment to determine the extent to which young TEs tend to nest within older transposed elements and identified a 4-fold higher tendency of TEs to insert into existing TEs than to insert within non-TE intergenic regions. Our analysis demonstrates that TEs are highly biased to insert within certain TEs, in specific orientations and within specific targeted TE positions. TE nesting events also reveal new characteristics of the molecular mechanisms underlying transposition.  相似文献   

11.

Background and Aims

Although monocotyledonous plants comprise one of the two major groups of angiosperms and include >65 000 species, comprehensive genome analysis has been focused mainly on the Poaceae (grass) family. Due to this bias, most of the conclusions that have been drawn for monocot genome evolution are based on grasses. It is not known whether these conclusions apply to many other monocots.

Methods

To extend our understanding of genome evolution in the monocots, Asparagales genomic sequence data were acquired and the structural properties of asparagus and onion genomes were analysed. Specifically, several available onion and asparagus bacterial artificial chromosomes (BACs) with contig sizes >35 kb were annotated and analysed, with a particular focus on the characterization of long terminal repeat (LTR) retrotransposons.

Key Results

The results reveal that LTR retrotransposons are the major components of the onion and garden asparagus genomes. These elements are mostly intact (i.e. with two LTRs), have mainly inserted within the past 6 million years and are piled up into nested structures. Analysis of shotgun genomic sequence data and the observation of two copies for some transposable elements (TEs) in annotated BACs indicates that some families have become particularly abundant, as high as 4–5 % (asparagus) or 3–4 % (onion) of the genome for the most abundant families, as also seen in large grass genomes such as wheat and maize.

Conclusions

Although previous annotations of contiguous genomic sequences have suggested that LTR retrotransposons were highly fragmented in these two Asparagales genomes, the results presented here show that this was largely due to the methodology used. In contrast, this current work indicates an ensemble of genomic features similar to those observed in the Poaceae.  相似文献   

12.
For large regions of many proteins, and even entire proteins, no homology to known domains or proteins can be detected. These sequences are often referred to as orphans. Surprisingly, it has been reported that the large number of orphans is sustained in spite of a rapid increase of available genomic sequences. However, it is believed that de novo creation of coding sequences is rare in comparison to mechanisms such as domain shuffling and gene duplication; hence, most sequences should have homologs in other genomes.To investigate this, the sequences of 19 complete fungi genomes were compared. By using the phylogenetic relationship between these genomes, we could identify potentially de novo created orphans in Saccharomyces cerevisiae. We found that only a small fraction, < 2%, of the S. cerevisiae proteome is orphan, which confirms that de novo creation of coding sequences is indeed rare. Furthermore, we found it necessary to compare the most closely related species to distinguish between de novo created sequences and rapidly evolving sequences where homologs are present but cannot be detected.Next, the orphan proteins (OPs) and orphan domains (ODs) were characterized. First, it was observed that both OPs and ODs are short. In addition, at least some of the OPs have been shown to be functional in experimental assays, showing that they are not pseudogenes. Furthermore, in contrast to what has been reported before and what is seen for older orphans, S. cerevisiae specific ODs and proteins are not more disordered than other proteins. This might indicate that many of the older, and earlier classified, orphans indeed are fast-evolving sequences. Finally, > 90% of the detected ODs are located at the protein termini, which suggests that these orphans could have been created by mutations that have affected the start or stop codons.  相似文献   

13.
Transposable elements (TEs) have been identified in every organism in which they have been looked for. The sequencing of large genomes, such as the human genome and those of Drosophila, Arabidopsis, Caenorhabditis, has also shown that they are a major constituent of these genomes, accounting for 15% of the genome of Drosophila, 45% of the human genome, and more than 70% in some plants and amphibians. Compared with the 1% of genomic DNA dedicated to protein-coding sequences in the human genome, this has prompted various researchers to suggest that the TEs and the other repetitive sequences that constitute the so-called "noncoding DNA", are where the most stimulating discoveries will be made in the future (Bromham, 2002). We are therefore getting further and further from the original idea that this DNA was simply "junk DNA", that owed its presence in the genome entirely to its capacity for selfish transposition. Our understanding of the structures of TEs, their distribution along the genomes, their sequence and insertion polymorphisms within genomes, and within and between populations and species, their impact on genes and on the regulatory mechanisms of genetic expression, their effects on exon shuffling and other phenomena that reshape the genome, and their impact on genome size has increased dramatically in recent years. This leads to a more general picture of the impact of TEs on genomes, though many copies are still mainly selfish or junk DNA. In this review we focus mainly on discoveries made in Drosophila, but we also use information about other genomes when this helps to elucidate the general processes involved in the organization, plasticity, and evolution of genomes.  相似文献   

14.
We have previously characterized several genomic rearrangements of Epstein-Barr virus (EBV) DNA contained in one of the defective EBV genomes harbored by the P3HR-1 (HR-1) line (H. B. Jenson, M. S. Rabson, and G. Miller, J. Virol. 58:475-486, 1986). One recombinant clone of heterogeneous DNA (het DNA) from this defective genome is an EcoRI fragment of 16 kilobase pairs (kbp) which is a palindrome. DNA digestion fragments specific for the center of this palindrome were present in cells which contained het DNA but not in cells which lacked het DNA. Thus, the palindrome was not an artifact of DNA cloning. The organization of the center of this palindrome was studied by DNA sequencing. The comparable region of the parental HR-1 genome was also studied by DNA sequencing. The central 3,495 base pairs (bp) of the palindrome were composed of sequences derived exclusively from internal repeat 1 of EBV, represented by BamHI W fragment. At each end of the central 3,495 hp was a symmetrical recombination with sequences of BamHI-Z, located more than 50 kbp away on the standard EBV genome. The central 3,495 bp were composed of an unduplicated 341 bp flanked by two perfect palindromic repeats of 1,577 bp. The 341-bp unique region was a portion of a 387-bp region of standard HR-1 BamHI-W which was identical to the central 387 bp of the palindrome. This central 387-bp region contained numerous stretches of dyad symmetry capable of forming a large stem-and-loop structure. The palindromic rearrangement had created two novel open reading frames in het DNA derived from standard HR-1 BamHI-W sequences. These two het DNA open reading frames had different amino termini but identical carboxy termini derived from the large open reading frame in standard HR-1 BamHI-W (HR-1 BWRF1). The BamHI-W sequences found in het DNA did not include either the TATA box of standard HR-1 BamHI-W or the exons which are present in the potentially polycistronic latent mRNAs encoding EBV nuclear antigens. These marked alterations in genomic structure may relate to the unique biologic properties of virus stocks containing het DNA by creation of new polypeptides or by formation or deletion of regulatory or functional signals.  相似文献   

15.
The maize genome contains a helitron insertion   总被引:8,自引:0,他引:8       下载免费PDF全文
The maize mutation sh2-7527 was isolated in a conventional maize breeding program in the 1970s. Although the mutant contains foreign sequences within the gene, the mutation is not attributable to an interchromosomal exchange or to a chromosomal inversion. Hence, the mutation was caused by an insertion. Sequences at the two Sh2 borders have not been scrambled or mutated, suggesting that the insertion is not caused by a catastrophic reshuffling of the maize genome. The insertion is large, at least 12 kb, and is highly repetitive in maize. As judged by hybridization, sorghum contains only one or a few copies of the element, whereas no hybridization was seen to the Arabidopsis genome. The insertion acts from a distance to alter the splicing of the sh2 pre-mRNA. Three distinct intron-bearing maize genes were found in the insertion. Of most significance, the insertion bears striking similarity to the recently described DNA helicase-bearing transposable elements termed HELITRONS: Like Helitrons, the inserted sequence of sh2-7527 is large, lacks terminal repeats, does not duplicate host sequences, and was inserted between a host dinucleotide AT. Like Helitrons, the maize element contains 5' TC and 3' CTRR termini as well as two short palindromic sequences near the 3' terminus that potentially can form a 20-bp hairpin. Although the maize element lacks sequence information for a DNA helicase, it does contain four exons with similarity to a plant DEAD box RNA helicase. A second Helitron insertion was found in the maize genomic database. These data strongly suggest an active Helitron in the present-day maize genome.  相似文献   

16.
真核生物转座子鉴定和分类计算方法   总被引:3,自引:0,他引:3  
Xu HE  Zhang HH  Han MJ  Shen YH  Huang XZ  Xiang ZH  Zhang Z 《遗传》2012,34(8):1009-1019
重复序列是真核生物基因组的重要组成成分,根据其序列特征及在基因组中的存在形式,可以进一步分为串联重复、片段重复和散在重复。其中,散在重复大多起源于转座子。根据转座介质的不同,转座子又可分为DNA和逆转录转座子。转座子的转座和扩增对基因的进化和基因组的稳定具有显著的影响;同时与其他类型的重复序列相比,转座子的结构和分类更为复杂多样,使得对转座子的鉴定和分类更为复杂和困难。鉴于此,文章简要概括了转座子的功能及分类,总结了真核生物转座子鉴定、分类和注释的3个步骤:(1)重复序列库的构建;(2)重复序列的校正和分类;(3)基因组注释。着重介绍了每一步骤所采用的不同计算方法,比较了不同方法的优缺点。只有把多种方法结合起来使用才能实现全基因组转座子的精确鉴定、分类和注释,这将为转座子的全基因组鉴定和分类提供借鉴意义。  相似文献   

17.

Background

Galileo is one of three members of the P superfamily of DNA transposons. It was originally discovered in Drosophila buzzatii, in which three segregating chromosomal inversions were shown to have been generated by ectopic recombination between Galileo copies. Subsequently, Galileo was identified in six of 12 sequenced Drosophila genomes, indicating its widespread distribution within this genus. Galileo is strikingly abundant in Drosophila willistoni, a neotropical species that is highly polymorphic for chromosomal inversions, suggesting a role for this transposon in the evolution of its genome.

Results

We carried out a detailed characterization of all Galileo copies present in the D. willistoni genome. A total of 191 copies, including 133 with two terminal inverted repeats (TIRs), were classified according to structure in six groups. The TIRs exhibited remarkable variation in their length and structure compared to the most complete copy. Three copies showed extended TIRs due to internal tandem repeats, the insertion of other transposable elements (TEs), or the incorporation of non-TIR sequences into the TIRs. Phylogenetic analyses of the transposase (TPase)-encoding and TIR segments yielded two divergent clades, which we termed Galileo subfamilies V and W. Target-site duplications (TSDs) in D. willistoni Galileo copies were 7- or 8-bp in length, with the consensus sequence GTATTAC. Analysis of the region around the TSDs revealed a target site motif (TSM) with a 15-bp palindrome that may give rise to a stem-loop secondary structure.

Conclusions

There is a remarkable abundance and diversity of Galileo copies in the D. willistoni genome, although no functional copies were found. The TIRs in particular have a dynamic structure and extend in different ways, but their ends (required for transposition) are more conserved than the rest of the element. The D. willistoni genome harbors two Galileo subfamilies (V and W) that diverged ~9 million years ago and may have descended from an ancestral element in the genome. Galileo shows a significant insertion preference for a 15-bp palindromic TSM.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-792) contains supplementary material, which is available to authorized users.  相似文献   

18.

Background  

Transposable elements (TEs) are mobile sequences found in nearly all eukaryotic genomes. They have the ability to move and replicate within a genome, often influencing genome evolution and gene expression. The identification of TEs is an important part of every genome project. The number of sequenced genomes is rapidly rising, and the need to identify TEs within them is also growing. The ability to do this automatically and effectively in a manner similar to the methods used for genes is of increasing importance. There exist many difficulties in identifying TEs, including their tendency to degrade over time and that many do not adhere to a conserved structure. In this work, we describe a homology-based approach for the automatic identification of high-quality consensus TEs, aimed for use in the analysis of newly sequenced genomes.  相似文献   

19.
M-MuLV-specific DNA probes were used to establish the state of integration and amplification of recombinant proviral sequences in Moloney virus-induced tumors of Balb/Mo, Balb/c and 129 mice. The somatically acquired viral sequences contain both authentic M-MuLV genomes and recombinants of M-MuLV with endogenous viral sequences. All reintegrated genomes carry long terminal repeat (LTR) sequences at both termini of their genome. In the preleukemic stage a large population of cells exhibiting a random distribution of reintegrated M-MuLV genomes are seen, but during outgrowth of the tumor, selection of cells occurs leaving one or a few clonal descendants in the outgrown tumor. In this latter stage recombinant genomes can be detected. Although these recombinants constitute a heterogeneous group of proviruses, characteristic molecular markers are conserved among many individual proviral recombinants, lending credence to the notion that a certain recombinant structure is a prerequisite for the onset of neoplasia. The structure of these recombinants shows close structural similarities to the previously described mink cell focus-inducing (MCF)-type viruses.  相似文献   

20.
Lerat E  Burlet N  Biémont C  Vieira C 《Gene》2011,473(2):100-109
Transposable elements (TEs) are indwelling components of genomes, and their dynamics have been a driving force in genome evolution. Although we now have more information concerning their amounts and characteristics in various organisms, we still have little data from overall comparisons of their sequences in very closely-related species. While the Drosophila melanogaster genome has been extensively studied, we have only limited knowledge regarding the precise TE sequences in the genomes of the related species Drosophila simulans, Drosophila sechellia and Drosophila yakuba. In this study we analyzed the number and structure of TE copies in the sequenced genomes of these four species. Our findings show that, unexpectedly, the number of TE insertions in D. simulans is greater than that in D. melanogaster, but that most of the copies in D. simulans are degraded and in small fragments, as in D. sechellia and D. yakuba. This suggests that all three species were invaded by numerous TEs a long time ago, but have since regulated their activity, as the present TE copies are degraded, with very few full-length elements. In contrast, in D. melanogaster, a recent activation of TEs has resulted in a large number of almost-identical TE copies. We have detected variants of some TEs in D. simulans and D. sechellia, that are almost identical to the reference TE sequences in D. melanogaster, suggesting that D. melanogaster has recently been invaded by active TE variants from the other species. Our results indicate that the three species D. simulans, D. sechellia, and D. yakuba seem to be at a different stage of their TE life cycle when compared to D. melanogaster. Moreover, we show that D. melanogaster has been invaded by active TE variants for several TE families likely to come from D. simulans or the ancestor of D. simulans and D. sechellia. The numerous horizontal transfer events implied to explain these results could indicate introgression events between these species.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号