首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
Transposable elements (TEs) are repetitive DNA sequences that are ubiquitous, extremely abundant and dynamic components of practically all genomes. Much effort has gone into annotation of TE copies in reference genomes. The sequencing cost reduction and the newly available next-generation sequencing (NGS) data from multiple strains within a species offer an unprecedented opportunity to study population genomics of TEs in a range of organisms. Here, we present a computational pipeline (T-lex) that uses NGS data to detect the presence/absence of annotated TE copies. T-lex can use data from a large number of strains and returns estimates of population frequencies of individual TE insertions in a reasonable time. We experimentally validated the accuracy of T-lex detecting presence or absence of 768 previously identified TE copies in two resequenced Drosophila melanogaster strains. Approximately 95% of the TE insertions were detected with 100% sensitivity and 97% specificity. We show that even at low levels of coverage T-lex produces accurate results for TE copies that it can identify reliably but that the rate of 'no data' calls increases as the coverage falls below 15×. T-lex is a broadly applicable and flexible tool that can be used in any genome provided the availability of the reference genome, individual TE copy annotation and NGS data.  相似文献   

3.
Transposable elements (TEs) are mobile, repetitive DNA sequences that are almost ubiquitous in prokaryotic and eukaryotic genomes. They have a large impact on genome structure, function and evolution. With the recent development of high-throughput sequencing methods, many genome sequences have become available, making possible comparative studies of TE dynamics at an unprecedented scale. Several methods have been proposed for the de novo identification of TEs in sequenced genomes. Most begin with the detection of genomic repeats, but the subsequent steps for defining TE families differ. High-quality TE annotations are available for the Drosophila melanogaster and Arabidopsis thaliana genome sequences, providing a solid basis for the benchmarking of such methods. We compared the performance of specific algorithms for the clustering of interspersed repeats and found that only a particular combination of algorithms detected TE families with good recovery of the reference sequences. We then applied a new procedure for reconciling the different clustering results and classifying TE sequences. The whole approach was implemented in a pipeline using the REPET package. Finally, we show that our combined approach highlights the dynamics of well defined TE families by making it possible to identify structural variations among their copies. This approach makes it possible to annotate TE families and to study their diversification in a single analysis, improving our understanding of TE dynamics at the whole-genome scale and for diverse species.  相似文献   

4.
Lerat E  Burlet N  Biémont C  Vieira C 《Gene》2011,473(2):100-109
Transposable elements (TEs) are indwelling components of genomes, and their dynamics have been a driving force in genome evolution. Although we now have more information concerning their amounts and characteristics in various organisms, we still have little data from overall comparisons of their sequences in very closely-related species. While the Drosophila melanogaster genome has been extensively studied, we have only limited knowledge regarding the precise TE sequences in the genomes of the related species Drosophila simulans, Drosophila sechellia and Drosophila yakuba. In this study we analyzed the number and structure of TE copies in the sequenced genomes of these four species. Our findings show that, unexpectedly, the number of TE insertions in D. simulans is greater than that in D. melanogaster, but that most of the copies in D. simulans are degraded and in small fragments, as in D. sechellia and D. yakuba. This suggests that all three species were invaded by numerous TEs a long time ago, but have since regulated their activity, as the present TE copies are degraded, with very few full-length elements. In contrast, in D. melanogaster, a recent activation of TEs has resulted in a large number of almost-identical TE copies. We have detected variants of some TEs in D. simulans and D. sechellia, that are almost identical to the reference TE sequences in D. melanogaster, suggesting that D. melanogaster has recently been invaded by active TE variants from the other species. Our results indicate that the three species D. simulans, D. sechellia, and D. yakuba seem to be at a different stage of their TE life cycle when compared to D. melanogaster. Moreover, we show that D. melanogaster has been invaded by active TE variants for several TE families likely to come from D. simulans or the ancestor of D. simulans and D. sechellia. The numerous horizontal transfer events implied to explain these results could indicate introgression events between these species.  相似文献   

5.
Discovering and detecting transposable elements in genome sequences   总被引:2,自引:0,他引:2  
The contribution of transposable elements (TEs) to genome structure and evolution as well as their impact on genome sequencing, assembly, annotation and alignment has generated increasing interest in developing new methods for their computational analysis. Here we review the diversity of innovative approaches to identify and annotate TEs in the post-genomic era, covering both the discovery of new TE families and the detection of individual TE copies in genome sequences. These approaches span a broad spectrum in computational biology including de novo, homology-based, structure-based and comparative genomic methods. We conclude that the integration and visualization of multiple approaches and the development of new conceptual representations for TE annotation will further advance the computational analysis of this dynamic component of the genome.  相似文献   

6.
The techniques that are usually used to detect transposable elements (TEs) in nucleic acid sequences rely on sequence similarity with previously characterized elements. However, these methods are likely to miss many elements in various organisms. We tested two strategies for the detection of unknown elements. The first, which we call "TBLASTX strategy," searches for TE sequences by comparing the six-frame translations of the nucleic acid sequences of known TEs with the genomic sequence of interest. The second, "repeat-based strategy," searches genomic sequences for long repeats and clusters them in groups of similar sequences. TE copies from a given family are expected to cluster together. We tested the Drosophila melanogaster genomic sequence and the recently sequenced Anopheles gambiae genome in which most TEs remain unknown. We showed that the "TBLASTX strategy" is very efficient as it detected at least 332 new TE families in D. melanogaster and 400 in A. gambiae. This was unexpected in Drosophila as TEs of this organism have been extensively studied. The "repeat-based strategy" appeared to be very inefficient because of two problems: (i) TE copies are heavily deleted and few copies share homologous regions, and (ii) segmental duplications are frequent and it is not easy to distinguish them from TE copies.  相似文献   

7.
Miniature inverted-repeat transposable elements (MITEs) are a special type of Class 2 non-autonomous transposable element (TE) that are abundant in the non-coding regions of the genes of many plant and animal species. The accurate identification of MITEs has been a challenge for existing programs because they lack coding sequences and, as such, evolve very rapidly. Because of their importance to gene and genome evolution, we developed MITE-Hunter, a program pipeline that can identify MITEs as well as other small Class 2 non-autonomous TEs from genomic DNA data sets. The output of MITE-Hunter is composed of consensus TE sequences grouped into families that can be used as a library file for homology-based TE detection programs such as RepeatMasker. MITE-Hunter was evaluated by searching the rice genomic database and comparing the output with known rice TEs. It discovered most of the previously reported rice MITEs (97.6%), and found sixteen new elements. MITE-Hunter was also compared with two other MITE discovery programs, FINDMITE and MUST. Unlike MITE-Hunter, neither of these programs can search large genomic data sets including whole genome sequences. More importantly, MITE-Hunter is significantly more accurate than either FINDMITE or MUST as the vast majority of their outputs are false-positives.  相似文献   

8.
The Drosophila melanogaster genome contains approximately 100 distinct families of transposable elements (TEs). In the euchromatic part of the genome, each family is present in a small number of copies (5-150 copies), with individual copies of TEs often present at very low frequencies in populations. This pattern is likely to reflect a balance between the inflow of TEs by transposition and the removal of TEs by natural selection. The nature of natural selection acting against TEs remains controversial. We provide evidence that selection against chromosome abnormalities caused by ectopic recombination limits the spread of some TEs. We also demonstrate for the first time that some TE families in the Drosophila euchromatin appear to be only marginally affected by purifying selection and contain many copies at high population frequencies. We argue that TEs in these families attain high population frequencies and even reach fixation as a result of low family-wide transposition rates leading to low TE copy numbers and consequently reduced strength of selection acting on individual TE copies. Fixation of TEs in these families should provide an upward pressure on the size of intergenic sequences counterbalancing rapid DNA loss through small deletions. Copy-number-dependent selection on TE families caused by ectopic recombination may also promote diversity among TEs in the Drosophila genome.  相似文献   

9.
Although transposable elements (TEs) are known to be potent sources of mutation, their contribution to the generation of recent adaptive changes has never been systematically assessed. In this work, we conduct a genome-wide screen for adaptive TE insertions in Drosophila melanogaster that have taken place during or after the spread of this species out of Africa. We determine population frequencies of 902 of the 1,572 TEs in Release 3 of the D. melanogaster genome and identify a set of 13 putatively adaptive TEs. These 13 TEs increased in population frequency sharply after the spread out of Africa. We argue that many of these TEs are in fact adaptive by demonstrating that the regions flanking five of these TEs display signatures of partial selective sweeps. Furthermore, we show that eight out of the 13 putatively adaptive elements show population frequency heterogeneity consistent with these elements playing a role in adaptation to temperate climates. We conclude that TEs have contributed considerably to recent adaptive evolution (one TE-induced adaptation every 200-1,250 y). The majority of these adaptive insertions are likely to be involved in regulatory changes. Our results also suggest that TE-induced adaptations arise more often from standing variants than from new mutations. Such a high rate of TE-induced adaptation is inconsistent with the number of fixed TEs in the D. melanogaster genome, and we discuss possible explanations for this discrepancy.  相似文献   

10.
Transposable elements (TEs) are the primary contributors to the genome bulk in many organisms and are major players in genome evolution. A clear and thorough understanding of the population dynamics of TEs is therefore essential for full comprehension of the eukaryotic genome evolution and function. Although TEs in Drosophila melanogaster have received much attention, population dynamics of most TE families in this species remains entirely unexplored. It is not clear whether the same population processes can account for the population behaviors of all TEs in Drosophila or whether, as has been suggested previously, different orders behave according to very different rules. In this work, we analyzed population frequencies for a large number of individual TEs (755 TEs) in five North American and one sub-Saharan African D. melanogaster populations (75 strains in total). These TEs have been annotated in the reference D. melanogaster euchromatic genome and have been sampled from all three major orders (non-LTR, LTR, and TIR) and from all families with more than 20 TE copies (55 families in total). We find strong evidence that TEs in Drosophila across all orders and families are subject to purifying selection at the level of ectopic recombination. We showed that strength of this selection varies predictably with recombination rate, length of individual TEs, and copy number and length of other TEs in the same family. Importantly, these rules do not appear to vary across orders. Finally, we built a statistical model that considered only individual TE-level (such as the TE length) and family-level properties (such as the copy number) and were able to explain more than 40% of the variation in TE frequencies in D. melanogaster.  相似文献   

11.
The well-established inaccuracy of purely computational methods for annotating genome sequences necessitates an interactive tool to allow biological experts to refine these approximations by viewing and independently evaluating the data supporting each annotation. Apollo was developed to meet this need, enabling curators to inspect genome annotations closely and edit them. FlyBase biologists successfully used Apollo to annotate the Drosophila melanogaster genome and it is increasingly being used as a starting point for the development of customized annotation editing tools for other genome projects.  相似文献   

12.
Genome size varies considerably between species, and transposable elements (TEs) are known to play an important role in this variability. However, it is far from clear whether TEs are involved in genome size differences between populations within a given species. We show here that in Drosophila melanogaster and Drosophila simulans the size of the genome varies among populations and is correlated with the TE copy number on the chromosome arms. The TEs embedded within the heterochromatin do not seem to be involved directly in this phenomenon, although they may contribute to differences in genome size. Furthermore, genome size and TE content variations parallel the worldwide colonization of D. melanogaster species. No such relationship exists for the more recently dispersed D. simulans species, which indicates that a quantitative increase in the TEs in local populations and fly migration are sufficient to account for the increase in genome size, with no need for an adaptation hypothesis.  相似文献   

13.
Transposable elements (TEs) are a major source of genetic variability in genomes, creating genetic novelty and driving genome evolution. Analysis of sequenced genomes has revealed considerable diversity in TE families, copy number, and localization between different, closely related species. For instance, although the twin species Drosophila melanogaster and D. simulans share the same TE families, they display different amounts of TEs. Furthermore, previous analyses of wild type derived strains of D. simulans have revealed high polymorphism regarding TE copy number within this species. Several factors may influence the diversity and abundance of TEs in a genome, including molecular mechanisms such as epigenetic factors, which could be a source of variation in TE success. In this paper, we present the first analysis of the epigenetic status of four TE families (roo, tirant, 412 and F) in seven wild type strains of D. melanogaster and D. simulans. Our data shows intra- and inter-specific variations in the histone marks that adorn TE copies. Our results demonstrate that the chromatin state of common TEs varies among TE families, between closely related species and also between wild type strains.  相似文献   

14.
15.
16.
Patrizio Dimitri 《Genetica》1997,100(1-3):85-93
Several families of transposable elements (TEs), most of them belonging to the retrotransposon catagory, are particularly enriched in Drosophila melanogaster constitutive heterochromatin. The enrichment of TE-homologous sequences into heterochromatin is not a peculiar feature of the Drosophila genome, but appears to be widespread among higher eukaryotes. The constitutive heterochromatin of D. melanogaster contains several genetically active domains; this raises the possibility that TE-homologous sequences inserted into functional heterochromatin compartments may be expressed. In this review, I present available data on the genetic and molecular organization of D. melanogaster constitutive heterochromatin and its relationship with transposable elements. The implications of these findings on the possible impact of heterochromatic TEs on the function and evolution of the host genome are also discussed. This revised version was published online in August 2006 with corrections to the Cover Date.  相似文献   

17.
We describe an algorithm, ReAS, to recover ancestral sequences for transposable elements (TEs) from the unassembled reads of a whole genome shotgun. The main assumptions are that these TEs must exist at high copy numbers across the genome and must not be so old that they are no longer recognizable in comparison to their ancestral sequences. Tested on the japonica rice genome, ReAS was able to reconstruct all of the high copy sequences in the Repbase repository of known TEs, and increase the effectiveness of RepeatMasker in identifying TEs from genome sequences.  相似文献   

18.
Triticeae species (including wheat, barley and rye) have huge and complex genomes due to polyploidization and a high content of transposable elements (TEs). TEs are known to play a major role in the structure and evolutionary dynamics of Triticeae genomes. During the last 5 years, substantial stretches of contiguous genomic sequence from various species of Triticeae have been generated, making it necessary to update and standardize TE annotations and nomenclature. In this study we propose standard procedures for these tasks, based on structure, nucleic acid and protein sequence homologies. We report statistical analyses of TE composition and distribution in large blocks of genomic sequences from wheat and barley. Altogether, 3.8 Mb of wheat sequence available in the databases was analyzed or re-analyzed, and compared with 1.3 Mb of re-annotated genomic sequences from barley. The wheat sequences were relatively gene-rich (one gene per 23.9 kb), although wheat gene-derived sequences represented only 7.8% (159 elements) of the total, while the remainder mainly comprised coding sequences found in TEs (54.7%, 751 elements). Class I elements [mainly long terminal repeat (LTR) retrotransposons] accounted for the major proportion of TEs, in terms of sequence length as well as element number (83.6% and 498, respectively). In addition, we show that the gene-rich sequences of wheat genome A seem to have a higher TE content than those of genomes B and D, or of barley gene-rich sequences. Moreover, among the various TE groups, MITEs were most often associated with genes: 43.1% of MITEs fell into this category. Finally, the TRIM and copia elements were shown to be the most active TEs in the wheat genome. The implications of these results for the evolution of diploid and polyploid wheat species are discussed. Electronic Supplementary Material Supplementary material is available for this article at  相似文献   

19.
Transposable elements (TEs) make up around 10%-15% of the Drosophila melanogaster genome, but its sibling species Drosophila simulans carries only one third as many such repeat sequences. We do not, however, have an overall view of copy numbers of the various classes of TEs (long terminal repeat [LTR] retrotransposons, non-LTR retrotransposons, and transposons) in genomes of natural populations of both species. We analyzed 34 elements in individuals from various natural populations of these species. We show that D. melanogaster has higher average chromosomal insertion site numbers per genome than D. simulans for all TEs except five. The LTR retrotransposons gypsy, ZAM, and 1731 and the transposon bari-1 present similar low copy numbers in both species. The transposon hobo has a large number of insertion sites, with significantly more sites in D. simulans. High variation between populations in number of insertion sites of some elements of D. simulans suggests that these elements can invade the genome of the entire species starting from a local population. We propose that TEs in the D. simulans genome are being awakened and amplified as they had been a long time ago in D. melanogaster.  相似文献   

20.
Vieira C  Biémont C 《Genetica》2004,120(1-3):115-123
Transposable elements (TEs) in the two sibling species, Drosophila melanogaster and D. simulans, differ considerably in amount and dynamics, with D. simulans having a smaller amount of TEs than D. melanogaster. Several hypotheses have been proposed to explain these differences, based on the evolutionary history of the two species, and claim differences either in the effective size of the population or in genome characteristics. Recent data suggest, however, that the higher amount of TEs in D. melanogaster could be associated with the worldwide invasion of D. melanogaster a long time ago while D. simulans is still under the process of such geographical spread. Stresses due to new environmental conditions and crosses between migrating populations could explain the mobilization of TEs while the flies colonize. Colonization and TE mobilization may be strong evolutionary forces that have shaped and are still shaping the eukaryote genomes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号