首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.

Background

The 17 Gb bread wheat genome has massively expanded through the proliferation of transposable elements (TEs) and two recent rounds of polyploidization. The assembly of a 774 Mb reference sequence of wheat chromosome 3B provided us with the opportunity to explore the impact of TEs on the complex wheat genome structure and evolution at a resolution and scale not reached so far.

Results

We develop an automated workflow, CLARI-TE, for TE modeling in complex genomes. We delineate precisely 56,488 intact and 196,391 fragmented TEs along the 3B pseudomolecule, accounting for 85% of the sequence, and reconstruct 30,199 nested insertions. TEs have been mostly silent for the last one million years, and the 3B chromosome has been shaped by a succession of bursts that occurred between 1 to 3 million years ago. Accelerated TE elimination in the high-recombination distal regions is a driving force towards chromosome partitioning. CACTAs overrepresented in the high-recombination distal regions are significantly associated with recently duplicated genes. In addition, we identify 140 CACTA-mediated gene capture events with 17 genes potentially created by exon shuffling and show that 19 captured genes are transcribed and under selection pressure, suggesting the important role of CACTAs in the recent wheat adaptation.

Conclusion

Accurate TE modeling uncovers the dynamics of TEs in a highly complex and polyploid genome. It provides novel insights into chromosome partitioning and highlights the role of CACTA transposons in the high level of gene duplication in wheat.

Electronic supplementary material

The online version of this article (doi:10.1186/s13059-014-0546-4) contains supplementary material, which is available to authorized users.  相似文献   

2.
3.
4.

Background

The siRNA and piRNA pathways have been shown in insects to be essential for regulation of gene expression and defence against exogenous and endogenous genetic elements (viruses and transposable elements). The vast majority of endogenous small RNAs produced by the siRNA and piRNA pathways originate from repetitive or transposable elements (TE). In D. melanogaster, TE-derived endogenous siRNAs and piRNAs are involved in genome surveillance and maintenance of genome integrity. In the medically relevant malaria mosquito Anopheles gambiae TEs constitute 12-16% of the genome size. Genetic variations induced by TE activities are known to shape the genome landscape and to alter the fitness in An. gambiae.

Results

Here, using bioinformatics approaches we analyzed the small RNA data sets from 6 libraries formally reported in a previous study and examined the expression of the mixed germline/somatic siRNAs and piRNAs produced in adult An. gambiae females. We characterized a large population of TE-derived endogenous siRNAs and piRNAs, which constitutes 56-60% of the total siRNA and piRNA reads in the analysed libraries. Moreover, we identified a number of protein coding genes producing gene-specific siRNAs and piRNAs that were generally expressed at much lower levels than the TE-associated small RNAs. Detailed sequence analysis revealed that An. gambiae piRNAs were produced by both “ping-pong” dependent (TE-associated piRNAs) and independent mechanisms (genic piRNAs). Similarly to D. melanogaster, more than 90% of the detected piRNAs were produced from TE-associated clusters in An. gambiae. We also found that biotic stress as blood feeding and infection with Plasmodium parasite, the etiological agent of malaria, modulated the expression levels of the endogenous siRNAs and piRNAs in An. gambiae.

Conclusions

We identified a large and diverse set of the endogenously derived siRNAs and piRNAs that share common and distinct aspects of small RNA expression across insect species, and inferred their impact on TE and gene activity in An. gambiae. The TE-specific small RNAs produced by both the siRNA and piRNA pathways represent an important aspect of genome stability and genetic variation, which might have a strong impact on the evolution of the genome and vector competence in the malaria mosquitoes.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1436-1) contains supplementary material, which is available to authorized users.  相似文献   

5.
6.

Background

Despite having predominately deleterious fitness effects, transposable elements (TEs) are major constituents of eukaryote genomes in general and of plant genomes in particular. Although the proportion of the genome made up of TEs varies at least four-fold across plants, the relative importance of the evolutionary forces shaping variation in TE abundance and distributions across taxa remains unclear. Under several theoretical models, mating system plays an important role in governing the evolutionary dynamics of TEs. Here, we use the recently sequenced Capsella rubella reference genome and short-read whole genome sequencing of multiple individuals to quantify abundance, genome distributions, and population frequencies of TEs in three recently diverged species of differing mating system, two self-compatible species (C. rubella and C. orientalis) and their self-incompatible outcrossing relative, C. grandiflora.

Results

We detect different dynamics of TE evolution in our two self-compatible species; C. rubella shows a small increase in transposon copy number, while C. orientalis shows a substantial decrease relative to C. grandiflora. The direction of this change in copy number is genome wide and consistent across transposon classes. For insertions near genes, however, we detect the highest abundances in C. grandiflora. Finally, we also find differences in the population frequency distributions across the three species.

Conclusion

Overall, our results suggest that the evolution of selfing may have different effects on TE evolution on a short and on a long timescale. Moreover, cross-species comparisons of transposon abundance are sensitive to reference genome bias, and efforts to control for this bias are key when making comparisons across species.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-602) contains supplementary material, which is available to authorized users.  相似文献   

7.
8.
The distribution of transposable elements (TEs) in a genome reflects a balance between insertion rate and selection against new insertions. Understanding the distribution of TEs therefore provides insights into the forces shaping the organization of genomes. Past research has shown that TEs tend to accumulate in genomic regions with low gene density and low recombination rate. However, little is known about the factors modulating insertion rates across the genome and their evolutionary significance. One candidate factor is gene expression, which has been suggested to increase local insertion rate by rendering DNA more accessible. We test this hypothesis by comparing the TE density around germline- and soma-expressed genes in the euchromatin of Drosophila melanogaster. Because only insertions that occur in the germline are transmitted to the next generation, we predicted a higher density of TEs around germline-expressed genes than soma-expressed genes. We show that the rate of TE insertions is greater near germline- than soma-expressed genes. However, this effect is partly offset by stronger selection for genome compactness (against excess noncoding DNA) on germline-expressed genes. We also demonstrate that the local genome organization in clusters of coexpressed genes plays a fundamental role in the genomic distribution of TEs. Our analysis shows that—in addition to recombination rate—the distribution of TEs is shaped by the interaction of gene expression and genome organization. The important role of selection for compactness sheds a new light on the role of TEs in genome evolution. Instead of making genomes grow passively, TEs are controlled by the forces shaping genome compactness, most likely linked to the efficiency of gene expression or its complexity and possibly their interaction with mechanisms of TE silencing.  相似文献   

9.

Background

The transposable element (TE) content of the genomes of plant species varies from near zero in the genome of Utricularia gibba to more than 80 % in many species. It is not well understood whether this variation in genome composition results from common mechanisms or stochastic variation. The major obstacles to investigating mechanisms of TE evolution have been a lack of comparative genomic data sets and efficient computational methods for measuring differences in TE composition between species. In this study, we describe patterns of TE evolution in 14 species in the flowering plant family Asteraceae and 1 outgroup species in the Calyceraceae to investigate phylogenetic patterns of TE dynamics in this important group of plants.

Results

Our findings indicate that TE families in the Asteraceae exhibit distinct patterns of non-neutral evolution, and that there has been a directional increase in copy number of Gypsy retrotransposons since the origin of the Asteraceae. Specifically, there is marked increase in Gypsy abundance at the origin of the Asteraceae and at the base of the tribe Heliantheae. This latter shift in genome composition has had a significant impact on the diversity and abundance distribution of TEs in a lineage-specific manner.

Conclusions

We show that the TE-driven expansion of plant genomes can be facilitated by just a few TE families, and is likely accompanied by the modification and/or replacement of the TE community. Importantly, large shifts in TE composition may be correlated with major of phylogenetic transitions.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1830-8) contains supplementary material, which is available to authorized users.  相似文献   

10.

Background

Comparative evolutionary analysis of whole genomes requires not only accurate annotation of gene space, but also proper annotation of the repetitive fraction which is often the largest component of most if not all genomes larger than 50 kb in size.

Results

Here we present the Rice TE database (RiTE-db) - a genus-wide collection of transposable elements and repeated sequences across 11 diploid species of the genus Oryza and the closely-related out-group Leersia perrieri. The database consists of more than 170,000 entries divided into three main types: (i) a classified and curated set of publicly-available repeated sequences, (ii) a set of consensus assemblies of highly-repetitive sequences obtained from genome sequencing surveys of 12 species; and (iii) a set of full-length TEs, identified and extracted from 12 whole genome assemblies.

Conclusions

This is the first report of a repeat dataset that spans the majority of repeat variability within an entire genus, and one that includes complete elements as well as unassembled repeats. The database allows sequence browsing, downloading, and similarity searches. Because of the strategy adopted, the RiTE-db opens a new path to unprecedented direct comparative studies that span the entire nuclear repeat content of 15 million years of Oryza diversity.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1762-3) contains supplementary material, which is available to authorized users.  相似文献   

11.

Summary

The classification of transposable elements (TEs) is key step towards deciphering their potential impact on the genome. However, this process is often based on manual sequence inspection by TE experts. With the wealth of genomic sequences now available, this task requires automation, making it accessible to most scientists. We propose a new tool, PASTEC, which classifies TEs by searching for structural features and similarities. This tool outperforms currently available software for TE classification. The main innovation of PASTEC is the search for HMM profiles, which is useful for inferring the classification of unknown TE on the basis of conserved functional domains of the proteins. In addition, PASTEC is the only tool providing an exhaustive spectrum of possible classifications to the order level of the Wicker hierarchical TE classification system. It can also automatically classify other repeated elements, such as SSR (Simple Sequence Repeats), rDNA or potential repeated host genes. Finally, the output of this new tool is designed to facilitate manual curation by providing to biologists with all the evidence accumulated for each TE consensus.

Availability

PASTEC is available as a REPET module or standalone software (http://urgi.versailles.inra.fr/download/repet/REPET_linux-x64-2.2.tar.gz). It requires a Unix-like system. There are two standalone versions: one of which is parallelized (requiring Sun grid Engine or Torque), and the other of which is not.  相似文献   

12.

Background

The publicly available Laccaria bicolor genome sequence has provided a considerable genomic resource allowing systematic identification of transposable elements (TEs) in this symbiotic ectomycorrhizal fungus. Using a TE-specific annotation pipeline we have characterized and analyzed TEs in the L. bicolor S238N-H82 genome.

Methodology/Principal Findings

TEs occupy 24% of the 60 Mb L. bicolor genome and represent 25,787 full-length and partial copy elements distributed within 171 families. The most abundant elements were the Copia-like. TEs are not randomly distributed across the genome, but are tightly nested or clustered. The majority of TEs exhibits signs of ancient transposition except some intact copies of terminal inverted repeats (TIRS), long terminal repeats (LTRs) and a large retrotransposon derivative (LARD) element. There were three main periods of TE expansion in L. bicolor: the first from 57 to 10 Mya, the second from 5 to 1 Mya and the most recent from 0.5 Mya ago until now. LTR retrotransposons are closely related to retrotransposons found in another basidiomycete, Coprinopsis cinerea.

Conclusions

This analysis 1) represents an initial characterization of TEs in the L. bicolor genome, 2) contributes to improve genome annotation and a greater understanding of the role TEs played in genome organization and evolution and 3) provides a valuable resource for future research on the genome evolution within the Laccaria genus.  相似文献   

13.

Background

In order to maintain genome information accurately and relevantly, original genome annotations need to be updated and evaluated regularly. Manual reannotation of genomes is important as it can significantly reduce the propagation of errors and consequently diminishes the time spent on mistaken research. For this reason, after five years from the initial submission of the Entamoeba histolytica draft genome publication, we have re-examined the original 23 Mb assembly and the annotation of the predicted genes.

Principal Findings

The evaluation of the genomic sequence led to the identification of more than one hundred artifactual tandem duplications that were eliminated by re-assembling the genome. The reannotation was done using a combination of manual and automated genome analysis. The new 20 Mb assembly contains 1,496 scaffolds and 8,201 predicted genes, of which 60% are identical to the initial annotation and the remaining 40% underwent structural changes. Functional classification of 60% of the genes was modified based on recent sequence comparisons and new experimental data. We have assigned putative function to 3,788 proteins (46% of the predicted proteome) based on the annotation of predicted gene families, and have identified 58 protein families of five or more members that share no homology with known proteins and thus could be entamoeba specific. Genome analysis also revealed new features such as the presence of segmental duplications of up to 16 kb flanked by inverted repeats, and the tight association of some gene families with transposable elements.

Significance

This new genome annotation and analysis represents a more refined and accurate blueprint of the pathogen genome, and provides an upgraded tool as reference for the study of many important aspects of E. histolytica biology, such as genome evolution and pathogenesis.  相似文献   

14.
Throughout evolution, eukaryotic genomes have been invaded by transposable elements (TEs). Little is known about the factors leading to genomic proliferation of TEs, their preferred integration sites and the molecular mechanisms underlying their insertion. We analyzed hundreds of thousands nested TEs in the human genome, i.e. insertions of TEs into existing ones. We first discovered that most TEs insert within specific ‘hotspots’ along the targeted TE. In particular, retrotransposed Alu elements contain a non-canonical single nucleotide hotspot for insertion of other Alu sequences. We next devised a method for identification of integration sequence motifs of inserted TEs that are conserved within the targeted TEs. This method revealed novel sequences motifs characterizing insertions of various important TE families: Alu, hAT, ERV1 and MaLR. Finally, we performed a global assessment to determine the extent to which young TEs tend to nest within older transposed elements and identified a 4-fold higher tendency of TEs to insert into existing TEs than to insert within non-TE intergenic regions. Our analysis demonstrates that TEs are highly biased to insert within certain TEs, in specific orientations and within specific targeted TE positions. TE nesting events also reveal new characteristics of the molecular mechanisms underlying transposition.  相似文献   

15.

Background

The movement of mobile elements among species by horizontal transposon transfer (HTT) influences the evolution of genomes through the modification of structure and function. Helitrons are a relatively new lineage of DNA-based (class II) transposable elements (TEs) that propagate by rolling-circle replication, and are capable of acquiring host DNA. The rapid spread of Helitrons among animal lineages by HTT is facilitated by shuttling in viral particles or by unknown mechanisms mediated by close organism associations (e.g. between hosts and parasites).

Results

A non-autonomous Helitron independently annotated as BmHel-2 from Bombyx mori and the MITE01 element from Ostrinia nubilalis was predicted in the genomes of 24 species in the insect Order Lepidoptera. Integrated Helitrons retained ≥ 65% sequence identity over a 250 bp consensus, and were predicted to retain secondary structures inclusive of a 3′-hairpin and a 5′-subterminal inverted repeat. Highly similar Hel-2 copies were predicted in the genomes of insects and associated viruses, which along with a previous documented case of real-time virus-insect cell line transposition suggests that this Helitron has likely propagated by HTT.

Conclusions

These findings provide evidence that insect virus may mediate the HTT of Helitron-like TEs. This movement may facilitate the shuttling of DNA elements among insect genomes. Further sampling is required to determine the putative role of HTT in insect genome evolution.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1318-6) contains supplementary material, which is available to authorized users.  相似文献   

16.
17.

Background

Aberrant DNA methylation is a hallmark of many cancers. Classically there are two types of endometrial cancer, endometrioid adenocarcinoma (EAC), or Type I, and uterine papillary serous carcinoma (UPSC), or Type II. However, the whole genome DNA methylation changes in these two classical types of endometrial cancer is still unknown.

Results

Here we described complete genome-wide DNA methylome maps of EAC, UPSC, and normal endometrium by applying a combined strategy of methylated DNA immunoprecipitation sequencing (MeDIP-seq) and methylation-sensitive restriction enzyme digestion sequencing (MRE-seq). We discovered distinct genome-wide DNA methylation patterns in EAC and UPSC: 27,009 and 15,676 recurrent differentially methylated regions (DMRs) were identified respectively, compared with normal endometrium. Over 80% of DMRs were in intergenic and intronic regions. The majority of these DMRs were not interrogated on the commonly used Infinium 450K array platform. Large-scale demethylation of chromosome X was detected in UPSC, accompanied by decreased XIST expression. Importantly, we discovered that the majority of the DMRs harbored promoter or enhancer functions and are specifically associated with genes related to uterine development and disease. Among these, abnormal methylation of transposable elements (TEs) may provide a novel mechanism to deregulate normal endometrium-specific enhancers derived from specific TEs.

Conclusions

DNA methylation changes are an important signature of endometrial cancer and regulate gene expression by affecting not only proximal promoters but also distal enhancers.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-868) contains supplementary material, which is available to authorized users.  相似文献   

18.
19.

Background

Somaclonal variation generally occurs in plants regenerated from tissue culture. However, fundamental issues regarding molecular characteristics, mutation rates and mutation spectra of plant somatic variation as well as their phenotypic relevance have been addressed only recently. Moreover, these studies have reported highly discrepant results in different plant species and even in the same plant genotype.

Methodology/principal findings

We investigated heritable genomic variation induced by tissue culture in rice by whole genome re-sequencing of an extensively selfed somaclonal line (TC-reg-2008) and its wild type (WT) donor (cv. Hitomebore). We computed the overall mutation rate, single nucleotide polymorphisms (SNPs), small scale insertions/deletions (Indels) and mobilization of transposable elements (TEs). We assessed chromosomal distribution of the various types of genomic variations, tested correlations between SNPs and Indels, and examined concomitancy between TE activity and its cytosine methylation states. We also performed gene ontology (GO) analysis of genes containing nonsynonymous mutations and large-effect mutations, and assayed effects of the genomic variations on phenotypes under both normal growing condition and several abiotic stresses. We found that heritable somaclonal genomic variation occurred extensively in rice. The genomic variations distributed non-randomly across each of the 12 rice chromosomes, and affected a large number of functional genes. The phenotypic penetrance of the genomic variations was condition-dependent.

Conclusions/significance

Tissue culture is a potent means to generate heritable genetic variations in rice, which bear distinct difference at least in space (chromosomal distribution) from those occurred under natural settings. Our findings have provided new information regarding the mutation rate and spectrum as well as chromosomal distribution pattern of somaclonal variation in rice. Our data also suggest that rice possesses a strong capacity to canalize genetic variations under normal growing conditions to maintain phenotypic robustness, which however can be released by certain abiotic stresses to generate variable phenotypes.  相似文献   

20.
Sexual reproduction allows transposable elements (TEs) to proliferate, leading to rapid divergence between populations and species. A significant outcome of divergence in the TE landscape is evident in hybrid dysgenic syndromes, a strong form of genomic incompatibility that can arise when (TE) family abundance differs between two parents. When TEs inherited from the father are absent in the mother''s genome, TEs can become activated in the progeny, causing germline damage and sterility. Studies in Drosophila indicate that dysgenesis can occur when TEs inherited paternally are not matched with a pool of corresponding TE silencing PIWI-interacting RNAs (piRNAs) provisioned by the female germline. Using the D. virilis syndrome of hybrid dysgenesis as a model, we characterize the effects that divergence in TE profile between parents has on offspring. Overall, we show that divergence in the TE landscape is associated with persisting differences in germline TE expression when comparing genetically identical females of reciprocal crosses and these differences are transmitted to the next generation. Moreover, chronic and persisting TE expression coincides with increased levels of genic piRNAs associated with reduced gene expression. Combined with these effects, we further demonstrate that gene expression is idiosyncratically influenced by differences in the genic piRNA profile of the parents that arise though polymorphic TE insertions. Overall, these results support a model in which early germline events in dysgenesis establish a chronic, stable state of both TE and gene expression in the germline that is maintained through adulthood and transmitted to the next generation. This work demonstrates that divergence in the TE profile is associated with diverse piRNA-mediated transgenerational effects on gene expression within populations.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号