首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
All genomes include gene families with very limited taxonomic distributions that potentially represent new genes and innovations in protein-coding sequence, raising questions on the origins of such genes. Some of these genes are hypothesized to have formed de novo, from noncoding sequences, and recent work has begun to elucidate the processes by which de novo gene formation can occur. A special case of de novo gene formation, overprinting, describes the origin of new genes from noncoding alternative reading frames of existing open reading frames (ORFs). We argue that additionally, out-of-frame gene fission/fusion events of alternative reading frames of ORFs and out-of-frame lateral gene transfers could contribute to the origin of new gene families. To demonstrate this, we developed an original pattern-search in sequence similarity networks, enhancing the use of these graphs, commonly used to detect in-frame remodeled genes. We applied this approach to gene families in 524 complete genomes of Escherichia coli. We identified 767 gene families whose evolutionary history likely included at least one out-of-frame remodeling event. These genes with out-of-frame components represent ∼2.5% of all genes in the E. coli pangenome, suggesting that alternative reading frames of existing ORFs can contribute to a significant proportion of de novo genes in bacteria.  相似文献   

3.
Dissecting the genetic mechanisms underlying dioecy (i.e., separate female and male individuals) is critical for understanding the evolution of this pervasive reproductive strategy. Nonetheless, the genetic basis of sex determination remains unclear in many cases, especially in systems where dioecy has arisen recently. Within the economically important plant genus Solanum (∼2,000 species), dioecy is thought to have evolved independently at least 4 times across roughly 20 species. Here, we generate the first genome sequence of a dioecious Solanum and use it to ascertain the genetic basis of sex determination in this species. We de novo assembled and annotated the genome of Solanum appendiculatum (assembly size: ∼750 Mb scaffold N50: 0.92 Mb; ∼35,000 genes), identified sex-specific sequences and their locations in the genome, and inferred that males in this species are the heterogametic sex. We also analyzed gene expression patterns in floral tissues of males and females, finding approximately 100 genes that are differentially expressed between the sexes. These analyses, together with observed patterns of gene-family evolution specific to S. appendiculatum, consistently implicate a suite of genes from the regulatory network controlling pectin degradation and modification in the expression of sex. Furthermore, the genome of a species with a relatively young sex-determination system provides the foundational resources for future studies on the independent evolution of dioecy in this clade.  相似文献   

4.
Expansion or shrinkage of existing tandem repeats (TRs) associated with various biological processes has been actively studied in both prokaryotic and eukaryotic genomes, while their origin and biological implications remain mostly unknown. Here we describe various duplications (de novo TRs) that occurred in the coding region of a β-lactamase gene, where a conserved structure called the omega loop is encoded. These duplications that occurred under selection using ceftazidime conferred substrate spectrum extension to include the antibiotic. Under selective pressure with one of the original substrates (amoxicillin), a high level of reversion occurred in the mutant β-lactamase genes completing a cycle back to the original substrate spectrum. The de novo TRs coupled with reversion makes a genetic toggling mechanism enabling reversible switching between the two phases of the substrate spectrum of β-lactamases. This toggle exemplifies the effective adaptation of de novo TRs for enhanced bacterial survival. We found pairs of direct repeats that mediated the DNA duplication (TR formation). In addition, we found different duos of sequences that mediated the DNA duplication. These novel elements—that we named SCSs (same-strand complementary sequences)—were also found associated with β-lactamase TR mutations from clinical isolates. Both direct repeats and SCSs had a high correlation with TRs in diverse bacterial genomes throughout the major phylogenetic lineages, suggesting that they comprise a fundamental mechanism shaping the bacterial evolution.  相似文献   

5.
6.
The Red Queen hypothesis depicts evolution as the continual struggle to adapt. According to this hypothesis, new genes, especially those originating from nongenic sequences (i.e., de novo genes), are eliminated unless they evolve continually in adaptation to a changing environment. Here, we analyze two Drosophila de novo miRNAs that are expressed in a testis-specific manner with very high rates of evolution in their DNA sequence. We knocked out these miRNAs in two sibling species and investigated their contributions to different fitness components. We observed that the fitness contributions of miR-975 in Drosophila simulans seem positive, in contrast to its neutral contributions in D. melanogaster, whereas miR-983 appears to have negative contributions in both species, as the fitness of the knockout mutant increases. As predicted by the Red Queen hypothesis, the fitness difference of these de novo miRNAs indicates their different fates.  相似文献   

7.

Background

The animal mitochondrial genome is generally considered to be under selection for both compactness and gene order conservation. As more mitochondrial genomes are sequenced, mitochondrial duplications and gene rearrangements have been frequently identified among diverse animal groups. Although several mechanisms of gene rearrangement have been proposed thus far, more observational evidence from major taxa is needed to validate specific mechanisms. In the current study, the complete mitochondrial DNA of sixteen bird species from the family Ardeidae was sequenced and the evolution of mitochondrial gene rearrangements was investigated. The mitochondrial genomes were then used to review the phylogenies of these ardeid birds.

Results

The complete mitochondrial genome sequences of the sixteen ardeid birds exhibited four distinct mitochondrial gene orders in which two of them, named as “duplicate tRNAGlu–CR” and “duplicate tRNAThr–tRNAPro and CR”, were newly discovered. These gene rearrangements arose from an evolutionary process consistent with the tandem duplication - random loss model (TDRL). Additionally, duplications in these gene orders were near identical in nucleotide sequences within each individual, suggesting that they evolved in concert. Phylogenetic analyses of the sixteen ardeid species supported the idea that Ardea ibis, Ardea modesta and Ardea intermedia should be classified as genus Ardea, and Ixobrychus flavicollis as genus Ixobrychus, and indicated that within the subfamily Ardeinae, Nycticorax nycticorax is closely related to genus Egretta and that Ardeola bacchus and Butorides striatus are closely related to the genus Ardea.

Conclusions

The duplicate tRNAThr–CR gene order is found in most ardeid lineages, suggesting this gene order is the ancestral pattern within these birds and persisted in most lineages via concerted evolution. In two independent lineages, when the concerted evolution stopped in some subsections due to the accumulation of numerous substitutions and deletions, the duplicate tRNAThr–CR gene order was transformed into three other gene orders. The phylogenetic trees produced from concatenated rRNA and protein coding genes have high support values in most nodes, indicating that the mitochondrial genome sequences are promising markers for resolving the phylogenetic issues of ardeid birds when more taxa are added.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-573) contains supplementary material, which is available to authorized users.  相似文献   

8.
9.
Cauliflower mosaic virus (CaMV) is a plant pararetrovirus with a double-stranded DNA genome. It is the type member of the genus Caulimovirus in the family Caulimoviridae. CaMV is transmitted by sap inoculation and in nature by aphids in a semi-persistent manner. To investigate the patterns and timescale of CaMV migration and evolution, we sequenced and analyzed the genomes of 67 isolates of CaMV collected mostly in Greece, Iran, Turkey, and Japan together with nine published sequences. We identified the open-reading frames (ORFs) in the genomes and inferred their phylogeny. After removing recombinant sequences, we estimated the substitution rates, divergence times, and phylogeographic patterns of the virus populations. We found that recombination has been a common feature of CaMV evolution, and that ORFs I–V have a different evolutionary history from ORF VI. The ORFs have evolved at rates between 1.71 and 5.81×10−4 substitutions/site/year, similar to those of viruses with RNA or ssDNA genomes. We found four geographically confined lineages. CaMV probably spread from a single population to other parts of the world around 400–500 years ago, and is now widely distributed among Eurasian countries. Our results revealed evidence of frequent gene flow between populations in Turkey and those of its neighboring countries, with similar patterns observed for Japan and the USA. Our study represents the first report on the spatial and temporal spread of a plant pararetrovirus.  相似文献   

10.
11.
12.
13.
Actinidia chinensis is an important economic plant belonging to the basal lineage of the asterids. Availability of a complete Actinidia chloroplast genome sequence is crucial to understanding phylogenetic relationships among major lineages of angiosperms and facilitates kiwifruit genetic improvement. We report here the complete nucleotide sequences of the chloroplast genomes for Actinidia chinensis and A. chinensis var deliciosa obtained through de novo assembly of Illumina paired-end reads produced by total DNA sequencing. The total genome size ranges from 155,446 to 157,557 bp, with an inverted repeat (IR) of 24,013 to 24,391 bp, a large single copy region (LSC) of 87,984 to 88,337 bp and a small single copy region (SSC) of 20,332 to 20,336 bp. The genome encodes 113 different genes, including 79 unique protein-coding genes, 30 tRNA genes and 4 ribosomal RNA genes, with 16 duplicated in the inverted repeats, and a tRNA gene (trnfM-CAU) duplicated once in the LSC region. Comparisons of IR boundaries among four asterid species showed that IR/LSC borders were extended into the 5portion of the psbA gene and IR contraction occurred in Actinidia. The clap gene has been lost from the chloroplast genome in Actinidia, and may have been transferred to the nucleus during chloroplast evolution. Twenty-seven polymorphic simple sequence repeat (SSR) loci were identified in the Actinidia chloroplast genome. Maximum parsimony analyses of a 72-gene, 16 taxa angiosperm dataset strongly support the placement of Actinidiaceae in Ericales within the basal asterids.  相似文献   

14.
Bumblebees are a diverse group of globally important pollinators in natural ecosystems and for agricultural food production. With both eusocial and solitary life-cycle phases, and some social parasite species, they are especially interesting models to understand social evolution, behavior, and ecology. Reports of many species in decline point to pathogen transmission, habitat loss, pesticide usage, and global climate change, as interconnected causes. These threats to bumblebee diversity make our reliance on a handful of well-studied species for agricultural pollination particularly precarious. To broadly sample bumblebee genomic and phenotypic diversity, we de novo sequenced and assembled the genomes of 17 species, representing all 15 subgenera, producing the first genus-wide quantification of genetic and genomic variation potentially underlying key ecological and behavioral traits. The species phylogeny resolves subgenera relationships, whereas incomplete lineage sorting likely drives high levels of gene tree discordance. Five chromosome-level assemblies show a stable 18-chromosome karyotype, with major rearrangements creating 25 chromosomes in social parasites. Differential transposable element activity drives changes in genome sizes, with putative domestications of repetitive sequences influencing gene coding and regulatory potential. Dynamically evolving gene families and signatures of positive selection point to genus-wide variation in processes linked to foraging, diet and metabolism, immunity and detoxification, as well as adaptations for life at high altitudes. Our study reveals how bumblebee genes and genomes have evolved across the Bombus phylogeny and identifies variations potentially linked to key ecological and behavioral traits of these important pollinators.  相似文献   

15.
The genus Blumea (Asteroideae, Asteraceae) comprises about 100 species, including herbs, shrubs, and small trees. Previous studies have been unable to resolve taxonomic issues and the phylogeny of the genus Blumea due to the low polymorphism of molecular markers. Therefore, suitable polymorphic regions need to be identified. Here, we de novo assembled plastomes of the three Blumea species Boxyodonta, B. tenella, and B. balsamifera and compared them with 26 other species of Asteroideae after correction of annotations. These species have quadripartite plastomes with similar gene content, genome organization, and inverted repeat contraction and expansion comprising 113 genes, including 80 protein‐coding, 29 transfer RNA, and 4 ribosomal RNA genes. The comparative analysis of codon usage, amino acid frequency, microsatellite repeats, oligonucleotide repeats, and transition and transversion substitutions has revealed high resemblance among the newly assembled species of Blumea. We identified 10 highly polymorphic regions with nucleotide diversity above 0.02, including rps16‐trnQ, ycf1, ndhF‐rpl32, petN‐psbM, and rpl32‐trnL, and they may be suitable for the development of robust, authentic, and cost‐effective markers for barcoding and inference of the phylogeny of the genus Blumea. Among these highly polymorphic regions, five regions also co‐occurred with oligonucleotide repeats and support use of repeats as a proxy for the identification of polymorphic loci. The phylogenetic analysis revealed a close relationship between Blumea and Pluchea within the tribe Inuleae. At tribe level, our phylogeny supports a sister relationship between Astereae and Anthemideae rooted as Gnaphalieae, Calenduleae, and Senecioneae. These results are contradictory to recent studies which reported a sister relationship between “Senecioneae and Anthemideae” and “Astereae and Gnaphalieae” or a sister relationship between Astereae and Gnaphalieae rooted as Calenduleae, Anthemideae, and then Senecioneae using nuclear genome sequences. The conflicting phylogenetic signals observed at the tribal level between plastidt and nuclear genome data require further investigation.  相似文献   

16.
Bacteria of the genus ‘Candidatus Phytoplasma’ are uncultivated intracellular plant pathogens transmitted by phloem-feeding insects. They have small genomes lacking genes for essential metabolites, which they acquire from either plant or insect hosts. Nonetheless, some phytoplasmas, such as ‘Ca. P. solani’, have broad plant host range and are transmitted by several polyphagous insect species. To understand better how these obligate symbionts can colonize such a wide range of hosts, the genome of ‘Ca. P. solani’ strain SA-1 was sequenced from infected periwinkle via a metagenomics approach. The de novo assembly generated a draft genome with 19 contigs totalling 821,322 bp, which corresponded to more than 80% of the estimated genome size. Further completion of the genome was challenging due to the high occurrence of repetitive sequences. The majority of repeats consisted of gene arrangements characteristic of phytoplasma potential mobile units (PMUs). These regions showed variation in gene orders intermixed with genes of unknown functions and lack of similarity to other phytoplasma genes, suggesting that they were prone to rearrangements and acquisition of new sequences via recombination. The availability of this high-quality draft genome also provided a foundation for genome-scale genotypic analysis (e.g., average nucleotide identity and average amino acid identity) and molecular phylogenetic analysis. Phylogenetic analyses provided evidence of horizontal transfer for PMU-like elements from various phytoplasmas, including distantly related ones. The ‘Ca. P. solani’ SA-1 genome also contained putative secreted protein/effector genes, including a homologue of SAP11, found in many other phytoplasma species.  相似文献   

17.
Assessing the contribution of promoters and coding sequences to gene evolution is an important step toward discovering the major genetic determinants of human evolution. Many specific examples have revealed the evolutionary importance of cis-regulatory regions. However, the relative contribution of regulatory and coding regions to the evolutionary process and whether systemic factors differentially influence their evolution remains unclear. To address these questions, we carried out an analysis at the genome scale to identify signatures of positive selection in human proximal promoters. Next, we examined whether genes with positively selected promoters (Prom+ genes) show systemic differences with respect to a set of genes with positively selected protein-coding regions (Cod+ genes). We found that the number of genes in each set was not significantly different (8.1% and 8.5%, respectively). Furthermore, a functional analysis showed that, in both cases, positive selection affects almost all biological processes and only a few genes of each group are located in enriched categories, indicating that promoters and coding regions are not evolutionarily specialized with respect to gene function. On the other hand, we show that the topology of the human protein network has a different influence on the molecular evolution of proximal promoters and coding regions. Notably, Prom+ genes have an unexpectedly high centrality when compared with a reference distribution (P = 0.008, for Eigenvalue centrality). Moreover, the frequency of Prom+ genes increases from the periphery to the center of the protein network (P = 0.02, for the logistic regression coefficient). This means that gene centrality does not constrain the evolution of proximal promoters, unlike the case with coding regions, and further indicates that the evolution of proximal promoters is more efficient in the center of the protein network than in the periphery. These results show that proximal promoters have had a systemic contribution to human evolution by increasing the participation of central genes in the evolutionary process.  相似文献   

18.
19.
20.
Sialyltransferases are key enzymes in the biosynthesis of sialoglycoconjugates that catalyze the transfer of sialic residue from its activated form to an oligosaccharidic acceptor. β-Galactoside α2,6-sialyltransferases ST6Gal I and ST6Gal II are the two unique members of the ST6Gal family described in higher vertebrates. The availability of genome sequences enabled the identification of more distantly related invertebrates'' st6gal gene sequences and allowed us to propose a scenario of their evolution. Using a phylogenomic approach, we present further evidence of an accelerated evolution of the st6gal1 genes both in their genomic regulatory sequences and in their coding sequence in reptiles, birds, and mammals known as amniotes, whereas st6gal2 genes conserve an ancestral profile of expression throughout vertebrate evolution.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号