首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.

Background

The members of cupin superfamily exhibit large variations in their sequences, functions, organization of domains, quaternary associations and the nature of bound metal ion, despite having a conserved β-barrel structural scaffold. Here, an attempt has been made to understand structure-function relationships among the members of this diverse superfamily and identify the principles governing functional diversity. The cupin superfamily also contains proteins for which the structures are available through world-wide structural genomics initiatives but characterized as “hypothetical”. We have explored the feasibility of obtaining clues to functions of such proteins by means of comparative analysis with cupins of known structure and function.

Methodology/Principal Findings

A 3-D structure-based phylogenetic approach was undertaken. Interestingly, a dendrogram generated solely on the basis of structural dissimilarity measure at the level of domain folds was found to cluster functionally similar members. This clustering also reflects an independent evolution of the two domains in bicupins. Close examination of structural superposition of members across various functional clusters reveals structural variations in regions that not only form the active site pocket but are also involved in interaction with another domain in the same polypeptide or in the oligomer.

Conclusions/Significance

Structure-based phylogeny of cupins can influence identification of functions of proteins of yet unknown function with cupin fold. This approach can be extended to other proteins with a common fold that show high evolutionary divergence. This approach is expected to have an influence on the function annotation in structural genomics initiatives.  相似文献   

2.

Background

Speciose clades usually harbor species with a broad spectrum of adaptive strategies and complex distribution patterns, and thus constitute ideal systems to disentangle biotic and abiotic causes underlying species diversification. The delimitation of such study systems to test evolutionary hypotheses is difficult because they often rely on artificial genus concepts as starting points. One of the most prominent examples is the bellflower genus Campanula with some 420 species, but up to 600 species when including all lineages to which Campanula is paraphyletic. We generated a large alignment of petD group II intron sequences to include more than 70% of described species as a reference. By comparison with partial data sets we could then assess the impact of selective taxon sampling strategies on phylogenetic reconstruction and subsequent evolutionary conclusions.

Methodology/Principal Findings

Phylogenetic analyses based on maximum parsimony (PAUP, PRAP), Bayesian inference (MrBayes), and maximum likelihood (RAxML) were first carried out on the large reference data set (D680). Parameters including tree topology, branch support, and age estimates, were then compared to those obtained from smaller data sets resulting from “classification-guided” (D088) and “phylogeny-guided sampling” (D101). Analyses of D088 failed to fully recover the phylogenetic diversity in Campanula, whereas D101 inferred significantly different branch support and age estimates.

Conclusions/Significance

A short genomic region with high phylogenetic utility allowed us to easily generate a comprehensive phylogenetic framework for the speciose Campanula clade. Our approach recovered 17 well-supported and circumscribed sub-lineages. Knowing these will be instrumental for developing more specific evolutionary hypotheses and guide future research, we highlight the predictive value of a mass taxon-sampling strategy as a first essential step towards illuminating the detailed evolutionary history of diverse clades.  相似文献   

3.

Background

Carotenoids are isoprenoid compounds synthesized by all photosynthetic organisms. Despite much research on carotenoid biosynthesis in the model plant Arabidopsis thaliana, there is a lack of information on the carotenoid pathway in Brassica rapa. To better understand its carotenoid biosynthetic pathway, we performed a systematic analysis of carotenoid biosynthetic genes at the genome level in B. rapa.

Results

We identified 67 carotenoid biosynthetic genes in B. rapa, which were orthologs of the 47 carotenoid genes in A. thaliana. A high level of synteny was observed for carotenoid biosynthetic genes between A. thaliana and B. rapa. Out of 47 carotenoid biosynthetic genes in A. thaliana, 46 were successfully mapped to the 10 B. rapa chromosomes, and most of the genes retained more than one copy in B. rapa. The gene expansion was caused by the whole-genome triplication (WGT) event experienced by Brassica species. An expression analysis of the carotenoid biosynthetic genes suggested that their expression levels differed in root, stem, leaf, flower, callus, and silique tissues. Additionally, the paralogs of each carotenoid biosynthetic gene, which were generated from the WGT in B. rapa, showed significantly different expression levels among tissues, suggesting differentiated functions for these multi-copy genes in the carotenoid pathway.

Conclusions

This first systematic study of carotenoid biosynthetic genes in B. rapa provides insights into the carotenoid metabolic mechanisms of Brassica crops. In addition, a better understanding of carotenoid biosynthetic genes in B. rapa will contribute to the development of conventional and transgenic B. rapa cultivars with enriched carotenoid levels in the future.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1655-5) contains supplementary material, which is available to authorized users.  相似文献   

4.
5.
6.

Background

Recent advances in DNA sequencing techniques resulted in more than forty sequenced plant genomes representing a diverse set of taxa of agricultural, energy, medicinal and ecological importance. However, gene family curation is often only inferred from DNA sequence homology and lacks insights into evolutionary processes contributing to gene family dynamics. In a comparative genomics framework, we integrated multiple lines of evidence provided by gene synteny, sequence homology and protein-based Hidden Markov Modelling to extract homologous super-clusters composed of multi-domain resistance (R)-proteins of the NB-LRR type (for NUCLEOTIDE BINDING/LEUCINE-RICH REPEATS), that are involved in plant innate immunity.

Results

To assess the diversity of R-proteins within and between species, we screened twelve eudicot plant genomes including six major crops and found a total of 2,363 NB-LRR genes. Our curated R-proteins set shows a 50% average for tandem duplicates and a 22% fraction of gene copies retained from ancient polyploidy events (ohnologs). We provide evidence for strong positive selection and show significant differences in molecular evolution rates (Ka/Ks-ratio) among tandem- (mean = 1.59), ohnolog (mean = 1.36) and singleton (mean = 1.22) R-gene duplicates. To foster the process of gene-edited plant breeding, we report species-specific presence/absence of all 140 NB-LRR genes present in the model plant Arabidopsis and describe four distinct clusters of NB-LRR “gatekeeper” loci sharing syntenic orthologs across all analyzed genomes.

Conclusion

By curating a near-complete set of multi-domain R-protein clusters in an eudicot-wide scale, our analysis offers significant insight into evolutionary dynamics underlying diversification of the plant innate immune system. Furthermore, our methods provide a blueprint for future efforts to identify and more rapidly clone functional NB-LRR genes from any plant species.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-966) contains supplementary material, which is available to authorized users.  相似文献   

7.

Background

The widespread popularity of genomic applications is threatened by the “bioinformatics bottleneck” resulting from uncertainty about the cost and infrastructure needed to meet increasing demands for next-generation sequence analysis. Cloud computing services have been discussed as potential new bioinformatics support systems but have not been evaluated thoroughly.

Results

We present benchmark costs and runtimes for common microbial genomics applications, including 16S rRNA analysis, microbial whole-genome shotgun (WGS) sequence assembly and annotation, WGS metagenomics and large-scale BLAST. Sequence dataset types and sizes were selected to correspond to outputs typically generated by small- to midsize facilities equipped with 454 and Illumina platforms, except for WGS metagenomics where sampling of Illumina data was used. Automated analysis pipelines, as implemented in the CloVR virtual machine, were used in order to guarantee transparency, reproducibility and portability across different operating systems, including the commercial Amazon Elastic Compute Cloud (EC2), which was used to attach real dollar costs to each analysis type. We found considerable differences in computational requirements, runtimes and costs associated with different microbial genomics applications. While all 16S analyses completed on a single-CPU desktop in under three hours, microbial genome and metagenome analyses utilized multi-CPU support of up to 120 CPUs on Amazon EC2, where each analysis completed in under 24 hours for less than $60. Representative datasets were used to estimate maximum data throughput on different cluster sizes and to compare costs between EC2 and comparable local grid servers.

Conclusions

Although bioinformatics requirements for microbial genomics depend on dataset characteristics and the analysis protocols applied, our results suggests that smaller sequencing facilities (up to three Roche/454 or one Illumina GAIIx sequencer) invested in 16S rRNA amplicon sequencing, microbial single-genome and metagenomics WGS projects can achieve cost-efficient bioinformatics support using CloVR in combination with Amazon EC2 as an alternative to local computing centers.  相似文献   

8.
9.

Background

Classical morphological taxonomy places the approximately 1400 recognized species of Scleractinia (hard corals) into 27 families, but many aspects of coral evolution remain unclear despite the application of molecular phylogenetic methods. In part, this may be a consequence of such studies focusing on the reef-building (shallow water and zooxanthellate) Scleractinia, and largely ignoring the large number of deep-sea species. To better understand broad patterns of coral evolution, we generated molecular data for a broad and representative range of deep sea scleractinians collected off New Caledonia and Australia during the last decade, and conducted the most comprehensive molecular phylogenetic analysis to date of the order Scleractinia.

Methodology

Partial (595 bp) sequences of the mitochondrial cytochrome oxidase subunit 1 (CO1) gene were determined for 65 deep-sea (azooxanthellate) scleractinians and 11 shallow-water species. These new data were aligned with 158 published sequences, generating a 234 taxon dataset representing 25 of the 27 currently recognized scleractinian families.

Principal Findings/Conclusions

There was a striking discrepancy between the taxonomic validity of coral families consisting predominantly of deep-sea or shallow-water species. Most families composed predominantly of deep-sea azooxanthellate species were monophyletic in both maximum likelihood and Bayesian analyses but, by contrast (and consistent with previous studies), most families composed predominantly of shallow-water zooxanthellate taxa were polyphyletic, although Acroporidae, Poritidae, Pocilloporidae, and Fungiidae were exceptions to this general pattern. One factor contributing to this inconsistency may be the greater environmental stability of deep-sea environments, effectively removing taxonomic “noise” contributed by phenotypic plasticity. Our phylogenetic analyses imply that the most basal extant scleractinians are azooxanthellate solitary corals from deep-water, their divergence predating that of the robust and complex corals. Deep-sea corals are likely to be critical to understanding anthozoan evolution and the origins of the Scleractinia.  相似文献   

10.

Background

Lateral gene transfer (LGT) is an important evolutionary process in microbial evolution. In sewage treatment plants, LGT of antibiotic resistance and xenobiotic degradation-related proteins has been suggested, but the role of LGT outside these processes is unknown. Microbial communities involved in Enhanced Biological Phosphorus Removal (EBPR) have been used to treat wastewater in the last 50 years and may provide insights into adaptation to an engineered environment. We introduce two different types of analysis to identify LGT in EBPR sewage communities, based on identifying assembled sequences with more than one strong taxonomic match, and on unusual phylogenetic patterns. We applied these methods to investigate the role of LGT in six energy-related metabolic pathways.

Results

The analyses identified overlapping but non-identical sets of transferred enzymes. All of these were homologous with sequences from known mobile genetic elements, and many were also in close proximity to transposases and integrases in the EBPR data set. The taxonomic method had higher sensitivity than the phylogenetic method, identifying more potential LGTs. Both analyses identified the putative transfer of five enzymes within an Australian community, two in a Danish community, and none in a US-derived culture.

Conclusions

Our methods were able to identify sequences with unusual phylogenetic or compositional properties as candidate LGT events. The association of these candidates with known mobile elements supports the hypothesis of transfer. The results of our analysis strongly suggest that LGT has influenced the development of functionally important energy-related pathways in EBPR systems, but transfers may be unique to each community due to different operating conditions or taxonomic composition.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1752-5) contains supplementary material, which is available to authorized users.  相似文献   

11.
Krug AZ  Jablonski D  Roy K  Beu AG 《PloS one》2010,5(12):e15362

Background

The low taxonomic diversity of polar marine faunas today reflects both the failure of clades to colonize or diversify in high latitudes and regional extinctions of once-present clades. However, simple models of polar evolution are made difficult by the strikingly different faunal compositions and community structures of the two poles.

Methodology/Principal Findings

A comparison of early Cenozoic Arctic and Antarctic bivalve faunas with modern ones, within the framework of a molecular phylogeny, shows that while Arctic losses were randomly distributed across the tree, Antarctic losses were significantly concentrated in more derived families, resulting in communities dominated by basal lineages. Potential mechanisms for the phylogenetic structure to Antarctic extinctions include continental isolation, changes in primary productivity leading to turnover of both predators and prey, and the effect of glaciation on shelf habitats.

Conclusions/Significance

These results show that phylogenetic consequences of past extinctions can vary substantially among regions and thus shape regional faunal structures, even when due to similar drivers, here global cooling, and provide the first phylogenetic support for the “retrograde” hypothesis of Antarctic faunal evolution.  相似文献   

12.

Background

The ribosome translates mRNA to protein with the aid of a number of accessory protein factors. Translational GTPases (trGTPases) are an integral part of the ‘core set’ of essential translational factors, and are some of the most conserved proteins across life. This study takes advantage of the wealth of available genomic data, along with novel functional information that has come to light for a number of trGTPases to address the full evolutionary and functional diversity of this superfamily across all domains of life.

Results

Through sensitive sequence searching combined with phylogenetic analysis, 57 distinct subfamilies of trGTPases are identified: 14 bacterial, 7 archaeal and 35 eukaryotic (of which 21 are known or predicted to be organellar). The results uncover the functional evolution of trGTPases from before the last common ancestor of life on earth to the current day.

Conclusions

While some trGTPases are universal, others are limited to certain taxa, suggesting lineage-specific translational control mechanisms that exist on a base of core factors. These lineage-specific features may give organisms the ability to tune their translation machinery to respond to their environment. Only a fraction of the diversity of the trGTPase superfamily has been subjected to experimental analyses; this comprehensive classification brings to light novel and overlooked translation factors that are worthy of further investigation.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1289-7) contains supplementary material, which is available to authorized users.  相似文献   

13.

Background

Effectiveness of ART regimens strongly depends upon complex interactions between the selective pressure of drugs and the evolution of mutations that allow or restrict drug resistance.

Methods

Four clinical isolates from NRTI-exposed, NNRTI-naive subjects were passaged in increasing concentrations of NVP in combination with 1 µM 3 TC and 2 µM ADV to assess selective pressures of multi-drug treatment. A novel parameter inference procedure, based on a stochastic viral growth model, was used to estimate phenotypic resistance and fitness from in vitro combination passage experiments.

Results

Newly developed mathematical methods estimated key phenotypic parameters of mutations arising through selective pressure exerted by 3 TC and NVP. Concentrations of 1 µM 3 TC maintained the M184V mutation, which was associated with intrinsic fitness deficits. Increasing NVP concentrations selected major NNRTI resistance mutations. The evolutionary pathway of NVP resistance was highly dependent on the viral genetic background, epistasis as well as stochasticity. Parameter estimation indicated that the previously unrecognized mutation L228Q was associated with NVP resistance in some isolates.

Conclusion

Serial passage of viruses in the presence of multiple drugs may resemble the selection of mutations observed among treated individuals and populations in vivo and indicate evolutionary preferences and restrictions. Phenotypic resistance estimated here “in silico” from in vitro passage experiments agreed well with previous knowledge, suggesting that the unique combination of “wet-” and “dry-lab” experimentation may improve our understanding of HIV-1 resistance evolution in the future.  相似文献   

14.

Background

We present a multi-locus phylogenetic analysis of the shallow water (high intertidal) barnacle genus Chthamalus, focusing on member species in the western hemisphere. Understanding the phylogeny of this group improves interpretation of classical ecological work on competition, distributional changes associated with climate change, and the morphological evolution of complex cirripede phenotypes.

Methodology and Findings

We use traditional and Bayesian phylogenetic and ‘deep coalescent’ approaches to identify a phylogeny that supports the monophyly of the mostly American ‘fissus group’ of Chthamalus, but that also supports a need for taxonomic revision of Chthamalus and Microeuraphia. Two deep phylogeographic breaks were also found within the range of two tropical American taxa (C. angustitergum and C. southwardorum) as well.

Conclusions

Our data, which include two novel gene regions for phylogenetic analysis of cirripedes, suggest that much more evaluation of the morphological evolutionary history and taxonomy of Chthamalid barnacles is necessary. These data and associated analyses also indicate that the radiation of species in the late Pliocene and Pleistocene was very rapid, and may provide new insights toward speciation via transient allopatry or ecological barriers.  相似文献   

15.

Background

The early evolution and diversification of Hox-related genes in eumetazoans has been the subject of conflicting hypotheses concerning the evolutionary conservation of their role in axial patterning and the pre-bilaterian origin of the Hox and ParaHox clusters. The diversification of Hox/ParaHox genes clearly predates the origin of bilaterians. However, the existence of a “Hox code” predating the cnidarian-bilaterian ancestor and supporting the deep homology of axes is more controversial. This assumption was mainly based on the interpretation of Hox expression data from the sea anemone, but growing evidence from other cnidarian taxa puts into question this hypothesis.

Methodology/Principal Findings

Hox, ParaHox and Hox-related genes have been investigated here by phylogenetic analysis and in situ hybridisation in Clytia hemisphaerica, an hydrozoan species with medusa and polyp stages alternating in the life cycle. Our phylogenetic analyses do not support an origin of ParaHox and Hox genes by duplication of an ancestral ProtoHox cluster, and reveal a diversification of the cnidarian HOX9-14 genes into three groups called A, B, C. Among the 7 examined genes, only those belonging to the HOX9-14 and the CDX groups exhibit a restricted expression along the oral-aboral axis during development and in the planula larva, while the others are expressed in very specialised areas at the medusa stage.

Conclusions/Significance

Cross species comparison reveals a strong variability of gene expression along the oral-aboral axis and during the life cycle among cnidarian lineages. The most parsimonious interpretation is that the Hox code, collinearity and conservative role along the antero-posterior axis are bilaterian innovations.  相似文献   

16.

Background

A better understanding of the size and abundance of open reading frames (ORFS) in whole genomes may shed light on the factors that control genome complexity. Here we examine the statistical distributions of open reading frames (i.e. distribution of start and stop codons) in the fully sequenced genomes of 297 prokaryotes, and 14 eukaryotes.

Methodology/Principal Findings

By fitting mixture models to data from whole genome sequences we show that the size-frequency distributions for ORFS are strikingly similar across prokaryotic and eukaryotic genomes. Moreover, we show that i) a large fraction (60–80%) of ORF size-frequency distributions can be predicted a priori with a stochastic assembly model based on GC content, and that (ii) size-frequency distributions of the remaining “non-random” ORFs are well-fitted by log-normal or gamma distributions, and similar to the size distributions of annotated proteins.

Conclusions/Significance

Our findings suggest stochastic processes have played a primary role in the evolution of genome complexity, and that common processes govern the conservation and loss of functional genomics units in both prokaryotes and eukaryotes.  相似文献   

17.
18.

Background

Only a small fraction of the mosquito species of the genus Anopheles are able to transmit malaria, one of the biggest killer diseases of poverty, which is mostly prevalent in the tropics. This diversity has genetic, yet unknown, causes. In a further attempt to contribute to the elucidation of these variances, the international “Anopheles Genomes Cluster Consortium” project (a.k.a. “16 Anopheles genomes project”) was established, aiming at a comprehensive genomic analysis of several anopheline species, most of which are malaria vectors. In the frame of the international consortium carrying out this project our team studied the genes encoding families of non-coding RNAs (ncRNAs), concentrating on four classes: microRNA (miRNA), ribosomal RNA (rRNA), small nuclear RNA (snRNA), and in particular small nucleolar RNA (snoRNA) and, finally, transfer RNA (tRNA).

Results

Our analysis was carried out using, exclusively, computational approaches, and evaluating both the primary NGS reads as well as the respective genome assemblies produced by the consortium and stored in VectorBase; moreover, the results of RNAseq surveys in cases in which these were available and meaningful were also accessed in order to obtain supplementary data, as were “pre-genomic era” sequence data stored in nucleic acid databases. The investigation included the identification and analysis, in most species studied, of ncRNA genes belonging to several families, as well as the analysis of the evolutionary relations of some of those genes in cross-comparisons to other members of the genus Anopheles.

Conclusions

Our study led to the identification of members of these gene families in the majority of twenty different anopheline taxa. A set of tools for the study of the evolution and molecular biology of important disease vectors has, thus, been obtained.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-1038) contains supplementary material, which is available to authorized users.  相似文献   

19.
20.

Background

As the first pathway-specific enzyme in carotenoid biosynthesis, phytoene synthase (PSY) is a prime regulatory target. This includes a number of biotechnological approaches that have successfully increased the carotenoid content in agronomically relevant non-green plant tissues through tissue-specific PSY overexpression. We investigated the differential effects of constitutive AtPSY overexpression in green and non-green cells of transgenic Arabidopsis lines. This revealed striking similarities to the situation found in orange carrot roots with respect to carotenoid amounts and sequestration mechanism.

Methology/Principal Findings

In Arabidopsis seedlings, carotenoid content remained unaffected by increased AtPSY levels although the protein was almost quantitatively imported into plastids, as shown by western blot analyses. In contrast, non-photosynthetic calli and roots overexpressing AtPSY accumulated carotenoids 10 and 100-fold above the corresponding wild-type tissues and contained 1800 and 500 µg carotenoids per g dry weight, respectively. This increase coincided with a change of the pattern of accumulated carotenoids, as xanthophylls decreased relative to β-carotene and carotene intermediates accumulated. As shown by polarization microscopy, carotenoids were found deposited in crystals, similar to crystalline-type chromoplasts of non-green tissues present in several other taxa. In fact, orange-colored carrots showed a similar situation with increased PSY protein as well as carotenoid levels and accumulation patterns whereas wild white-rooted carrots were similar to Arabidopsis wild type roots in this respect. Initiation of carotenoid crystal formation by increased PSY protein amounts was further confirmed by overexpressing crtB, a bacterial PSY gene, in white carrots, resulting in increased carotenoid amounts deposited in crystals.

Conclusions

The sequestration of carotenoids into crystals can be driven by the functional overexpression of one biosynthetic enzyme in non-green plastids not requiring a chromoplast developmental program as this does not exist in Arabidopsis. Thus, PSY expression plays a major, rate-limiting role in the transition from white to orange-colored carrots.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号