首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
With the arrival of low-cost, next-generation sequencing, a multitude of new plant genomes are being publicly released, providing unseen opportunities and challenges for comparative genomics studies. Here, we present PLAZA 2.5, a user-friendly online research environment to explore genomic information from different plants. This new release features updates to previous genome annotations and a substantial number of newly available plant genomes as well as various new interactive tools and visualizations. Currently, PLAZA hosts 25 organisms covering a broad taxonomic range, including 13 eudicots, five monocots, one lycopod, one moss, and five algae. The available data consist of structural and functional gene annotations, homologous gene families, multiple sequence alignments, phylogenetic trees, and colinear regions within and between species. A new Integrative Orthology Viewer, combining information from different orthology prediction methodologies, was developed to efficiently investigate complex orthology relationships. Cross-species expression analysis revealed that the integration of complementary data types extended the scope of complex orthology relationships, especially between more distantly related species. Finally, based on phylogenetic profiling, we propose a set of core gene families within the green plant lineage that will be instrumental to assess the gene space of draft or newly sequenced plant genomes during the assembly or annotation phase.  相似文献   

2.
3.
Unicellular algae serve as models for the study and discovery of metabolic pathways, for the functional dissection of cell biological processes such as organellar division and cell motility, and for the identification of novel genes and gene functions. The recent completion of several algal genome sequences and expressed sequence tag collections and the establishment of nuclear and organellar transformation methods has opened the way for functional genomics approaches using algal model systems. The thermo-acidophilic unicellular red alga Galdieria sulphuraria represents a particularly interesting species for a genomics approach owing to its extraordinary metabolic versatility such as heterotrophic and mixotrophic growth on more than 50 different carbon sources and its adaptation to hot acidic environments. However, the ab initio prediction of genes required for unknown metabolic pathways from genome sequences is not trivial. A compelling strategy for gene identification is the comparison of similarly sized genomes of related organisms with different physiologies. Using this approach, candidate genes were identified that are critical to the metabolic versatility of Galdieria. Expressed sequence tags and high-throughput genomic sequence reads covering >70% of the G. sulphuraria genome were compared to the genome of the unicellular, obligate photoautotrophic red alga Cyanidioschyzon merolae. More than 30% of the Galdieria sequences did not relate to any of the Cyanidioschyzon genes. A closer inspection of these sequences revealed a large number of membrane transporters and enzymes of carbohydrate metabolism that are unique to Galdieria. Based on these data, it is proposed that genes involved in the uptake of reduced carbon compounds and enzymes involved in their metabolism are crucial to the metabolic flexibility of G. sulphuraria.  相似文献   

4.
Functional genomic approaches, such as proteomics, greatly enhance the value of genome sequences by providing a global level assessment of which genes are expressed, when genes are expressed and at what cellular levels gene products are synthesized. With over 1000 complete genome sequences of different microorganisms available, and DNA sequencing for environmental samples (metagenomes) producing vast amounts of gene sequence data, there is a real opportunity and a clear need to generate associated functional genomic data to learn about the source microorganisms. In contrast to the technological advances that have led to the accelerated rate and ease at which DNA sequence data can be generated, mass spectrometry based proteomics remains a technically sophisticated and exacting science. In recognition of the need to make proteomics more accessible to a growing number of environmental microbiologists so that the 'functional genomics gap' may be bridged, this review strives to demystify proteomic technologies and describe ways in which they have been applied, and more importantly, can be applied to study the physiology and ecology of extremophiles.  相似文献   

5.
The challenge in the development of universal algal primers lies in the genetic diversity contained within the vast array of evolutionary lineages present in this informally named group of organisms. A comparative genomics approach was used previously to identify conserved primers flanking a region of the plastid genome. Our present research illustrates the feasibility of amplifying and sequencing this marker across multiple algal lineages. We present a preliminary framework of 107 novel sequences of this region from 62 red algae, 19 green algae, 14 brown algae, 8 cyanobacteria, 2 diatoms, 1 xanthophyte, and 1 euglenoid, and illustrate levels of divergence of the marker for well‐represented groups in a neighbor‐joining analysis. This ~410 nt region distinguishes most species included in the analysis. The remarkable universality of these primers suggests potential for their use in assays of environmental samples in which they could be used to simultaneously detect a number of different algal lineages.  相似文献   

6.
7.
It has been more than 10 years since the first bacterial genome sequence was published. Hundreds of bacterial genome sequences are now available for comparative genomics, and searching a given protein against more than a thousand genomes will soon be possible. The subject of this review will address a relatively straightforward question: “What have we learned from this vast amount of new genomic data?” Perhaps one of the most important lessons has been that genetic diversity, at the level of large-scale variation amongst even genomes of the same species, is far greater than was thought. The classical textbook view of evolution relying on the relatively slow accumulation of mutational events at the level of individual bases scattered throughout the genome has changed. One of the most obvious conclusions from examining the sequences from several hundred bacterial genomes is the enormous amount of diversity—even in different genomes from the same bacterial species. This diversity is generated by a variety of mechanisms, including mobile genetic elements and bacteriophages. An examination of the 20 Escherichia coli genomes sequenced so far dramatically illustrates this, with the genome size ranging from 4.6 to 5.5 Mbp; much of the variation appears to be of phage origin. This review also addresses mobile genetic elements, including pathogenicity islands and the structure of transposable elements. There are at least 20 different methods available to compare bacterial genomes. Metagenomics offers the chance to study genomic sequences found in ecosystems, including genomes of species that are difficult to culture. It has become clear that a genome sequence represents more than just a collection of gene sequences for an organism and that information concerning the environment and growth conditions for the organism are important for interpretation of the genomic data. The newly proposed Minimal Information about a Genome Sequence standard has been developed to obtain this information.  相似文献   

8.
Comparative algal genomics often relies on predicted genes from de novo assembled genomes. However, the artifacts introduced by different gene-prediction approaches, and their impact on comparative genomic analysis remain poorly understood. Here, using available genome data from six dinoflagellate species in the Symbiodiniaceae, we identified methodological biases in the published genes that were predicted using different approaches and putative contaminant sequences in the published genome assemblies. We developed and applied a comprehensive customized workflow to predict genes from these genomes. The observed variation among predicted genes resulting from our workflow agreed with current understanding of phylogenetic relationships among these taxa, whereas the variation among the previously published genes was largely biased by the distinct approaches used in each instance. Importantly, these biases affect the inference of homologous gene families and synteny among genomes, thus impacting biological interpretation of these data. Our results demonstrate that a consistent gene-prediction approach is critical for comparative analysis of dinoflagellate genomes.  相似文献   

9.
Lyssaviruses are RNA viruses with single-strand, negative-sense genomes responsible for rabies-like diseases in mammals. To date, genomic and evolutionary studies have most often utilized partial genome sequences, particularly of the nucleoprotein and glycoprotein genes, with little consideration of genome-scale evolution. Herein, we report the first genomic and evolutionary analysis using complete genome sequences of all recognised lyssavirus genotypes, including 14 new complete genomes of field isolates from 6 genotypes and one genotype that is completely sequenced for the first time. In doing so we significantly increase the extent of genome sequence data available for these important viruses. Our analysis of these genome sequence data reveals that all lyssaviruses have the same genomic organization. A phylogenetic analysis reveals strong geographical structuring, with the greatest genetic diversity in Africa, and an independent origin for the two known genotypes that infect European bats. We also suggest that multiple genotypes may exist within the diversity of viruses currently classified as 'Lagos Bat'. In sum, we show that rigorous phylogenetic techniques based on full length genome sequence provide the best discriminatory power for genotype classification within the lyssaviruses.  相似文献   

10.
The genome sequences provide a first glimpse into the genomic basis of the biological diversity of filamentous fungi and yeast. The genome sequence of the budding yeast, Saccharomyces cerevisiae, with a small genome size, unicellular growth, and rich history of genetic and molecular analyses was a milestone of early genomics in the 1990s. The subsequent completion of fission yeast, Schizosaccharomyces pombe and genetic model, Neurospora crassa initiated a revolution in the genomics of the fungal kingdom. In due course of time, a substantial number of fungal genomes have been sequenced and publicly released, representing the widest sampling of genomes from any eukaryotic kingdom. An ambitious genome-sequencing program provides a wealth of data on metabolic diversity within the fungal kingdom, thereby enhancing research into medical science, agriculture science, ecology, bioremediation, bioenergy, and the biotechnology industry. Fungal genomics have higher potential to positively affect human health, environmental health, and the planet’s stored energy. With a significant increase in sequenced fungal genomes, the known diversity of genes encoding organic acids, antibiotics, enzymes, and their pathways has increased exponentially. Currently, over a hundred fungal genome sequences are publicly available; however, no inclusive review has been published. This review is an initiative to address the significance of the fungal genome-sequencing program and provides the road map for basic and applied research.  相似文献   

11.
12.
Community genomics in microbial ecology and evolution   总被引:1,自引:0,他引:1  
It is possible to reconstruct near-complete, and possibly complete, genomes of the dominant members of microbial communities from DNA that is extracted directly from the environment. Genome sequences from environmental samples capture the aggregate characteristics of the strain population from which they were derived. Comparison of the sequence data within and among natural populations can reveal the evolutionary processes that lead to genome diversification and speciation. Community genomic datasets can also enable subsequent gene expression and proteomic studies to determine how resources are invested and functions are distributed among community members. Ultimately, genomics can reveal how individual species and strains contribute to the net activity of the community.  相似文献   

13.
Xu J 《Molecular ecology》2006,15(7):1713-1731
Microbial ecology examines the diversity and activity of micro-organisms in Earth's biosphere. In the last 20 years, the application of genomics tools have revolutionized microbial ecological studies and drastically expanded our view on the previously underappreciated microbial world. This review first introduces the basic concepts in microbial ecology and the main genomics methods that have been used to examine natural microbial populations and communities. In the ensuing three specific sections, the applications of the genomics in microbial ecological research are highlighted. The first describes the widespread application of multilocus sequence typing and representational difference analysis in studying genetic variation within microbial species. Such investigations have identified that migration, horizontal gene transfer and recombination are common in natural microbial populations and that microbial strains can be highly variable in genome size and gene content. The second section highlights and summarizes the use of four specific genomics methods (phylogenetic analysis of ribosomal RNA, DNA-DNA re-association kinetics, metagenomics, and micro-arrays) in analysing the diversity and potential activity of microbial populations and communities from a variety of terrestrial and aquatic environments. Such analyses have identified many unexpected phylogenetic lineages in viruses, bacteria, archaea, and microbial eukaryotes. Functional analyses of environmental DNA also revealed highly prevalent, but previously unknown, metabolic processes in natural microbial communities. In the third section, the ecological implications of sequenced microbial genomes are briefly discussed. Comparative analyses of prokaryotic genomic sequences suggest the importance of ecology in determining microbial genome size and gene content. The significant variability in genome size and gene content among strains and species of prokaryotes indicate the highly fluid nature of prokaryotic genomes, a result consistent with those from multilocus sequence typing and representational difference analyses. The integration of various levels of ecological analyses coupled to the application and further development of high throughput technologies are accelerating the pace of discovery in microbial ecology.  相似文献   

14.
Mining bacterial genomes for antimicrobial targets   总被引:2,自引:0,他引:2  
The elucidation of whole-genome sequences is expected to have a revolutionary impact on the discovery of novel medicines. With the availability of complete genome sequences of more than 30 different species, the field of antimicrobial drug discovery has the opportunity to access a remarkable diversity of genomic information. In this review, I summarize how microbial genomics has changed strategies of drug discovery by applying bioinformatics, novel genetic approaches and genomics-based technologies, including analysis of gene expression using DNA microarrays.  相似文献   

15.
Despite their importance to evolution, ecology, and cell biology, eukaryotes that acquired plastids through secondary endosymbiosis remain poorly studied from a genomic standpoint. Chromalveolata, a eukaryotic supergroup proposed to have descended from a heterotrophic eukaryote that acquired a red algal plastid by secondary endosymbiosis, includes four major lineages (alveolates, cryptophytes, haptophytes, and heterokonts). The chromalveolates exhibit remarkable diversity of cellular organization, and the available data suggest that they exhibit equal diversity in their genome organization. One of the most obvious differences in cellular organization is the retention of a highly reduced red algal nucleus in cryptophytes (also known as cryptomonads), but there are other major differences among chromalveolate lineages, including the loss of photosynthesis in multiple lineages. Although the hypothesis of chromalveolate monophyly is appealing, there is limited support for the hypothesis from nuclear genes, and questions have even been raised about the monophyly of chromalveolate plastids. Evidence for the chromalveolate hypothesis from large‐scale nuclear data sets is reviewed, and alternative hypotheses are described. The potential for integrating information from chromalveolate genomics into functional genomics is described, emphasizing both the methodological challenges and the opportunities for future phylogenomic analyses of these groups.  相似文献   

16.
Pathways database system: an integrated system for biological pathways   总被引:1,自引:0,他引:1  
MOTIVATION: During the next phase of the Human Genome Project, research will focus on functional studies of attributing functions to genes, their regulatory elements, and other DNA sequences. To facilitate the use of genomic information in such studies, a new modeling perspective is needed to examine and study genome sequences in the context of many kinds of biological information. Pathways are the logical format for modeling and presenting such information in a manner that is familiar to biological researchers. RESULTS: In this paper we present an integrated system, called Pathways Database System, with a set of software tools for modeling, storing, analyzing, visualizing, and querying biological pathways data at different levels of genetic, molecular, biochemical and organismal detail. The novel features of the system include: (a) genomic information integrated with other biological data and presented from a pathway, rather than from the DNA sequence, perspective; (b) design for biologists who are possibly unfamiliar with genomics, but whose research is essential for annotating gene and genome sequences with biological functions; (c) database design, implementation and graphical tools which enable users to visualize pathways data in multiple abstraction levels, and to pose predetermined queries; and (d) an implementation that allows for web(XML)-based dissemination of query outputs (i.e. pathways data) to researchers in the community, giving them control on the use of pathways data. AVAILABILITY: Available on request from the authors.  相似文献   

17.
ASAP: the Alternative Splicing Annotation Project   总被引:2,自引:0,他引:2  
Recently, genomics analyses have demonstrated that alternative splicing is widespread in mammalian genomes (30-60% of genes reported to have multiple isoforms), and may be one of their most important mechanisms of functional regulation. However, by comparison with other genomics data such as genome annotation, SNPs, or gene expression, there exists relatively little database infrastructure for the study of alternative splicing. We have constructed an online database ASAP (the Alternative Splicing Annotation Project) for biologists to access and mine the enormous wealth of alternative splicing information coming from genomics and proteomics. ASAP is based on genome-wide analyses of alternative splicing in human (30 793 alternative splice relationships found) from detailed alignment of expressed sequences onto the genomic sequence. ASAP provides precise gene exon-intron structure, alternative splicing, tissue specificity of alternative splice forms, and protein isoform sequences resulting from alternative splicing. Moreover, it can help biologists design probe sequences for distinguishing specific mRNA isoforms. ASAP is intended to be a community resource for collaborative annotation of alternative splice forms, their regulation, and biological functions. The URL for ASAP is http://www.bioinformatics.ucla.edu/ASAP.  相似文献   

18.
Comparative genomics is an essential tool to unravel how genomes change over evolutionary time and to gain clues on the links between functional genomics and evolution. In prokaryotes, the large, good quality, genome sequences available in public databases and the recently developed large-scale computational methods, offer an unprecedent view on the ecology and evolution of microorganisms through comparative genomics. In this work, we examined the links among genome structure (i.e., the sequential distribution of nucleotides itself by detrended fluctuation analysis, DFA) and genomic diversity (i.e., gene functionality by Clusters of Orthologous Genes, COGs) in 828 full sequenced prokaryotic genomes from 548 different bacteria and archaea species. DFA scaling exponent α indicated persistent long-range correlations (fractality) in each genome analyzed. Higher resolution power was found when considering the sequential succession of purine (AG) vs. pyrimidine (CT) bases than either keto (GT) to amino (AC) forms or strongly (GC) vs. weakly (AT) bonded nucleotides. Interestingly, the phyla Aquificae, Fusobacteria, Dictyoglomi, Nitrospirae, and Thermotogae were closer to archaea than to their bacterial counterparts. A strong significant correlation was found between scaling exponent α and COGs distribution, and we consistently observed that the larger α the more heterogeneous was the gene distribution within each functional category, suggesting a close relationship between primary nucleotides sequence structure and functional genes composition.  相似文献   

19.
20.
How variation in the genome translates into biological diversity and new species originate has endured as the mystery of mysteries in evolutionary biology. African cichlid fishes are prime model systems to address speciation‐related questions for their remarkable taxonomic and phenotypic diversity, and the possible role of gene flow in this process. Here, we capitalize on genome sequencing and phylogenomic analyses to address the relative impacts of incomplete lineage sorting, introgression and hybrid speciation in the Neolamprologus savoryi‐complex (the ‘Princess cichlids’) from Lake Tanganyika. We present a time‐calibrated species tree based on whole‐genome sequences and provide strong evidence for incomplete lineage sorting in the early phases of diversification and multiple introgression events affecting different stages. Importantly, we find that the Neolamprologus chromosomes show centre‐to‐periphery biases in nucleotide diversity, sequence divergence, GC content, incomplete lineage sorting and rates of introgression, which are likely modulated by recombination density and linked selection. The detection of heterogeneous genomic landscapes has strong implications on the genomic mechanisms involved in speciation. Collinear chromosomal regions can be protected from gene flow and harbour incompatibility genes if they reside in lowly recombining regions, and coupling can evolve between nonphysically linked genomic regions (chromosome centres in particular). Simultaneously, higher recombination towards chromosome peripheries makes these more dynamic, evolvable regions where adaptation polymorphisms have a fertile ground. Hence, differences in genome architecture could explain the levels of taxonomic and phenotypic diversity seen in taxa with collinear genomes and might have contributed to the spectacular cichlid diversity observed today.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号