首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
While genome sequencing is becoming ever more routine, genome annotation remains a challenging process. Identification of the coding sequences within the genomic milieu presents a tremendous challenge, especially for eukaryotes with their complex gene architectures. Here, we present a method to assist the annotation process through the use of proteomic data and bioinformatics. Mass spectra of digested protein preparations of the organism of interest were acquired and searched against a protein database created by a six-frame translation of the genome. The identified peptides were mapped back to the genome, compared to the current annotation, and then categorized as supporting or extending the current genome annotation. We named the classified peptides Expressed Peptide Tags (EPTs). The well-annotated bacterium Rhodopseudomonas palustris was used as a control for the method and showed a high degree of correlation between EPT mapping and the current annotation, with 86% of the EPTs confirming existing gene calls and less than 1% of the EPTs expanding on the current annotation. The eukaryotic plant pathogens Phytophthora ramorum and Phytophthora sojae, whose genomes have been recently sequenced and are much less well-annotated, were also subjected to this method. A series of algorithmic steps were taken to increase the confidence of EPT identification for these organisms, including generation of smaller subdatabases to be searched against, and definition of EPT criteria that accommodates the more complex eukaryotic gene architecture. As expected, the analysis of the Phytophthora species showed less correlation between EPT mapping and their current annotation. While approximately 76% of Phytophthora EPTs supported the current annotation, a portion of them (7.7% and 12.9% for P. ramorum and P. sojae, respectively) suggested modification to current gene calls or identified novel genes that were missed by the current genome annotation of these organisms.  相似文献   

2.
After almost a decade of work, the sequencing, assembly, and annotation of the genome of the fungal pathogen Candida albicans is finally close at hand. This review covers the early history of the C. albicans genome project, from the release of early assemblies that provided the impetus for an explosion in functional genomics research, to a community-based annotation and a preview of the work that was necessary for the production of a final genome assembly.  相似文献   

3.
Halopiger xanaduensis is the type species of the genus Halopiger and belongs to the euryarchaeal family Halobacteriaceae. H. xanaduensis strain SH-6, which is designated as the type strain, was isolated from the sediment of a salt lake in Inner Mongolia, Lake Shangmatala. Like other members of the family Halobacteriaceae, it is an extreme halophile requiring at least 2.5 M salt for growth. We report here the sequencing and annotation of the 4,355,268 bp genome, which includes one chromosome and three plasmids. This genome is part of a Joint Genome Institute (JGI) Community Sequencing Program (CSP) project to sequence diverse haloarchaeal genomes.  相似文献   

4.
马铃薯致病疫霉研究进展   总被引:3,自引:0,他引:3  
马铃薯致病疫霉(Phytophthora infestans)属卵菌纲(Oomycetes)霜霉目(Peronosporales)腐霉科(Pythiaceae)疫霉属(Phytophthora),是马铃薯和番茄晚疫病病原菌。由于晚疫病对马铃薯生产的毁灭性和严重性,对致病疫霉的研究一直是关注的重点。本文首先对病害引起的症状、发生特点及流行规律进行阐述,对有性生殖发生的遗传规律和多种交配型共存的大环境下病原菌群体结构变异特点进行归纳总结。随着2009年致病疫霉基因组测序的完成,本文比对了疫霉属目前已完成测序各个种的基因组学特点,介绍了致病疫霉在效应子克隆方面的研究进展及线粒体基因组研究现状,阐述了功能基因组学的两个重要技术:高密度遗传连锁图谱(high density linkage mapping)和全基因组关联分析(genome-wide association study,GWAS),及其在挖掘致病疫霉重要功能基因上的应用。本文有助于了解致病疫霉研究热点及后续突破方向,可为深入解析致病疫霉的功能基因及致病机制提供参考,对开发马铃薯晚疫病菌药物靶标及预测病害的大规模流行趋势也具有重要意...  相似文献   

5.
Lower eukaryotes of the kingdom Fungi include a variety of biotechnologically important yeast species that are in the focus of genome research for more than a decade. Due to the rapid progress in ultra-fast sequencing technologies, the amount of available yeast genome data increases steadily. Thus, an efficient bioinformatics platform is required that covers genome assembly, eukaryotic gene prediction, genome annotation, comparative yeast genomics, and metabolic pathway reconstruction. Here, we present a bioinformatics platform for yeast genomics named RAPYD addressing the key requirements of extensive yeast sequence data analysis. The first step is a comprehensive regional and functional annotation of a yeast genome. A region prediction pipeline was implemented to obtain reliable and high-quality predictions of coding sequences and further genome features. Functions of coding sequences are automatically determined using a configurable prediction pipeline. Based on the resulting functional annotations, a metabolic pathway reconstruction module can be utilized to rapidly generate an overview of organism-specific features and metabolic blueprints. In a final analysis step shared and divergent features of closely related yeast strains can be explored using the comparative genomics module. An in-depth application example of the yeast Meyerozyma guilliermondii illustrates the functionality of RAPYD. A user-friendly web interface is available at https://rapyd.cebitec.uni-bielefeld.de.  相似文献   

6.
7.
Staphylothermus hellenicus belongs to the order Desulfurococcales within the archaeal phylum Crenarchaeota. Strain P8(T) is the type strain of the species and was isolated from a shallow hydrothermal vent system at Palaeochori Bay, Milos, Greece. It is a hyperthermophilic, anaerobic heterotroph. Here we describe the features of this organism together with the complete genome sequence and annotation. The 1,580,347 bp genome with its 1,668 protein-coding and 48 RNA genes was sequenced as part of a DOE Joint Genome Institute (JGI) Laboratory Sequencing Program (LSP) project.  相似文献   

8.
9.
Structural proteomics: a tool for genome annotation   总被引:1,自引:0,他引:1  
In any newly sequenced genome, 30% to 50% of genes encode proteins with unknown molecular or cellular function. Fortunately, structural genomics is emerging as a powerful approach of functional annotation. Because of recent developments in high-throughput technologies, ongoing structural genomics projects are generating new structures at an unprecedented rate. In the past year, structural studies have identified many new structural motifs involved in enzymatic catalysis or in binding ligands or other macromolecules (DNA, RNA, protein). The efficiency by which function is deduced from structure can be further improved by the integration of structure with bioinformatics and other experimental approaches, such as screening for enzymatic activity or ligand binding.  相似文献   

10.
Phytophthora species are devastating plant pathogens in both agricultural and natural environments. Due to their significant economic and environmental impact, there has been increasing interest in Phytophthora genetics and genomics, culminating in the recent release of three complete genome sequences (P. ramorum, P. sojae, and P. infestans). In this study, genome and other large sequence databases were used to identify over 225 potential genetic markers for phylogenetic analyses. Here, we present a genus-wide phylogeny for 82 Phytophthora species using seven of the most informative loci (approximately 8700 nucleotide sites). Our results support the division of the genus into 10 well-supported clades. The relationships among these clades were rigorously evaluated using a number of phylogenetic methods. This is the most comprehensive study of Phytophthora relationships to date, and many newly discovered species have been included. A more resolved phylogeny of Phytophthora species will allow for better interpretations of the overall evolutionary history of the genus.  相似文献   

11.
Phytophthora species are devastating plant pathogens in both agricultural and natural environments. Due to their significant economic and environmental impact, there has been increasing interest in Phytophthora genetics and genomics, culminating in the recent release of three complete genome sequences (P. ramorum, P. sojae, and P. infestans). In this study, genome and other large sequence databases were used to identify over 225 potential genetic markers for phylogenetic analyses. Here, we present a genus-wide phylogeny for 82 Phytophthora species using seven of the most informative loci (approximately 8700 nucleotide sites). Our results support the division of the genus into 10 well-supported clades. The relationships among these clades were rigorously evaluated using a number of phylogenetic methods. This is the most comprehensive study of Phytophthora relationships to date, and many newly discovered species have been included. A more resolved phylogeny of Phytophthora species will allow for better interpretations of the overall evolutionary history of the genus.  相似文献   

12.
The enediynes are one of the most fascinating families of bacterial natural products given their unprecedented molecular architecture and extraordinary cytotoxicity. Enediynes are rare with only 11 structurally characterized members and four additional members isolated in their cycloaromatized form. Recent advances in DNA sequencing have resulted in an explosion of microbial genomes. A virtual survey of the GenBank and JGI genome databases revealed 87 enediyne biosynthetic gene clusters from 78 bacteria strains, implying that enediynes are more common than previously thought. Here we report the construction and analysis of an enediyne genome neighborhood network (GNN) as a high-throughput approach to analyze secondary metabolite gene clusters. Analysis of the enediyne GNN facilitated rapid gene cluster annotation, revealed genetic trends in enediyne biosynthetic gene clusters resulting in a simple prediction scheme to determine 9- versus 10-membered enediyne gene clusters, and supported a genomic-based strain prioritization method for enediyne discovery.  相似文献   

13.
The use and development of post-genomic tools naturally depends on large-scale genome sequencing projects. The usefulness of post-genomic applications is dependent on the accuracy of genome annotations, for which the correct identification of intron-exon borders in complex genomes of eukaryotic organisms is often an error-prone task. Although automated algorithms for predicting intron-exon structures are available, supporting exon evidence is necessary to achieve comprehensive genome annotation. Besides cDNA and EST support, peptides identified via MS/MS can be used as extrinsic evidence in a proteogenomic approach. We describe an improved version of the Genomic Peptide Finder (GPF), which aligns de novo predicted amino acid sequences to the genomic DNA sequence of an organism while correcting for peptide sequencing errors and accounting for the possibility of splicing. We have coupled GPF and the gene finding program AUGUSTUS in a way that provides automatic structural annotations of the Chlamydomonas reinhardtii genome, using highly unbiased GPF evidence. A comparison of the AUGUSTUS gene set incorporating GPF evidence to the standard JGI FM4 (Filtered Models 4) gene set reveals 932 GPF peptides that are not contained in the Filtered Models 4 gene set. Furthermore, the GPF evidence improved the AUGUSTUS gene models by altering 65 gene models and adding three previously unidentified genes.  相似文献   

14.
15.
The Prokaryotic Super Program Advisory Committee met on March 27, 2013 for their annual review the Prokaryotic Super Program at the DOE Joint Genome Institute. As is the case with any site visit or program review, the objective is to evaluate progress in meeting organizational objectives, provide feedback to from the user-community and to assist the JGI in formulating plans for the coming year. The advisors want to commend the JGI for its central role in developing new technologies and capabilities, and for catalyzing the formation of new collaborative user communities. Highlights of the post-meeting exchanges among the advisors focused on the importance of programmatic initiatives including:• GEBA, which serves as a phylogenetic “base-map” on which our knowledge of functional diversity can be layered.• FEBA, which promises to provide new insights into the physiological capabilities of prokaryotes under highly standardized conditions.• Single-cell genomics technology, which is seen to significantly enhance our ability to interpret genomic and metagenomic data and broaden the scope of the GEBA program to encompass at least a part of the microbial “dark-matter”.• IMG, which is seen to play a central role in JGI programs and is viewed as a strategically important asset in the JGI portfolio.On this latter point, the committee encourages the formation of a strategic relationship between IMG and the Kbase to ensure that the intelligence, deep knowledge and experience captured in the former is not lost. The committee strongly urges the DOE to continue its support for maintaining this critical resource.  相似文献   

16.
This report from the International Summer School 'From Genome to Life', held at the Institute d'Etudes Scientifiques de Cargèse in Corsica in July 2002, covers the talks of the invited speakers. The topics of the talks can be broadly grouped into the areas of genome annotation, comparative and evolutionary genomics, functional genomics, proteomics, structural genomics, pharmacogenomics, and organelle genomes, epigenetics and RNA.  相似文献   

17.
Functional annotation is routinely performed for large-scale genomics projects and databases. Researchers working on more specific problems, for instance on an individual pathway or complex, also need to be able to quickly, completely and accurately annotate sequences. The Bioverse sequence annotation server (http://bioverse.compbio.washington.edu) provides a web-based interface to allow users to submit protein sequences to the Bioverse framework. Sequences are functionally and structurally annotated and potential contextual annotations are provided. Researchers can also submit candidate genomes for annotation of all proteins encoded by the genome (proteome).  相似文献   

18.
The 4th Bologna Winter School on Biotechnologies was held on 9-15 February 2003 at the University of Bologna, Italy, with the specific aim of discussing recent developments in bioinformatics. The school provided an opportunity for students and scientists to debate current problems in computational biology and possible solutions. The course, co-supported (as last year) by the European Science Foundation program on Functional Genomics, focused mainly on hot topics in structural genomics, including recent CASP and CAPRI results, recent and promising genomewide predictions, protein-protein and protein-DNA interaction predictions and genome functional annotation. The topics were organized into four main sections (http://www.biocomp.unibo.it).  相似文献   

19.
Phytophthora spp. are serious pathogens that threaten numerous cultivated crops, trees, and natural vegetation worldwide. The soybean pathogen P. sojae has been developed as a model oomycete. Here, we report a bacterial artificial chromosome (BAC)-based, integrated physical map of the P. sojae genome. We constructed two BAC libraries, digested 8,681 BACs with seven restriction enzymes, end labeled the digested fragments with four dyes, and analyzed them with capillary electrophoresis. Fifteen data sets were constructed from the fingerprints, using individual dyes and all possible combinations, and were evaluated for contig assembly. In all, 257 contigs were assembled from the XhoI data set, collectively spanning approximately 132 Mb in physical length. The BAC contigs were integrated with the draft genome sequence of P. sojae by end sequencing a total of 1,440 BACs that formed a minimal tiling path. This enabled the 257 contigs of the BAC map to be merged with 207 sequence scaffolds to form an integrated map consisting of 79 superscaffolds. The map represents the first genome-wide physical map of a Phytophthora sp. and provides a valuable resource for genomics and molecular biology research in P. sojae and other Phytophthora spp. In one illustration of this value, we have placed the 350 members of a superfamily of putative pathogenicity effector genes onto the map, revealing extensive clustering of these genes.  相似文献   

20.
The chicken genome is sequenced and this, together with microarray and other functional genomics technologies, makes post-genomic research possible in the chicken. At this time, however, such research is hindered by a lack of genomic structural and functional annotations. Bio-ontologies have been developed for different annotation requirements, as well as to facilitate data sharing and computational analysis, but these are not yet optimally utilized in the chicken. Here we discuss genomic annotation and bio-ontologies. We focus specifically on the Gene Ontology (GO), chicken GO annotations and how these can facilitate functional genomics in the chicken. The GO is the most developed and widely used bio-ontology. It is the de facto standard for functional annotation. Despite its critical importance in analyzing microarray and other functional genomics data, relatively few chicken gene products have any GO annotation. When these are available, the average quality of chicken gene products annotations (defined using evidence code weight and annotation depth) is much less than in mouse. Moreover, tools allowing chicken researchers to easily and rapidly use the GO are either lacking or hard to use. To address all of these problems we developed ChickGO and AgBase. Chicken GO annotations are provided by complementary work at MSU-AgBase and EBI-GOA. The GO tools pipeline at AgBase uses GO to derive functional and biological significance from microarray and other functional genomics data. Not only will improved genomic annotation and tools to use these annotations benefit the chicken research community but they will also facilitate research in other avian species and comparative genomics.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号