首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 625 毫秒
1.

Background  

Most of the existing in silico phosphorylation site prediction systems use machine learning approach that requires preparing a good set of classification data in order to build the classification knowledge. Furthermore, phosphorylation is catalyzed by kinase enzymes and hence the kinase information of the phosphorylated sites has been used as major classification data in most of the existing systems. Since the number of kinase annotations in protein sequences is far less than that of the proteins being sequenced to date, the prediction systems that use the information found from the small clique of kinase annotated proteins can not be considered as completely perfect for predicting outside the clique. Hence the systems are certainly not generalized. In this paper, a novel generalized prediction system, PPRED (Phosphorylation PREDictor) is proposed that ignores the kinase information and only uses the evolutionary information of proteins for classifying phosphorylation sites.  相似文献   

2.

Background  

Balanus amphitrite is a barnacle commonly used in biofouling research. Although many aspects of its biology have been elucidated, the lack of genetic information is impeding a molecular understanding of its life cycle. As part of a wider multidisciplinary approach to reveal the biogenic cues influencing barnacle settlement and metamorphosis, we have sequenced and annotated the first cDNA library for B. amphitrite. We also present a systematic validation of potential reference genes for normalization of quantitative real-time PCR (qRT-PCR) data obtained from different developmental stages of this animal.  相似文献   

3.

Background  

While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets across 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase.  相似文献   

4.

Background  

Lately, there has been a great interest in the application of information extraction methods to the biomedical domain, in particular, to the extraction of relationships of genes, proteins, and RNA from scientific publications. The development and evaluation of such methods requires annotated domain corpora.  相似文献   

5.
6.

Background  

A nearly complete collection of gene-deletion mutants (96% of annotated open reading frames) of the yeast Saccharomyces cerevisiae has been systematically constructed. Tag microarrays are widely used to measure the fitness of each mutant in a mutant mixture. The tag array experiments can have a complex experimental design, such as time course measurements and drug treatment with multiple dosages.  相似文献   

7.

Background  

Prediction of protein subcellular localization generally involves many complex factors, and using only one or two aspects of data information may not tell the true story. For this reason, some recent predictive models are deliberately designed to integrate multiple heterogeneous data sources for exploiting multi-aspect protein feature information. Gene ontology, hereinafter referred to as GO, uses a controlled vocabulary to depict biological molecules or gene products in terms of biological process, molecular function and cellular component. With the rapid expansion of annotated protein sequences, gene ontology has become a general protein feature that can be used to construct predictive models in computational biology. Existing models generally either concatenated the GO terms into a flat binary vector or applied majority-vote based ensemble learning for protein subcellular localization, both of which can not estimate the individual discriminative abilities of the three aspects of gene ontology.  相似文献   

8.

Background  

Although the protein-coding sequences in the Saccharomyces cerevisiae genome have been studied and annotated extensively, much less is known about the extent and characteristics of the untranslated regions of yeast mRNAs.  相似文献   

9.
10.

Background  

Corynebacterium diphtheriae, the causative agent of diphtheria, is well-investigated in respect to toxin production, while little is known about C. diphtheriae factors crucial for colonization of the host. In this study, we investigated the function of surface-associated protein DIP1281, previously annotated as hypothetical invasion-associated protein.  相似文献   

11.

Background  

The genome sequencing projects have shown our limited knowledge regarding gene function, e.g. S. cerevisiae has 5–6,000 genes of which nearly 1,000 have an uncertain function. Their gross influence on the behaviour of the cell can be observed using large-scale metabolomic studies. The metabolomic data produced need to be structured and annotated in a machine-usable form to facilitate the exploration of the hidden links between the genes and their functions.  相似文献   

12.

Background  

Helicobacter pylori is the causative agent for gastritis, and peptic and duodenal ulcers. The bacterium displays 5-6 polar sheathed flagella that are essential for colonisation and persistence in the gastric mucosa. The biochemistry and genetics of flagellar biogenesis in H. pylori has not been fully elucidated. Bioinformatics analysis suggested that the gene HP0256, annotated as hypothetical, was a FliJ homologue. In Salmonella, FliJ is a chaperone escort protein for FlgN and FliT, two proteins that themselves display chaperone activity for components of the hook, the rod and the filament.  相似文献   

13.
JCoDA: a tool for detecting evolutionary selection   总被引:1,自引:0,他引:1  

Background  

The incorporation of annotated sequence information from multiple related species in commonly used databases (Ensembl, Flybase, Saccharomyces Genome Database, Wormbase, etc.) has increased dramatically over the last few years. This influx of information has provided a considerable amount of raw material for evaluation of evolutionary relationships. To aid in the process, we have developed JCoDA (Java Codon Delimited Alignment) as a simple-to-use visualization tool for the detection of site specific and regional positive/negative evolutionary selection amongst homologous coding sequences.  相似文献   

14.

Background  

Twenty-eight genes putatively encoding cytosolic glutathione transferases have been identified in the Anopheles gambiae genome. We manually annotated these genes and then confirmed the annotation by sequencing of A. gambiae cDNAs. Phylogenetic analysis with the 37 putative GST genes from Drosophila and representative GSTs from other taxa was undertaken to develop a nomenclature for insect GSTs. The epsilon class of insect GSTs has previously been implicated in conferring insecticide resistance in several insect species. We compared the expression level of all members of this GST class in two strains of A. gambiae to determine whether epsilon GST expression is correlated with insecticide resistance status.  相似文献   

15.

Background  

Chromatin immunoprecipitation combined with DNA microarrays (ChIP-chip) is a high-throughput assay for DNA-protein-binding or post-translational chromatin/histone modifications. However, the raw microarray intensity readings themselves are not immediately useful to researchers, but require a number of bioinformatic analysis steps. Identified enriched regions need to be bioinformatically annotated and compared to related datasets by statistical methods.  相似文献   

16.

Background  

Several strains of bacteria have sequenced and annotated genomes, which have been used in conjunction with biochemical and physiological data to reconstruct genome-scale metabolic networks. Such reconstruction amounts to a two-dimensional annotation of the genome. These networks have been analyzed with a constraint-based formalism and a variety of biologically meaningful results have emerged. Staphylococcus aureus is a pathogenic bacterium that has evolved resistance to many antibiotics, representing a significant health care concern. We present the first manually curated elementally and charge balanced genome-scale reconstruction and model of S. aureus' metabolic networks and compute some of its properties.  相似文献   

17.

Background  

Tribolium castaneum is a species of Coleoptera, the largest and most diverse order of all eukaryotes. Components of the innate immune system are hardly known in this insect, which is in a key phylogenetic position to inform us about genetic innovations accompanying the evolution of holometabolous insects. We have annotated immunity-related genes and compared them with homologous molecules from other species.  相似文献   

18.

Background  

Bacillus subtilis is an organism of interest because of its extensive industrial applications, its similarity to pathogenic organisms, and its role as the model organism for Gram-positive, sporulating bacteria. In this work, we introduce a new genome-scale metabolic model of B. subtilis 168 called iBsu1103. This new model is based on the annotated B. subtilis 168 genome generated by the SEED, one of the most up-to-date and accurate annotations of B. subtilis 168 available.  相似文献   

19.

Background  

aaTHEP1, the gene product of aq_1292 from Aquifex aeolicus, shows sequence homology to proteins from most thermophiles, hyperthermophiles, and higher organisms such as man, mouse, and fly. In contrast, there are almost no homologous proteins in mesophilic unicellular microorganisms. aaTHEP1 is a thermophilic enzyme exhibiting both ATPase and GTPase activity in vitro. Although annotated as a nucleotide kinase, such an activity could not be confirmed for aaTHEP1 experimentally and the in vivo function of aaTHEP1 is still unknown.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号