首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The recognition of gene/protein names in literature is one of the pivotal steps in the processing of biological literatures for information extraction or data mining. We have compiled a lexicon of biomedical words (conserved patterns/ potential motifs) which has the combination of only 20 alphabets of amino acids. The remaining 6 letters of the English alphabets (B, J, O, U, X, Z) are treated as invalid amino acid characters (to our context), We have jumbled the 6 letters for the sake of usage and convenience and termed as ’JUZBOX‘ and these characters were filtered in the biomedical lexicon. Undoubtedly, the generation of biomedical words from protein sequence using JUZBOX have applications specific for functional annotation.

Availability

JUZBOX is available freely at http://www.spices.res.in/juzbox  相似文献   

2.
Summary : FT is a tool written in C++, which implements the Fourier analysis method to locate periodicities in aminoacid or DNA sequences. It is provided for free public use on a WWW server with a Java interface. Availability : The server address is http://o2.db. uoa.gr/FT Contact : shamodr@atlas.uoa.gr   相似文献   

3.
4.
Rapid increase in protein sequence information from genome sequencing projects demand the intervention of bioinformatics tools to recognize interesting gene-products and associated function. Often, multiple algorithms need to be employed to improve accuracy in predictions and several structure prediction algorithms are on the public domain. Here, we report the availability of an Integrated Web-server as a bioinformatics online package dedicated for in-silico analysis of protein sequence and structure data (IWS). IWS provides web interface to both in-house and widely accepted programs from major bioinformatics groups, organized as 10 different modules. IWS also provides interactive images for Analysis Work Flow, which will provide transparency to the user to carry out analysis by moving across modules seamlessly and to perform their predictions in a rapid manner. AVAILABILITY: IWS IS AVAILABLE FROM THE URL: http://caps.ncbs.res.in/iws.  相似文献   

5.
6.
This paper presents Tcoffee@igs, a new server provided to the community by Hewlet Packard computers and the Centre National de la Recherche Scientifique. This server is a web-based tool dedicated to the computation, the evaluation and the combination of multiple sequence alignments. It uses the latest version of the T-Coffee package. Given a set of unaligned sequences, the server returns an evaluated multiple sequence alignment and the associated phylogenetic tree. This server also makes it possible to evaluate the local reliability of an existing alignment and to combine several alternative multiple alignments into a single new one. Tcoffee@igs can be used for aligning protein, RNA or DNA sequences. Datasets of up to 100 sequences (2000 residues long) can be processed. The server and its documentation are available from: http://igs-server.cnrs-mrs.fr/Tcoffee/.  相似文献   

7.
Power-aware computing has emerged as a significant concern in data centers. In this work, we develop empirical models for estimating the power consumed by web servers. These models can be used by on-the-fly power-saving algorithms and are imperative for simulators that evaluate the power behavior of workloads. To apply power saving methodologies and algorithms at the data center level, we must first be able to measure or estimate the power and performance of individual servers running in the data centers. We show a novel method for developing full system web server power models that reduces non-linear relationships among performance measurements and system power and prunes model parameters. The web server power models use as parameters performance indicators read from the machine internal performance counters. We evaluate our approach on an AMD Opteron-based web server and on an Intel i7-based web sever. Our best model displays an average absolute error of 1.92 % for Intel i7 server and 1.46 % for AMD Opteron as compared to actual measurements, and 90th percentile for the absolute percent error equals to 2.66 % for Intel i7 and 2.08 % for AMD Opteron.  相似文献   

8.
GeneInfoMiner is a web-based system for searching Medline abstracts using sequence ID lists such as GenBank accession numbers derived from high-throughput experiments. It will map query results to MeSH topics to facilitate the exploration of the biological significance of the sequence ID lists. GeneInfoMiner is based on a custom gene and protein name identification engine that can map gene and protein names to important molecular biology databases.  相似文献   

9.
M Källberg  H Wang  S Wang  J Peng  Z Wang  H Lu  J Xu 《Nature protocols》2012,7(8):1511-1522
A key challenge of modern biology is to uncover the functional role of the protein entities that compose cellular proteomes. To this end, the availability of reliable three-dimensional atomic models of proteins is often crucial. This protocol presents a community-wide web-based method using RaptorX (http://raptorx.uchicago.edu/) for protein secondary structure prediction, template-based tertiary structure modeling, alignment quality assessment and sophisticated probabilistic alignment sampling. RaptorX distinguishes itself from other servers by the quality of the alignment between a target sequence and one or multiple distantly related template proteins (especially those with sparse sequence profiles) and by a novel nonlinear scoring function and a probabilistic-consistency algorithm. Consequently, RaptorX delivers high-quality structural models for many targets with only remote templates. At present, it takes RaptorX ~35 min to finish processing a sequence of 200 amino acids. Since its official release in August 2011, RaptorX has processed ~6,000 sequences submitted by ~1,600 users from around the world.  相似文献   

10.
An Internet computing server has been developed to identify all the occurrences of the internal sequence repeats in a protein and DNA sequences. Further, an option is provided for the users to check the occurrence(s) of the resultant sequence repeats in the other sequence and structure (Protein Data Bank) databases. The databases deployed in the proposed computing engine are up-to-date and thus the users will get the latest information available in the respective databases. The server is freely accessible over the World Wide Web (WWW). AVAILABILITY: http://bioserver1.physics.iisc.ernet.in/fair/  相似文献   

11.
BAGET (Bacterial and Archaeal Gene Exploration Tool) is a web service designed to facilitate extraction, by molecular geneticists and phylogeneticists, of specific gene and protein sequences from completely determined prokaryotic genomes. Upon selection of a particular prokaryotic organism and gene, two levels of visual gene context information are provided on a single dynamic page: (i) a graphical representation of a user defined portion of the chromosome centered on the gene of interest and (ii) the DNA sequence of the query gene, of the immediate neighboring genes and the intergenic regions each identified by a consistent color code. The aminoacid sequence is provided for protein-coding query genes. Query results can be exported as a rich text format (RTF) word processor file for printing, archival or further analysis. AVAILABILITY: http://archaea.u-psud.fr/bin/baget.dll.  相似文献   

12.
Salt bridges can play important roles in protein structure and function and have stabilizing and destabilizing effects in protein folding. ESBRI is a software available as web tool which analyses the salt bridges in a protein structure, starting from the atomic coordinates. In the case of protein complexes, the salt bridges between protein chains can be evaluated, as well as those among specific charged amino acids and the different protein subunits, in order to obtain useful information regard the protein-protein interaction.  相似文献   

13.
The Northeast Structural Genomics Consortium (NESG) is one of nine NIH-funded pilot projects created to develop technologies needed for structural studies of proteins on a genome-wide scale. One of the most challenging aspects of this emerging field is the production of protein samples amenable to structural determination. To do this efficiently, all steps in the protein production pipeline must be automated. Here we describe the Primer program (linked from http://www-nmr.cabm.rutgers.edu/bioinformatics, www-nmr.cabm.rutgers.edu/bioinformatics, a web-based primer design program freely available to the scientific community, which was created to automate this time consuming and laborious task. This program has the ability to simultaneously calculate plasmid specific primer sets for multiple open reading frame (ORF) targets, including 96-well and greater formats. Primer includes a library of commonly used plasmid systems and possesses the ability to upload user-defined plasmid systems. In addition to calculating gene-specific annealing regions for each target, the program also adds appropriate restriction endonuclease recognition or viral recombination sites while preserving a reading frame with plasmid based fusions. Primer has several useful features such as sorting calculated primer sets by target size, facilitating interpretation of PCR amplifications by agarose gel electrophoresis, as well as supplying the molecular biologist with many important characteristics of each target such as the expected size of the PCR amplified DNA fragment and internal restriction sites. The NESG has cloned over 1500 genes using oligonucleotide primers designed by Primer.  相似文献   

14.
As scientific literature databases like MEDLINE increase in size, so does the time required to search them. Scientists must frequently inspect long lists of references manually, often just reading the titles. XplorMed is a web tool that aids MEDLINE searching by summarizing the subjects contained in the results, thus allowing users to focus on subjects of interest. Here we describe new features added to XplorMed during the last 2 years (http://www.bork.embl-heidelberg.de/xplormed/).  相似文献   

15.
16.
Fan L  Yao YG 《Mitochondrion》2011,11(2):351-356
MitoTool, a web-based bioinformatics platform, is designed for deciphering human mitochondrial DNA (mtDNA) data in batch mode. The platform has advantages in (i) parsing diverse types of mtDNA data; (ii) automatically classifying haplogroup according to mtDNA sequences or variants; (iii) discovering possibly missing variants of the samples with claimed haplogroups status; (iv) estimating the evolutionary conservation index, protein coding effect and potential pathogenicity of certain substitutions; (v) performing statistical analysis for haplogroup distribution frequency between case and control groups. Furthermore, it offers an integrated database for retrieving five types of mitochondrion-related information. The MitoTool is freely accessed at http://www.mitotool.org.  相似文献   

17.
GRIMM: genome rearrangements web server   总被引:14,自引:0,他引:14  
SUMMARY: Genome Rearrangements In Man and Mouse (GRIMM) is a tool for analyzing rearrangements of gene orders in pairs of unichromosomal and multichromosomal genomes, with either signed or unsigned gene data. Although there are several programs for analyzing rearrangements in unichromosomal genomes, this is the first to analyze rearrangements in multichromosomal genomes. GRIMM also provides a new algorithm for analyzing comparative maps for which gene directions are unknown. AVAILABILITY: A web server, with instructions and sample data, is available at http://www-cse.ucsd.edu/groups/bioinformatics/GRIMM.  相似文献   

18.
The web server MetalDetector classifies histidine residues in proteins into one of two states (free or metal bound) and cysteines into one of three states (free, metal bound or disulfide bridged). A decision tree integrates predictions from two previously developed methods (DISULFIND and Metal Ligand Predictor). Cross-validated performance assessment indicates that our server predicts disulfide bonding state at 88.6% precision and 85.1% recall, while it identifies cysteines and histidines in transition metal-binding sites at 79.9% precision and 76.8% recall, and at 60.8% precision and 40.7% recall, respectively. AVAILABILITY: Freely available at http://metaldetector.dsi.unifi.it. SUPPLEMENTARY INFORMATION: Details and data can be found at http://metaldetector.dsi.unifi.it/help.php.  相似文献   

19.
Gaggle Tool Creator (GTC) is a web application which provides access to public annotation, interaction, orthology, and genomic data for hundreds of organisms, and enables instant analysis of the data using many popular web-based and desktop applications.  相似文献   

20.
The pseudo oligonucleotide composition, or pseudo K-tuple nucleotide composition (PseKNC), can be used to represent a DNA or RNA sequence with a discrete model or vector yet still keep considerable sequence order information, particularly the global or long-range sequence order information, via the physicochemical properties of its constituent oligonucleotides. Therefore, the PseKNC approach may hold very high potential for enhancing the power in dealing with many problems in computational genomics and genome sequence analysis. However, dealing with different DNA or RNA problems may need different kinds of PseKNC. Here, we present a flexible and user-friendly web server for PseKNC (at http://lin.uestc.edu.cn/pseknc/default.aspx) by which users can easily generate many different modes of PseKNC according to their need by selecting various parameters and physicochemical properties. Furthermore, for the convenience of the vast majority of experimental scientists, a step-by-step guide is provided on how to use the current web server to generate their desired PseKNC without the need to follow the complicated mathematical equations, which are presented in this article just for the integrity of PseKNC formulation and its development. It is anticipated that the PseKNC web server will become a very useful tool in computational genomics and genome sequence analysis.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号