首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 531 毫秒
1.
The genome sequence DataBase   总被引:1,自引:0,他引:1       下载免费PDF全文
The Genome Sequence DataBase (GSDB) is a database of publicly available nucleotide sequences and their associated biological and bibliographic information. Several notable changes have occurred in the past year: GSDB stopped accepting data submissions from researchers; ownership of data submitted to GSDB was transferred to GenBank; sequence analysis capabilities were expanded to include Smith-Waterman and Frame Search; and Sequence Viewer became available to Mac users. The content of GSDB remains up-to-date because publicly available data is acquired from the International Nucleotide Sequence Database Collaboration databases (IC) on a nightly basis. This allows GSDB to continue providing researchers with the ability to analyze, query and retrieve nucleotide sequences in the database. GSDB and its related tools are freely accessible from the URL: http://www.ncgr.org  相似文献   

2.
Novotny M  Madsen D  Kleywegt GJ 《Proteins》2004,54(2):260-270
When a new protein structure has been determined, comparison with the database of known structures enables classification of its fold as new or belonging to a known class of proteins. This in turn may provide clues about the function of the protein. A large number of fold comparison programs have been developed, but they have never been subjected to a comprehensive and critical comparative analysis. Here we describe an evaluation of 11 publicly available, Web-based servers for automatic fold comparison. Both their functionality (e.g., user interface, presentation, and annotation of results) and their performance (i.e., how well established structural similarities are recognized) were assessed. The servers were subjected to a battery of performance tests covering a broad spectrum of folds as well as special cases, such as multidomain proteins, Calpha-only models, new folds, and NMR-based models. The CATH structural classification system was used as a reference. These tests revealed the strong and weak sides of each server. On the whole, CE, DALI, MATRAS, and VAST showed the best performance, but none of the servers achieved a 100% success rate. Where no structurally similar proteins are found by any individual server, it is recommended to try one or two other servers before any conclusions concerning the novelty of a fold are put on paper.  相似文献   

3.
Antibodies are useful tools to characterize the components of the human proteome and to validate potential protein biomarkers discovered through various clinical proteomics efforts. The lack of validation results across various applications for most antibodies often makes it necessary to perform cumbersome investigations to ensure specificity of a particular antibody in a certain application. A need therefore exists for a standardized system for sharing validation data about publicly available antibodies and to allow antibody providers as well as users to contribute and edit experimental evidence data, including data also on the antigen. Here we describe a new publicly available portal called Antibodypedia, which has been developed to allow sharing of information regarding validation of antibodies in which providers can submit their own validation results and reliability scores. We report standardized validation criteria and submission rules for applications such as Western blots, protein arrays, immunohistochemistry, and immunofluorescence. The contributor is expected to provide experimental evidence and a validation score for each antibody, and the users can subsequently provide feedback and comments on the use of the antibody. The database thus provides a virtual resource of publicly available antibodies toward human proteins with accompanying experimental evidence supporting an individual validation score for each antibody in an application-specific manner.  相似文献   

4.
Sequencing ribosomal RNA (rRNA) genes is currently the method of choice for phylogenetic reconstruction, nucleic acid based detection and quantification of microbial diversity. The ARB software suite with its corresponding rRNA datasets has been accepted by researchers worldwide as a standard tool for large scale rRNA analysis. However, the rapid increase of publicly available rRNA sequence data has recently hampered the maintenance of comprehensive and curated rRNA knowledge databases. A new system, SILVA (from Latin silva, forest), was implemented to provide a central comprehensive web resource for up to date, quality controlled databases of aligned rRNA sequences from the Bacteria, Archaea and Eukarya domains. All sequences are checked for anomalies, carry a rich set of sequence associated contextual information, have multiple taxonomic classifications, and the latest validly described nomenclature. Furthermore, two precompiled sequence datasets compatible with ARB are offered for download on the SILVA website: (i) the reference (Ref) datasets, comprising only high quality, nearly full length sequences suitable for in-depth phylogenetic analysis and probe design and (ii) the comprehensive Parc datasets with all publicly available rRNA sequences longer than 300 nucleotides suitable for biodiversity analyses. The latest publicly available database release 91 (August 2007) hosts 547 521 sequences split into 461 823 small subunit and 85 689 large subunit rRNAs.  相似文献   

5.
The MIPS Rice (Oryza sativa) database (MOsDB; http://mips.gsf.de/proj/rice) provides a comprehensive data collection dedicated to the genome information of rice. Rice (O. sativa L.) is one of the most important food crops for over half the world's population and serves as a major model system in cereal genome research. MOsDB integrates data from two publicly available rice genomic sequences, O. sativa L. ssp. indica and O. sativa L. ssp. japonica. Besides regularly updated rice genome sequence information, MOsDB provides an integrated resource for associated analysis data, e.g. internal and external annotation information as well as a complex characterization of all annotated rice genes. The MOsDB web interface supports various search options and allows browsing the database content. MOsDB is continuously expanding to include an increasing range of data type and the growing amount of information on the rice genome.  相似文献   

6.
SUMMARY: The searchable mutant database PLPMDB has been developed to provide rapid and simple access to relevant mutant information on pyridoxal-5'-phosphate dependent enzymes. All data have been extracted from publications and publicly available databases, then organized in a relational database to enable searching via a web-based search form. The current version of PLPMDB contains 688 mutants described in 220 research papers. The database is a useful tool for planning mutant experiments and for interpretation of information from such experiments. AVAILABILITY: PLPMDB is freely accessible from http://www.studiofmp.com/plpmdb/index.htm.  相似文献   

7.
gpDB is a publicly accessible, relational database, containing information about G-proteins, G-protein coupled receptors (GPCRs) and effectors, as well as information concerning known interactions between these molecules. The sequences are classified according to a hierarchy of different classes, families and subfamilies based on literature search. The main innovation besides the classification of G-proteins, GPCRs and effectors is the relational model of the database, describing the known coupling specificity of GPCRs to their respective alpha subunits of G-proteins, and also the specific interaction between G-proteins and their effectors, a unique feature not available in any other database. AVAILABILITY: http://bioinformatics.biol.uoa.gr/gpDB CONTACT: shamodr@biol.uoa.gr SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.  相似文献   

8.
In the face of drastically rising drug discovery costs, strategies promising to reduce development timelines and expenditures are being pursued. Computer-aided virtual screening and repurposing approved drugs are two such strategies that have shown recent success. Herein, we report the creation of a highly-curated in silico database of chemical structures representing approved drugs, chemical isolates from traditional medicinal herbs, and regulated chemicals, termed the SWEETLEAD database. The motivation for SWEETLEAD stems from the observance of conflicting information in publicly available chemical databases and the lack of a highly curated database of chemical structures for the globally approved drugs. A consensus building scheme surveying information from several publicly accessible databases was employed to identify the correct structure for each chemical. Resulting structures are filtered for the active pharmaceutical ingredient, standardized, and differing formulations of the same drug were combined in the final database. The publically available release of SWEETLEAD (https://simtk.org/home/sweetlead) provides an important tool to enable the successful completion of computer-aided repurposing and drug discovery campaigns.  相似文献   

9.
DEPD: a novel database for differentially expressed proteins   总被引:4,自引:0,他引:4  
SUMMARY: The Differentially Expressed Protein Database was designed to store the output of comparative proteomics studies and provides a publicly available query and analysis platform for data mining. The database contains information about more than 3000 differentially expressed proteins (DEPs) manually extracted from the published literature, including relevant biological, experimental and methodological elements. Tools for visualization and functional analysis of DEPs are provided via a user-friendly webinterface. AVAILABILITY: http://protchem.hunnu.edu.cn/depd/.  相似文献   

10.
SUMMARY: Lipoxygenases are a family of enzymes involved in a variety of human diseases like inflammation, asthma, artherosclerosis and cancer. The lipoxygenases database (LOX-DB) aims to be a web accessible compendium of information in particular on the mammalian members of this multigene family. This resource includes molecular structures, reference data, tools for structural and computational analysis as well as links to related information maintained by others. The data can be retrieved by the use of various search options and analyzed applying publicly available visualization tools. AVAILABILITY: LOX-DB is available at http://www.dkfz-heidelberg.de/spec/lox-db/  相似文献   

11.
The PANTHER database was designed for high-throughput analysis of protein sequences. One of the key features is a simplified ontology of protein function, which allows browsing of the database by biological functions. Biologist curators have associated the ontology terms with groups of protein sequences rather than individual sequences. Statistical models (Hidden Markov Models, or HMMs) are built from each of these groups. The advantage of this approach is that new sequences can be automatically classified as they become available. To ensure accurate functional classification, HMMs are constructed not only for families, but also for functionally distinct subfamilies. Multiple sequence alignments and phylogenetic trees, including curator-assigned information, are available for each family. The current version of the PANTHER database includes training sequences from all organisms in the GenBank non-redundant protein database, and the HMMs have been used to classify gene products across the entire genomes of human, and Drosophila melanogaster. The ontology terms and protein families and subfamilies, as well as Drosophila gene c;assifications, can be browsed and searched for free. Due to outstanding contractual obligations, access to human gene classifications and to protein family trees and multiple sequence alignments will temporarily require a nominal registration fee. PANTHER is publicly available on the web at http://panther.celera.com.  相似文献   

12.
13.
GenBank   总被引:51,自引:4,他引:47       下载免费PDF全文
The GenBank((R))sequence database incorporates publicly available DNA sequences of >55 000 different organisms, primarily through direct submission of sequence data from individual laboratories and large-scale sequencing projects. Most submissions are made using the BankIt (Web) or Sequin programs and accession numbers are assigned by GenBank staff upon receipt. Data exchange with the EMBL Data Library and the DNA Data Bank of Japan helps ensure comprehensive worldwide coverage. GenBank data is accessible through NCBI's integrated retrieval system, Entrez, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping and protein structure information, plus the biomedical literature via PubMed. Sequence similarity searching is provided by the BLAST family of programs. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. NCBI also offers a wide range of WWW retrieval and analysis services based on GenBank data. The GenBank database and related resources are freely accessible via the NCBI home page at http://www.ncbi.nlm.nih.gov  相似文献   

14.
A QTL resource and comparison tool for pigs: PigQTLDB   总被引:12,自引:2,他引:10  
During the past decade, efforts to map quantitative trait loci (QTL) in pigs have resulted in hundreds of QTL being reported for growth, meat quality, reproduction, disease resistance, and other traits. It is a challenge to locate, interpret, and compare QTL results from different studies. We have developed a pig QTL database (PigQTLdb) that integrates available pig QTL data in the public domain, thus, facilitating the use of this QTL data in future studies. We also developed a pig trait classification system to standardize names of traits and to simplify organization and searching of the trait data. These steps made it possible to compare primary data from diverse sources and methods. We used existing pig map databases and other publicly available data resources (such as PubMed) to avoid redundant developmental work. The PigQTLdb was also designed to include data representing major genes and markers associated with a large effect on economically important traits. To date, over 790 QTL from 73 publications have been curated into the database. Those QTL cover more than 300 different traits. The data have been submitted to the Entrez Gene and the Map Viewer resources at NCBI, where the information about markers was matched to marker records in NCBI’s UniSTS database. Having these data in a public resource like NCBI allows regularly updated automatic matching of markers to public sequence data by e-PCR. The submitted data, and the results of these calculations, are retrievable from NCBI via Entrez Gene, Map Viewer, and UniSTS. Efforts were undertaken to improve the integrated functional genomics resources for pigs.  相似文献   

15.
16.
The germplasm of the genus Nicotiana contains more than 5,000 accessions and plays an important role in modern biological research. Tobacco can be used as a model system to develop methodologies for plant transformation and for investigating gene function. In order to develop the study of Nicotiana, a large quantity of data on germplasm, sequences, molecular markers and genetically modified tobacco was required for in-depth and systematic collation and research. It became necessary to establish a special database for tobacco genetics and breeding. The tobacco genetics and breeding (TGB, http://yancao.sdau.edu.cn/tgb) database was developed with the aim of bringing together tobacco genetics and breeding. The database has three main features: (1) a materials database with information on 1,472 Nicotiana germplasm accessions, as well as updated genomic and expressed sequence tag (EST) data available from the public database; (2) a molecular markers database containing a total of 12,388 potential intron polymorphisms 10,551 EST-simple sequence repeat (EST-SSR) and 66,297 genomic-SSR markers; and (3) an applications database with genetic maps and some genetically modified studies in tobacco. The TGB database also makes Basic Local Alignment Search Tool and primer designing tools publicly available. As far as can be ascertained, the TGB database is the first tobacco genetics and breeding database to be created, and all this comprehensive information will aid basic research into Nicotiana and other related plants. It will serve as an excellent resource for the online tobacco research community.  相似文献   

17.
18.
The Botany Array Resource provides the means for obtaining and archiving microarray data for Arabidopsis thaliana as well as biologist-friendly tools for viewing and mining both our own and other's data, for example, from the AtGenExpress Consortium. All the data produced are publicly available through the web interface of the database at http://bbc.botany.utoronto.ca. The database has been designed in accordance with the Minimum Information About a Microarray Experiment convention -- all expression data are associated with the corresponding experimental details. The database is searchable and it also provides a set of useful and easy-to-use web-based data-mining tools for researchers with sophisticated yet understandable output graphics. These include Expression Browser for performing 'electronic Northerns', Expression Angler for identifying genes that are co-regulated with a gene of interest, and Promomer for identifying potential cis-elements in the promoters of individual or co-regulated genes.  相似文献   

19.
Multiple sequence alignment was performed against eight proteases from the Flaviviridae family using ClustalW to illustrate conserved domains. Two sets of prediction approaches were applied and the results compared. Firstly, secondary structure prediction was performed using available structure prediction servers. The second approach made use of the information on the secondary structures extracted from structure prediction servers, threading techniques and DSSP database of some of the templates used in the threading techniques. Consensus on the one-dimensional secondary structure of Den2 protease was obtained from each approach and evaluated against data from the recently crystallised Den2 NS2B/NS3 obtained from the Protein Data Bank (PDB). Results indicated the second approach to show higher accuracy compared to the use of prediction servers only. Thus, it is plausible that this approach is applicable to the initial stage of structural studies of proteins with low amino acid sequence homology against other available proteins in the PDB.  相似文献   

20.
GenBank          下载免费PDF全文
The GenBank sequence database incorporates publicly available DNA sequences of more than 105 000 different organisms, primarily through direct submission of sequence data from individual laboratories and large-scale sequencing projects. Most submissions are made using the BankIt (web) or Sequin programs and accession numbers are assigned by GenBank staff upon receipt. Data exchange with the EMBL Data Library and the DNA Data Bank of Japan helps ensure comprehensive worldwide coverage. GenBank data is accessible through NCBI’s integrated retrieval system, Entrez, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical literature via PubMed. Sequence similarity searching is provided by the BLAST family of programs. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. NCBI also offers a wide range of World Wide Web retrieval and analysis services based on GenBank data. The GenBank database and related resources are freely accessible via the NCBI home page at http://www.ncbi.nlm.nih.gov.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号