首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The 1999 SWISS-2DPAGE database update   总被引:9,自引:0,他引:9  
SWISS-2DPAGE (http://www.expasy.ch/ch2d/ ) is an annotated two-dimensional polyacrylamide gel electro-phoresis (2-DE) database established in 1993. The current release contains 24 reference maps from human and mouse biological samples, as well as from Saccharomyces cerevisiae, Escherichia coli and Dictyostelium discoideum origin. These reference maps have now 2824 identified spots, corresponding to 614 separate protein entries in the database, in addition to virtual entries for each SWISS-PROT sequence or any user-entered amino acids sequence. Last year improvements in the SWISS-2DPAGE database are as follows: three new maps have been created and several others have been updated; cross-references to newly built federated 2-DE databases have been added; new functions to access the data have been provided through the ExPASy proteomics server.  相似文献   

2.
The SWISS-2DPAGE database: what has changed during the last year.   总被引:1,自引:0,他引:1  
SWISS-2DPAGE (http://www.expasy.ch/ch2d/) is an annotated two-dimensional polyacrylamide gel electrophoresis (2-D PAGE) database established in 1993. The current release contains 21 reference maps from human and mouse biological samples, as well as from Saccharomyces cerevisiae, Escherichia coli and Dictyostelium discoideum origin. These reference maps now have 2480 identified spots, corresponding to 528 separate protein entries in the database, in addition to virtual entries for each SWISS-PROT sequence. During the last year, the SWISS-2DPAGE has undergone major changes. Six new maps have been added, and new functions to access the data have been provided through the ExPASy server. Finally, an important change concerns the database funding source.  相似文献   

3.
We have updated the Protein Sequence-Structure Analysis Relational Database (PSSARD) first published in the Int. J. Biol. Macromol. 36 (2005) 259-262 corresponding to 1573 representative protein chains selected from the Protein Data Bank (PDB). In this, the updated and revised PSSARD (Version 2.0), we have included all proteins in the Protein Data Bank available at the time of developing this database including the NMR PDB entries. The current database corresponds to 22,752 XRAY PDB entries and 3977 NMR PDB entries and is separated accordingly in order to facilitate the appropriate database search. The representative protein chains can also be separately accessed within the current database. We have made a provision to combine more than one field to query the database and the results of any search can be used to carry out further nested searches using a combination of queries. We have provided hyperlinks to the individual PDB entries obtained as the result of any search in PSSARD in order to obtain additional details relevant to the protein structure. Certain applications useful to identify domains and structural motifs are discussed.  相似文献   

4.
EXProt (database for EXPerimentally verified Protein functions) is a new non-redundant database containing protein sequences for which the function has been experimentally verified. It is a selection of 3976 entries from the Prokaryotes section of the EMBL Nucleotide Sequence Database, Release 66, and 375 entries from the Pseudomonas Community Annotation Project (PseudoCAP). The entries in EXProt all have a unique ID number and provide information about the organism, protein sequence, functional annotation, link to entry in original database, and if known, gene name and link to references in PubMed/Medline. The EXProt web page (http://www.cmbi.nl/EXProt) provides further details of the database and a link to a BLAST search (blastp & blastx) of the database. The EXProt entries are indexed in SRS (http://www.cmbi.nl/srs/) and can be searched by means of keywords. Authors can be reached by email (exprot(cmbi.kun.nl).  相似文献   

5.
A database was established from human hemofiltrate (HF) that consisted of a mass database and a sequence database, with the aim of analyzing the composition of the peptide fraction in human blood. To establish a mass database, all 480 fractions of a peptide bank generated from HF were analyzed by MALDI-TOF mass spectrometry. Using this method, over 20 000 molecular masses representing native, circulating peptides were detected. Estimation of repeatedly detected masses suggests that approximately 5000 different peptides were recorded. More than 95% of the detected masses are smaller than 15 000, indicating that HF predominantly contains peptides. The sequence database contains over 340 entries from 75 different protein and peptide precursors. 55% of the entries are fragments from plasma proteins (fibrinogen A 13%, albumin 10%, β2-microglobulin 8.5%, cystatin C 7%, and fibrinogen B 6%). Seven percent of the entries represent peptide hormones, growth factors and cytokines. Thirty-three percent belong to protein families such as complement factors, enzymes, enzyme inhibitors and transport proteins. Five percent represent novel peptides of which some show homology to known peptide and protein families. The coexistence of processed peptide fragments, biologically active peptides and peptide precursors suggests that HF reflects the peptide composition of plasma. Interestingly, protein modules such as EGF domains (meprin Aα-fragments), somatomedin-B domains (vitronectin fragments), thyroglobulin domains (insulin like growth factor-binding proteins), and Kazal-type inhibitor domains were identified. Alignment of sequenced fragments to their precursor proteins and the analysis of their cleavage sites revealed that there are different processing pathways of plasma proteins in vivo.  相似文献   

6.
A set of computer programs is described which constitutes a clone database management system. Maintenance of the database and the stocks of material is designed to be under the control of one person or group of people, who may insert, delete or modify data entries, and who may interrogate the database as to which stocks are in need of checking. The system is organised in such a way that information is freely and speedily available to all users. Database entries may be accessed by name or key word.  相似文献   

7.
Biomolecular NMR chemical shift data are key information for the functional analysis of biomolecules and the development of new techniques for NMR studies utilizing chemical shift statistical information. Structural genomics projects are major contributors to the accumulation of protein chemical shift information. The management of the large quantities of NMR data generated by each project in a local database and the transfer of the data to the public databases are still formidable tasks because of the complicated nature of NMR data. Here we report an automated and efficient system developed for the deposition and annotation of a large number of data sets including (1)H, (13)C and (15)N resonance assignments used for the structure determination of proteins. We have demonstrated the feasibility of our system by applying it to over 600 entries from the internal database generated by the RIKEN Structural Genomics/Proteomics Initiative (RSGI) to the public database, BioMagResBank (BMRB). We have assessed the quality of the deposited chemical shifts by comparing them with those predicted from the PDB coordinate entry for the corresponding protein. The same comparison for other matched BMRB/PDB entries deposited from 2001-2011 has been carried out and the results suggest that the RSGI entries greatly improved the quality of the BMRB database. Since the entries include chemical shifts acquired under strikingly similar experimental conditions, these NMR data can be expected to be a promising resource to improve current technologies as well as to develop new NMR methods for protein studies.  相似文献   

8.
Proteins may simultaneously exist at, or move between, two or more different subcellular locations. Proteins with multiple locations or dynamic feature of this kind are particularly interesting because they may have some very special biological functions intriguing to investigators in both basic research and drug discovery. For instance, among the 6408 human protein entries that have experimentally observed subcellular location annotations in the Swiss-Prot database (version 50.7, released 19-Sept-2006), 973 ( approximately 15%) have multiple location sites. The number of total human protein entries (except those annotated with "fragment" or those with less than 50 amino acids) in the same database is 14,370, meaning a gap of (14,370-6408)=7962 entries for which no knowledge is available about their subcellular locations. Although one can use the computational approach to predict the desired information for the gap, so far all the existing methods for predicting human protein subcellular localization are limited in the case of single location site only. To overcome such a barrier, a new ensemble classifier, named Hum-mPLoc, was developed that can be used to deal with the case of multiple location sites as well. Hum-mPLoc is freely accessible to the public as a web server at http://202.120.37.186/bioinf/hum-multi. Meanwhile, for the convenience of people working in the relevant areas, Hum-mPLoc has been used to identify all human protein entries in the Swiss-Prot database that do not have subcellular location annotations or are annotated as being uncertain. The large-scale results thus obtained have been deposited in a downloadable file prepared with Microsoft Excel and named "Tab_Hum-mPLoc.xls". This file is available at the same website and will be updated twice a year to include new entries of human proteins and reflect the continuous development of Hum-mPLoc.  相似文献   

9.
The effectiveness of any proteomics database search depends on the theoretical candidate information contained in the protein database. Unfortunately, candidate entries from protein databases such as UniProt rarely contain all the post-translational modifications (PTMs), disulfide bonds, or endogenous cleavages of interest to researchers. These omissions can limit discovery of novel and biologically important proteoforms. Conversely, searching for a specific proteoform becomes a computationally difficult task for heavily modified proteins. Both situations require updates to the database through user-annotated entries. Unfortunately, manually creating properly formatted UniProt Extensible Markup Language (XML) files is tedious and prone to errors. ProSight Annotator solves these issues by providing a graphical interface for adding user-defined features to UniProt-formatted XML files for better informed proteoform searches. It can be downloaded from http://prosightannotator.northwestern.edu .  相似文献   

10.
SUMMARY: The EMBL Nucleotide Sequence Database, maintained at the European Bioinformatics institute, is Europe's primary nucleotide sequences database. Its entries are subject to changes, but only the most recent versions are preserved in the database. The EMBL Sequence Version Archive is a new publicly available database retaining also the earlier versions of these entries. AVAILABILITY: http://www.ebi.ac.uk/embl/sva/  相似文献   

11.
Using two-dimensional gel electrophoresis (2-DE) and electrospray-tandem mass spectrometry (ESI-MS/MS), we have started the proteome analysis of the cell line Nicotiana tabacum cv. Bright Yellow-2 (tobacco BY-2). The BY-2 cell suspension culture is widely used as a model system to study the growth and development of plant cells. We present a protocol describing the sample preparation and 2-DE, enabling us to separate and display more than 1000 proteins from this cell culture. A reference gel was generated, using immobilized pH gradient isoelectric focusing in a linear gradient from pH 3 to 10 and 12% Sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE). Although the tobacco genome is not sequenced yet, a range of protein spots from this reference map was identified by means of a semi-automated liquid chromatography-ESI-quadrupole time of flight-tandem MS (LC-ESI-QTOF-MS-MS) setup and cross-species matching. These data were integrated in a database, which can be accessed at http://tby2-www.uia.ac.be/tby2/. On the on-line reference map, the identified protein spots are hyperlinked to individual protein entries. Each protein entry contains all identification information, as well as links to relevant entries in other on-line databases. Comprehensive search functions are implemented. Especially for an unsequenced but widespread model organism like tobacco BY-2, such a reference database is a convenient source for protein information that brings protein identification within reach without the need for extensive MS. This publicly accessible database provides a solid basis for tobacco BY-2 proteomics in the future.  相似文献   

12.
EXProt is a non-redundant protein database containing a selection of entries from genome annotation projects and public databases, aimed at including only proteins with an experimentally verified function. In EXProt release 2.0 we have collected entries from the Pseudomonas aeruginosa community annotation project (PseudoCAP), the Escherichia coli genome and proteome database (GenProtEC) and the translated coding sequences from the Prokaryotes division of EMBL nucleotide sequence database, which are described as having an experimentally verified function. Each entry in EXProt has a unique ID number and contains information about the species, amino acid sequence, functional annotation and, in most cases, links to references in MEDLINE/PubMed and to the entry in the original database. EXProt is indexed in SRS at CMBI (http://www.cmbi.kun.nl/srs/) and can be searched with BLAST and FASTA through the EXProt web page (http://www.cmbi.kun.nl/EXProt/).  相似文献   

13.
We introduce a metric for local sequence alignments that has utility for accelerating optimal alignment searches without loss of sensitivity. The metric's triangle inequality property permits identification of redundant database entries guaranteed to have optimal alignments to the query sequence that fall below a specified score threshold, thereby permitting comparisons to these entries to be skipped. We prove the existence of the metric for a variety of scoring systems, including the most commonly used ones, and show that a triangle inequality can be established as well for nucleotide-to-protein sequence comparisons. We discuss a database clustering and search strategy that takes advantage of the triangle inequality. The strategy permits moderate but significant acceleration of searches against the widely used "nr" protein database. It also provides a theoretically based method for database clustering in general and provides a standard against which to compare heuristic clustering strategies.  相似文献   

14.
UniSave: the UniProtKB sequence/annotation version database   总被引:1,自引:0,他引:1  
SUMMARY: The UniProtKB Sequence/Annotation Version database (UniSave) is a comprehensive archive of UniProtKB/Swiss-Prot and UniProtKB/TrEMBL entry versions. All changed Swiss-Prot and TrEMBL entries are loaded into the UniSave as part of the public bi-weekly UniProtKB releases. Unlike the UniProtKB, which contains only the latest Swiss-Prot and TrEMBL entry versions, the UniSave provides access to previous versions of these entries. AVAILABILITY: http://www.ebi.ac.uk/uniprot/unisave  相似文献   

15.
A novel database, under the acronym RISSC (Ribosomal Intergenic Spacer Sequence Collection), has been created. It compiles more than 1600 entries of edited DNA sequence data from the 16S-23S ribosomal spacers present in most prokaryotes and organelles (e.g. mitochondria and chloroplasts) and is accessible through the Internet (http://ulises.umh.es/RISSC), where systematic searches for specific words can be conducted, as well as BLAST-type sequence searches. Additionally, a characteristic feature of this region, the presence/absence and nature of tRNA genes within the spacer, is included in all the entries, even when not previously indicated in the original database. All these combined features could provide a useful documentation tool for studies on evolution, identification, typing and strain characterization, among others.  相似文献   

16.

Background  

The MEDLINE database contains over 12 million references to scientific literature, with about 3/4 of recent articles including an abstract of the publication. Retrieval of entries using queries with keywords is useful for human users that need to obtain small selections. However, particular analyses of the literature or database developments may need the complete ranking of all the references in the MEDLINE database as to their relevance to a topic of interest. This report describes a method that does this ranking using the differences in word content between MEDLINE entries related to a topic and the whole of MEDLINE, in a computational time appropriate for an article search query engine.  相似文献   

17.
The androgen receptor gene mutations database.   总被引:3,自引:0,他引:3       下载免费PDF全文
The androgen receptor gene mutations database is a comprehensive listing of mutations published in journals and meetings proceedings. The majority of mutations are point mutations identified in patients with androgen insensitivity syndrome. Information is included regarding the phenotype, the nature and location of the mutations, as well as the effects of the mutations on the androgen binding activity of the receptor. The current version of the database contains 149 entries, of which 114 are unique mutations. The database is available from EMBL (NetServ@EMBL-Heidelberg.DE) or as a Macintosh Filemaker file (mc33001@musica.mcgill.ca).  相似文献   

18.
SENTRA, available via URL http://wit.mcs.anl.gov/WIT2/Sentra/, is a database of proteins associated with microbial signal transduction. The database currently includes the classical two-component signal transduction pathway proteins and methyl-accepting chemotaxis proteins, but will be expanded to also include other classes of signal transduction systems that are modulated by phosphorylation or methylation reactions. Although the majority of database entries are from prokaryotic systems, eukaroytic proteins with bacterial-like signal transduction domains are also included. Currently SENTRA contains signal transduction proteins in 34 complete and almost completely sequenced prokaryotic genomes, as well as sequences from 243 organisms available in public databases (SWISS-PROT and EMBL). The analysis was carried out within the framework of the WIT2 system, which is designed and implemented to support genetic sequence analysis and comparative analysis of sequenced genomes.  相似文献   

19.
MicroRNAs are small noncoding RNAs that play an important role in the regulation of various biological processes through their interaction with cellular messenger RNAs. They are frequently dysregulated in cancer and have shown great potential as tissue-based markers for cancer classification and prognostication. microRNAs are also present in extracellular human body fluids such as serum, plasma, saliva, and urine. Most of circulating microRNAs are present in human plasma and serum cofractionate with the Argonaute2 (Ago2) protein. However, circulating microRNAs have been also found in membrane-bound vesicles such as exosomes. Since microRNAs circulate in the bloodstream in a highly stable, extracellular form, they may be used as blood-based biomarkers for cancer and other diseases. A knowledge base of extracellular circulating miRNAs is a fundamental tool for biomedical research. In this work, we present miRandola, a comprehensive manually curated classification of extracellular circulating miRNAs. miRandola is connected to miRò, the miRNA knowledge base, allowing users to infer the potential biological functions of circulating miRNAs and their connections with phenotypes. The miRandola database contains 2132 entries, with 581 unique mature miRNAs and 21 types of samples. miRNAs are classified into four categories, based on their extracellular form: miRNA-Ago2 (173 entries), miRNA-exosome (856 entries), miRNA-HDL (20 entries) and miRNA-circulating (1083 entries). miRandola is available online at: http://atlas.dmi.unict.it/mirandola/index.html.  相似文献   

20.
PhosphoBase: a database of phosphorylation sites.   总被引:2,自引:0,他引:2       下载免费PDF全文
PhosphoBase is a database of experimentally verified phosphorylation sites. Version 1.0 contains 156 entries and 398 experimentally determined phosphorylation sites. Entries are compiled and revised from the literature and from major protein sequence databases such as SwissProt and PIR. The entries provide information about the phosphoprotein and the exact position of its phosphorylation sites. Furthermore, part of the entries contain information about kinetic data obtained from enzyme assays on specific peptides. To illustrate the use of data extracted from PhosphoBase we present a sequence logo displaying the overall conservation of positions around serines phosphorylated by protein kinase A (PKA). PhosphoBase is available on the WWW at http://www.cbs.dtu.dk/databases/PhosphoBase/  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号