首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The iProClass database is an integrated resource that provides comprehensive family relationships and structural and functional features of proteins, with rich links to various databases. It is extended from ProClass, a protein family database that integrates PIR superfamilies and PROSITE motifs. The iProClass currently consists of more than 200,000 non-redundant PIR and SWISS-PROT proteins organized with more than 28,000 superfamilies, 2600 domains, 1300 motifs, 280 post-translational modification sites and links to more than 30 databases of protein families, structures, functions, genes, genomes, literature and taxonomy. Protein and family summary reports provide rich annotations, including membership information with length, taxonomy and keyword statistics, full family relationships, comprehensive enzyme and PDB cross-references and graphical feature display. The database facilitates classification-driven annotation for protein sequence databases and complete genomes, and supports structural and functional genomic research. The iProClass is implemented in Oracle 8i object-relational system and available for sequence search and report retrieval at http://pir.georgetown.edu/iproclass/.  相似文献   

2.
The apple (Malus domestica) is one of the most economically important fruit crops in the world, due its importance to human nutrition and health. To analyze the function and evolution of different apple genes, we developed apple gene function and gene family database (AppleGFDB) for collecting, storing, arranging, and integrating functional genomics information of the apple. The AppleGFDB provides several layers of information about the apple genes, including nucleotide and protein sequences, chromosomal locations, gene structures, and any publications related to these annotations. To further analyze the functional genomics data of apple genes, the AppleGFDB was designed to enable users to easily retrieve information through a suite of interfaces, including gene ontology, protein domain and InterPro. In addition, the database provides tools for analyzing the expression profiles and microRNAs of the apple. Moreover, all of the analyzed and collected data can be downloaded from the database. The database can also be accessed using a convenient web server that supports a full-text search, a BLAST sequence search, and database browsing. Furthermore, to facilitate cooperation among apple researchers, AppleGFDB is presented in a user-interactive platform, which provides users with the opportunity to modify apple gene annotations and submit publication information for related genes. AppleGFDB is available at http://www.applegene.org or http://gfdb.sdau.edu.cn/.  相似文献   

3.
MOTIVATION: The PFDB (Protein Family Database) is a new database designed to integrate protein family-related data with relevant functional and genomic data. It currently manages biological data for three projects-the CATH protein domain database (Orengo et al., 1997; Pearl et al., 2001), the VIDA virus domains database (Albà et al., 2001) and the Gene3D database (Buchan et al., 2001). The PFDB has been designed to accommodate protein families identified by a variety of sequence based or structure based protocols and provides a generic resource for biological research by enabling mapping between different protein families and diverse biochemical and genetic data, including complete genomes. RESULTS: A characteristic feature of the PFDB is that it has a number of meta-level entities (for example aggregation, collection and inclusion) represented as base tables in the final design. The explicit representation of relationships at the meta-level has a number of advantages, including flexibility-both in terms of the range of queries that can be formulated and the ability to integrate new biological entities within the existing design. A potential drawback with this approach-poor performance caused by the number of joins across meta-level tables-is avoided by implementing the PFDB with materialized views using the mature relational database technology of Oracle 8i. The resultant database is both fast and flexible. This paper presents the principles on which the database has been designed and implemented, and describes the current status of the database and query facilities supported.  相似文献   

4.
Bread wheat (Triticum aestivum) is one of the most important crop plants, globally providing staple food for a large proportion of the human population. However, improvement of this crop has been limited due to its large and complex genome. Advances in genomics are supporting wheat crop improvement. We provide a variety of web-based systems hosting wheat genome and genomic data to support wheat research and crop improvement. WheatGenome.info is an integrated database resource which includes multiple web-based applications. These include a GBrowse2-based wheat genome viewer with BLAST search portal, TAGdb for searching wheat second-generation genome sequence data, wheat autoSNPdb, links to wheat genetic maps using CMap and CMap3D, and a wheat genome Wiki to allow interaction between diverse wheat genome sequencing activities. This system includes links to a variety of wheat genome resources hosted at other research organizations. This integrated database aims to accelerate wheat genome research and is freely accessible via the web interface at http://www.wheatgenome.info/.  相似文献   

5.
6.

Background  

Structural and functional research often requires the computation of sets of protein structures based on certain properties of the proteins, such as sequence features, fold classification, or functional annotation. Compiling such sets using current web resources is tedious because the necessary data are spread over many different databases. To facilitate this task, we have created COLUMBA, an integrated database of annotations of protein structures.  相似文献   

7.
ProClass is a protein family database that organizes non-redundant sequence entries into families defined collectively by PIR superfamilies and PROSITE patterns. By combining global similarities and functional motifs into a single classification scheme, ProClass helps to reveal domain and family relationships and classify multi-domain proteins. The database currently consists of >155 000 sequence entries retrieved from both PIR-International and SWISS-PROT databases. Approximately 92 000 or 60% of the ProClass entries are classified into approximately 6000 families, including a large number of new members detected by our GeneFIND family identification system. The ProClass motif collection contains approximately 72 000 motif sequences and >1300 multiple alignments for all PROSITE patterns, including >21 000 matches not listed in PROSITE and mostly detected from unique PIR sequences. To maximize family information retrieval, the database provides links to various protein family, domain, alignment and structural class databases. With its high classification rate and comprehensive family relationships, ProClass can be used to support full-scale genomic annotation. The database, now being implemented in an object-relational database management system, is available for online sequence search and record retrieval from our WWW server at http://pir.georgetown.edu/gfserver/proclass.html  相似文献   

8.
Protocadherin family: diversity, structure, and function   总被引:1,自引:0,他引:1  
Protocadherins are predominantly expressed in the nervous system, and constitute the largest subgroup within the cadherin superfamily. The recent structural elucidation of the amino-terminal cadherin domain in an archetypal protocadherin revealed unique and remarkable features: the lack of an interface for homophilic adhesiveness found in classical cadherins, and the presence of loop structures specific to the protocadherin family. The unique features of protocadherins extend to their genomic organization. Recent findings have revealed unexpected allelic and combinatorial gene regulation for clustered protocadherins, a major subgroup in the protocadherin family. The unique structural repertoire and unusual gene regulation of the protocadherin family may provide the molecular basis for the extraordinary diversity of the nervous system.  相似文献   

9.
BioSilico is a web-based database system that facilitates the search and analysis of metabolic pathways. Heterogeneous metabolic databases including LIGAND, ENZYME, EcoCyc and MetaCyc are integrated in a systematic way, thereby allowing users to efficiently retrieve the relevant information on enzymes, biochemical compounds and reactions. In addition, it provides well-designed view pages for more detailed summary information. BioSilico is developed as an extensible system with a robust systematic architecture.  相似文献   

10.
11.
This paper describes the first maize database of proteins separated by two-dimensional electrophoresis. Fifty-six coleoptile proteins and 18 leaf proteins from two maize lines were partially microsequenced. Thirty-six proteins (49%) displayed high similarity with database proteins. Nine of these proteins, representing five different functions, had never been described in maize. No conclusive function could be found for 45 polypeptides (61% of the microsequenced proteins). In addition, an alternative identification method, based on amino acid analysis, allowed candidates to be proposed for 17 proteins out of 44 additional proteins analyzed in the coleoptiles. These results are stored in a database which also includes, when available, genetic information about the chromosomal location of structural genes and regulatory factors of proteins. This database is being used in the context of a project on the genetic mapping of the expressed genome in maize.  相似文献   

12.
Tau protein: an update on structure and function   总被引:2,自引:0,他引:2  
  相似文献   

13.
Microbes utilize enzymes to perform a variety of functions. Enzymes are biocatalysts working as highly efficient machines at the molecular level. In the past, enzymes have been viewed as static entities and their function has been explained on the basis of direct structural interactions between the enzyme and the substrate. A variety of experimental and computational techniques, however, continue to reveal that proteins are dynamically active machines, with various parts exhibiting internal motions at a wide range of time-scales. Increasing evidence also indicates that these internal protein motions play a role in promoting protein function such as enzyme catalysis. Moreover, the thermodynamical fluctuations of the solvent, surrounding the protein, have an impact on internal protein motions and, therefore, on enzyme function. In this review, we describe recent biochemical and theoretical investigations of internal protein dynamics linked to enzyme catalysis. In the enzyme cyclophilin A, investigations have lead to the discovery of a network of protein vibrations promoting catalysis. Cyclophilin A catalyzes peptidyl-prolyl cis/trans isomerization in a variety of peptide and protein substrates. Recent studies of cyclophilin A are discussed in detail and other enzymes (dihydrofolate reductase and liver alcohol dehydrogenase) where similar discoveries have been reported are also briefly discussed. The detailed characterization of the discovered networks indicates that protein dynamics plays a role in rate-enhancement achieved by enzymes. An integrated view of enzyme structure, dynamics and function have wide implications in understanding allosteric and co-operative effects, as well as protein engineering of more efficient enzymes and novel drug design.  相似文献   

14.
15.
Zea mays DataBase (ZmDB) seeks to provide a comprehensive view of maize (corn) genetics by linking genomic sequence data with gene expression analysis and phenotypes of mutant plants. ZmDB originated in 1999 as the Web portal for a large project of maize gene discovery, sequencing and phenotypic analysis using a transposon tagging strategy and expressed sequence tag (EST) sequencing. Recently, ZmDB has broadened its scope to include all public maize ESTs, genome survey sequences (GSSs), and protein sequences. More than 170 000 ESTs are currently clustered into approximately 20 000 contigs and about an equal number of apparent singlets. These clusters are continuously updated and annotated with respect to potential encoded protein products. More than 100 000 GSSs are similarly assembled and annotated by spliced alignment with EST and protein sequences. The ZmDB interface provides quick access to analytical tools for further sequence analysis. Every sequence record is linked to several display options and similarity search tools, including services for multiple sequence alignment, protein domain determination and spliced alignment. Furthermore, ZmDB provides web-based ordering of materials generated in the project, including ESTs, ordered collections of genomic sequences tagged with the RescueMu transposon and microarrays of amplified ESTs. ZmDB can be accessed at http://zmdb.iastate.edu/.  相似文献   

16.
FlyMine is a data warehouse that addresses one of the important challenges of modern biology: how to integrate and make use of the diversity and volume of current biological data. Its main focus is genomic and proteomics data for Drosophila and other insects. It provides web access to integrated data at a number of different levels, from simple browsing to construction of complex queries, which can be executed on either single items or lists.  相似文献   

17.
The National Agricultural Biotechnology Information Center (NABIC) reconstructed an AllergenPro database for allergenic proteins analysis and allergenicity prediction. The AllergenPro is an integrated web-based system providing information about allergen in foods, microorganisms, animals and plants. The allergen database has the three main features namely, (1) allergen list with epitopes, (2) searching of allergen using keyword, and (3) methods for allergenicity prediction. This updated AllergenPro outputs the search based allergen information through a user-friendly web interface, and users can run tools for allergenicity prediction using three different methods namely, (1) FAO/WHO, (2) motif-based and (3) epitope-based methods.

Availability

The database is available for free at http://nabic.rda.go.kr/allergen/  相似文献   

18.
19.
20.
MOTIVATION: The rapid increase in the number of structures in the Protein Databank (PDB) makes it difficult to find all structures in a given protein class. Automatically-maintained web-based summaries are one solution to this problem. RESULTS: Summary of Antibody Crystal Structures (SACS), a self-maintaining web-site containing summary information on antibody structures in the PDB, is described. Mirrored PDB data are processed automatically using a Make-based system to identify new antibody structures. The PDB header records and sequence data are then parsed to identify a number of features of the structure and the data are stored using eXtensible Markup Language (XML). eXtensible Stylesheet Language: Transformations (XSLT), a new style sheet language for XML, is used to generate Hypertext Markup Language (HTML) pages containing either a one-line summary of every structure or a more detailed page describing a single antibody.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号