首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
There has been considerable recent interest in proteomic analyses of plasma for the purpose of discovering biomarkers. Profiling N-linked glycopeptides is a particularly promising method because the population of N-linked glycosites represents the proteomes of plasma, the cell surface, and secreted proteins at very low redundancy and provides a compelling link between the tissue and plasma proteomes. Here, we describe UniPep http://www.unipep.org--a database of human N-linked glycosites--as a resource for biomarker discovery.  相似文献   

2.
3.

Background

Comparative evolutionary analysis of whole genomes requires not only accurate annotation of gene space, but also proper annotation of the repetitive fraction which is often the largest component of most if not all genomes larger than 50 kb in size.

Results

Here we present the Rice TE database (RiTE-db) - a genus-wide collection of transposable elements and repeated sequences across 11 diploid species of the genus Oryza and the closely-related out-group Leersia perrieri. The database consists of more than 170,000 entries divided into three main types: (i) a classified and curated set of publicly-available repeated sequences, (ii) a set of consensus assemblies of highly-repetitive sequences obtained from genome sequencing surveys of 12 species; and (iii) a set of full-length TEs, identified and extracted from 12 whole genome assemblies.

Conclusions

This is the first report of a repeat dataset that spans the majority of repeat variability within an entire genus, and one that includes complete elements as well as unassembled repeats. The database allows sequence browsing, downloading, and similarity searches. Because of the strategy adopted, the RiTE-db opens a new path to unprecedented direct comparative studies that span the entire nuclear repeat content of 15 million years of Oryza diversity.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1762-3) contains supplementary material, which is available to authorized users.  相似文献   

4.
5.
An Antarctic strain (NJ-7) of Chlorella vulgaris possesses the same 18S rRNA sequence as that of a temperate strain (UTEX259), but shows significantly higher freezing tolerance than the latter. Suppression subtractive hybridization (SSH) was performed to identify genes of intensified expression in NJ-7 relative to UTEX259. Among the genes identified, Ccor1 and Ccor2, co-organized in the same gene cluster Ccor1-Ccor2-Ccor1-Ccor2, showed much higher expression levels in NJ-7 than in UTEX259 at both 20°C and 4°C. As detected by Northern blot and Western blot analyses, the two genes were cold-inducible in NJ-7 but almost not expressed in UTEX259. Their encoded products are predicted to share 55.7% identity to each other and possess physicochemical characteristics similar to that of late embryogenesis abundant (LEA) proteins in plants. The purified recombinant Ccor1 and Ccor2 showed high heat-stability and could act as cryoprotectants to lactate dehydrogenase in vitro. Based on their expression patterns and protein characteristics, we propose that Ccor1 and Ccor2 are two novel LEA proteins and are related to the greatly enhanced freezing tolerance in the Antarctic strain.  相似文献   

6.
The Olfactory Receptor Database (ORDB) is a WWW-accessible database that has been expanded from an olfactory receptor resource to a chemoreceptor resource. It stores data on six classes of G-protein-coupled sensory chemoreceptors: (i) olfactory receptor-like proteins, (ii) vomeronasal receptors, (iii) insect olfactory receptors, (iv) worm chemo-receptors, (v) taste papilla receptors and (vi) fungal pheromone receptors. A complementary database of the ligands of these receptors (OdorDB) has been constructed and is publicly available in a pilot mode. The database schema of ORDB has been changed from traditional relational to EAV/CR (Entity-Attribute-Value with Classes and Relationships), which allows the interoperability of ORDB with other related databases as well as the creation of intra-database associations among objects. This inter-operability facilitates users to follow information from odor molecule binding to its putative receptor, to the properties of the neuron expressing the receptor, to a computational model of activity of olfactory bulb neurons. In addition, tools and resources have been added allowing users to access interactive phylogenetic trees and alignments of sensory chemoreceptors. ORDB is available via the WWW at http://ycmi.med. yale.edu/senselab/ordb/  相似文献   

7.
The Homeodomain Resource is an annotated collection of non-redundant protein sequences, three-dimensional structures and genomic information for the homeodomain protein family. Release 2.0 contains 765 full-length homeodomain-containing sequences, 29 experimentally derived structures and 116 homeobox loci implicated in human genetic disorders. Entries are fully hyperlinked to facilitate easy retrieval of the original records from source databases. A simple search engine with a graphical user interface is provided to query the component databases and assemble customized data sets. A new feature for this release is the addition of more automated methods for database searching, maintenance and implementation of efficient data management. The Homeodomain Resource is freely available through the WWW at http://genome.nhgri.nih.gov/homeodomain  相似文献   

8.
9.
The Mouse Genome Database supports the use of mice in genome research, offering researchers information on gene characterization, genetic maps, comparative genomic data, and phenotypes.  相似文献   

10.
TRIPLES is a web-accessible database of TRansposon-Insertion Phenotypes, Localization and Expression in Saccharomyces cerevisiae—a relational database housing nearly half a million data points generated from an ongoing study using large-scale transposon mutagenesis to characterize gene function in yeast. At present, TRIPLES contains three principal data sets (i.e. phenotypic data, protein localization data and expression data) for over 3500 annotated yeast genes as well as several hundred non-annotated open reading frames. In addition, the TRIPLES web site provides online order forms linked to each data set so that users may request any strain or reagent generated from this project free of charge. In response to user requests, the TRIPLES web site has undergone several recent modifications. Our localization data have been supplemented with approximately 500 fluorescent micrographs depicting actual staining patterns observed upon indirect immunofluorescence analysis of indicated epitope-tagged proteins. These localization data, as well as all other data sets within TRIPLES, are now available in full as tab-delimited text. To accommodate increased reagent requests, all orders are now cataloged in a separate database, and users are notified immediately of order receipt and shipment. Also, TRIPLES is one of five sites incorporated into the new functional analysis tool Function Junction provided by the Saccharomyces Genome Database. TRIPLES may be accessed from the Yale Genome Analysis Center (YGAC) homepage at http://ygac.med.yale.edu.  相似文献   

11.
The PRINTS database: a resource for identification of protein families   总被引:4,自引:0,他引:4  
The PRINTS database houses a collection of protein fingerprints, which may be used to assign family and functional attributes to uncharacterised sequences, such as those currently emanating from the various genome-sequencing projects. The April 2002 release includes 1,700 family fingerprints, encoding approximately 10,500 motifs, covering a range of globular and membrane proteins, modular polypeptides and so on. Fingerprints are groups of conserved motifs that, taken together, provide diagnostic protein family signatures. They derive much of their potency from the biological context afforded by matching motif neighbours; this makes them at once more flexible and powerful than single-motif approaches. The technique further departs from other pattern-matching methods by readily allowing the creation of fingerprints at superfamily-, family- and subfamily-specific levels, thereby allowing more fine-grained diagnoses. Here, we provide an overview of the method of protein fingerprinting and how the results of fingerprint analyses are used to build PRINTS and its relational cousin, PRINTS-S.  相似文献   

12.
MOTIVATION: Increasing antibiotics resistance in human pathogens represents a pressing public health issue worldwide for which novel antibiotic therapies based on antimicrobial peptides (AMPs) may offer one possible solution. In the current study, we utilized publicly available data on AMPs to construct hidden Markov models (HMMs) that enable recognition of individual classes of antimicrobials peptides (such as defensins, cathelicidins, cecropins, etc.) with up to 99% accuracy and can be used for discovering novel AMP candidates. RESULTS: HMM models for both mature peptides and propeptides were constructed. A total of 146 models for mature peptides and 40 for propeptides have been developed for individual AMP classes. These were created by clustering and analyzing AMP sequences available in the public sources and by consequent iterative scanning of the Swiss-Prot database for previously unknown gene-coded AMPs. As a result, an additional 229 additional AMPs have been identified from Swiss-Prot, and all but 34 could be associated with known antimicrobial activities according to the literature. The final set of 1045 mature peptides and 253 propeptides have been organized into the open-source AMPer database. AVAILABILITY: The developed HMM-based tools and AMP sequences can be accessed through the AMPer resource at http://www.cnbi2.com/cgi-bin/amp.pl  相似文献   

13.
PhosphaBase is an ontology-driven database resource containing information on the protein phosphatase family. It is the first public resource dedicated to protein phosphatases, which are enzymes that perform dephosphorylation reactions. In conjunction with the phosphorylation action of protein kinases, phosphatases are involved in important control and communication mechanisms in the cell. They have also been implicated in many human diseases, including diabetes and obesity, cancers, and neurodegenerative conditions. PhosphaBase aims to centralize the growing base of knowledge in the phosphatase research domain. The resource is built around a formal, domain-specific DAML+OIL ontology, and the data are collected from heterogeneous biological sources using Gene Ontology terms as a means of data extraction. The overall ontology-driven architecture provides a robust structure with distinct advantages for sustainability and provides the potential for the development of diagnostic tools, as well as a data repository.  相似文献   

14.
15.
Ng KW  Lawson J  Garner HR 《BioTechniques》2004,37(2):218, 220-218, 222
PathoGene is a web-based resource that streamlines the process of predicting genes in microorganisms and designs PCR primers for amplification to facilitate sequence analysis and experimentation. PathoGene currently supports primer design for every complete microbial, viral, and fungal genome as cataloged in GenBank by the National Center for Biotechnology Information (NCBI; http://www.ncbi.nlm.nih.gov/). The resulting primers can then be subjected to a stand-alone Basic Local Alignment Search Tool (BLAST) system called PathoBLAST in which the predicted PCR product and/or primers can be compared against the genome of interest or a similar genome to find related genes or estimate primer quality.  相似文献   

16.
17.
The peptaibols are a large family of membrane-active peptides with considerable sequence homology, but with different biological properties and three-dimensional structures. They constitute a rich resource of naturally occurring 'mutants' which are potentially valuable for structure/function studies of ion channels. A searchable on-line database of sequences and structures of the peptaibols has been created at http://www.cryst.bbk.ac.uk/peptaibol, as a resource for the biological and structural community. In this paper, the contents and organization of the website are discussed as well as procedures for submission of new entries to the database. At present, more than 300 peptaibol sequences are stored in the database. Each sequence entry contains its full literature reference and information about its biological source. Tools are provided for searching for specific peptaibol sequences or groupings of sequences, and for locating peptaibols containing specified sequence motifs. In addition the website acts as a database for structural information. The coordinates of all currently available peptaibol x-ray and NMR structures are included and complemented, where appropriate. with molecular graphics illustrations. These include figures of model channel structures and comparisons between different peptaibol structures. The peptaibol database thus provides a tool for ready access to information and a means of investigating the sequences and structures of this class of polypeptides.  相似文献   

18.
19.
Griss J  Haudek-Prinz V  Gerner C 《Proteomics》2011,11(5):1000-1004
Clinical proteomics faces extremely complex and variable data. Here, we present an updated version of the Griss Proteomics Database Engine (GPDE): A free biological proteomic database specifically designed for clinical proteomics and biomarker discovery (http://gpde.sourceforge.net). It combines experiments based on investigated cell types thereby supporting customizable biological meta-analyses. Through the new features described here, the GPDE now became a powerful yet easy-to-use tool to support the fast identification and reliable evaluation of biomarker candidates.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号