首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 203 毫秒
1.
The PDBsum web server provides structural analyses of the entries in the Protein Data Bank (PDB). Two recent additions are described here. The first is the detailed analysis of the SARS‐CoV‐2 virus protein structures in the PDB. These include the variants of concern, which are shown both on the sequences and 3D structures of the proteins. The second addition is the inclusion of the available AlphaFold models for human proteins. The pages allow a search of the protein against existing structures in the PDB via the Sequence Annotated by Structure (SAS) server, so one can easily compare the predicted model against experimentally determined structures. The server is freely accessible to all at http://www.ebi.ac.uk/pdbsum.  相似文献   

2.
The antibody repertoires of individuals and groups have been used to explore disease states, understand vaccine responses, and drive therapeutic development. The arrival of B‐cell receptor repertoire sequencing has enabled researchers to get a snapshot of these antibody repertoires, and as more data are generated, increasingly in‐depth studies are possible. However, most publicly available data only exist as raw FASTQ files, making the data hard to access, process, and compare. The Observed Antibody Space (OAS) database was created in 2018 to offer clean, annotated, and translated repertoire data. In this paper, we describe an update to OAS that has been driven by the increasing volume of data and the appearance of paired (VH/VL) sequence data. OAS is now accessible via a new web server, with standardized search parameters and a new sequence‐based search option. The new database provides both nucleotides and amino acids for every sequence, with additional sequence annotations to make the data Minimal Information about Adaptive Immune Receptor Repertoire compliant, and comments on potential problems with the sequence. OAS now contains 25 new studies, including severe acute respiratory syndrome coronavirus 2 data and paired sequencing data. The new database is accessible at http://opig.stats.ox.ac.uk/webapps/oas/, and all data are freely available for download.  相似文献   

3.
We modeled 3D structures of all SARS‐CoV‐2 proteins, generating 2,060 models that span 69% of the viral proteome and provide details not available elsewhere. We found that ˜6% of the proteome mimicked human proteins, while ˜7% was implicated in hijacking mechanisms that reverse post‐translational modifications, block host translation, and disable host defenses; a further ˜29% self‐assembled into heteromeric states that provided insight into how the viral replication and translation complex forms. To make these 3D models more accessible, we devised a structural coverage map, a novel visualization method to show what is—and is not—known about the 3D structure of the viral proteome. We integrated the coverage map into an accompanying online resource (https://aquaria.ws/covid) that can be used to find and explore models corresponding to the 79 structural states identified in this work. The resulting Aquaria‐COVID resource helps scientists use emerging structural data to understand the mechanisms underlying coronavirus infection and draws attention to the 31% of the viral proteome that remains structurally unknown or dark.  相似文献   

4.
SUMMARY: A number of freely available text mining tools have been put together to extract highly reliable Drosophila gene interaction data from text. The system has been tested with The Interactive Fly, showing low recall (27-34%), but very high precision (93-97%). AVAILABILITY: The extracted data and a web interface for submission of texts to GIFT analysis are available at http://gift.cryst.bbk.ac.uk/gift CONTACT: n.domedel_puig@cryst.bbk.ac.uk SUPPLEMENTARY INFORMATION: Additional documentation, such as the dictionaries and the reference sets, are available at the GIFT website.  相似文献   

5.
Deeper understanding of liver pathophysiology would benefit from a comprehensive quantitative proteome resource at cell type resolution to predict outcome and design therapy. Here, we quantify more than 150,000 sequence‐unique peptides aggregated into 10,000 proteins across total liver, the major liver cell types, time course of primary cell cultures, and liver disease states. Bioinformatic analysis reveals that half of hepatocyte protein mass is comprised of enzymes and 23% of mitochondrial proteins, twice the proportion of other liver cell types. Using primary cell cultures, we capture dynamic proteome remodeling from tissue states to cell line states, providing useful information for biological or pharmaceutical research. Our extensive data serve as spectral library to characterize a human cohort of non‐alcoholic steatohepatitis and cirrhosis. Dramatic proteome changes in liver tissue include signatures of hepatic stellate cell activation resembling liver cirrhosis and providing functional insights. We built a web‐based dashboard application for the interactive exploration of our resource (www.liverproteome.org).  相似文献   

6.
Enzymes are critical proteins in every organism. They speed up essential chemical reactions, help fight diseases, and have a wide use in the pharmaceutical and manufacturing industries. Wet lab experiments to figure out an enzyme''s function are time consuming and expensive. Therefore, the need for computational approaches to address this problem are becoming necessary. Usually, an enzyme is extremely specific in performing its function. However, there exist enzymes that can perform multiple functions. A multi‐functional enzyme has vast potential as it reduces the need to discover/use different enzymes for different functions. We propose an approach to predict a multi‐functional enzyme''s function up to the most specific fourth level of the hierarchy of the Enzyme Commission (EC) number. Previous studies can only predict the function of the enzyme till level 1. Using a dataset of 2,583 multi‐functional enzymes, we achieved a hierarchical subset accuracy of 71.4% and a Macro F1 Score of 96.1% at the fourth level. The robustness of the network was further tested on a multi‐functional isoforms dataset. Our method is broadly applicable and may be used to discover better enzymes. The web‐server can be freely accessed at http://hecnet.cbrlab.org/.  相似文献   

7.
Olivibacter sitiensis Ntougias et al. 2007 is a member of the family Sphingobacteriaceae, phylum Bacteroidetes. Members of the genus Olivibacter are phylogenetically diverse and of significant interest. They occur in diverse habitats, such as rhizosphere and contaminated soils, viscous wastes, composts, biofilter clean-up facilities on contaminated sites and cave environments, and they are involved in the degradation of complex and toxic compounds. Here we describe the features of O. sitiensis AW-6T, together with the permanent-draft genome sequence and annotation. The organism was sequenced under the Genomic Encyclopedia for Bacteria and Archaea (GEBA) project at the DOE Joint Genome Institute and is the first genome sequence of a species within the genus Olivibacter. The genome is 5,053,571 bp long and is comprised of 110 scaffolds with an average GC content of 44.61%. Of the 4,565 genes predicted, 4,501 were protein-coding genes and 64 were RNA genes. Most protein-coding genes (68.52%) were assigned to a putative function. The identification of 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase-coding genes indicates involvement of this organism in the catechol catabolic pathway. In addition, genes encoding for β-1,4-xylanases and β-1,4-xylosidases reveal the xylanolytic action of O. sitiensis.  相似文献   

8.
9.
This article describes the development and creation of the Protein Circular Dichroism Data Bank (PCDDB), a deposition and searchable data bank for validated circular dichroism spectra located at http://pcddb.cryst.bbk.ac.uk/.  相似文献   

10.
Methods to reliably assess the accuracy of genome sequence data are lacking. Currently completeness is only described qualitatively and mis-assemblies are overlooked. Here we present REAPR, a tool that precisely identifies errors in genome assemblies without the need for a reference sequence. We have validated REAPR on complete genomes or de novo assemblies from bacteria, malaria and Caenorhabditis elegans, and demonstrate that 86% and 82% of the human and mouse reference genomes are error-free, respectively. When applied to an ongoing genome project, REAPR provides corrected assembly statistics allowing the quantitative comparison of multiple assemblies. REAPR is available at http://www.sanger.ac.uk/resources/software/reapr/.  相似文献   

11.
The interaction environment of a protein in a cellular network is important in defining the role that the protein plays in the system as a whole, and thus its potential suitability as a drug target. Despite the importance of the network environment, it is neglected during target selection for drug discovery. Here, we present the first systematic, comprehensive computational analysis of topological, community and graphical network parameters of the human interactome and identify discriminatory network patterns that strongly distinguish drug targets from the interactome as a whole. Importantly, we identify striking differences in the network behavior of targets of cancer drugs versus targets from other therapeutic areas and explore how they may relate to successful drug combinations to overcome acquired resistance to cancer drugs. We develop, computationally validate and provide the first public domain predictive algorithm for identifying druggable neighborhoods based on network parameters. We also make available full predictions for 13,345 proteins to aid target selection for drug discovery. All target predictions are available through canSAR.icr.ac.uk. Underlying data and tools are available at https://cansar.icr.ac.uk/cansar/publications/druggable_network_neighbourhoods/.  相似文献   

12.
Over the past 35 years, developmental geneticists have made impressive progress toward an understanding of how genes specify morphology and function, particularly as they relate to the specification of each physical component of an organism. In the last 20 years, male courtship behavior in Drosophila melanogaster has emerged as a robust model system for the study of genetic specification of behavior. Courtship behavior is both complex and innate, and a single gene, fruitless (fru), is both necessary and sufficient for all aspects of the courtship ritual. Typically, loss of male-specific Fruitless protein function results in male flies that perform the courtship ritual incorrectly, slowly, or not at all. Here we describe a novel requirement for fru: we have identified a group of cells in which male Fru proteins are required to reduce the speed of courtship initiation. In addition, we have identified a gene, Trapped in endoderm 1 (Tre1), which is required in these cells for normal courtship and mating behavior. Tre1 encodes a G-protein-coupled receptor required for establishment of cell polarity and cell migration and has previously not been shown to be involved in courtship behavior. We describe the results of feminization of the Tre1-expressing neurons, as well as the effects on courtship behavior of mutation of Tre1. In addition, we show that Tre1 is expressed in a sexually dimorphic pattern in the central and peripheral nervous systems and investigate the role of the Tre1 cells in mate identification.  相似文献   

13.
Members of the M13 class of metalloproteases have been implicated in diseases and in reproductive fitness. Nevertheless, their physiological role remains poorly understood. To obtain a tractable model with which to analyze this protein family’s function, we characterized the gene family in Drosophila melanogaster and focused on reproductive phenotypes. The D. melanogaster genome contains 24 M13 class protease homologs, some of which are orthologs of human proteases, including neprilysin. Many are expressed in the reproductive tracts of either sex. Using RNAi we individually targeted the five Nep genes most closely related to vertebrate neprilysin, Nep1-5, to investigate their roles in reproduction. A reduction in Nep1, Nep2, or Nep4 expression in females reduced egg laying. Nep1 and Nep2 are required in the CNS and the spermathecae for wild-type fecundity. Females that are null for Nep2 also show defects as hosts of sperm competition as well as an increased rate of depletion for stored sperm. Furthermore, eggs laid by Nep2 mutant females are fertilized normally, but arrest early in embryonic development. In the male, only Nep1 was required to induce normal patterns of female egg laying. Reduction in the expression of Nep2-5 in the male did not cause any dramatic effects on reproductive fitness, which suggests that these genes are either nonessential for male fertility or perform redundant functions. Our results suggest that, consistent with the functions of neprilysins in mammals, these proteins are also required for reproduction in Drosophila, opening up this model system for further functional analysis of this protein class and their substrates.  相似文献   

14.
wings apart (wap) is a recessive, semilethal gene located on the X chromosome in Drosophila melanogaster, which is required for normal wing-vein patterning. We show that the wap mutation also results in loss of the adult jump muscle. We use complementation mapping and gene-specific RNA interference to localize the wap locus to the proximal X chromosome. We identify the annotated gene CG14614 as the gene affected by the wap mutation, since one wap allele contains a non-sense mutation in CG14614, and a genomic fragment containing only CG14614 rescues the jump-muscle phenotypes of two wap mutant alleles. The wap gene lies centromere-proximal to touch-insensitive larva B and centromere-distal to CG14619, which is tentatively assigned as the gene affected in introverted mutants. In mutant wap animals, founder cell precursors for the jump muscle are specified early in development, but are later lost. Through tissue-specific knockdowns, we demonstrate that wap function is required in both the musculature and the nervous system for normal jump-muscle formation. wap/CG14614 is homologous to vertebrate wdr68, DDB1 and CUL4 associated factor 7, which also are expressed in neuromuscular tissues. Thus, our findings provide insight into mechanisms of neuromuscular development in higher animals and facilitate the understanding of neuromuscular diseases that may result from mis-expression of muscle-specific or neuron-specific genes.  相似文献   

15.
The spectrum of mutations discovered in cancer genomes can be explained by the activity of a few elementary mutational processes. We present a novel probabilistic method, EMu, to infer the mutational signatures of these processes from a collection of sequenced tumors. EMu naturally incorporates the tumor-specific opportunity for different mutation types according to sequence composition. Applying EMu to breast cancer data, we derive detailed maps of the activity of each process, both genome-wide and within specific local regions of the genome. Our work provides new opportunities to study the mutational processes underlying cancer development. EMu is available at http://www.sanger.ac.uk/resources/software/emu/.  相似文献   

16.
17.
18.
19.
Since the discovery of microRNAs (miRNAs) only two decades ago, they have emerged as an essential component of the gene regulatory machinery. miRNAs have seemingly paradoxical features: a single miRNA is able to simultaneously target hundreds of genes, while its presence is mostly dispensable for animal viability under normal conditions. It is known that miRNAs act as stress response factors; however, it remains challenging to determine their relevant targets and the conditions under which they function. To address this challenge, we propose a new workflow for miRNA function analysis, by which we found that the evolutionarily young miRNA family, the mir-310s (mir-310/mir-311/mir-312/mir-313), are important regulators of Drosophila metabolic status. mir-310s-deficient animals have an abnormal diet-dependent expression profile for numerous diet-sensitive components, accumulate fats, and show various physiological defects. We found that the mir-310s simultaneously repress the production of several regulatory factors (Rab23, DHR96, and Ttk) of the evolutionarily conserved Hedgehog (Hh) pathway to sharpen dietary response. As the mir-310s expression is highly dynamic and nutrition sensitive, this signal relay model helps to explain the molecular mechanism governing quick and robust Hh signaling responses to nutritional changes. Additionally, we discovered a new component of the Hh signaling pathway in Drosophila, Rab23, which cell autonomously regulates Hh ligand trafficking in the germline stem cell niche. How organisms adjust to dietary fluctuations to sustain healthy homeostasis is an intriguing research topic. These data are the first to report that miRNAs can act as executives that transduce nutritional signals to an essential signaling pathway. This suggests miRNAs as plausible therapeutic agents that can be used in combination with low calorie and cholesterol diets to manage quick and precise tissue-specific responses to nutritional changes.  相似文献   

20.
The rapid evolution of essential developmental genes and their protein products is both intriguing and problematic. The rapid evolution of gene products with simple protein folds and a lack of well-characterized functional domains typically result in a low discovery rate of orthologous genes. Additionally, in the absence of orthologs it is difficult to study the processes and mechanisms underlying rapid evolution. In this study, we have investigated the rapid evolution of centrosomin (cnn), an essential gene encoding centrosomal protein isoforms required during syncytial development in Drosophila melanogaster. Until recently the rapid divergence of cnn made identification of orthologs difficult and questionable because Cnn violates many of the assumptions underlying models for protein evolution. To overcome these limitations, we have identified a group of insect orthologs and present conserved features likely to be required for the functions attributed to cnn in D. melanogaster. We also show that the rapid divergence of Cnn isoforms is apparently due to frequent coding sequence indels and an accelerated rate of intronic additions and eliminations. These changes appear to be buffered by multi-exon and multi-reading frame maximum potential ORFs, simple protein folds, and the splicing machinery. These buffering features also occur in other genes in Drosophila and may help prevent potentially deleterious mutations due to indels in genes with large coding exons and exon-dense regions separated by small introns. This work promises to be useful for future investigations of cnn and potentially other rapidly evolving genes and proteins.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号