共查询到20条相似文献,搜索用时 203 毫秒
1.
Roman A. Laskowski Janet M. Thornton 《Protein science : a publication of the Protein Society》2022,31(1):283
The PDBsum web server provides structural analyses of the entries in the Protein Data Bank (PDB). Two recent additions are described here. The first is the detailed analysis of the SARS‐CoV‐2 virus protein structures in the PDB. These include the variants of concern, which are shown both on the sequences and 3D structures of the proteins. The second addition is the inclusion of the available AlphaFold models for human proteins. The pages allow a search of the protein against existing structures in the PDB via the Sequence Annotated by Structure (SAS) server, so one can easily compare the predicted model against experimentally determined structures. The server is freely accessible to all at http://www.ebi.ac.uk/pdbsum. 相似文献
2.
Tobias H. Olsen Fergus Boyles Charlotte M. Deane 《Protein science : a publication of the Protein Society》2022,31(1):141
The antibody repertoires of individuals and groups have been used to explore disease states, understand vaccine responses, and drive therapeutic development. The arrival of B‐cell receptor repertoire sequencing has enabled researchers to get a snapshot of these antibody repertoires, and as more data are generated, increasingly in‐depth studies are possible. However, most publicly available data only exist as raw FASTQ files, making the data hard to access, process, and compare. The Observed Antibody Space (OAS) database was created in 2018 to offer clean, annotated, and translated repertoire data. In this paper, we describe an update to OAS that has been driven by the increasing volume of data and the appearance of paired (VH/VL) sequence data. OAS is now accessible via a new web server, with standardized search parameters and a new sequence‐based search option. The new database provides both nucleotides and amino acids for every sequence, with additional sequence annotations to make the data Minimal Information about Adaptive Immune Receptor Repertoire compliant, and comments on potential problems with the sequence. OAS now contains 25 new studies, including severe acute respiratory syndrome coronavirus 2 data and paired sequencing data. The new database is accessible at http://opig.stats.ox.ac.uk/webapps/oas/, and all data are freely available for download. 相似文献
3.
Sen I ODonoghue Andrea Schafferhans Neblina Sikta Christian Stolte Sandeep Kaur Bosco K Ho Stuart Anderson James B Procter Christian Dallago Nicola Bordin Matt Adcock Burkhard Rost 《Molecular systems biology》2021,17(9)
We modeled 3D structures of all SARS‐CoV‐2 proteins, generating 2,060 models that span 69% of the viral proteome and provide details not available elsewhere. We found that ˜6% of the proteome mimicked human proteins, while ˜7% was implicated in hijacking mechanisms that reverse post‐translational modifications, block host translation, and disable host defenses; a further ˜29% self‐assembled into heteromeric states that provided insight into how the viral replication and translation complex forms. To make these 3D models more accessible, we devised a structural coverage map, a novel visualization method to show what is—and is not—known about the 3D structure of the viral proteome. We integrated the coverage map into an accompanying online resource (https://aquaria.ws/covid) that can be used to find and explore models corresponding to the 79 structural states identified in this work. The resulting Aquaria‐COVID resource helps scientists use emerging structural data to understand the mechanisms underlying coronavirus infection and draws attention to the 31% of the viral proteome that remains structurally unknown or dark. 相似文献
4.
SUMMARY: A number of freely available text mining tools have been put together to extract highly reliable Drosophila gene interaction data from text. The system has been tested with The Interactive Fly, showing low recall (27-34%), but very high precision (93-97%). AVAILABILITY: The extracted data and a web interface for submission of texts to GIFT analysis are available at http://gift.cryst.bbk.ac.uk/gift CONTACT: n.domedel_puig@cryst.bbk.ac.uk SUPPLEMENTARY INFORMATION: Additional documentation, such as the dictionaries and the reference sets, are available at the GIFT website. 相似文献
5.
Lili Niu Philipp E Geyer Rajat Gupta Alberto Santos Florian Meier Sophia Doll Nicolai J Wewer Albrechtsen Sabine Klein Cristina Ortiz Frank E Uschner Robert Schierwagen Jonel Trebicka Matthias Mann 《Molecular systems biology》2022,18(5)
Deeper understanding of liver pathophysiology would benefit from a comprehensive quantitative proteome resource at cell type resolution to predict outcome and design therapy. Here, we quantify more than 150,000 sequence‐unique peptides aggregated into 10,000 proteins across total liver, the major liver cell types, time course of primary cell cultures, and liver disease states. Bioinformatic analysis reveals that half of hepatocyte protein mass is comprised of enzymes and 23% of mitochondrial proteins, twice the proportion of other liver cell types. Using primary cell cultures, we capture dynamic proteome remodeling from tissue states to cell line states, providing useful information for biological or pharmaceutical research. Our extensive data serve as spectral library to characterize a human cohort of non‐alcoholic steatohepatitis and cirrhosis. Dramatic proteome changes in liver tissue include signatures of hepatic stellate cell activation resembling liver cirrhosis and providing functional insights. We built a web‐based dashboard application for the interactive exploration of our resource (www.liverproteome.org). 相似文献
6.
Kinaan Aamir Khan Safyan Aman Memon Hammad Naveed 《Protein science : a publication of the Protein Society》2021,30(9):1935
Enzymes are critical proteins in every organism. They speed up essential chemical reactions, help fight diseases, and have a wide use in the pharmaceutical and manufacturing industries. Wet lab experiments to figure out an enzyme''s function are time consuming and expensive. Therefore, the need for computational approaches to address this problem are becoming necessary. Usually, an enzyme is extremely specific in performing its function. However, there exist enzymes that can perform multiple functions. A multi‐functional enzyme has vast potential as it reduces the need to discover/use different enzymes for different functions. We propose an approach to predict a multi‐functional enzyme''s function up to the most specific fourth level of the hierarchy of the Enzyme Commission (EC) number. Previous studies can only predict the function of the enzyme till level 1. Using a dataset of 2,583 multi‐functional enzymes, we achieved a hierarchical subset accuracy of 71.4% and a Macro F1 Score of 96.1% at the fourth level. The robustness of the network was further tested on a multi‐functional isoforms dataset. Our method is broadly applicable and may be used to discover better enzymes. The web‐server can be freely accessed at http://hecnet.cbrlab.org/. 相似文献
7.
Spyridon Ntougias Alla Lapidus James Han Konstantinos Mavromatis Amrita Pati Amy Chen Hans-Peter Klenk Tanja Woyke Constantinos Fasseas Nikos C. Kyrpides Georgios I. Zervakis 《Standards in genomic sciences》2014,9(3):783-793
Olivibacter sitiensis Ntougias et al. 2007 is a member of the family Sphingobacteriaceae, phylum Bacteroidetes. Members of the genus Olivibacter are phylogenetically diverse and of significant interest. They occur in diverse habitats, such as rhizosphere and contaminated soils, viscous wastes, composts, biofilter clean-up facilities on contaminated sites and cave environments, and they are involved in the degradation of complex and toxic compounds. Here we describe the features of O. sitiensis AW-6T, together with the permanent-draft genome sequence and annotation. The organism was sequenced under the Genomic Encyclopedia for Bacteria and Archaea (GEBA) project at the DOE Joint Genome Institute and is the first genome sequence of a species within the genus Olivibacter. The genome is 5,053,571 bp long and is comprised of 110 scaffolds with an average GC content of 44.61%. Of the 4,565 genes predicted, 4,501 were protein-coding genes and 64 were RNA genes. Most protein-coding genes (68.52%) were assigned to a putative function. The identification of 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase-coding genes indicates involvement of this organism in the catechol catabolic pathway. In addition, genes encoding for β-1,4-xylanases and β-1,4-xylosidases reveal the xylanolytic action of O. sitiensis. 相似文献
8.
9.
This article describes the development and creation of the Protein Circular Dichroism Data Bank (PCDDB), a deposition and searchable data bank for validated circular dichroism spectra located at http://pcddb.cryst.bbk.ac.uk/. 相似文献
10.
Martin Hunt Taisei Kikuchi Mandy Sanders Chris Newbold Matthew Berriman Thomas D Otto 《Genome biology》2013,14(5):R47
Methods to reliably assess the accuracy of genome sequence data are lacking. Currently completeness is only described qualitatively and mis-assemblies are overlooked. Here we present REAPR, a tool that precisely identifies errors in genome assemblies without the need for a reference sequence. We have validated REAPR on complete genomes or de novo assemblies from bacteria, malaria and Caenorhabditis elegans, and demonstrate that 86% and 82% of the human and mouse reference genomes are error-free, respectively. When applied to an ongoing genome project, REAPR provides corrected assembly statistics allowing the quantitative comparison of multiple assemblies. REAPR is available at http://www.sanger.ac.uk/resources/software/reapr/. 相似文献
11.
Costas Mitsopoulos Amanda C. Schierz Paul Workman Bissan Al-Lazikani 《PLoS computational biology》2015,11(12)
The interaction environment of a protein in a cellular network is important in defining the role that the protein plays in the system as a whole, and thus its potential suitability as a drug target. Despite the importance of the network environment, it is neglected during target selection for drug discovery. Here, we present the first systematic, comprehensive computational analysis of topological, community and graphical network parameters of the human interactome and identify discriminatory network patterns that strongly distinguish drug targets from the interactome as a whole. Importantly, we identify striking differences in the network behavior of targets of cancer drugs versus targets from other therapeutic areas and explore how they may relate to successful drug combinations to overcome acquired resistance to cancer drugs. We develop, computationally validate and provide the first public domain predictive algorithm for identifying druggable neighborhoods based on network parameters. We also make available full predictions for 13,345 proteins to aid target selection for drug discovery. All target predictions are available through canSAR.icr.ac.uk. Underlying data and tools are available at https://cansar.icr.ac.uk/cansar/publications/druggable_network_neighbourhoods/. 相似文献
12.
Over the past 35 years, developmental geneticists have made impressive progress
toward an understanding of how genes specify morphology and function, particularly as
they relate to the specification of each physical component of an organism. In the
last 20 years, male courtship behavior in Drosophila melanogaster
has emerged as a robust model system for the study of genetic specification of
behavior. Courtship behavior is both complex and innate, and a single gene,
fruitless (fru), is both necessary and sufficient for all aspects of the
courtship ritual. Typically, loss of male-specific Fruitless protein function results
in male flies that perform the courtship ritual incorrectly, slowly, or not at all.
Here we describe a novel requirement for fru: we have identified a group of cells in which male Fru
proteins are required to reduce the speed of courtship initiation. In addition, we
have identified a gene, Trapped in endoderm
1 (Tre1), which is required in these cells for normal courtship
and mating behavior. Tre1 encodes a G-protein-coupled receptor required for
establishment of cell polarity and cell migration and has previously not been shown
to be involved in courtship behavior. We describe the results of feminization of the
Tre1-expressing neurons, as well as the effects on courtship
behavior of mutation of Tre1. In addition, we show that Tre1 is expressed in a sexually dimorphic pattern in the
central and peripheral nervous systems and investigate the role of the
Tre1 cells in mate identification. 相似文献
13.
Jessica L. Sitnik Carmen Francis Korneel Hens Roger Huybrechts Mariana F. Wolfner Patrick Callaerts 《Genetics》2014,196(3):781-797
Members of the M13 class of metalloproteases have been implicated in diseases and in reproductive fitness. Nevertheless, their physiological role remains poorly understood. To obtain a tractable model with which to analyze this protein family’s function, we characterized the gene family in Drosophila melanogaster and focused on reproductive phenotypes. The D. melanogaster genome contains 24 M13 class protease homologs, some of which are orthologs of human proteases, including neprilysin. Many are expressed in the reproductive tracts of either sex. Using RNAi we individually targeted the five Nep genes most closely related to vertebrate neprilysin, Nep1-5, to investigate their roles in reproduction. A reduction in Nep1, Nep2, or Nep4 expression in females reduced egg laying. Nep1 and Nep2 are required in the CNS and the spermathecae for wild-type fecundity. Females that are null for Nep2 also show defects as hosts of sperm competition as well as an increased rate of depletion for stored sperm. Furthermore, eggs laid by Nep2 mutant females are fertilized normally, but arrest early in embryonic development. In the male, only Nep1 was required to induce normal patterns of female egg laying. Reduction in the expression of Nep2-5 in the male did not cause any dramatic effects on reproductive fitness, which suggests that these genes are either nonessential for male fertility or perform redundant functions. Our results suggest that, consistent with the functions of neprilysins in mammals, these proteins are also required for reproduction in Drosophila, opening up this model system for further functional analysis of this protein class and their substrates. 相似文献
14.
Ginny R. Morriss Carmelita T. Jaramillo Crystal M. Mikolajczak Sandy Duong MaryAnn S. Jaramillo Richard M. Cripps 《Genetics》2013,195(3):927-940
wings apart (wap) is a recessive, semilethal gene located on the X chromosome in Drosophila melanogaster, which is required for normal wing-vein patterning. We show that the wap mutation also results in loss of the adult jump muscle. We use complementation mapping and gene-specific RNA interference to localize the wap locus to the proximal X chromosome. We identify the annotated gene CG14614 as the gene affected by the wap mutation, since one wap allele contains a non-sense mutation in CG14614, and a genomic fragment containing only CG14614 rescues the jump-muscle phenotypes of two wap mutant alleles. The wap gene lies centromere-proximal to touch-insensitive larva B and centromere-distal to CG14619, which is tentatively assigned as the gene affected in introverted mutants. In mutant wap animals, founder cell precursors for the jump muscle are specified early in development, but are later lost. Through tissue-specific knockdowns, we demonstrate that wap function is required in both the musculature and the nervous system for normal jump-muscle formation. wap/CG14614 is homologous to vertebrate wdr68, DDB1 and CUL4 associated factor 7, which also are expressed in neuromuscular tissues. Thus, our findings provide insight into mechanisms of neuromuscular development in higher animals and facilitate the understanding of neuromuscular diseases that may result from mis-expression of muscle-specific or neuron-specific genes. 相似文献
15.
Andrej Fischer Christopher JR Illingworth Peter J Campbell Ville Mustonen 《Genome biology》2013,14(4):R39
The spectrum of mutations discovered in cancer genomes can be explained by the activity of a few elementary mutational processes. We present a novel probabilistic method, EMu, to infer the mutational signatures of these processes from a collection of sequenced tumors. EMu naturally incorporates the tumor-specific opportunity for different mutation types according to sequence composition. Applying EMu to breast cancer data, we derive detailed maps of the activity of each process, both genome-wide and within specific local regions of the genome. Our work provides new opportunities to study the mutational processes underlying cancer development. EMu is available at http://www.sanger.ac.uk/resources/software/emu/. 相似文献
16.
17.
18.
19.
Ibrahim ?mer ?i?ek Samir Karaca Marko Brankatschk Suzanne Eaton Henning Urlaub Halyna R. Shcherbata 《Genetics》2016,202(3):1167-1183
Since the discovery of microRNAs (miRNAs) only two decades ago, they have emerged as an essential component of the gene regulatory machinery. miRNAs have seemingly paradoxical features: a single miRNA is able to simultaneously target hundreds of genes, while its presence is mostly dispensable for animal viability under normal conditions. It is known that miRNAs act as stress response factors; however, it remains challenging to determine their relevant targets and the conditions under which they function. To address this challenge, we propose a new workflow for miRNA function analysis, by which we found that the evolutionarily young miRNA family, the mir-310s (mir-310/mir-311/mir-312/mir-313), are important regulators of Drosophila metabolic status. mir-310s-deficient animals have an abnormal diet-dependent expression profile for numerous diet-sensitive components, accumulate fats, and show various physiological defects. We found that the mir-310s simultaneously repress the production of several regulatory factors (Rab23, DHR96, and Ttk) of the evolutionarily conserved Hedgehog (Hh) pathway to sharpen dietary response. As the mir-310s expression is highly dynamic and nutrition sensitive, this signal relay model helps to explain the molecular mechanism governing quick and robust Hh signaling responses to nutritional changes. Additionally, we discovered a new component of the Hh signaling pathway in Drosophila, Rab23, which cell autonomously regulates Hh ligand trafficking in the germline stem cell niche. How organisms adjust to dietary fluctuations to sustain healthy homeostasis is an intriguing research topic. These data are the first to report that miRNAs can act as executives that transduce nutritional signals to an essential signaling pathway. This suggests miRNAs as plausible therapeutic agents that can be used in combination with low calorie and cholesterol diets to manage quick and precise tissue-specific responses to nutritional changes. 相似文献
20.
The rapid evolution of essential developmental genes and their protein products is both intriguing and problematic. The rapid evolution of gene products with simple protein folds and a lack of well-characterized functional domains typically result in a low discovery rate of orthologous genes. Additionally, in the absence of orthologs it is difficult to study the processes and mechanisms underlying rapid evolution. In this study, we have investigated the rapid evolution of centrosomin (cnn), an essential gene encoding centrosomal protein isoforms required during syncytial development in Drosophila melanogaster. Until recently the rapid divergence of cnn made identification of orthologs difficult and questionable because Cnn violates many of the assumptions underlying models for protein evolution. To overcome these limitations, we have identified a group of insect orthologs and present conserved features likely to be required for the functions attributed to cnn in D. melanogaster. We also show that the rapid divergence of Cnn isoforms is apparently due to frequent coding sequence indels and an accelerated rate of intronic additions and eliminations. These changes appear to be buffered by multi-exon and multi-reading frame maximum potential ORFs, simple protein folds, and the splicing machinery. These buffering features also occur in other genes in Drosophila and may help prevent potentially deleterious mutations due to indels in genes with large coding exons and exon-dense regions separated by small introns. This work promises to be useful for future investigations of cnn and potentially other rapidly evolving genes and proteins. 相似文献