首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 140 毫秒
1.

Background

The binding of peptide fragments of antigens to class II MHC is a crucial step in initiating a helper T cell immune response. The identification of such peptide epitopes has potential applications in vaccine design and in better understanding autoimmune diseases and allergies. However, comprehensive experimental determination of peptide-MHC binding affinities is infeasible due to MHC diversity and the large number of possible peptide sequences. Computational methods trained on the limited experimental binding data can address this challenge. We present the MultiRTA method, an extension of our previous single-type RTA prediction method, which allows the prediction of peptide binding affinities for multiple MHC allotypes not used to train the model. Thus predictions can be made for many MHC allotypes for which experimental binding data is unavailable.

Results

We fit MultiRTA models for both HLA-DR and HLA-DP using large experimental binding data sets. The performance in predicting binding affinities for novel MHC allotypes, not in the training set, was tested in two different ways. First, we performed leave-one-allele-out cross-validation, in which predictions are made for one allotype using a model fit to binding data for the remaining MHC allotypes. Comparison of the HLA-DR results with those of two other prediction methods applied to the same data sets showed that MultiRTA achieved performance comparable to NetMHCIIpan and better than the earlier TEPITOPE method. We also directly tested model transferability by making leave-one-allele-out predictions for additional experimentally characterized sets of overlapping peptide epitopes binding to multiple MHC allotypes. In addition, we determined the applicability of prediction methods like MultiRTA to other MHC allotypes by examining the degree of MHC variation accounted for in the training set. An examination of predictions for the promiscuous binding CLIP peptide revealed variations in binding affinity among alleles as well as potentially distinct binding registers for HLA-DR and HLA-DP. Finally, we analyzed the optimal MultiRTA parameters to discover the most important peptide residues for promiscuous and allele-specific binding to HLA-DR and HLA-DP allotypes.

Conclusions

The MultiRTA method yields competitive performance but with a significantly simpler and physically interpretable model compared with previous prediction methods. A MultiRTA prediction webserver is available at http://bordnerlab.org/MultiRTA.
  相似文献   

2.
Zhang L  Chen Y  Wong HS  Zhou S  Mamitsuka H  Zhu S 《PloS one》2012,7(2):e30483

Motivation

Accurate identification of peptides binding to specific Major Histocompatibility Complex Class II (MHC-II) molecules is of great importance for elucidating the underlying mechanism of immune recognition, as well as for developing effective epitope-based vaccines and promising immunotherapies for many severe diseases. Due to extreme polymorphism of MHC-II alleles and the high cost of biochemical experiments, the development of computational methods for accurate prediction of binding peptides of MHC-II molecules, particularly for the ones with few or no experimental data, has become a topic of increasing interest. TEPITOPE is a well-used computational approach because of its good interpretability and relatively high performance. However, TEPITOPE can be applied to only 51 out of over 700 known HLA DR molecules.

Method

We have developed a new method, called TEPITOPEpan, by extrapolating from the binding specificities of HLA DR molecules characterized by TEPITOPE to those uncharacterized. First, each HLA-DR binding pocket is represented by amino acid residues that have close contact with the corresponding peptide binding core residues. Then the pocket similarity between two HLA-DR molecules is calculated as the sequence similarity of the residues. Finally, for an uncharacterized HLA-DR molecule, the binding specificity of each pocket is computed as a weighted average in pocket binding specificities over HLA-DR molecules characterized by TEPITOPE.

Result

The performance of TEPITOPEpan has been extensively evaluated using various data sets from different viewpoints: predicting MHC binding peptides, identifying HLA ligands and T-cell epitopes and recognizing binding cores. Among the four state-of-the-art competing pan-specific methods, for predicting binding specificities of unknown HLA-DR molecules, TEPITOPEpan was roughly the second best method next to NETMHCIIpan-2.0. Additionally, TEPITOPEpan achieved the best performance in recognizing binding cores. We further analyzed the motifs detected by TEPITOPEpan, examining the corresponding literature of immunology. Its online server and PSSMs therein are available at http://www.biokdd.fudan.edu.cn/Service/TEPITOPEpan/.  相似文献   

3.

Background

Prediction of the binding ability of antigen peptides to major histocompatibility complex (MHC) class II molecules is important in vaccine development. The variable length of each binding peptide complicates this prediction. Motivated by a text mining model designed for building a classifier from labeled and unlabeled examples, we have developed an iterative supervised learning model for the prediction of MHC class II binding peptides.

Results

A linear programming (LP) model was employed for the learning task at each iteration, since it is fast and can re-optimize the previous classifier when the training sets are altered. The performance of the new model has been evaluated with benchmark datasets. The outcome demonstrates that the model achieves an accuracy of prediction that is competitive compared to the advanced predictors (the Gibbs sampler and TEPITOPE). The average areas under the ROC curve obtained from one variant of our model are 0.753 and 0.715 for the original and homology reduced benchmark sets, respectively. The corresponding values are respectively 0.744 and 0.673 for the Gibbs sampler and 0.702 and 0.667 for TEPITOPE.

Conclusion

The iterative learning procedure appears to be effective in prediction of MHC class II binders. It offers an alternative approach to this important predictionproblem.  相似文献   

4.

Objectives

To investigate the effects of operational process conditions on expression of MHC class II protein from a stable Drosophila S2 cell line.

Results

When the Drosophila S2 cells were grown in vented orbitally shaken TubeSpin bioreactor 600 containers, cell growth was improved three-fold and the yield of recombinant major histocompatibility (MHC) class II protein (HLA-DR12xHis) increased four-fold over the levels observed for the same cells cultivated in roller bottles (RB) without vented caps. Culturing in RB with vented caps while increasing the rotation speed from 6 rpm to 18 rpm also improved cell growth five-fold and protein productivity three-fold which is comparable to the levels observed in the orbitally shaken containers. Protein activity was found to be almost identical between the two vessel systems tested.

Conclusions

Optimized cell culture conditions and a more efficient vessel type can enhance gas transfer and mixing and lead to substantial improvement of recombinant product yields from S2 cells.
  相似文献   

5.
CD4 positive T helper cells control many aspects of specific immunity. These cells are specific for peptides derived from protein antigens and presented by molecules of the extremely polymorphic major histocompatibility complex (MHC) class II system. The identification of peptides that bind to MHC class II molecules is therefore of pivotal importance for rational discovery of immune epitopes. HLA-DR is a prominent example of a human MHC class II. Here, we present a method, NetMHCIIpan, that allows for pan-specific predictions of peptide binding to any HLA-DR molecule of known sequence. The method is derived from a large compilation of quantitative HLA-DR binding events covering 14 of the more than 500 known HLA-DR alleles. Taking both peptide and HLA sequence information into account, the method can generalize and predict peptide binding also for HLA-DR molecules where experimental data is absent. Validation of the method includes identification of endogenously derived HLA class II ligands, cross-validation, leave-one-molecule-out, and binding motif identification for hitherto uncharacterized HLA-DR molecules. The validation shows that the method can successfully predict binding for HLA-DR molecules-even in the absence of specific data for the particular molecule in question. Moreover, when compared to TEPITOPE, currently the only other publicly available prediction method aiming at providing broad HLA-DR allelic coverage, NetMHCIIpan performs equivalently for alleles included in the training of TEPITOPE while outperforming TEPITOPE on novel alleles. We propose that the method can be used to identify those hitherto uncharacterized alleles, which should be addressed experimentally in future updates of the method to cover the polymorphism of HLA-DR most efficiently. We thus conclude that the presented method meets the challenge of keeping up with the MHC polymorphism discovery rate and that it can be used to sample the MHC "space," enabling a highly efficient iterative process for improving MHC class II binding predictions.  相似文献   

6.

Background

Predicting B-cell epitopes is very important for designing vaccines and drugs to fight against the infectious agents. However, due to the high complexity of this problem, previous prediction methods that focus on linear and conformational epitope prediction are both unsatisfactory. In addition, antigen interacting with antibody is context dependent and the coarse binary classification of antigen residues into epitope and non-epitope without the corresponding antibody may not reveal the biological reality. Therefore, we take a novel way to identify epitopes by using associations between antibodies and antigens.

Results

Given a pair of antibody-antigen sequences, the epitope residues can be identified by two types of associations: paratope-epitope interacting biclique and cooccurrent pattern of interacting residue pairs. As the association itself does not include the neighborhood information on the primary sequence, residues' cooperativity and relative composition are then used to enhance our method. Evaluation carried out on a benchmark data set shows that the proposed method produces very good performance in terms of accuracy. After compared with other two structure-based B-cell epitope prediction methods, results show that the proposed method is competitive to, sometimes even better than, the structure-based methods which have much smaller applicability scope.

Conclusions

The proposed method leads to a new way of identifying B-cell epitopes. Besides, this antibody-specified epitope prediction can provide more precise and helpful information for wet-lab experiments.
  相似文献   

7.

Background

STAT1 and IRF1 collaborate to induce interferon-γ (IFNγ) stimulated genes (ISGs), but the extent to which they act alone or together is unclear. The effect of single nucleotide polymorphisms (SNPs) on in vivo binding is also largely unknown.

Results

We show that IRF1 binds at proximal or distant ISG sites twice as often as STAT1, increasing to sixfold at the MHC class I locus. STAT1 almost always bound with IRF1, while most IRF1 binding events were isolated. Dual binding sites at remote or proximal enhancers distinguished ISGs that were responsive to IFNγ versus cell-specific resistant ISGs, which showed fewer and mainly single binding events. Surprisingly, inducibility in one cell type predicted ISG-responsiveness in other cells. Several dbSNPs overlapped with STAT1 and IRF1 binding motifs, and we developed methodology to rapidly assess their effects. We show that in silico prediction of SNP effects accurately reflects altered binding both in vitro and in vivo.

Conclusions

These data reveal broad cooperation between STAT1 and IRF1, explain cell type specific differences in ISG-responsiveness, and identify genetic variants that may participate in the pathogenesis of immune disorders.
  相似文献   

8.

Introduction

Human plasma metabolomics offer powerful tools for understanding disease mechanisms and identifying clinical biomarkers for diagnosis, efficacy prediction and patient stratification. Although storage conditions can affect the reliability of data from metabolites, strict control of these conditions remains challenging, particularly when clinical samples are included from multiple centers. Therefore, it is necessary to consider stability profiles of each analyte.

Objectives

The purpose of this study was to extract unstable metabolites from vast metabolome data and identify factors that cause instability.

Method

Plasma samples were obtained from five healthy volunteers, were stored under ten different conditions of time and temperature and were quantified using leading-edge metabolomics. Instability was evaluated by comparing quantitation values under each storage condition with those obtained after ?80 °C storage.

Result

Stability profiling of the 992 metabolites showed time- and temperature-dependent increases in numbers of significantly changed metabolites. This large volume of data enabled comparisons of unstable metabolites with their related molecules and allowed identification of causative factors, including compound-specific enzymatic activity in plasma and chemical reactivity. Furthermore, these analyses indicated extreme instability of 1-docosahexaenoylglycerol, 1-arachidonoylglycerophosphate, cystine, cysteine and N6-methyladenosine.

Conclusion

A large volume of data regarding storage stability was obtained. These data are a contribution to the discovery of biomarker candidates without misselection based on unreliable values and to the establishment of suitable handling procedures for targeted biomarker quantification.
  相似文献   

9.

Background

The heme-protein interactions are essential for various biological processes such as electron transfer, catalysis, signal transduction and the control of gene expression. The knowledge of heme binding residues can provide crucial clues to understand these activities and aid in functional annotation, however, insufficient work has been done on the research of heme binding residues from protein sequence information.

Methods

We propose a sequence-based approach for accurate prediction of heme binding residues by a novel integrative sequence profile coupling position specific scoring matrices with heme specific physicochemical properties. In order to select the informative physicochemical properties, we design an intuitive feature selection scheme by combining a greedy strategy with correlation analysis.

Results

Our integrative sequence profile approach for prediction of heme binding residues outperforms the conventional methods using amino acid and evolutionary information on the 5-fold cross validation and the independent tests.

Conclusions

The novel feature of an integrative sequence profile achieves good performance using a reduced set of feature vector elements.
  相似文献   

10.

Background  

Antigen presenting cells (APCs) sample the extra cellular space and present peptides from here to T helper cells, which can be activated if the peptides are of foreign origin. The peptides are presented on the surface of the cells in complex with major histocompatibility class II (MHC II) molecules. Identification of peptides that bind MHC II molecules is thus a key step in rational vaccine design and developing methods for accurate prediction of the peptide:MHC interactions play a central role in epitope discovery. The MHC class II binding groove is open at both ends making the correct alignment of a peptide in the binding groove a crucial part of identifying the core of an MHC class II binding motif. Here, we present a novel stabilization matrix alignment method, SMM-align, that allows for direct prediction of peptide:MHC binding affinities. The predictive performance of the method is validated on a large MHC class II benchmark data set covering 14 HLA-DR (human MHC) and three mouse H2-IA alleles.  相似文献   

11.

Background

Current risk prediction models in heart failure (HF) including clinical characteristics and biomarkers only have moderate predictive value. The aim of this study was to use matrix assisted laser desorption ionisation mass spectrometry (MALDI-MS) profiling to determine if a combination of peptides identified with MALDI-MS will better predict clinical outcomes of patients with HF.

Methods

A cohort of 100 patients with HF were recruited in the biomarker discovery phase (50 patients who died or had a HF hospital admission vs. 50 patients who did not have an event). The peptide extraction from plasma samples was performed using reversed phase C18. Then samples were analysed using MALDI-MS. A multiple peptide biomarker model was discovered that was able to predict clinical outcomes for patients with HF. Finally, this model was validated in an independent cohort with 100 patients with HF.

Results

After normalisation and alignment of all the processed spectra, a total of 11,389 peptides (m/z) were detected using MALDI-MS. A multiple biomarker model was developed from 14 plasma peptides that was able to predict clinical outcomes in HF patients with an area under the receiver operating characteristic curve (AUC) of 1.000 (p?=?0.0005). This model was validated in an independent cohort with 100 HF patients that yielded an AUC of 0.817 (p?=?0.0005) in the biomarker validation phase. Addition of this model to the BIOSTAT risk prediction model increased the predictive probability for clinical outcomes of HF from an AUC value of 0.643 to an AUC of 0.823 (p?=?0.0021). Moreover, using the prediction model of fourteen peptides and the composite model of the multiple biomarker of fourteen peptides with the BIOSTAT risk prediction model achieved a better predictive probability of time-to-event in prediction of clinical events in patients with HF (p?=?0.0005).

Conclusions

The results obtained in this study suggest that a cluster of plasma peptides using MALDI-MS can reliably predict clinical outcomes in HF that may help enable precision medicine in HF.
  相似文献   

12.

Objectives

To explore the effects of Lin28A on progression of osteocarcinoma (OS) cells.

Results

Lin28A mRNA and protein expressions were significantly increased in OS tissues compared with that in normal adjacent tissues. Expressions of Lin28A and long noncoding RNA MALAT1 were positively correlated. Patients with higher Lin28A expression had shorter overall survival. Moreover, Lin28A knockdown inhibited OS cells proliferation, migration, invasion and promoted cell apoptosis; Lin28A was found to harbor binding sites on MALAT1 sequences and associated with MALAT1, and increased MALAT1 stability and expression. Notably, the inhibition of Lin28A knockdown was attenuated or even reversed by MALAT1 overexpression.

Conclusions

RNA binding protein Lin28A could facilitate OS cells progression by associating with the long noncoding RNA MALAT1.
  相似文献   

13.

Background

Liquid chromatography combined with tandem mass spectrometry is an important tool in proteomics for peptide identification. Liquid chromatography temporally separates the peptides in a sample. The peptides that elute one after another are analyzed via tandem mass spectrometry by measuring the mass-to-charge ratio of a peptide and its fragments. De novo peptide sequencing is the problem of reconstructing the amino acid sequences of a peptide from this measurement data. Past de novo sequencing algorithms solely consider the mass spectrum of the fragments for reconstructing a sequence.

Results

We propose to additionally exploit the information obtained from liquid chromatography. We study the problem of computing a sequence that is not only in accordance with the experimental mass spectrum, but also with the chromatographic retention time. We consider three models for predicting the retention time and develop algorithms for de novo sequencing for each model.

Conclusions

Based on an evaluation for two prediction models on experimental data from synthesized peptides we conclude that the identification rates are improved by exploiting the chromatographic information. In our evaluation, we compare our algorithms using the retention time information with algorithms using the same scoring model, but not the retention time.
  相似文献   

14.

Background

Horse fibrinogen has been identified as a plasma specific ferritin-binding protein. There are two ways in the binding of ferritin-binding protein with ferritin: one is direct binding and the other is indirect binding which is heme-mediated. The aim of this study was to analyze the binding between horse fibrinogen and ferritin.

Findings

Although fibrinogen in horse plasma did not show the binding to ferritin coated on the plate wells, after following heat-treatment (60°C, 30 min) of horse plasma, plasma fibrinogen as well as purified horse fibrinogen bound to plates coated with horse spleen ferritin, but not with its apoferritin which lost heme as well as iron after the treatment of reducing reagent. Binding of purified or plasma fibrinogen to ferritin was inhibited by hemin and Sn-protoporphyrin IX (Sn-PPIX), but not by PPIX or Zn-PPIX.

Conclusions

Heat-treatment of horse plasma enabled plasma fibrinogen to bind to plate well coated with holo-ferritin. From the binding analysis of fibrinogen and ferritin, it is suggested that horse fibrinogen recognized iron or tin in complexed with the heme- or the hemin-ring, and also suggest that some fibrinogens circulate in the form of a complex with ferritin and/or heat-labile factors which inhibit the binding of fibrinogen with ferritin.
  相似文献   

15.

Background

The arboviruses Zika virus (ZIKV) and Dengue virus (DENV) have important epidemiological impact in Brazil and other tropical regions of the world. Recently, it was shown that previous humoral immunity to DENV enhances ZIKV replication in vitro, which may lead to more severe forms of the disease. Thus, traditional approaches of vaccine development aiming to control viral infection through neutralizing antibodies may induce cross-reactive enhancing antibodies. In contrast, cellular immune response was shown to be capable of controlling DENV infection independently of antibodies. The aim of the present study was to design a flavivirus NS5 protein capable of inducing a cellular immune response against DENV and ZIKV.

Methods

A consensus sequence of ZIKV NS5 protein was designed among isolates from various continents. Epitopes were predicted for the most prevalent alleles of class I and II HLA in the Brazilian population. Then, this epitopes were analyzed with regard to their conservation, population coverage and distribution along the whole antigen.

Results

Nineteen epitopes predicted to be more reactive (percentile rank <1) and 100% conserved among ZIKV and DENV serotypes were selected. The distribution of such epitopes along the protein was shown on a three-dimensional model and population coverage was calculated for different regions of the world. The designed protein was predicted to be stable and the distribution of selected epitopes was shown to be homogeneous along domains. The population coverage of selected epitopes was higher than 50% for most of tropical areas of the world.

Conclusion

Such results indicate that the proposed antigen has the potential to induce protective cellular immune response to ZIKV and DENV in different human populations of the world.
  相似文献   

16.

Introduction

Collecting feces is easy. It offers direct outcome to endogenous and microbial metabolites.

Objectives

In a context of lack of consensus about fecal sample preparation, especially in animal species, we developed a robust protocol allowing untargeted LC-HRMS fingerprinting.

Methods

The conditions of extraction (quantity, preparation, solvents, dilutions) were investigated in bovine feces.

Results

A rapid and simple protocol involving feces extraction with methanol (1/3, M/V) followed by centrifugation and a step filtration (10 kDa) was developed.

Conclusion

The workflow generated repeatable and informative fingerprints for robust metabolome characterization.
  相似文献   

17.

Background

Cancer immunotherapy uses one’s own immune system to fight cancerous cells. As immune system is hard-wired to distinguish self and non-self, cancer immunotherapy is predicted to target cancerous cells specifically, therefore is less toxic than chemotherapy and radiation therapy, two major treatments for cancer. Cancer immunologists have spent decades to search for the specific targets in cancerous cells.

Methods

Due to the recent advances in high throughput sequencing and bioinformatics, evidence has merged that the neoantigens in cancerous cells are probably the cancer-specific targets that lead to the destruction of cancer.We will review the transplantable murine tumor models for cancer immunotherapy and the bioinformatics tools used to navigate mouse genome to identify tumor-rejecting neoantigens.

Results

Several groups have independently identified point mutations that can be recognized by T cells of host immune system. It is consistent with the note that the formation of peptide-MHC I-TCR complex is critical to activate T cells. Both anchor residue and TCR-facing residue mutations have been reported. While TCR-facing residue mutations may directly activate specific T cells, anchor residue mutations improve the binding of peptides to MHC I molecules, which increases the presentation of peptides and the T cell activation indirectly.

Conclusions

Our work indicates that the affinity of neoepitopes for MHC I is not a predictor for anti-tumor immune responses in mice. Instead differential agretopic index (DAI), the numerical difference of epitope-MHC I affinities between the mutated and un-mutated sequences is a significant predictor. A similar bioinformatics pipeline has been developed to generate personalized vaccines to treat human ovarian cancer in a Phase I clinical trial.
  相似文献   

18.

Background

Protein-protein interactions are important for several cellular processes. Understanding the mechanism of protein-protein recognition and predicting the binding sites in protein-protein complexes are long standing goals in molecular and computational biology.

Methods

We have developed an energy based approach for identifying the binding site residues in protein–protein complexes. The binding site residues have been analyzed with sequence and structure based parameters such as binding propensity, neighboring residues in the vicinity of binding sites, conservation score and conformational switching.

Results

We observed that the binding propensities of amino acid residues are specific for protein-protein complexes. Further, typical dipeptides and tripeptides showed high preference for binding, which is unique to protein-protein complexes. Most of the binding site residues are highly conserved among homologous sequences. Our analysis showed that 7% of residues changed their conformations upon protein-protein complex formation and it is 9.2% and 6.6% in the binding and non-binding sites, respectively. Specifically, the residues Glu, Lys, Leu and Ser changed their conformation from coil to helix/strand and from helix to coil/strand. Leu, Ser, Thr and Val prefer to change their conformation from strand to coil/helix.

Conclusions

The results obtained in this study will be helpful for understanding and predicting the binding sites in protein-protein complexes.
  相似文献   

19.
Lyu  Chuqiao  Wang  Lei  Zhang  Juhua 《BMC genomics》2018,19(10):905-165

Background

The DNase I hypersensitive sites (DHSs) are associated with the cis-regulatory DNA elements. An efficient method of identifying DHSs can enhance the understanding on the accessibility of chromatin. Despite a multitude of resources available on line including experimental datasets and computational tools, the complex language of DHSs remains incompletely understood.

Methods

Here, we address this challenge using an approach based on a state-of-the-art machine learning method. We present a novel convolutional neural network (CNN) which combined Inception like networks with a gating mechanism for the response of multiple patterns and longterm association in DNA sequences to predict multi-scale DHSs in Arabidopsis, rice and Homo sapiens.

Results

Our method obtains 0.961 area under curve (AUC) on Arabidopsis, 0.969 AUC on rice and 0.918 AUC on Homo sapiens.

Conclusions

Our method provides an efficient and accurate way to identify multi-scale DHSs sequences by deep learning.
  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号