首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Many drugs with very different affinity to a large number of receptors are described. Thus, in this work, we selected drug-target pairs (DTPs/nDTPs) of drugs with high affinity/nonaffinity for different targets. Quantitative structure-activity relationship (QSAR) models become a very useful tool in this context because they substantially reduce time and resource-consuming experiments. Unfortunately, most QSAR models predict activity against only one protein target and/or they have not been implemented on a public Web server yet, freely available online to the scientific community. To solve this problem, we developed a multitarget QSAR (mt-QSAR) classifier combining the MARCH-INSIDE software for the calculation of the structural parameters of drug and target with the linear discriminant analysis (LDA) method in order to seek the best model. The accuracy of the best LDA model was 94.4% (3,859/4,086 cases) for training and 94.9% (1,909/2,012 cases) for the external validation series. In addition, we implemented the model into the Web portal Bio-AIMS as an online server entitled MARCH-INSIDE Nested Drug-Bank Exploration & Screening Tool (MIND-BEST), located at http://miaja.tic.udc.es/Bio-AIMS/MIND-BEST.php . This online tool is based on PHP/HTML/Python and MARCH-INSIDE routines. Finally, we illustrated two practical uses of this server with two different experiments. In experiment 1, we report for the first time a MIND-BEST prediction, synthesis, characterization, and MAO-A and MAO-B pharmacological assay of eight rasagiline derivatives, promising for anti-Parkinson drug design. In experiment 2, we report sampling, parasite culture, sample preparation, 2-DE, MALDI-TOF and -TOF/TOF MS, MASCOT search, 3D structure modeling with LOMETS, and MIND-BEST prediction for different peptides as new protein of the found in the proteome of the bird parasite Trichomonas gallinae, which is promising for antiparasite drug targets discovery.  相似文献   

2.
There are many protein ligands and/or drugs described with very different affinity to a large number of target proteins or receptors. In this work, we selected Ligands or Drug-target pairs (DTPs/nDTPs) of drugs with high affinity/non-affinity for different targets. Quantitative Structure-Activity Relationships (QSAR) models become a very useful tool in this context to substantially reduce time and resources consuming experiments. Unfortunately most QSAR models predict activity against only one protein target and/or have not been implemented in the form of public web server freely accessible online to the scientific community. To solve this problem, we developed here a multi-target QSAR (mt-QSAR) classifier using the MARCH-INSIDE technique to calculate structural parameters of drug and target plus one Artificial Neuronal Network (ANN) to seek the model. The best ANN model found is a Multi-Layer Perceptron (MLP) with profile MLP 20:20-15-1:1. This MLP classifies correctly 611 out of 678 DTPs (sensitivity=90.12%) and 3083 out of 3408 nDTPs (specificity=90.46%), corresponding to training accuracy=90.41%. The validation of the model was carried out by means of external predicting series. The model classifies correctly 310 out of 338 DTPs (sensitivity=91.72%) and 1527 out of 1674 nDTP (specificity=91.22%) in validation series, corresponding to total accuracy=91.30% for validation series (predictability). This model favorably compares with other ANN models developed in this work and Machine Learning classifiers published before to address the same problem in different aspects. We implemented the present model at web portal Bio-AIMS in the form of an online server called: Non-Linear MARCH-INSIDE Nested Drug-Bank Exploration & Screening Tool (NL MIND-BEST), which is located at URL: http://miaja.tic.udc.es/Bio-AIMS/NL-MIND-BEST.php. This online tool is based on PHP/HTML/Python and MARCH-INSIDE routines. Finally we illustrated two practical uses of this server with two different experiments. In experiment 1, we report by first time Quantum QSAR study, synthesis, characterization, and experimental assay of antiplasmodial and cytotoxic activities of oxoisoaporphine alkaloids derivatives as well as NL MIND-BEST prediction of potential target proteins. In experiment 2, we report sampling, parasite culture, sample preparation, 2-DE, MALDI-TOF, and -TOF/TOF MS, MASCOT search, MM/MD 3D structure modeling, and NL MIND-BEST prediction for different peptides a new protein of the found in the proteome of the human parasite Giardia lamblia, which is promising for anti-parasite drug-targets discovery.  相似文献   

3.
Lipid-Binding Proteins (LIBPs) or Fatty Acid-Binding Proteins (FABPs) play an important role in many diseases such as different types of cancer, kidney injury, atherosclerosis, diabetes, intestinal ischemia and parasitic infections. Thus, the computational methods that can predict LIBPs based on 3D structure parameters became a goal of major importance for drug-target discovery, vaccine design and biomarker selection. In addition, the Protein Data Bank (PDB) contains 3000+ protein 3D structures with unknown function. This list, as well as new experimental outcomes in proteomics research, is a very interesting source to discover relevant proteins, including LIBPs. However, to the best of our knowledge, there are no general models to predict new LIBPs based on 3D structures. We developed new Quantitative Structure-Activity Relationship (QSAR) models based on 3D electrostatic parameters of 1801 different proteins, including 801 LIBPs. We calculated these electrostatic parameters with the MARCH-INSIDE software and they correspond to the entire protein or to specific protein regions named core, inner, middle, and surface. We used these parameters as inputs to develop a simple Linear Discriminant Analysis (LDA) classifier to discriminate 3D structure of LIBPs from other proteins. We implemented this predictor in the web server named LIBP-Pred, freely available at , along with other important web servers of the Bio-AIMS portal. The users can carry out an automatic retrieval of protein structures from PDB or upload their custom protein structural models from their disk created with LOMETS server. We demonstrated the PDB mining option performing a predictive study of 2000+ proteins with unknown function. Interesting results regarding the discovery of new Cancer Biomarkers in humans or drug targets in parasites have been discussed here in this sense.  相似文献   

4.
We developed a computational method to predict the retention times of peptides in HPLC using artificial neural networks (ANN). We performed stepwise multiple linear regressions and selected for ANN input amino acids that significantly affected the LC retention time. Unlike conventional linear models, the trained ANN accurately predicted the retention time of peptides containing up to 50 amino acid residues. In 834 peptides, there was a strong correlation (R2 = 0.928) between measured and predicted retention times. We demonstrated the utility of our method by the prediction of the retention time of 121,273 peptides resulting from LysC-digestion of the Escherichia coli proteome. Our approach is useful for the proteome-wide characterization of peptides and the identification of unknown peptide peaks obtained in proteome analysis.  相似文献   

5.
A new matrix-assisted laser desorption/ionization time of flight mass spectrometer (MALDI-ToF MS), developed specifically for the identification and characterization of proteins and peptides in proteomic investigations, is described. The mass spectrometer which can be integrated with the 2-D gel electrophoresis workflow is a bench-top instrument, enabling rapid, reliable and unattended protein identification in low-, as well as high-throughput proteomics applications. To obtain precise information on peptide sequences, the instrument utilizes a timed ion gate and a unique quadratic field reflectron (Z2 technology), allowing single-run, post-source decay (PSD) of selected peptides. In this study, the performance of the instrument in reflectron, PSD and linear mode, respectively, was investigated. The results showed that the limit of detection for a single peptide in reflectron mode was 125 amol with a signal to noise ratio exceeding 20. Average mass resolution for peptides larger than 2000 u was around 13,000 full width, half maximum (FWHM). The limit for protein identification during peptide mass fingerprinting (PMF) was 500 amol with a sequence coverage of 18%. Mass error during PMF analysis was less than 15 ppm for 17 out of 25 (68%) identified peptides. In PSD mode, a complete series of y-ions of a CAF-derivatized peptide could be obtained from 3.75 fmol of material. The average mass error of PSD-generated fragments was less than 0.14 u. Finally, in linear mode, intact proteins with molecular masses greater than 300,000 u were detected with mass errors below 0.2%.  相似文献   

6.
Separation and identification of hydrophobic membrane proteins is a major challenge in proteomics. Identification of such sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE)-separated proteins by peptide mass fingerprinting (PMF) via matrix-assisted laser desorption/ionization-time of flight (MALDI-TOF) is frequently hampered by the insufficient amount of peptides being generated and their low signal intensity. Using the seven helical transmembrane-spanning proton pump bacteriorhodopsin as model protein, we demonstrate here that SDS removal from hydrophobic proteins by ion-pair extraction prior to in-gel tryptic proteolysis leads to a tenfold higher sensitivity in mass spectrometric identification via PMF, with respect to initial protein load on SDS-PAGE. Furthermore, parallel sequencing of the generated peptides by electrospray ionization-mass spectrometry (ESI-MS) and tandem mass spectrometry (MS/MS) was possible without further sample cleanup. We also show identification of other membrane proteins by this protocol, as proof of general applicability.  相似文献   

7.
Predicting the bioactivity of peptides and proteins is an important challenge in drug development and protein engineering. In this study we introduce a novel approach, the so-called “physics and chemistry-driven artificial neural network (Phys-Chem ANN)”, to deal with such a problem. Unlike the existing ANN approaches, which were designed under the inspiration of biological neural system, the Phys-Chem ANN approach is based on the physical and chemical principles, as well as the structural features of proteins. In the Phys-Chem ANN model the “hidden layers” are no longer virtual “neurons”, but real structural units of proteins and peptides. It is a hybridization approach, which combines the linear free energy concept of quantitative structure-activity relationship (QSAR) with the advanced mathematical technique of ANN. The Phys-Chem ANN approach has adopted an iterative and feedback procedure, incorporating both machine-learning and artificial intelligence capabilities. In addition to making more accurate predictions for the bioactivities of proteins and peptides than is possible with the traditional QSAR approach, the Phys-Chem ANN approach can also provide more insights about the relationship between bioactivities and the structures involved than the ANN approach does. As an example of the application of the Phys-Chem ANN approach, a predictive model for the conformational stability of human lysozyme is presented.  相似文献   

8.
9.
10.
Peptide mass fingerprinting (PMF) is a powerful technique for identification of proteins derived from in-gel digests by virtue of their matrix-assisted laser desorption/ionization-time of flight mass spectra. However, there are circumstances where the under-representation of peptides in the mass spectrum and the complexity of the source proteome mean that PMF is inadequate as an identification tool. In this paper, we show that identification is substantially enhanced by inclusion of composition data for a single amino acid. Labelling in vivo with a stable isotope labelled amino acid (in this paper, decadeuterated leucine) identifies the number of such amino acids in each digest fragment, and show a considerable gain in the ability of PMF to identify the parent protein. The method is tolerant to the extent of labelling, and as such, may be applicable to a range of single cell systems.  相似文献   

11.
The molecular sizes of hydrolysates of acetylated and succinylated caseins by pepsin-pancreatin were examined by gel filtration on Sephadex G-15. There were more large peptides (average residual number > 15) in the hydrolysates of the acylated caseins than there were in the hydrolysate of unmodified casein. In these large peptides from the acylated caseins the contents of Nε-acyl-lysines were high. The digestibility of Nε-acetyl and Nε-succinyl lysine bonds in peptides by aminopeptidases [(EC 3.4.11.1) and (EC 3.4.11.2)] and watermelon carboxypeptidase [model enzyme of cathepsin A (EC 3.4.16.1)] was examined using digest of acylated caseins by pepsin, trypsin and α-chymotrypsin and some synthetic peptides. All peptidases released either Nε-acetyl or Nε-succinyl-lysine from peptides.

The hydrolytic processes of acetylated and succinylated proteins before and after intestinal absorption are discussed.  相似文献   

12.
Development of angiotensin I-converting enzyme (ACE, EC 3.4.15.1) inhibitory peptides from food protein is under extensive research as alternative for the prevention of hypertension. However, it is difficult to identify peptides released from food sources. To accelerate the progress of peptide identification, a three layer back propagation neural network model was established to predict the ACE-inhibitory activity of pentapeptides derived from bovine hemoglobin by simulated enzyme digestion. The pentapeptide WTQRF has the best predicted value with experimental IC50 23.93 μM. The potential molecular mechanism of the WTQRF / ACE interaction was investigated by flexible docking.  相似文献   

13.
14.
15.
A two-dimensional (2D) separation method was used to decrease sample complexity in analysis of tryptic peptides from glomerular membrane proteins by tandem mass spectrometry (MS/MS). The first dimension was carried out by electrocapture (EC), which fractionates peptides according to electrophoretic mobility. The second dimension was reverse-phase liquid chromatography (RP-LC), in which EC fractions were further separated and analyzed online by MS/MS. Using this methodology, we now identify 102 glomerular proteins (57 membrane proteins). Many peptides were possible to observe and select for MS/MS only using the 2D approach. Others were detectable in both one-dimensional (1D, without the EC step) and 2D experiments but were selectable for sequence analysis only from the 2D separations because the decrease in complexity then gives time for the mass analyzer to select the peptide and switch to the MS/MS mode. A minority of the peptides were detectable only in the 1D mode (presumably because of handling losses), but at the end this did not decrease the number of proteins identified by the 2D separation. After a database search, the combination of EC and RP-LC MS/MS versus a 1D RP-LC MS/MS separation resulted in a threefold increase in the number of proteins identified and improved the sequence coverage in the identifications, bringing our proteome-identified glomerular proteins to 282.  相似文献   

16.
Peptide mass fingerprinting (PMF) is a valuable method for rapid and high-throughput protein identification using the proteomics approach. Automated search engines, such as Ms-Fit, Mascot, ProFound, and Peptldent, have facilitated protein identification through PMF. The potential to obtain a true MS protein identification result depends on the choice of algorithm as well as experimental factors that influence the information content in MS data. When mass spectral data are incomplete and/or have low mass accuracy, the “number of matches” approach may be inadequate for a useful identification. Several studies have evaluated factors influencing the quality of mass spectrometry (MS) experiments. Missed cleavages, posttranslational modifications of peptides and contaminants (e.g., keratin) are important factors that can affect the results of MS analyses by influencing the identification process as well as the quality of the MS spectra. We compared search engines frequently used to identify proteins fromHomo sapiens andHalobacterium salinarum by evaluating factors, including data-based and mass tolerance to develop an improved search engine for PMF. This study may provide information to help develop a more effective algorithm for protein identification in each species through PMF.  相似文献   

17.
18.
Gay S  Binz PA  Hochstrasser DF  Appel RD 《Proteomics》2002,2(10):1374-1391
Matrix-assisted laser desorption/ionization-time of flight mass spectrometry has become a valuable tool in proteomics. With the increasing acquisition rate of mass spectrometers, one of the major issues is the development of accurate, efficient and automatic peptide mass fingerprinting (PMF) identification tools. Current tools are mostly based on counting the number of experimental peptide masses matching with theoretical masses. Almost all of them use additional criteria such as isoelectric point, molecular weight, PTMs, taxonomy or enzymatic cleavage rules to enhance prediction performance. However, these identification tools seldom use peak intensities as parameter as there is currently no model predicting the intensities based on the physicochemical properties of peptides. In this work, we used standard datamining methods such as classification and regression methods to find correlations between peak intensities and the properties of the peptides composing a PMF spectrum. These methods were applied on a dataset comprising a series of PMF experiments involving 157 proteins. We found that the C4.5 method gave the more informative results for the classification task (prediction of the presence or absence of a peptide in a spectra) and M5' for the regression methods (prediction of the normalized intensity of a peptide peak). The C4.5 result correctly classified 88% of the theoretical peaks; whereas the M5' peak intensities had a correlation coefficient of 0.6743 with the experimental peak intensities. These methods enabled us to obtain decision and model trees that can be directly used for prediction and identification of PMF results. The work performed permitted to lay the foundations of a method to analyze factors influencing the peak intensity of PMF spectra. A simple extension of this analysis could lead to improve the accuracy of the results by using a larger dataset. Additional peptide characteristics or even PMF experimental parameters can also be taken into account in the datamining process to analyze their influence on the peak intensity. Furthermore, this datamining approach can certainly be extended to the tandem mass spectrometry domain or other mass spectrometry derived methods.  相似文献   

19.
When searching for prospective novel peptides, it is difficult to determine the biological activity of a peptide based only on its sequence. The “trial and error” approach is generally laborious, expensive and time consuming due to the large number of different experimental setups required to cover a reasonable number of biological assays. To simulate a virtual model for Hymenoptera insects, 166 peptides were selected from the venoms and hemolymphs of wasps, bees and ants and applied to a mathematical model of multivariate analysis, with nine different chemometric components: GRAVY, aliphaticity index, number of disulfide bonds, total residues, net charge, pI value, Boman index, percentage of alpha helix, and flexibility prediction. Principal component analysis (PCA) with non-linear iterative projections by alternating least-squares (NIPALS) algorithm was performed, without including any information about the biological activity of the peptides. This analysis permitted the grouping of peptides in a way that strongly correlated to the biological function of the peptides. Six different groupings were observed, which seemed to correspond to the following groups: chemotactic peptides, mastoparans, tachykinins, kinins, antibiotic peptides, and a group of long peptides with one or two disulfide bonds and with biological activities that are not yet clearly defined. The partial overlap between the mastoparans group and the chemotactic peptides, tachykinins, kinins and antibiotic peptides in the PCA score plot may be used to explain the frequent reports in the literature about the multifunctionality of some of these peptides. The mathematical model used in the present investigation can be used to predict the biological activities of novel peptides in this system, and it may also be easily applied to other biological systems.  相似文献   

20.
Identification of proteins by mass spectrometry (MS) is an essential step in proteomic studies and is typically accomplished by either peptide mass fingerprinting (PMF) or amino acid sequencing of the peptide. Although sequence information from MS/MS analysis can be used to validate PMF-based protein identification, it may not be practical when analyzing a large number of proteins and when high- throughput MS/MS instrumentation is not readily available. At present, a vast majority of proteomic studies employ PMF. However, there are huge disparities in criteria used to identify proteins using PMF. Therefore, to reduce incorrect protein identification using PMF, and also to increase confidence in PMF-based protein identification without accompanying MS/MS analysis, definitive guiding principles are essential. To this end, we propose a value-based scoring system that provides guidance on evaluating when PMF-based protein identification can be deemed sufficient without accompanying amino acid sequence data from MS/MS analysis.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号