首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
For data‐independent acquisition by means of sequential window acquisition of all theoretical fragment ion spectra (SWATH), a reference library of data‐dependent acquisition (DDA) runs is typically used to correlate the quantitative data from the fragment ion spectra with peptide identifications. The quality and coverage of such a reference library is therefore essential when processing SWATH data. In general, library sizes can be increased by reducing the impact of DDA precursor selection with replicate runs or fractionation. However, these strategies can affect the match between the library and SWATH measurement, and thus larger library sizes do not necessarily correspond to improved SWATH quantification. Here, three fractionation strategies to increase local library size were compared to standard library building using replicate DDA injection: protein SDS‐PAGE fractionation, peptide high‐pH RP‐HPLC fractionation and MS‐acquisition gas phase fractionation. The impact of these libraries on SWATH performance was evaluated in terms of the number of extracted peptides and proteins, the match quality of the peptides and the extraction reproducibility of the transitions. These analyses were conducted using the hydrophilic proteome of differentiating human embryonic stem cells. Our results show that SWATH quantitative results and interpretations are affected by choice of fractionation technique. Data are available via ProteomeXchange with identifier PXD006190.  相似文献   

2.
3.
There are more than 200 completed genomes and over 1 million nonredundant sequences in public repositories. Although the structural data are more sparse (approximately 13,000 nonredundant structures solved to date), several powerful sequence-based methodologies now allow these structures to be mapped onto related regions in a significant proportion of genome sequences. We review a number of publicly available strategies for providing structural annotations for genome sequences, and we describe the protocol adopted to provide CATH structural annotations for completed genomes. In particular, we assess the performance of several sequence-based protocols employing Hidden Markov model (HMM) technologies for superfamily recognition, including a new approach (SAMOSA [sequence augmented models of structure alignments]) that exploits multiple structural alignments from the CATH domain structure database when building the models. Using a data set of remote homologs detected by structure comparison and manually validated in CATH, a single-seed HMM library was able to recognize 76% of the data set. Including the SAMOSA models in the HMM library showed little gain in homolog recognition, although a slight improvement in alignment quality was observed for very remote homologs. However, using an expanded 1D-HMM library, CATH-ISL increased the coverage to 86%. The single-seed HMM library has been used to annotate the protein sequences of 120 genomes from all three major kingdoms, allowing up to 70% of the genes or partial genes to be assigned to CATH superfamilies. It has also been used to recruit sequences from Swiss-Prot and TrEMBL into CATH domain superfamilies, expanding the CATH database eightfold.  相似文献   

4.
In order to search for a common structural motif in the phosphate-binding sites of protein-mononucleotide complexes, we investigated the structural variety of phosphate-binding schemes by an all-against-all comparison of 491 binding sites found in the Protein Data Bank. We found four frequently occurring structural motifs composed of protein atoms interacting with phosphate groups, each of which appears in different protein superfamilies with different folds. The most frequently occurring motif, which we call the structural P-loop, is shared by 13 superfamilies and is characterized by a four-residue fragment, GXXX, interacting with a phosphate group through the backbone atoms. Various sequence motifs, including Walker's A motif or the P-loop, turn out to be a structural P-loop found in a few specific superfamilies. The other three motifs are found in pairs of superfamilies: protein kinase and glutathione synthetase ATPase domain like, actin-like ATPase domain and nucleotidyltransferase, and FMN-linked oxidoreductase and PRTase.  相似文献   

5.
We describe an experimental procedure to mimic the formation of long (over 40 residues) co-oligopetide sequences in many identical copies which may have occurred in the prebiotic molecular evolution. The basic hypothesis is that chain formation is based on the stepwise fragment condensation of randomly generated short oligopeptides, whereby the elongation takes place under the contingent environmental constraints (solubility, pH, salinity), which eliminate most of the products, and thus determine the selection towards one particular small set of chains. The present work aims at verifying the validity of this scheme. In order to do so, we utilize a classic synthetic procedure based on the Merrifield solid-phase synthesis of peptides for the synthesis of randomly produced peptides as well as for their stepwise fragment condensation. Thus, starting from a library of peptides with n=10, the first condensation step produces a library of 16 peptides with 20 residues each (n=20), of which only four remain water-soluble and, therefore, capable to undergo the next fragment condensation step. This gives rise to 16 peptides with n=30, out of which twelve precipitate out under the chosen pH and buffer conditions and are eliminated. Finally, a 44-residue-long water-soluble de novo protein is obtained. This has no homologies or similarities with extant proteins, and, based on circular dichroism (CD), it assumes a stable three-dimensional folding. In agreement with CD data, molecular-modelling simulations suggest an helical fold for the protein with poor, if any, structural homology with known proteins. The implication of this procedure as a general mechanism for the etiology of de novo macromolecular sequences and globular proteins in the origin of life is briefly discussed.  相似文献   

6.
7.
Label free quantitation by measurement of peptide fragment signal intensity (MS2 quantitation) is a technique that has seen limited use due to the stochastic nature of data dependent acquisition (DDA). However, data independent acquisition has the potential to make large scale MS2 quantitation a more viable technique. In this study we used an implementation of data independent acquisition—SWATH—to perform label free protein quantitation in a model bacterium Clostridium stercorarium. Four tryptic digests analyzed by SWATH were probed by an ion library containing information on peptide mass and retention time obtained from DDA experiments. Application of this ion library to SWATH data quantified 1030 proteins with at least two peptides quantified (~40% of predicted proteins in the C. stercorarium genome) in each replicate. Quantitative results obtained were very consistent between biological replicates (R2 ~ 0.960). Protein quantitation by summation of peptide fragment signal intensities was also highly consistent between biological replicates (R2 ~ 0.930), indicating that this approach may have increased viability compared to recent applications in label free protein quantitation. SWATH based quantitation was able to consistently detect differences in relative protein quantity and it provided coverage for a number of proteins that were missed in some samples by DDA analysis.  相似文献   

8.
An understanding of structural changes and self-assembly of proteins, which are thought to involve specific peptide?Cpeptide interactions, will contribute to the development of therapeutic agents and diagnosis for the detection of conformational diseases. We hypothesize that certain peptides may contribute to the conformational change of prion proteins. The present paper describes the discovery of prion-related synthetic peptides which influence structural conversion of recombinant bovine prion protein. The peptides designed are prion-protein fragments containing core domains consisting of ??-helical (human prion protein fragment 180?C195) and known ??-sheet (human prion protein fragment 169?C175) structures. Additionally several reported known ??-sheet breaker peptides and a conjugate consisting of ??-sheet and ??-helix segments based on the secondary structures of human prion protein, designated HPPSH, have been chemically synthesized by the conventional Fmoc solid-phase method and characterized by circular dichroism and the Thioflavin T fluorescence method. Our data indicated that the co-existence of peptides, HPPSH or other prion fragment peptides involving toxic core sequence (the fragment 106?C126), influenced the kinetic rate of aggregation and the lag-time of fibril formation of recombinant bovine prion protein except the core sequence itself. The method will be used for discovery of responsible material from natural resources. And designed peptides can be also used for bio-detection.  相似文献   

9.
Spectral libraries have emerged as a viable alternative to protein sequence databases for peptide identification. These libraries contain previously detected peptide sequences and their corresponding tandem mass spectra (MS/MS). Search engines can then identify peptides by comparing experimental MS/MS scans to those in the library. Many of these algorithms employ the dot product score for measuring the quality of a spectrum-spectrum match (SSM). This scoring system does not offer a clear statistical interpretation and ignores fragment ion m/z discrepancies in the scoring. We developed a new spectral library search engine, Pepitome, which employs statistical systems for scoring SSMs. Pepitome outperformed the leading library search tool, SpectraST, when analyzing data sets acquired on three different mass spectrometry platforms. We characterized the reliability of spectral library searches by confirming shotgun proteomics identifications through RNA-Seq data. Applying spectral library and database searches on the same sample revealed their complementary nature. Pepitome identifications enabled the automation of quality analysis and quality control (QA/QC) for shotgun proteomics data acquisition pipelines.  相似文献   

10.
The conotoxin proteins are disulfide rich small peptides that target ion channels and G protein coupled receptors. And they provide promising application in treating some chronic pain, epilepsy, cardiovascular diseases, and so on. Conotoxins may be classified into 11 superfamilies: A, D, I1, I2, J, L, M, O, P, S, and T according to the disulfide connectivity, highly conserved N-terminal precursor sequence and similar mode of actions. Successful prediction mature conotoxin superfamily peptide has important signification for the biological and pharmacological functions of the toxins. In this study, a new algorithm of increment of diversity combined with modified Mahalanobis discriminant is presented to predict five superfamilies by using the pseudo amino acid composition. The results of jackknife cross-validation test show that the overall prediction sensitivity and specificity are 88% and 91%, respectively. The predictive algorithm is also used to predict three O-conotoxin families. The 72% sensitivity and 78% specificity are obtained. These results indicate that the conotoxin superfamily peptides correlate with their amino acid compositions.  相似文献   

11.
水稻巯基蛋白酶抑制剂研究进展   总被引:2,自引:0,他引:2  
综述水稻巯基蛋白酶抑制剂 (Oryzacystatin ,OC)近年的研究进展。巯基蛋白酶抑制剂 (CPI)统称为胱蛋白超家族。OCI含有CPI家族的典型保守序列Glu -Val-Val-Ala -Gly ,这是其抑制活性不可缺少的区段。利用水稻cDNA文库还克隆得到OCII。OCI与OCII之间在序列上有高度的同源性 ,但在对蛋白酶的抑制作用上有显著差异。OC对鞘翅目昆虫有较强的抗虫性。在抗病方面 ,OC可抑制稻瘟病菌丝体的生长  相似文献   

12.
The design, synthesis and binding affinity for VEGFR-1 receptors of a small library of linear and cyclic analogues of the VEGF(81-91) fragment are described. Cyclic 11- and 10-mer peptide derivatives were prepared using parallel solid-phase protocols. The formation of hydrocarbon alkene-bridged cyclic peptides was achieved through optimized ring-closing metathesis reactions from linear derivatives with conveniently located allylGly residues. Alkane-bridged analogues were successfully obtained by ulterior on-resin hydrogenation. Binding assays showed that some of these compounds were able to compete with labeled VEGF for interaction with the VEGFR-1 receptor. Several peptide derivatives, 2, 7 and 8, showed modest but significant binding affinity, indicating that the designed peptide could mimic the VEGF(81-91) fragment and therefore disrupt the VEGF/VEGFR-1 interaction. This fact opens the way for using these peptides as the starting point for biological/pharmacological tools to deeply investigate this protein-protein system.  相似文献   

13.
W R Pearson 《Genomics》1991,11(3):635-650
The sensitivity and selectivity of the FASTA and the Smith-Waterman protein sequence comparison algorithms were evaluated using the superfamily classification provided in the National Biomedical Research Foundation/Protein Identification Resource (PIR) protein sequence database. Sequences from each of the 34 superfamilies in the PIR database with 20 or more members were compared against the protein sequence database. The similarity scores of the related and unrelated sequences were determined using either the FASTA program or the Smith-Waterman local similarity algorithm. These two sets of similarity scores were used to evaluate the ability of the two comparison algorithms to identify distantly related protein sequences. The FASTA program using the ktup = 2 sensitivity setting performed as well as the Smith-Waterman algorithm for 19 of the 34 superfamilies. Increasing the sensitivity by setting ktup = 1 allowed FASTA to perform as well as Smith-Waterman on an additional 7 superfamilies. The rigorous Smith-Waterman method performed better than FASTA with ktup = 1 on 8 superfamilies, including the globins, immunoglobulin variable regions, calmodulins, and plastocyanins. Several strategies for improving the sensitivity of FASTA were examined. The greatest improvement in sensitivity was achieved by optimizing a band around the best initial region found for every library sequence. For every superfamily except the globins and immunoglobulin variable regions, this strategy was as sensitive as a full Smith-Waterman. For some sequences, additional sensitivity was achieved by including conserved but nonidentical residues in the lookup table used to identify the initial region.  相似文献   

14.
Isolation of a gene encoding a glycosylated cytokinin oxidase from maize   总被引:23,自引:0,他引:23  
The major cytokinin oxidase in immature maize kernels was purified to homogeneity. Selected tryptic peptides were used to design degenerate oligonucleotide primers for PCR isolation of a fragment of the oxidase gene. Hybridization of the PCR fragment to a maize genomic library allowed isolation of a full-length cytokinin oxidase gene (ckx1). The gene encodes a protein of approximately 57 kDa that possesses a signal peptide, eight consensus N-glycosylation sequences and a consensus FAD binding sequence. Expression of ckx1 in Pichia caused secretion of active glycosylated cytokinin oxidase that contains a substrate-reducible FAD. The gene displays sequence homology with a putative oxidoreductase from Arabidopsis thaliana and with the fas5 gene from Rhodococcus fascians.  相似文献   

15.
Bovine P2 Protein: Sequence at the NH2-Terminal of the Protein   总被引:2,自引:2,他引:0  
Sequence data from key fragments of the P2 protein established the order of cyanogen bromide (CNBr) peptides in the structure of the protein and the primary structure for approximately one-half of the molecule. Data were obtained from the three tryptic peptides of blocked NH2-terminal CNBr peptide (CN3), the large CNBr peptide of P2 protein (CN1), and a fragment obtained from P2 by cleavage at tryptophan with 2-(2-nitrophenylsulfenyl)-3-methyl-3'-bromoindolenine. This last fragment was found to contain an over-lapping sequence that proved the juxtaposition of CN1 and CN3 in P2 protein. Thus, based on this fact and the characteristics of the CNBr peptides, the P2 structure is composed of CNBr peptides in the order: CN3-CN1-CN2(Val)-CN2(Lys). A comparison was made between the partial sequence of P2 protein and the equivalent portion of the structure of bovine myelin basic protein. The structures of these two proteins were found to be distinctly different although certain similarities are found.  相似文献   

16.
The authors present fragment screening data obtained using a label-free parallel analysis approach where the binding of fragment library compounds to 4 different target proteins can be screened simultaneously using surface plasmon resonance detection. They suggest this method as a first step in fragment screening to identify and select binders, reducing the demanding requirements on subsequent X-ray or nuclear magnetic resonance studies, and as a valuable "clean-up" tool to eliminate unwanted promiscuous binders from libraries. A small directed fragment library of known thrombin binders and a general 500-compound fragment library were used in this study. Thrombin, blocked thrombin, carbonic anhydrase, and glutathione-S-transferase were immobilized on the sensor chip surface, and the direct binding of the fragments was studied in real time. Only 12 microg of each protein is needed for screening of a 3000-compound fragment library. For screening, a binding site-blocked target as reference facilitates the identification of binding site-selective hits and the signals from other reference proteins for the elimination of false positives. The scope and limitations of this screening approach are discussed for both target-directed and general fragment libraries.  相似文献   

17.
Evolution of function in protein superfamilies, from a structural perspective   总被引:29,自引:0,他引:29  
The recent growth in protein databases has revealed the functional diversity of many protein superfamilies. We have assessed the functional variation of homologous enzyme superfamilies containing two or more enzymes, as defined by the CATH protein structure classification, by way of the Enzyme Commission (EC) scheme. Combining sequence and structure information to identify relatives, the majority of superfamilies display variation in enzyme function, with 25 % of superfamilies in the PDB having members of different enzyme types. We determined the extent of functional similarity at different levels of sequence identity for 486,000 homologous pairs (enzyme/enzyme and enzyme/non-enzyme), with structural and sequence relatives included. For single and multi-domain proteins, variation in EC number is rare above 40 % sequence identity, and above 30 %, the first three digits may be predicted with an accuracy of at least 90 %. For more distantly related proteins sharing less than 30 % sequence identity, functional variation is significant, and below this threshold, structural data are essential for understanding the molecular basis of observed functional differences. To explore the mechanisms for generating functional diversity during evolution, we have studied in detail 31 diverse structural enzyme superfamilies for which structural data are available. A large number of variations and peculiarities are observed, at the atomic level through to gross structural rearrangements. Almost all superfamilies exhibit functional diversity generated by local sequence variation and domain shuffling. Commonly, substrate specificity is diverse across a superfamily, whilst the reaction chemistry is maintained. In many superfamilies, the position of catalytic residues may vary despite playing equivalent functional roles in related proteins. The implications of functional diversity within supefamilies for the structural genomics projects are discussed. More detailed information on these superfamilies is available at http://www.biochem.ucl.ac.uk/bsm/FAM-EC/.  相似文献   

18.
We herein report recent advances in our understanding of transport protein evolution. Numerous families of complex transmembrane transport proteins are believed to have arisen from short channel-forming amphipathic or hydrophobic peptides by various types of intragenic duplication events. Distinct pathways distinguish families, demonstrating independent origins for some, and allowing assignment of others to superfamilies. Some families have diversified in topology, whereas others have remained uniform. An example of 'retroevolution' was discovered where a more complex carrier gave rise to a structurally and functionally simpler channel. The results described in this review article expand our understanding of protein evolution.  相似文献   

19.
Quite recently, a few antibodies against bulk material surface have been selected from a human repertoire antibody library, and they are attracting immense interest in the bottom-up integration of nanomaterials. Here, we constructed antibody fragments with binding affinity and specificity for nonbiological inorganic material surfaces by grafting material-binding peptides into loops of the complementarity determining region (CDR) of antibodies. Loops were replaced by peptides with affinity for zinc oxide and silver material surfaces. Selection of CDR loop for replacement was critical to the functionalization of the grafted fragments; the grafting of material-binding peptide into the CDR2 loop functionalized the antibody fragments with the same affinity and selectivity as the peptides used. Structural insight on the scaffold fragment used implies that material-binding peptide should be grafted onto the most exposed CDR loop on scaffold fragment. We show that the CDR-grafting technique leads to a build-up creation of the antibody with affinity for nonbiological materials.  相似文献   

20.
Conotoxins are disulfide rich small peptides that target a broad spectrum of ion-channels and neuronal receptors. They offer promising avenues in the treatment of chronic pain, epilepsy and cardiovascular diseases. Assignment of newly sequenced mature conotoxins into appropriate superfamilies using a computational approach could provide valuable preliminary information on the biological and pharmacological functions of the toxins. However, creation of protein sequence patterns for the reliable identification and classification of new conotoxin sequences may not be effective due to the hypervariability of mature toxins. With the aim of formulating an in silico approach for the classification of conotoxins into superfamilies, we have incorporated the concept of pseudo-amino acid composition to represent a peptide in a mathematical framework that includes the sequence-order effect along with conventional amino acid composition. The polarity index attribute, which encodes information such as residue surface buriability, polarity, and hydropathy, was used to store the sequence-order effect. Several methods like BLAST, ISort (Intimate Sorting) predictor, least Hamming distance algorithm, least Euclidean distance algorithm and multi-class support vector machines (SVMs), were explored for superfamily identification. The SVMs outperform other methods providing an overall accuracy of 88.1% for all correct predictions with generalized squared correlation of 0.75 using jackknife cross-validation test for A, M, O and T superfamilies and a negative set consisting of short cysteine rich sequences from different eukaryotes having diverse functions. The computed sensitivity and specificity for the superfamilies were found to be in the range of 84.0-94.1% and 80.0-95.5%, respectively, attesting to the efficacy of multi-class SVMs for the successful in silico classification of the conotoxins into their superfamilies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号