首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 406 毫秒
1.
2.
Mathematical tools developed in the context of Shannon information theory were used to analyze the meaning of the BLOSUM score, which was split into three components termed as the BLOSUM spectrum (or BLOSpectrum). These relate respectively to the sequence convergence (the stochastic similarity of the two protein sequences), to the background frequency divergence (typicality of the amino acid probability distribution in each sequence), and to the target frequency divergence (compliance of the amino acid variations between the two sequences to the protein model implicit in the BLOCKS database). This treatment sharpens the protein sequence comparison, providing a rationale for the biological significance of the obtained score, and helps to identify weakly related sequences. Moreover, the BLOSpectrum can guide the choice of the most appropriate scoring matrix, tailoring it to the evolutionary divergence associated with the two sequences, or indicate if a compositionally adjusted matrix could perform better.[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29]  相似文献   

3.
4.
A decoding algorithm is tested that mechanistically models the progressive alignments that arise as the mRNA moves past the rRNA tail during translation elongation. Each of these alignments provides an opportunity for hybridization between the single-stranded, -terminal nucleotides of the 16S rRNA and the spatially accessible window of mRNA sequence, from which a free energy value can be calculated. Using this algorithm we show that a periodic, energetic pattern of frequency 1/3 is revealed. This periodic signal exists in the majority of coding regions of eubacterial genes, but not in the non-coding regions encoding the 16S and 23S rRNAs. Signal analysis reveals that the population of coding regions of each bacterial species has a mean phase that is correlated in a statistically significant way with species () content. These results suggest that the periodic signal could function as a synchronization signal for the maintenance of reading frame and that codon usage provides a mechanism for manipulation of signal phase.[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32]  相似文献   

5.
6.
7.
8.
9.
10.
11.
A Boolean network is a model used to study the interactions between different genes in genetic regulatory networks. In this paper, we present several algorithms using gene ordering and feedback vertex sets to identify singleton attractors and small attractors in Boolean networks. We analyze the average case time complexities of some of the proposed algorithms. For instance, it is shown that the outdegree-based ordering algorithm for finding singleton attractors works in time for , which is much faster than the naive time algorithm, where is the number of genes and is the maximum indegree. We performed extensive computational experiments on these algorithms, which resulted in good agreement with theoretical results. In contrast, we give a simple and complete proof for showing that finding an attractor with the shortest period is NP-hard.[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32]  相似文献   

12.
13.
14.
A complete understanding of the biological functions of large signaling peptides (>4 kDa) requires comprehensive characterization of their amino acid sequences and post-translational modifications, which presents significant analytical challenges. In the past decade, there has been great success with mass spectrometry-based de novo sequencing of small neuropeptides. However, these approaches are less applicable to larger neuropeptides because of the inefficient fragmentation of peptides larger than 4 kDa and their lower endogenous abundance. The conventional proteomics approach focuses on large-scale determination of protein identities via database searching, lacking the ability for in-depth elucidation of individual amino acid residues. Here, we present a multifaceted MS approach for identification and characterization of large crustacean hyperglycemic hormone (CHH)-family neuropeptides, a class of peptide hormones that play central roles in the regulation of many important physiological processes of crustaceans. Six crustacean CHH-family neuropeptides (8–9.5 kDa), including two novel peptides with extensive disulfide linkages and PTMs, were fully sequenced without reference to genomic databases. High-definition de novo sequencing was achieved by a combination of bottom-up, off-line top-down, and on-line top-down tandem MS methods. Statistical evaluation indicated that these methods provided complementary information for sequence interpretation and increased the local identification confidence of each amino acid. Further investigations by MALDI imaging MS mapped the spatial distribution and colocalization patterns of various CHH-family neuropeptides in the neuroendocrine organs, revealing that two CHH-subfamilies are involved in distinct signaling pathways.Neuropeptides and hormones comprise a diverse class of signaling molecules involved in numerous essential physiological processes, including analgesia, reward, food intake, learning and memory (1). Disorders of the neurosecretory and neuroendocrine systems influence many pathological processes. For example, obesity results from failure of energy homeostasis in association with endocrine alterations (2, 3). Previous work from our lab used crustaceans as model organisms found that multiple neuropeptides were implicated in control of food intake, including RFamides, tachykinin related peptides, RYamides, and pyrokinins (46).Crustacean hyperglycemic hormone (CHH)1 family neuropeptides play a central role in energy homeostasis of crustaceans (717). Hyperglycemic response of the CHHs was first reported after injection of crude eyestalk extract in crustaceans. Based on their preprohormone organization, the CHH family can be grouped into two sub-families: subfamily-I containing CHH, and subfamily-II containing molt-inhibiting hormone (MIH) and mandibular organ-inhibiting hormone (MOIH). The preprohormones of the subfamily-I have a CHH precursor related peptide (CPRP) that is cleaved off during processing; and preprohormones of the subfamily-II lack the CPRP (9). Uncovering their physiological functions will provide new insights into neuroendocrine regulation of energy homeostasis.Characterization of CHH-family neuropeptides is challenging. They are comprised of more than 70 amino acids and often contain multiple post-translational modifications (PTMs) and complex disulfide bridge connections (7). In addition, physiological concentrations of these peptide hormones are typically below picomolar level, and most crustacean species do not have available genome and proteome databases to assist MS-based sequencing.MS-based neuropeptidomics provides a powerful tool for rapid discovery and analysis of a large number of endogenous peptides from the brain and the central nervous system. Our group and others have greatly expanded the peptidomes of many model organisms (3, 1833). For example, we have discovered more than 200 neuropeptides with several neuropeptide families consisting of as many as 20–40 members in a simple crustacean model system (5, 6, 2531, 34). However, a majority of these neuropeptides are small peptides with 5–15 amino acid residues long, leaving a gap of identifying larger signaling peptides from organisms without sequenced genome. The observed lack of larger size peptide hormones can be attributed to the lack of effective de novo sequencing strategies for neuropeptides larger than 4 kDa, which are inherently more difficult to fragment using conventional techniques (3437). Although classical proteomics studies examine larger proteins, these tools are limited to identification based on database searching with one or more peptides matching without complete amino acid sequence coverage (36, 38).Large populations of neuropeptides from 4–10 kDa exist in the nervous systems of both vertebrates and invertebrates (9, 39, 40). Understanding their functional roles requires sufficient molecular knowledge and a unique analytical approach. Therefore, developing effective and reliable methods for de novo sequencing of large neuropeptides at the individual amino acid residue level is an urgent gap to fill in neurobiology. In this study, we present a multifaceted MS strategy aimed at high-definition de novo sequencing and comprehensive characterization of the CHH-family neuropeptides in crustacean central nervous system. The high-definition de novo sequencing was achieved by a combination of three methods: (1) enzymatic digestion and LC-tandem mass spectrometry (MS/MS) bottom-up analysis to generate detailed sequences of proteolytic peptides; (2) off-line LC fractionation and subsequent top-down MS/MS to obtain high-quality fragmentation maps of intact peptides; and (3) on-line LC coupled to top-down MS/MS to allow rapid sequence analysis of low abundance peptides. Combining the three methods overcomes the limitations of each, and thus offers complementary and high-confidence determination of amino acid residues. We report the complete sequence analysis of six CHH-family neuropeptides including the discovery of two novel peptides. With the accurate molecular information, MALDI imaging and ion mobility MS were conducted for the first time to explore their anatomical distribution and biochemical properties.  相似文献   

15.
Cysteine proteases of the papain superfamily are implicated in a number of cellular processes and are important virulence factors in the pathogenesis of parasitic disease. These enzymes have therefore emerged as promising targets for antiparasitic drugs. We report the crystal structures of three major parasite cysteine proteases, cruzain, falcipain-3, and the first reported structure of rhodesain, in complex with a class of potent, small molecule, cysteine protease inhibitors, the vinyl sulfones. These data, in conjunction with comparative inhibition kinetics, provide insight into the molecular mechanisms that drive cysteine protease inhibition by vinyl sulfones, the binding specificity of these important proteases and the potential of vinyl sulfones as antiparasitic drugs.Sleeping sickness (African trypanosomiasis), caused by Trypanosoma brucei, and malaria, caused by Plasmodium falciparum, are significant, parasitic diseases of sub-Saharan Africa (1). Chagas'' disease (South American trypanosomiasis), caused by Trypanosoma cruzi, affects approximately, 16–18 million people in South and Central America. For all three of these protozoan diseases, resistance and toxicity to current therapies makes treatment increasingly problematic, and thus the development of new drugs is an important priority (24).T. cruzi, T. brucei, and P. falciparum produce an array of potential target enzymes implicated in pathogenesis and host cell invasion, including a number of essential and closely related papain-family cysteine proteases (5, 6). Inhibitors of cruzain and rhodesain, major cathepsin L-like papain-family cysteine proteases of T. cruzi and T. brucei rhodesiense (710) display considerable antitrypanosomal activity (11, 12), and some classes have been shown to cure T. cruzi infection in mouse models (11, 13, 14).In P. falciparum, the papain-family cysteine proteases falcipain-2 (FP-2)6 and falcipain-3 (FP-3) are known to catalyze the proteolysis of host hemoglobin, a process that is essential for the development of erythrocytic parasites (1517). Specific inhibitors, targeted to both enzymes, display antiplasmodial activity (18). However, although the abnormal phenotype of FP-2 knock-outs is “rescued” during later stages of trophozoite development (17), FP-3 has proved recalcitrant to gene knock-out (16) suggesting a critical function for this enzyme and underscoring its potential as a drug target.Sequence analyses and substrate profiling identify cruzain, rhodesain, and FP-3 as cathepsin L-like, and several studies describe classes of small molecule inhibitors that target multiple cathepsin L-like cysteine proteases, some with overlapping antiparasitic activity (1922). Among these small molecules, vinyl sulfones have been shown to be effective inhibitors of a number of papain family-like cysteine proteases (19, 2327). Vinyl sulfones have many desirable attributes, including selectivity for cysteine proteases over serine proteases, stable inactivation of the target enzyme, and relative inertness in the absence of the protease target active site (25). This class has also been shown to have desirable pharmacokinetic and safety profiles in rodents, dogs, and primates (28, 29). We have determined the crystal structures of cruzain, rhodesain, and FP-3 bound to vinyl sulfone inhibitors and performed inhibition kinetics for each enzyme. Our results highlight key areas of interaction between proteases and inhibitors. These results help validate the vinyl sulfones as a class of antiparasitic drugs and provide structural insights to facilitate the design or modification of other small molecule inhibitor scaffolds.  相似文献   

16.
17.
18.
19.
Glycoprotein structure determination and quantification by MS requires efficient isolation of glycopeptides from a proteolytic digest of complex protein mixtures. Here we describe that the use of acids as ion-pairing reagents in normal-phase chromatography (IP-NPLC) considerably increases the hydrophobicity differences between non-glycopeptides and glycopeptides, thereby resulting in the reproducible isolation of N-linked high mannose type and sialylated glycopeptides from the tryptic digest of a ribonuclease B and fetuin mixture. The elution order of non-glycopeptides relative to glycopeptides in IP-NPLC is predictable by their hydrophobicity values calculated using the Wimley-White water/octanol hydrophobicity scale. O-linked glycopeptides can be efficiently isolated from fetuin tryptic digests using IP-NPLC when N-glycans are first removed with PNGase. IP-NPLC recovers close to 100% of bacterial N-linked glycopeptides modified with non-sialylated heptasaccharides from tryptic digests of periplasmic protein extracts from Campylobacter jejuni 11168 and its pglD mutant. Label-free nano-flow reversed-phase LC-MS is used for quantification of differentially expressed glycopeptides from the C. jejuni wild-type and pglD mutant followed by identification of these glycoproteins using multiple stage tandem MS. This method further confirms the acetyltransferase activity of PglD and demonstrates for the first time that heptasaccharides containing monoacetylated bacillosamine are transferred to proteins in both the wild-type and mutant strains. We believe that IP-NPLC will be a useful tool for quantitative glycoproteomics.Protein glycosylation is a biologically significant and complex post-translational modification, involved in cell-cell and receptor-ligand interactions (14). In fact, clinical biomarkers and therapeutic targets are often glycoproteins (59). Comprehensive glycoprotein characterization, involving glycosylation site identification, glycan structure determination, site occupancy, and glycan isoform distribution, is a technical challenge particularly for quantitative profiling of complex protein mixtures (1013). Both N- and O-glycans are structurally heterogeneous (i.e. a single site may have different glycans attached or be only partially occupied). Therefore, the MS1 signals from glycopeptides originating from a glycoprotein are often weaker than from non-glycopeptides. In addition, the ionization efficiency of glycopeptides is low compared with that of non-glycopeptides and is often suppressed in the presence of non-glycopeptides (1113). When the MS signals of glycopeptides are relatively high in simple protein digests then diagnostic sugar oxonium ion fragments produced by, for example, front-end collisional activation can be used to detect them. However, when peptides and glycopeptides co-elute, parent ion scanning is required to selectively detect the glycopeptides (14). This can be problematic in terms of sensitivity, especially for detecting glycopeptides in digests of complex protein extracts.Isolation of glycopeptides from proteolytic digests of complex protein mixtures can greatly enhance the MS signals of glycopeptides using reversed-phase LC-ESI-MS (RPLC-ESI-MS) or MALDI-MS (1524). Hydrazide chemistry is used to isolate, identify, and quantify N-linked glycopeptides effectively, but this method involves lengthy chemical procedures and does not preserve the glycan moieties thereby losing valuable information on glycan structure and site occupancy (1517). Capturing glycopeptides with lectins has been widely used, but restricted specificities and unspecific binding are major drawbacks of this method (1821). Under reversed-phase LC conditions, glycopeptides from tryptic digests of gel-separated glycoproteins have been enriched using graphite powder medium (22). In this case, however, a second digestion with proteinase K is required for trimming down the peptide moieties of tryptic glycopeptides so that the glycopeptides (typically <5 amino acid residues) essentially resemble the glycans with respect to hydrophilicity for subsequent separation. Moreover, the short peptide sequences of the proteinase K digest are often inadequate for de novo sequencing of the glycopeptides.Glycopeptide enrichment under normal-phase LC (NPLC) conditions has been demonstrated using various hydrophilic media and different capture and elution conditions (2328). NPLC allows either direct enrichment of peptides modified by various N-linked glycan structures using a ZIC®-HILIC column (2327) or targeting sialylated glycopeptides using a titanium dioxide micro-column (28). However, NPLC is neither effective for enriching less hydrophilic glycopeptides, e.g. the five high mannose type glycopeptides modified by 7–11 monosaccharide units from a tryptic digest of ribonuclease b (RNase B), nor for enriching O-linked glycopeptides of bovine fetuin using a ZIC-HILIC column (23). The use of Sepharose medium for enriching glycopeptides yielded only modest recovery of glycopeptides (28). In addition, binding of hydrophilic non-glycopeptides with these hydrophilic media contaminates the enriched glycopeptides (23, 28).We have recently developed an ion-pairing normal-phase LC (IP-NPLC) method to enrich glycopeptides from complex tryptic digests using Sepharose medium and salts or bases as ion-pairing reagents (29). Though reasonably effective the technique still left room for significant improvement. For example, the method demonstrated relatively modest glycopeptide selectivity, providing only 16% recovery for high mannose type glycopeptides (29). Here we report on a new IP-NPLC method using acids as ion-pairing reagents and polyhydroxyethyl aspartamide (A) as the stationary phase for the effective isolation of tryptic glycopeptides. The method was developed and evaluated using a tryptic digest of RNase B and fetuin mixture. In addition, we demonstrate that O-linked glycopeptides can be effectively isolated from a fetuin tryptic digest by IP-NPLC after removal of the N-linked glycans by PNGase F.The new IP-NPLC method was used to enrich N-linked glycopeptides from the tryptic digests of protein extracts of wild-type (wt) and PglD mutant strains of Campylobacter jejuni NCTC 11168. C. jejuni has a unique N-glycosylation system that glycosylates periplasmic and inner membrane proteins containing the extended N-linked sequon, D/E-X-N-X-S/T, where X is any amino acid other than proline (3032). The N-linked glycan of C. jejuni has been previously determined to be GalNAc-α1,4-GalNAc-α1,4-[Glcβ1,3]-GalNAc-α1,4-GalNAc-α1,4-GalNAc-α1,3-Bac-β1 (BacGalNAc5Glc residue mass: 1406 Da), where Bac is 2,4-diacetamido-2,4,6-trideoxyglucopyranose (30). In addition, the glycan structure of C. jejuni is conserved, unlike in eukaryotic systems (3032). IP-NPLC recovered close to 100% of the bacterial N-linked glycopeptides with virtually no contamination of non-glycopeptides. Furthermore, we demonstrate for the first time that acetylation of bacillosamine is incomplete in the wt using IP-NPLC and label-free MS.  相似文献   

20.
Helicobacter pylori CagA plays a key role in gastric carcinogenesis. Upon delivery into gastric epithelial cells, CagA binds and deregulates SHP-2 phosphatase, a bona fide oncoprotein, thereby causing sustained ERK activation and impaired focal adhesions. CagA also binds and inhibits PAR1b/MARK2, one of the four members of the PAR1 family of kinases, to elicit epithelial polarity defect. In nonpolarized gastric epithelial cells, CagA induces the hummingbird phenotype, an extremely elongated cell shape characterized by a rear retraction defect. This morphological change is dependent on CagA-deregulated SHP-2 and is thus thought to reflect the oncogenic potential of CagA. In this study, we investigated the role of the PAR1 family of kinases in the hummingbird phenotype. We found that CagA binds not only PAR1b but also other PAR1 isoforms, with order of strength as follows: PAR1b > PAR1d ≥ PAR1a > PAR1c. Binding of CagA with PAR1 isoforms inhibits the kinase activity. This abolishes the ability of PAR1 to destabilize microtubules and thereby promotes disassembly of focal adhesions, which contributes to the hummingbird phenotype. Consistently, PAR1 knockdown potentiates induction of the hummingbird phenotype by CagA. The morphogenetic activity of CagA was also found to be augmented through inhibition of non-muscle myosin II. Because myosin II is functionally associated with PAR1, perturbation of PAR1-regulated myosin II by CagA may underlie the defect of rear retraction in the hummingbird phenotype. Our findings reveal that CagA systemically inhibits PAR1 family kinases and indicate that malfunctioning of microtubules and myosin II by CagA-mediated PAR1 inhibition cooperates with deregulated SHP-2 in the morphogenetic activity of CagA.Infection with Helicobacter pylori strains bearing cagA (cytotoxin-associated gene A)-positive strains is the strongest risk factor for the development of gastric carcinoma, the second leading cause of cancer-related death worldwide (13). The cagA gene is located within a 40-kb DNA fragment, termed the cag pathogenicity island, which is specifically present in the genome of cagA-positive H. pylori strains (46). In addition to cagA, there are ∼30 genes in the cag pathogenicity island, many of which encode a bacterial type IV secretion system that delivers the cagA-encoded CagA protein into gastric epithelial cells (710). Upon delivery into gastric epithelial cells, CagA is localized to the plasma membrane, where it undergoes tyrosine phosphorylation at the C-terminal Glu-Pro-Ile-Tyr-Ala motifs by Src family kinases or c-Abl kinase (1114). The C-terminal Glu-Pro-Ile-Tyr-Ala-containing region of CagA is noted for the structural diversity among distinct H. pylori isolates. Oncogenic potential of CagA has recently been confirmed by a study showing that systemic expression of CagA in mice induces gastrointestinal and hematological malignancies (15).When expressed in gastric epithelial cells, CagA induces morphological transformation termed the hummingbird phenotype, which is characterized by the development of one or two long and thin protrusions resembling the beak of the hummingbird. It has been thought that the hummingbird phenotype is related to the oncogenic action of CagA (7, 1619). Pathophysiological relevance for the hummingbird phenotype in gastric carcinogenesis has recently been provided by the observation that infection with H. pylori carrying CagA with greater ability to induce the hummingbird phenotype is more closely associated with gastric carcinoma (2023). Elevated motility of hummingbird cells (cells showing the hummingbird phenotype) may also contribute to invasion and metastasis of gastric carcinoma.In host cells, CagA interacts with the SHP-2 phosphatase, C-terminal Src kinase, and Crk adaptor in a tyrosine phosphorylation-dependent manner (16, 24, 25) and also associates with Grb2 adaptor and c-Met in a phosphorylation-independent manner (26, 27). Among these CagA targets, much attention has been focused on SHP-2 because the phosphatase has been recognized as a bona fide oncoprotein, gain-of-function mutations of which are found in various human malignancies (17, 18, 28). Stable interaction of CagA with SHP-2 requires CagA dimerization, which is mediated by a 16-amino acid CagA-multimerization (CM)2 sequence present in the C-terminal region of CagA (29). Upon complex formation, CagA aberrantly activates SHP-2 and thereby elicits sustained ERK MAP kinase activation that promotes mitogenesis (30). Also, CagA-activated SHP-2 dephosphorylates and inhibits focal adhesion kinase (FAK), causing impaired focal adhesions. It has been shown previously that both aberrant ERK activation and FAK inhibition by CagA-deregulated SHP-2 are involved in induction of the hummingbird phenotype (31).Partitioning-defective 1 (PAR1)/microtubule affinity-regulating kinase (MARK) is an evolutionally conserved serine/threonine kinase originally isolated in C. elegans (3234). Mammalian cells possess four structurally related PAR1 isoforms, PAR1a/MARK3, PAR1b/MARK2, PAR1c/MARK1, and PAR1d/MARK4 (3537). Among these, PAR1a, PAR1b, and PAR1c are expressed in a variety of cells, whereas PAR1d is predominantly expressed in neural cells (35, 37). These PAR1 isoforms phosphorylate microtubule-associated proteins (MAPs) and thereby destabilize microtubules (35, 38), allowing asymmetric distribution of molecules that are involved in the establishment and maintenance of cell polarity.In polarized epithelial cells, CagA disrupts the tight junctions and causes loss of apical-basolateral polarity (39, 40). This CagA activity involves the interaction of CagA with PAR1b/MARK2 (19, 41). CagA directly binds to the kinase domain of PAR1b in a tyrosine phosphorylation-independent manner and inhibits the kinase activity. Notably, CagA binds to PAR1b via the CM sequence (19). Because PAR1b is present as a dimer in cells (42), CagA may passively homodimerize upon complex formation with the PAR1 dimer via the CM sequence, and this PAR1-directed CagA dimer would form a stable complex with SHP-2 through its two SH2 domains.Because of the critical role of CagA in gastric carcinogenesis (7, 1619), it is important to elucidate the molecular basis underlying the morphogenetic activity of CagA. In this study, we investigated the role of PAR1 isoforms in induction of the hummingbird phenotype by CagA, and we obtained evidence that CagA-mediated inhibition of PAR1 kinases contributes to the development of the morphological change by perturbing microtubules and non-muscle myosin II.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号