首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Proteins that share even low sequence homologies are known to adopt similar folds. The beta-propeller structural motif is one such example. Identifying sequences that adopt a beta-propeller fold is useful to annotate protein structure and function. Often, tandem sequence repeats provide the necessary signal for identifying beta-propellers in proteins. In our recent analysis to identify cell surface proteins in archaeal and bacterial genomes, we identified some proteins that contain novel tandem repeats "LVIVD", "RIVW" and "LGxL". In this work, based on protein fold predictions and three-dimensional comparative modeling methods, we predicted that these repeat types fold as beta-propeller. Further, the evolutionary trace analysis of all proteins constituting amino acid sequence repeats in beta-propellers suggest that the novel repeats have diverged from a common ancestor.  相似文献   

2.
Animal and plant eukaryotic pathogens, such as the human malaria parasite Plasmodium falciparum and the potato late blight agent Phytophthora infestans, are widely divergent eukaryotic microbes. Yet they both produce secretory virulence and pathogenic proteins that alter host cell functions. In P. falciparum, export of parasite proteins to the host erythrocyte is mediated by leader sequences shown to contain a host-targeting (HT) motif centered on an RxLx (E, D, or Q) core: this motif appears to signify a major pathogenic export pathway with hundreds of putative effectors. Here we show that a secretory protein of P. infestans, which is perceived by plant disease resistance proteins and induces hypersensitive plant cell death, contains a leader sequence that is equivalent to the Plasmodium HT-leader in its ability to export fusion of green fluorescent protein (GFP) from the P. falciparum parasite to the host erythrocyte. This export is dependent on an RxLR sequence conserved in P. infestans leaders, as well as in leaders of all ten secretory oomycete proteins shown to function inside plant cells. The RxLR motif is also detected in hundreds of secretory proteins of P. infestans, Phytophthora sojae, and Phytophthora ramorum and has high value in predicting host-targeted leaders. A consensus motif further reveals E/D residues enriched within approximately 25 amino acids downstream of the RxLR, which are also needed for export. Together the data suggest that in these plant pathogenic oomycetes, a consensus HT motif may reside in an extended sequence of approximately 25-30 amino acids, rather than in a short linear sequence. Evidence is presented that although the consensus is much shorter in P. falciparum, information sufficient for vacuolar export is contained in a region of approximately 30 amino acids, which includes sequences flanking the HT core. Finally, positional conservation between Phytophthora RxLR and P. falciparum RxLx (E, D, Q) is consistent with the idea that the context of their presentation is constrained. These studies provide the first evidence to our knowledge that eukaryotic microbes share equivalent pathogenic HT signals and thus conserved mechanisms to access host cells across plant and animal kingdoms that may present unique targets for prophylaxis across divergent pathogens.  相似文献   

3.
Repeat proteins comprise tandem arrays of a small structural motif. Their structure is defined and stabilized by interactions between residues that are close in the primary sequence. Several studies have investigated whether their structural modularity translates into modular thermodynamic properties. Tetratricopeptide repeat proteins (TPRs) are a class in which the repeated unit is a 34 amino acid helix-turn-helix motif. In this work, we use differential scanning calorimetry (DSC) to study the equilibrium stability of a series of TPR proteins with different numbers of an identical consensus repeat, from 2 to 20, CTPRa2 to CTPRa20. The DSC data provides direct evidence that the folding/unfolding transition of CTPR proteins does not fit a two-state folding model. Our results confirm and expand earlier studies on TPR proteins, which showed that apparent two-state unfolding curves are better fit by linear statistical mechanics models: 1D Ising models in which each repeat is treated as an independent folding unit.  相似文献   

4.
There are several different families of repeat proteins. In each, a distinct structural motif is repeated in tandem to generate an elongated structure. The nonglobular, extended structures that result are particularly well suited to present a large surface area and to function as interaction domains. Many repeat proteins have been demonstrated experimentally to fold and function as independent domains. In tetratricopeptide (TPR) repeats, the repeat unit is a helix-turn-helix motif. The majority of TPR motifs occur as three to over 12 tandem repeats in different proteins. The majority of TPR structures in the Protein Data Bank are of isolated domains. Here we present the high-resolution structure of NlpI, the first structure of a complete TPR-containing protein. We show that in this instance the TPR motifs do not fold and function as an independent domain, but are fully integrated into the three-dimensional structure of a globular protein. The NlpI structure is also the first TPR structure from a prokaryote. It is of particular interest because it is a membrane-associated protein, and mutations in it alter septation and virulence.  相似文献   

5.
Repeat proteins contain tandem arrays of a simple structural motif. In contrast to globular proteins, repeat proteins are stabilized only by interactions between residues that are relatively close together in the sequence, with no ”long-range” interactions. Our work focuses on the tetratricopeptide repeat (TPR), a 34 amino acid helix-turn-helix motif found in tandem arrays in many natural proteins. Earlier, we reported the design and characterization of a series of consensus TPR (CTPR) proteins, which are built as arrays of multiple tandem copies of a 34 amino acid consensus sequence. Here, we present the results of extensive hydrogen exchange (HX) studies of the folding-unfolding behavior of two CTPR proteins (CTPR2 and CTPR3). We used HX to detect and characterize partially folded species that are populated at low frequency in the nominally folded state. We show that for both proteins the equilibrium folding-unfolding transition is non-two-state, but sequential, with the outermost helices showing a significantly higher probability than inner helices of being unfolded. We show that the experimentally observed unfolding behavior is consistent with the predictions of a simple Ising model, in which individual helices are treated as ”spin-equivalents”. The results that we present have general implications for our understanding of the thermodynamic properties of repeat proteins.  相似文献   

6.
Genes containing multiple coding mini- and microsatellite repeats are highly dynamic components of genomes. Frequent recombination events within these tandem repeats lead to changes in repeat numbers, which in turn alters the amino acid sequence of the corresponding protein. In bacteria and yeasts, the expansion of such coding repeats in cell wall proteins is associated with alterations in immunogenicity, adhesion, and pathogenesis. We hypothesized that identification of repeat-containing putative cell wall proteins in the human pathogen Aspergillus fumigatus may reveal novel pathogenesis-related elements. Here, we report that the genome of A. fumigatus contains as many as 292 genes with internal repeats. Fourteen of 30 selected genes showed size variation of their repeat-containing regions among 11 clinical A. fumigatus isolates. Four of these genes, Afu3g08990, Afu2g05150 (MP-2), Afu4g09600, and Afu6g14090, encode putative cell wall proteins containing a leader sequence and a glycosylphosphatidylinositol anchor motif. All four genes are expressed and produce variable-size mRNA encoding a discrete number of repeat amino acid units. Their expression was altered during development and in response to cell wall-disrupting agents. Deletion of one of these genes, Afu3g08990, resulted in a phenotype characterized by rapid conidial germination and reduced adherence to extracellular matrix suggestive of an alteration in cell wall characteristics. The Afu3g08990 protein was localized to the cell walls of dormant and germinating conidia. Our findings suggest that a subset of the A. fumigatus cell surface proteins may be hypervariable due to recombination events in their internal tandem repeats. This variation may provide the functional diversity in cell surface antigens which allows rapid adaptation to the environment and/or elusion of the host immune system.  相似文献   

7.
8.
Mucins are macromolecules lying the cells in contact with external environment and protect the epithelium against constant attacks such as digestive fluids, microorganisms, pollutants, and toxins. Mucins are the main components of mucus and are synthesized and secreted by specialized cells of the epithelium (goblet cells, cells of mucous glands) or non mucin-secreting cells. Human mucin genes show common features: large size of their mRNAs, large nucleotide tandem repeat domains, complex expression both at tissular and cellular level. Since 1987, 21 MUC symbols have been used to designate genes encoding O-glycoproteins containing tandem repeat domains rich in serine, threonine and proline. Some of these genes encode true mucins while others encode non mucin adhesion O-glycoproteins. In this paper, we propose a classification based on sequence similarities and expression areas. Two main families can be distinguished: secreted mucins or gel-forming mucins (MUC2, MUC5AC, MUC5B, MUC6), and membrane-bound mucins (MUC1, MUC3, MUC4, MUC12, MUC17). Muc-deficient mice will provide important models in the study of functional relationships between these two mucin families.  相似文献   

9.
10.
We report characterisation of three copies of a novel repeat sequence isolated from a Mycobacterium bovis genomic library. The repeat occurs within open reading frames, potentially encoding a conserved tandem array of a pentapeptide sequence with the consensus X-Gly-Asn-X-Gly. The tandem array is present up to five times in M. bovis and it is proposed that they may occur in a family of genes expressing functionally related proteins. We postulate that these proteins may play a role in binding of M. bovis to host cell receptors.  相似文献   

11.
T cell activation through the CD2 cell surface receptor is transmitted by proline-rich sequences within its cytoplasmic tail. A membrane-proximal proline-rich tandem repeat, involved in cytokine production, is recognized by the intracellular CD2 binding protein CD2BP2. We solved the solution structure of the CD2 binding domain of CD2BP2, which we name the glycine-tyrosine-phenylalanine (GYF) domain. The GYF sequence is part of a structurally unique bulge-helix-bulge motif that constitutes the major binding site for the CD2 tail. A hydrophobic surface patch is created by motif residues that are highly conserved among a variety of proteins from diverse eukaryotic species. Thus, the architecture of the GYF domain may be widely used in protein-protein associations.  相似文献   

12.
The cag-pathogenicity-island-encoded type IV secretion system of Helicobacter pylori functions to translocate the effector protein CagA directly through the plasma membrane of gastric epithelial cells. Similar to other secretion systems, the Cag type IV secretion system elaborates a surface filament structure, which is unusually sheathed by the large cag-pathogenicity-island-encoded protein CagY. CagY is distinguished by unusual amino acid composition and extensive repetitive sequence organised into two defined repeat regions. The second and major repeat region (CagYrpt2) has a regular disposition of six repetitive motifs, which are subject to deletion and duplication, facilitating the generation of CagY size and phenotypic variants. In this study, we show CagYrpt2 to comprise two highly thermostable and acid-stable α-helical structural motifs, the most abundant of which (motif A) occurs in tandem arrays of one to six repeats terminally flanked by single copies of the second repeat (motif B). Isolated motifs demonstrate hetero- and homomeric interactions, suggesting a propensity for uniform assembly of discrete structural subunit motifs within the larger CagYrpt2 structure. Consistent with this, CagY proteins comprising substantially different repeat 2 motif organisations demonstrate equivalent CagA translocation competence, illustrating a remarkable structural and functional tolerance for precise deletion and duplication of motif subunits. We provide the first insight into the structural basis for CagYrpt2 assembly that accommodates both the variable motif sequence composition and the extensive contraction/expansion of repeat modules within the CagYrpt2 region.  相似文献   

13.
Topological characteristics of helical repeat proteins.   总被引:23,自引:0,他引:23  
The recent elucidation of protein structures based upon repeating amino acid motifs, including the armadillo motif, the HEAT motif and tetratricopeptide repeats, reveals that they belong to the class of helical repeat proteins. These proteins share the common property of being assembled from tandem repeats of an alpha-helical structural unit, creating extended superhelical structures that are ideally suited to create a protein recognition interface.  相似文献   

14.
Phytophthora infestans, the organism responsible for the Irish famine, causes late blight, a re-emerging disease of potato and tomato. Little is known about the molecular evolution of P. infestans genes. To identify candidate effector genes (virulence or avirulence genes) that may have co-evolved with the host, we mined expressed sequence tag (EST) data from infection stages of P. infestans for secreted and potentially polymorphic genes. This led to the identification of scr74, a gene that encodes a predicted 74-amino acid secreted cysteine-rich protein with similarity to the Phytophthora cactorum phytotoxin PcF. The expression of scr74 was upregulated approximately 60-fold 2 to 4 days after inoculation of tomato and was also significantly induced during early stages of colonization of potato. The scr74 gene was found to belong to a highly polymorphic gene family within P. infestans with 21 different sequences identified. Using the approximate and maximum likelihood (ML) methods, we found that diversifying selection likely caused the extensive polymorphism observed within the scr74 gene family. Pairwise comparisons of 17 scr74 sequences revealed elevated ratios of nonsynonymous to synonymous nucleotide-substitution rates, particularly in the mature region of the proteins. Using ML, all 21 polymorphic amino acid sites were identified to be under diversifying selection. Of these 21 amino acids, 19 are located in the mature protein region, suggesting that selection may have acted on the functional portions of the proteins. Further investigation of gene copy number and organization revealed that the scr74 gene family comprises at least three copies located in a region of no more than 300 kb of the P. infestans genome. We found evidence that recombination contributed to sequence divergence within at least one gene locus. These results led us to propose an evolutionary model that involves gene duplication and recombination, followed by functional divergence of scr74 genes. This study provides support for using diversifying selection as a criterion for identifying candidate effector genes from sequence databases.  相似文献   

15.
Phytophthora infestans is a devastating phytopathogenic oomycete that causes late blight on tomato and potato. Recent genome sequencing efforts of P. infestans and other Phytophthora species are generating vast amounts of sequence data providing opportunities to unlock the complex nature of pathogenesis. However, accurate annotation of Phytophthora genomes will be a significant challenge. Most of the information about gene structure in these species was gathered from a handful of genes resulting in significant limitations for development of ab initio gene-calling programs. In this study, we collected a total of 150 bioinformatically determined near full-length cDNA (FLcDNA) sequences of P. infestans that were predicted to contain full open reading frame sequences. We performed detailed computational analyses of these FLcDNA sequences to obtain a snapshot of P. infestans gene structure, gauge the degree of sequence conservation between P. infestans genes and those of Phytophthora sojae and Phytophthora ramorum, and identify patterns of gene conservation between P. infestans and various eukaryotes, particularly fungi, for which genome-wide translated protein sequences are available. These analyses helped us to define the structural characteristics of P. infestans genes using a validated data set. We also determined the degree of sequence conservation within the genus Phytophthora and identified a set of fast evolving genes. Finally, we identified a set of genes that are shared between Phytophthora and fungal phytopathogens but absent in animal fungal pathogens. These results confirm that plant pathogenic oomycetes and fungi share virulence components, and suggest that eukaryotic microbial pathogens that share similar lifestyles also share a similar set of genes independently of their phylogenetic relatedness.  相似文献   

16.
17.
Repeat proteins are ubiquitous and are involved in a myriad of essential processes. They are typically non-globular structures that act as diverse scaffolds for the mediation of protein-protein interactions. These excitingly different structures, which arise from tandem arrays of a repeated structural motif, have generated significant interest with respect to protein engineering and design. Recent advances have been made in the design and characterisation of repeat proteins. The highlights include re-engineering of binding specificity, quantitative models of repeat protein stability and kinetic studies of repeat protein folding.  相似文献   

18.
19.
The membrane-tethered mucins are cell surface-associated dimeric or multimeric molecules with extracellular, transmembrane and cytoplasmic portions, that arise from cleavage of the primary polypeptide chain. Following the first cleavage, which may be cotranslational, the subunits remain closely associated through undefined noncovalent interactions. These mucins all share a common structural motif, the SEA module that is found in many other membrane-associated proteins that are released from the cell surface and has been implicated in both the cleavage events and association of the subunits. Here we examine the SEA modules of three membrane-tethered mucins, MUC1, MUC3 and MUC12, which have significant sequence homology within the SEA domain. We previously identified the primary cleavage site within the MUC1 SEA domain as FRPG/SVVV a sequence that is highly conserved in MUC3 and MUC12. We now show by site-directed mutagenesis that the F, G and S residues are important for the efficiency of the cleavage reaction but not indispensable and that amino acids outside this motif are probably important. These data are consistent with a new model of the MUC1 SEA domain that is based on the solution structure of the MUC16 SEA module, derived by NMR spectroscopy. Further, we demonstrate that cleavage of human MUC3 and MUC12 occurs within the SEA domain. However, the SEA domains of MUC1, MUC3 and MUC12 are not interchangeable, suggesting that either these modules alone are insufficient to mediate efficient cleavage or that the 3D structure of the hybrid molecules does not adequately re-create an accessible cleavage site.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号