首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
We have determined the crystal structures of three homologous proteins from the pathogenic protozoans Leishmania donovani, Leishmania major, and Trypanosoma cruzi. We propose that these proteins represent a new subfamily within the isochorismatase superfamily (CDD classification cd004310). Their overall fold and key active site residues are structurally homologous both to the biochemically well-characterized N-carbamoylsarcosine-amidohydrolase, a cysteine hydrolase, and to the phenazine biosynthesis protein PHZD (isochorismase), an aspartyl hydrolase. All three proteins are annotated as mitochondrial-associated ribonuclease Mar1, based on a previous characterization of the homologous protein from L. tarentolae. This would constitute a new enzymatic activity for this structural superfamily, but this is not strongly supported by the observed structures. In these protozoan proteins, the extended active site is formed by inter-subunit association within a tetramer, which implies a distinct evolutionary history and substrate specificity from the previously characterized members of the isochorismatase superfamily. The characterization of the active site is supported crystallographically by the presence of an unidentified ligand bound at the active site cysteine of the T. cruzi structure.  相似文献   

2.
A detailed knowledge of a protein's functional site is an absolute prerequisite for understanding its mode of action at the molecular level. However, the rapid pace at which sequence and structural information is being accumulated for proteins greatly exceeds our ability to determine their biochemical roles experimentally. As a result, computational methods are required which allow for the efficient processing of the evolutionary information contained in this wealth of data, in particular that related to the nature and location of functionally important sites and residues. The method presented here, referred to as conserved functional group (CFG) analysis, relies on a simplified representation of the chemical groups found in amino acid side-chains to identify functional sites from a single protein structure and a number of its sequence homologues. We show that CFG analysis can fully or partially predict the location of functional sites in approximately 96% of the 470 cases tested and that, unlike other methods available, it is able to tolerate wide variations in sequence identity. In addition, we discuss its potential in a structural genomics context, where automation, scalability and efficiency are critical, and an increasing number of protein structures are determined with no prior knowledge of function. This is exemplified by our analysis of the hypothetical protein Ydde_Ecoli, whose structure was recently solved by members of the North East Structural Genomics consortium. Although the proposed active site for this protein needs to be validated experimentally, this example illustrates the scope of CFG analysis as a general tool for the identification of residues likely to play an important role in a protein's biochemical function. Thus, our method offers a convenient solution to rapidly and automatically process the vast amounts of data that are beginning to emerge from structural genomics projects.  相似文献   

3.
Comparative studies of the proteomes from different organisms have provided valuable information about protein domain distribution in the kingdoms of life. Earlier studies have been limited by the fact that only about 50% of the proteomes could be matched to a domain. Here, we have extended these studies by including less well-defined domain definitions, Pfam-B and clustered domains, MAS, in addition to Pfam-A and SCOP domains. It was found that a significant fraction of these domain families are homologous to Pfam-A or SCOP domains. Further, we show that all regions that do not match a Pfam-A or SCOP domain contain a significantly higher fraction of disordered structure. These unstructured regions may be contained within orphan domains or function as linkers between structured domains. Using several different definitions we have re-estimated the number of multi-domain proteins in different organisms and found that several methods all predict that eukaryotes have approximately 65% multi-domain proteins, while the prokaryotes consist of approximately 40% multi-domain proteins. However, these numbers are strongly dependent on the exact choice of cut-off for domains in unassigned regions. In conclusion, all eukaryotes have similar fractions of multi-domain proteins and disorder, whereas a high fraction of repeating domain is distinguished only in multicellular eukaryotes. This implies a role for repeats in cell-cell contacts while the other two features are important for intracellular functions.  相似文献   

4.
Seventy integral membrane proteins from the Mycobacterium tuberculosis genome have been cloned and expressed in Escherichia coli. A combination of T7 promoter-based vectors with hexa-His affinity tags and BL21 E. coli strains with additional tRNA genes to supplement sparsely used E. coli codons have been most successful. The expressed proteins have a wide range of molecular weights and number of transmembrane helices. Expression of these proteins has been observed in the membrane and insoluble fraction of E. coli cell lysates and, in some cases, in the soluble fraction. The highest expression levels in the membrane fraction were restricted to a narrow range of molecular weights and relatively few transmembrane helices. In contrast, overexpression in insoluble aggregates was distributed over a broad range of molecular weights and number of transmembrane helices.  相似文献   

5.
The natively disordered protein alpha-synuclein is the primary component of Lewy bodies, the cellular hallmark of Parkinson's disease. Most studies of this protein are performed in dilute solution, but its biologically relevant role is performed in the crowded environment inside cells. We addressed the effects of macromolecular crowding on alpha-synuclein by combining NMR data acquired in living Escherichia coli with in vitro NMR data. The crowded environment in the E.coli periplasm prevents a conformational change that is detected at 35 degrees C in dilute solution. This change is associated with an increase in hydrodynamic radius and the formation of secondary structure in the N-terminal 100 amino acid residues. By preventing this temperature-induced conformational change, crowding in the E.coli periplasm stabilizes the disordered monomer. We obtain the same stabilization in vitro upon crowding alpha-synuclein with 300 g/l of bovine serum albumin, indicating that crowding alone is sufficient to stabilize the disordered, monomeric protein. Two disease-associated variants (A30P and A53T) behave in the same way in both dilute solution and in the E.coli periplasm. These data reveal the importance of approaching the effects of macromolecular crowding on a case-by-case basis. Additionally, our work shows that discrete structured protein conformations may not be achieved by alpha-synuclein inside cells, implicating the commonly overlooked aspect of macromolecular crowding as a possible factor in the etiology of Parkinson's disease.  相似文献   

6.
Rubisco is a very large, complex and one of the most abundant proteins in the world and comprises up to 50% of all soluble protein in plants. The activity of Rubisco, the enzyme that catalyzes CO2 assimilation in photosynthesis, is regulated by Rubisco activase (Rca). In the present study, we searched for hypothetical protein of Vitis vinifera which has putative Rubisco activase function. The Arabidopsis and tobacco Rubisco activase protein sequences were used as seed sequences to search against Vitis vinifera in UniprotKB database. The selected hypothetical proteins of Vitis vinifera were subjected to sequence, structural and functional annotation. Subcellular localization predictions suggested it to be cytoplasmic protein. Homology modelling was used to define the three-dimensional (3D) structure of selected hypothetical proteins of Vitis vinifera. Template search revealed that all the hypothetical proteins share more than 80% sequence identity with structure of green-type Rubisco activase from tobacco, indicating proteins are evolutionary conserved. The homology modelling was generated using SWISS-MODEL. Several quality assessment and validation parameters computed indicated that homology models are reliable. Further, functional annotation through PFAM, CATH, SUPERFAMILY, CDART suggested that selected hypothetical proteins of Vitis vinifera contain ATPase family associated with various cellular activities (AAA) and belong to the AAA+ super family of ring-shaped P-loop containing nucleoside triphosphate hydrolases. This study will lead to research in the optimization of the functionality of Rubisco which has large implication in the improvement of plant productivity and resource use efficiency.  相似文献   

7.
Immobilised metal affinity chromatography (IMAC) is the most widely used technique for single-step purification of recombinant proteins. However, despite its use in the purification of heterologue proteins in the eubacteria Escherichia coli for decades, the presence of native E. coli proteins that exhibit a high affinity for divalent cations such as nickel, cobalt or copper has remained problematic. This is of particular relevance when recombinant molecules are not expressed at high levels or when their overexpression induces that of native bacterial proteins due to pleiotropism and/or in response to stress conditions. Identification of such contaminating proteins is clearly relevant to those involved in the purification of histidine-tagged proteins either at small/medium scale or in high-throughput processes. The work presented here reviews the native proteins from E. coli most commonly co-purified by IMAC, including Fur, Crp, ArgE, SlyD, GlmS, GlgA, ODO1, ODO2, YadF and YfbG. The binding of these proteins to metal-chelating resins can mostly be explained by their native metal-binding functions or their possession of surface clusters of histidine residues. However, some proteins fall outside these categories, implying that a further class of interactions may account for their ability to co-purify with histidine-tagged proteins. We propose a classification of these E. coli native proteins based on their physicochemical, structural and functional properties.  相似文献   

8.
Ebolavirus is the pathogen for Ebola Hemorrhagic Fever (EHF). This disease exhibits a high fatality rate and has recently reached a historically epidemic proportion in West Africa. Out of the 5 known Ebolavirus species, only Reston ebolavirus has lost human pathogenicity, while retaining the ability to cause EHF in long-tailed macaque. Significant efforts have been spent to determine the three-dimensional (3D) structures of Ebolavirus proteins, to study their interaction with host proteins, and to identify the functional motifs in these viral proteins. Here, in light of these experimental results, we apply computational analysis to predict the 3D structures and functional sites for Ebolavirus protein domains with unknown structure, including a zinc-finger domain of VP30, the RNA-dependent RNA polymerase catalytic domain and a methyltransferase domain of protein L. In addition, we compare sequences of proteins that interact with Ebolavirus proteins from RESTV-resistant primates with those from RESTV-susceptible monkeys. The host proteins that interact with GP and VP35 show an elevated level of sequence divergence between the RESTV-resistant and RESTV-susceptible species, suggesting that they may be responsible for host specificity. Meanwhile, we detect variable positions in protein sequences that are likely associated with the loss of human pathogenicity in RESTV, map them onto the 3D structures and compare their positions to known functional sites. VP35 and VP30 are significantly enriched in these potential pathogenicity determinants and the clustering of such positions on the surfaces of VP35 and GP suggests possible uncharacterized interaction sites with host proteins that contribute to the virulence of Ebolavirus.  相似文献   

9.
Analysis of the oligomeric state of a protein may provide insights into its physiological functions. Because membrane proteins are considered to be the workhorses of energy generation and polypeptide and nutrient transportation, in this study we characterized the membrane-associated proteome of Streptomyces coelicolor by two-dimensional (2D) blue native/sodium dodecyl sulfate–polyacrylamide gel electrophoresis (SDS–PAGE), high-resolution clear native/native PAGE, and native/SDS–PAGE. A total of 77 proteins were identified, and 20 proteins belonging to 15 complexes were characterized. Moreover, the resolution of high-resolution clear native/SDS–PAGE is much higher than that of blue native/SDS–PAGE. OBP (SCO5477) and BldKB (SCO5113) were identified as the main protein spots from the membrane fractions of S. coelicolor M145, suggesting that these two proteins are involved in extracellular peptide transportation. These two transporters exhibited multiple oligomeric states in the native PAGE system, which may suggest their multiple physiological functions in the development of S. coelicolor.  相似文献   

10.
Although animal breeding was practiced long before the science of genetics and the relevant disciplines of population and quantitative genetics were known, breeding programs have mainly relied on simply selecting and mating the best individuals on their own or relatives’ performance. This is based on sound quantitative genetic principles, developed and expounded by Lush, who attributed much of his understanding to Wright, and formalized in Fisher’s infinitesimal model. Analysis at the level of individual loci and gene frequency distributions has had relatively little impact. Now with access to genomic data, a revolution in which molecular information is being used to enhance response with “genomic selection” is occurring. The predictions of breeding value still utilize multiple loci throughout the genome and, indeed, are largely compatible with additive and specifically infinitesimal model assumptions. I discuss some of the history and genetic issues as applied to the science of livestock improvement, which has had and continues to have major spin-offs into ideas and applications in other areas.THE success of breeders in effecting immense changes in domesticated animals and plants greatly influenced Darwin’s insight into the power of selection and implications to evolution by natural selection. Following the Mendelian rediscovery, attempts were soon made to accommodate within the particulate Mendelian framework the continuous nature of many traits and the observation by Galton (1889) of a linear regression of an individual’s height on that of a relative, with the slope dependent on degree of relationship. A polygenic Mendelian model was first proposed by Yule (1902) (see Provine 1971; Hill 1984). After input from Pearson, Yule again, and Weinberg (who developed the theory a long way but whose work was ignored), its first full exposition in modern terms was by Ronald A. Fisher (1918) (biography by Box 1978). His analysis of variance partitioned the genotypic variance into additive, dominance and epistatic components. Sewall Wright (biography by Provine 1986) had by then developed the path coefficient method and subsequently (Wright 1921) showed how to compute inbreeding and relationship coefficients and their consequent effects on genetic variation of additive traits. His approach to relationship in terms of the correlation of uniting gametes may be less intuitive at the individual locus level than Malécot’s (1948) subsequent treatment in terms of identity by descent, but it transfers directly to the correlation of relatives for quantitative traits with additive effects.From these basic findings, the science of animal breeding was largely developed and expounded by Jay L. Lush (1896–1982) (see also commentaries by Chapman 1987 and Ollivier 2008). He was from a farming family and became interested in genetics as an undergraduate at Kansas State. Although his master’s degree was in genetics, his subsequent Ph.D. at the University of Wisconsin was in animal reproductive physiology. Following 8 years working in animal breeding at the University of Texas he went to Iowa State College (now University) in Ames in 1930. Wright was Lush’s hero: ‘I wish to acknowledge especially my indebtedness to Sewall Wright for many published and unpublished ideas upon which I have drawn, and for his friendly counsel” (Lush 1945, in the preface to his book Animal Breeding Plans). Lush commuted in 1931 to the University of Chicago to audit Sewall Wright’s course in statistical genetics and consult him. Speaking at the Poultry Breeders Roundtable in 1969: he said, “Those were by far the most fruitful 10 weeks I ever had.” (Chapman 1987, quoting A. E. Freeman). Lush was also exposed to and assimilated the work and ideas of R. A. Fisher, who lectured at Iowa State through the summers of 1931 and 1936 at the behest of G. W. Snedecor.Here I review Lush’s contributions and then discuss how animal breeding theory and methods have subsequently evolved. They have been based mainly on statistical methodology, supported to some extent by experiment and population genetic theory. Recently, the development of genomic methods and their integration into classical breeding theory has opened up ways to greatly enhance rates of genetic improvement. Lush focused on livestock improvement and spin-off into other areas was coincidental; but he had contact with corn breeders in Ames and beyond and made contributions to evolutionary biology and human genetics mainly through his developments in theory (e.g., Falconer 1965; Robertson 1966; Lande 1976, 1979; see also Hill and Kirkpatrick 2010). I make no attempt to be comprehensive, not least in choice of citations.  相似文献   

11.
Trichomonas vaginalis causes trichomoniasis, second most sexually transmitted disease. The genome sequence draft of T. vaginalis was published by The Institute of Genomic Research reveals an abnormally large genome size of 160 Mb. It was speculated that a significant portion of the proteome contains paralogous proteins. The present study was aimed at identification and analysis of the paralogous proteins. The all against all search approach is used to identify the paralogous proteins. The dataset of proteins was retrieved from TIGR and TrichDB FTP server. The BLAST-P program performed all against all database searches against the protein database of Trichomonas vaginalis available at NCBI genome database. In the present study about 50,000 proteins were searched where 2,700 proteins were found to be paralogous under the rigid selection criteria. The Pfam database search has identified significant number of paralogous proteins which were further categorized among different 1496 paralogous protein in pfam families, 1027 paralogous protein contains domain, 60 proteins were having different repeats and 1092 paralogous protein sequences of clans. Such identification and functional annotation of paralogous proteins will also help in removing paralogous proteins from possible drug targets in future. Presence of huge number of paralogous proteins across wide range of gene families and domains may be one of the possible mechanisms involved in the T. vaginalis genome expansion and evolution.  相似文献   

12.
Babor M  Gerzon S  Raveh B  Sobolev V  Edelman M 《Proteins》2008,70(1):208-217
Metal ions are crucial for protein function. They participate in enzyme catalysis, play regulatory roles, and help maintain protein structure. Current tools for predicting metal-protein interactions are based on proteins crystallized with their metal ions present (holo forms). However, a majority of resolved structures are free of metal ions (apo forms). Moreover, metal binding is a dynamic process, often involving conformational rearrangement of the binding pocket. Thus, effective predictions need to be based on the structure of the apo state. Here, we report an approach that identifies transition metal-binding sites in apo forms with a resulting selectivity >95%. Applying the approach to apo forms in the Protein Data Bank and structural genomics initiative identifies a large number of previously unknown, putative metal-binding sites, and their amino acid residues, in some cases providing a first clue to the function of the protein.  相似文献   

13.
Many stably folded proteins are proposed to contain long, unstructured loops. A series of hybrid proteins (EbE1-4) containing the folded scaffold of photosystem I accessory protein E (PsaE), an SH3-like protein, and the 40-residue heme-binding loop of cytochrome b(5) was created to inspect the dependence of thermodynamic and kinetic parameters on the residues at the interface of folded and flexible regions. Compared to the simplest hybrid (EbE1), the chimeras differed by Gly insertions (EbE2, EbE3) or an asymmetric four-residue restructuring of loop termini (EbE4). NMR spectroscopy indicated that the chimeras retained the PsaE topology; native and unfolded state solubilities, however, were affected to varying degrees. Thermal and chemical denaturation experiments revealed that the EbE2 and EbE1 constructs resulted in a modest destabilization of the PsaE core, whereas apparent stability was increased by >5 kJ/mol in EbE4. EbE3 aggregated at microM concentrations and was not studied in detail. EbE4 populated two native states (N1 and N2), which differed by hydrophobic core packing and C-terminal interactions. At room temperature, the population ratio ( approximately 3-4:1) favored the state whose spectroscopic properties most resembled those of PsaE (N1). EbE4 also demonstrated altered folding kinetics, displaying multiple slow phases related to the population of intermediates and possibly N2. It was concluded that loop anchors can affect protein properties, including stability, via short-range effects on local structure and long-range communication with the packed hydrophobic core. Modification of the attachment points appears to be a possible stepping stone in the transition from one three-dimensional structure to another.  相似文献   

14.
Auditory neuropathy spectrum disorder (ANSD) is caused by dys-synchronous auditory neural response as a result of impairment of the functions of the auditory nerve or inner hair cells, or synapses between inner hair cells and the auditory nerve. To identify a causative gene causing ANSD in the Korean population, we conducted gene screening of the OTOF, DIAPH3, and PJVK genes in 19 unrelated Korean patients with ANSD. A novel nonsense mutation (p.Y1064X) and a known pathogenic mutation (p.R1939Q) of the OTOF gene were identified in a patient as compound heterozygote. Pedigree analysis for these mutations showed co-segregation of mutation genotype and the disease in the family, and it supported that the p.Y1064X might be a novel genetic cause of autosomal recessive ANSD. A novel missense variant p.K1017R (c.3050A>G) in the DIAPH3 gene was also identified in the heterozygous state. In contrast, no mutation was detected in the PJVK gene. These results indicate that no major causative gene has been reported to date in the Korean population and that pathogenic mutations in undiscovered candidate genes may have an effect on ANSD.  相似文献   

15.
Hydroxyl radical footprinting (HRF) is a nonspecific protein footprinting method that has been increasingly used in recent years to analyze protein structure. The method oxidatively modifies solvent accessible sites in proteins, which changes upon alterations in the protein, such as ligand binding or a change in conformation. For HRF to provide accurate structural information, the method must probe the native structure of proteins. This requires careful experimental controls since an abundance of oxidative modifications can induce protein unfolding. Fast photochemical oxidation of proteins (FPOP) is a HRF method that generates hydroxyl radicals via photo‐dissociation of hydrogen peroxide using an excimer laser. The addition of a radical scavenger to the FPOP reaction reduces the lifetime of the radical, limiting the levels of protein oxidation. A direct assay is needed to ensure FPOP is probing the native conformation of the protein. Here, we report using enzymatic activity as a direct assay to validate that FPOP is probing the native structure of proteins. By measuring the catalytic activity of lysozyme and invertase after FPOP modification, we demonstrate that FPOP does not induce protein unfolding.  相似文献   

16.
This is the first attempt to resolve the phylogenetic relationship between different syngens of Paramecium bursaria and to investigate at a molecular level the intraspecific differentiation of strains originating from very distant geographical locations. Herein we introduce a new collection of five P. bursaria syngens maintained at St Petersburg State University, as the international collection of syngens was lost in the 1960s. To analyze the degree of speciation within Paramecium bursaria, we examined 26 strains belonging to five different syngens from distant and geographically isolated localities using rDNA (ITS1-5.8S-ITS2-5'LSU) fragments, mitochondrial cytochrome c oxidase subunit I (COI), and H4 gene fragments. It was shown that P. bursaria strains of the same syngens cluster together in all three inferred molecular phylogenies. The genetic diversity among the studied P. bursaria strains based on rDNA sequences was rather low. The COI divergence of Paramecium bursaria was also definitely lower than that observed in the Paramecium aurelia complex. The nucleotide sequences of the H4 gene analyzed in the present study indicate the extent of genetic differences between the syngens of Paramecium bursaria. Our study demonstrates the diagnostic value of molecular markers, which are important tools in the identification of Paramecium bursaria syngens.  相似文献   

17.
Rice functional genomics is a scientific approach that seeks to identify and define the function of rice genes, and uncover when and how genes work together to produce phenotypic traits. Rapid progress in rice genome sequencing has facilitated research in rice functional genomics in China. The Ministry of Science and Technology of China has funded two major rice functional genomics research programmes for building up the infrastructures of the functional genomics study such as developing rice functional genomics tools and resources. The programmes were also aimed at cloning and functional analyses of a number of genes controlling important agronomic traits from rice. National and international collaborations on rice functional genomics study are accelerating rice gene discovery and application.  相似文献   

18.
To gain more structural and functional information on the actomyosin complexes, we have engineered chimera proteins carrying the entire Dictyostelium actin in the loop 2 sequence of the motor domain of Dictyostelium myosin II. Although the chimera proteins were unable to polymerize by themselves, addition of skeletal actin promoted polymerization. Electron microscopic observation demonstrated that the chimera proteins were incorporated into actin filaments, when copolymerized with skeletal actin. Copolymerization with skeletal actin greatly enhanced the MgATPase, while the chimera proteins without added skeletal actin hydrolyzed ATP at a very low rate. These results indicate that the actin part and the motor domain part of the chimera proteins are correctly folded, but the chimera proteins are structurally stressed so that efficient polymerization is inhibited.  相似文献   

19.
It is recognized now that intrinsically disordered proteins (IDPs), which do not have unique 3D structures as a whole or in noticeable parts, constitute a significant fraction of any given proteome. IDPs are characterized by an astonishing structural and functional diversity that defines their ability to be universal regulators of various cellular pathways. Programmed cell death (PCD) is one of the most intricate cellular processes where the cell uses specialized cellular machinery and intracellular programs to kill itself. This cell-suicide mechanism enables metazoans to control cell numbers and to eliminate cells that threaten the animal''s survival. PCD includes several specific modules, such as apoptosis, autophagy, and programmed necrosis (necroptosis). These modules are not only tightly regulated but also intimately interconnected and are jointly controlled via a complex set of protein–protein interactions. To understand the role of the intrinsic disorder in controlling and regulating the PCD, several large sets of PCD-related proteins across 28 species were analyzed using a wide array of modern bioinformatics tools. This study indicates that the intrinsic disorder phenomenon has to be taken into consideration to generate a complete picture of the interconnected processes, pathways, and modules that determine the essence of the PCD. We demonstrate that proteins involved in regulation and execution of PCD possess substantial amount of intrinsic disorder. We annotate functional roles of disorder across and within apoptosis, autophagy, and necroptosis processes. Disordered regions are shown to be implemented in a number of crucial functions, such as protein–protein interactions, interactions with other partners including nucleic acids and other ligands, are enriched in post-translational modification sites, and are characterized by specific evolutionary patterns. We mapped the disorder into an integrated network of PCD pathways and into the interactomes of selected proteins that are involved in the p53-mediated apoptotic signaling pathway.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号