共查询到20条相似文献,搜索用时 0 毫秒
1.
Ko‐Hsin Chin Zhao‐Wei Huang Kun‐Chou Wei Chia‐Cheng Chou Cheng‐Chung Lee Hui‐Lin Shr Fei Philip Gao Ping‐Chiang Lyu Andrew H.‐J. Wang Shan‐Ho Chou 《Acta Crystallographica. Section F, Structural Biology Communications》2005,61(7):691-693
Xanthomonas campestris pv. campestris strain 17 is a Gram‐negative yellow‐pigmented pathogenic bacterium that causes black rot, one of the major worldwide diseases of cruciferous crops. Its genome contains approximately 4500 genes, one third of which have no known structure and/or function yet are highly conserved among several different bacterial genuses. One of these gene products is XC1692 protein, containing 141 amino acids. It was overexpressed in Escherichia coli, purified and crystallized in a variety of forms using the hanging‐drop vapour‐diffusion method. The crystals diffract to at least 1.45 Å resolution. They are hexagonal and belong to space group P63, with unit‐cell parameters a = b = 56.9, c = 71.0 Å. They contain one molecule per asymmetric unit. 相似文献
2.
Chao‐Yu Yang Ko‐Hsin Chin Chia‐Cheng Chou Hui‐Lin Shr Fei Philip Gao Ping‐Chiang Lyu Andrew H.‐J. Wang Shan‐Ho Chou 《Acta Crystallographica. Section F, Structural Biology Communications》2005,61(7):703-705
Xanthomonas campestris pv. campestris is a Gram‐negative yellow‐pigmented pathogenic bacterium that causes black rot, one of the major worldwide diseases of cruciferous crops. Its genome contains approximately 4500 genes, roughly one third of which have no known structure and/or function. However, some genes of unknown function are highly conserved among several different bacterial genuses. XC6422 is one such conserved hypothetical protein and has been overexpressed in Escherichia coli, purified and crystallized in a variety of forms using the hanging‐drop vapour‐diffusion method. Crystals grew to approximately 2 × 1.5 × 0.4 mm in size after one week and diffracted to at least 1.6 Å resolution. They belong to the monoclinic space group C2, with one molecule per asymmetric unit and unit‐cell parameters a = 75.8, b = 79.3, c = 38.2 Å, β = 109.4°. Determination of this structure may provide insights into the protein's function. 相似文献
3.
Identification of protein biochemical functions based on their three-dimensional structures is now required in the post-genome-sequencing era. Ligand binding is one of the major biochemical functions of proteins, and thus the identification of ligands and their binding sites is the starting point for the function identification. Previously we reported our first trial on structure-based function prediction, based on the similarity searches of molecular surfaces against the functional site database. Here we describe the extension of our first trial by expanding the search database to whole heteroatom binding sites appearing within the Protein Data Bank (PDB) with the new analysis protocol. In addition, we have determined the similarity threshold line, by using 10 structure pairs with solved free and complex structures. Finally, we extensively applied our method to newly determined hypothetical proteins, including some without annotations, and evaluated the performance of our methods. 相似文献
4.
Ko‐Hsin Chin Wei‐Tien Kuo Chia‐Cheng Chou Hui‐Lin Shr Ping‐Chiang Lyu Andrew H.‐J. Wang Shan‐Ho Chou 《Acta Crystallographica. Section F, Structural Biology Communications》2005,61(7):694-696
Xanthomonas campestris pv. campestris is a Gram‐negative yellow‐pigmented pathogenic bacterium that causes black rot, one of the major worldwide diseases of cruciferous crops. Its genome contains approximately 4500 genes, roughly one third of which have no known structure and/or function. However, some of these unknown genes are highly conserved among several different bacterial genuses. XC229 is one such protein containing 134 amino acids. It was overexpressed in Escherichia coli, purified and crystallized using the hanging‐drop vapour‐diffusion method. The crystal diffracted to a resolution of at least 1.80 Å. It is cubic and belongs to space group I2x3, with unit‐cell parameters a = b = c = 106.8 Å. It contains one or two molecules per asymmetric unit. 相似文献
5.
Kristina Bckbro Annette Roos Edward N. Baker Vickery L. Arcus 《Acta Crystallographica. Section D, Structural Biology》2004,60(4):733-735
Structural genomics offers a potential route to the discovery of protein function. As part of a structural genomics project focused on the hyperthermophilic crenarchaeon Pyrobaculum aerophilum, a conserved hypothetical protein, PAE2754, has been expressed in Escherichia coli, purified and crystallized. Because of the difficulties of preparing interpretable heavy‐atom derivatives with limited resolution and 8–12 molecules in the asymmetric unit, two leucine residues were selected for mutation to methionine. The double mutant L65M/L80M was created, expressed incorporating SeMet and crystallized. The crystals are monoclinic, space group P21, with unit‐cell parameters a = 56.4, b = 193.3, c = 60.5 Å, β = 94.6° and eight molecules (two tetramers) in the asymmetric unit. The crystals diffract to 2.75 Å resolution and are suitable for MAD phasing. 相似文献
6.
Sz‐Kai Ruan Ko‐Hsin Chin Hui‐Lin Shr Ping‐Chiang Lyu Andrew H.‐J. Wang Shan‐Ho Chou 《Acta Crystallographica. Section F, Structural Biology Communications》2007,63(1):30-33
XC5848, a hypothetical protein from the pathogenic bacterium Xanthomonas campestris that causes black rot, has been chosen as a potential target for the discovery of novel folds. It is unique to the Xanthomonas genus and has significant sequence identity mainly to corresponding proteins from the Xanthomonas genus. In this paper, the cloning, overexpression, purification and crystallization of the XC5848 protein are reported. The XC5848 crystals diffracted to a resolution of at least 1.68 Å. They belong to the orthorhombic space group P212121, with unit‐cell parameters a = 48.13, b = 51.62, c = 82.32 Å. Two molecules were found in each asymmetric unit. Preliminary structural studies nevertheless indicate that XC5848 belongs to the highly conserved Sm‐like α‐β‐β‐β‐β fold. However, significant differences in sequence and structure were observed. It therefore represents a novel variant of the crucial Sm‐like motif that is heavily involved in mRNA splicing and degradation. 相似文献
7.
Makoto Nakabayashi Naoki Shibata Hirofumi Komori Yasufumi Ueda Hitoshi Iino Akio Ebihara Seiki Kuramitsu Yoshiki Higuchi 《Acta Crystallographica. Section F, Structural Biology Communications》2005,61(12):1027-1031
The crystal structure of a conserved hypothetical protein, TTHA0849 from Thermus thermophilus HB8, has been determined at 2.4 Å resolution as a part of a structural and functional genomics project on T. thermophilus HB8. The main‐chain folding shows a compact α+β motif, forming a hydrophobic cavity in the molecule. A structural similarity search reveals that it resembles those steroidogenic acute regulatory proteins that contain the lipid‐transfer (START) domain, even though TTHA0849 shows comparatively weak sequence identity to polyketide cyclases. However, the size of the ligand‐binding cavity is distinctly smaller than other START domain‐containing proteins, suggesting that it catalyses the transfer of smaller ligand molecules. 相似文献
8.
Protein function elucidation often relies heavily on amino acid sequence analysis and other bioinformatics approaches. The reliance is extended to structure homology modeling for ligand docking and protein–protein interaction mapping. However, sequence analysis of RPA3313 exposes a large, unannotated class of hypothetical proteins mostly from the Rhizobiales order. In the absence of sequence and structure information, further functional elucidation of this class of proteins has been significantly hindered. A high quality NMR structure of RPA3313 reveals that the protein forms a novel split ββαβ fold with a conserved ligand binding pocket between the first β‐strand and the N‐terminus of the α‐helix. Conserved residue analysis and protein–protein interaction prediction analyses reveal multiple protein binding sites and conserved functional residues. Results of a mass spectrometry proteomic analysis strongly point toward interaction with the ribosome and its subunits. The combined structural and proteomic analyses suggest that RPA3313 by itself or in a larger complex may assist in the transportation of substrates to or from the ribosome for further processing. Proteins 2016; 85:93–102. © 2016 Wiley Periodicals, Inc. 相似文献
9.
The crystal structure of a conserved hypothetical protein from Escherichia coli has been determined using X-ray crystallography. The protein belongs to the Cluster of Orthologous Group COG1553 (National Center for Biotechnology Information database, NLM, NIH), for which there was no structural information available until now. Structural homology search with DALI algorism indicated that this protein has a new fold with no obvious similarity to those of other proteins with known three-dimensional structures. The protein quaternary structure consists of a dimer of trimers, which makes a characteristic cylinder shape. There is a large closed cavity with approximate dimensions of 16 Å × 16 Å × 20 Å in the center of the hexameric structure. Six putative active sites are positioned along the equatorial surface of the hexamer. There are several highly conserved residues including two possible functional cysteines in the putative active site. The possible molecular function of the protein is discussed. 相似文献
10.
The organization of ribosomal proteins in 16 prokaryotic genomes was studied as an example of comparative genome analyses of gene systems. Hypothetical ribosomal protein-containing operons were constructed. These operons also contained putative genes and other non-ribosomal genes. The correspondences among these genes across different organisms were clarified by sequence homology computations. In this way a cross tabulation of 70 ribosomal proteins genes was constructed. On average, these were organized into 9-14 operons in each genome. There were also 25 non-ribosomal or putative genes in these mainly ribosomal protein operons. Hence the table contains 95 genes in total. It was found that: (i) the conservation of the block of about 20 r-proteins in the L3 and L4 operons across almost the entire eubacteria and archaebacteria is remarkable; (ii) some operons only belong to eubacteria or archaebacteria; (iii) although the ribosomal protein operons are highly conserved within domain, there are fine variations in some operons across different organisms within each domain, and these variations are informative on the evolutionary relations among the organisms. This method provides a new potential for studying the origin and evolution of old species. 相似文献
11.
The organization of ribosomal proteins in 16 prokaryotic genomes was studied as an example of comparative genome analyses of gene systems. Hypothetical ribosomal protein-containing operons were constructed. These operons also contained putative genes and other non-ribosomal genes. The correspondences among these genes across different organisms were clarified by sequence homology computations. In this way a cross tabulation of 70 ribosomal proteins genes was constructed. On average, these were organized into 9-14 operons in each genome. There were also 25 non-ribosomal or putative genes in these mainly ribosomal protein operons. Hence the table contains 95 genes in total. It was found that: (i) the conservation of the block of about 20 r-proteins in the L3 and L4 operons across almost the entire eubacteria and ar-chaebacteria is remarkable; (ii) some operons only belong to eubacteria or archaebacte-ria; (iii) although the ribosomal protein operons are highly conserved within domain, there are fine variat 相似文献
12.
A detailed knowledge of a protein's functional site is an absolute prerequisite for understanding its mode of action at the molecular level. However, the rapid pace at which sequence and structural information is being accumulated for proteins greatly exceeds our ability to determine their biochemical roles experimentally. As a result, computational methods are required which allow for the efficient processing of the evolutionary information contained in this wealth of data, in particular that related to the nature and location of functionally important sites and residues. The method presented here, referred to as conserved functional group (CFG) analysis, relies on a simplified representation of the chemical groups found in amino acid side-chains to identify functional sites from a single protein structure and a number of its sequence homologues. We show that CFG analysis can fully or partially predict the location of functional sites in approximately 96% of the 470 cases tested and that, unlike other methods available, it is able to tolerate wide variations in sequence identity. In addition, we discuss its potential in a structural genomics context, where automation, scalability and efficiency are critical, and an increasing number of protein structures are determined with no prior knowledge of function. This is exemplified by our analysis of the hypothetical protein Ydde_Ecoli, whose structure was recently solved by members of the North East Structural Genomics consortium. Although the proposed active site for this protein needs to be validated experimentally, this example illustrates the scope of CFG analysis as a general tool for the identification of residues likely to play an important role in a protein's biochemical function. Thus, our method offers a convenient solution to rapidly and automatically process the vast amounts of data that are beginning to emerge from structural genomics projects. 相似文献
13.
Gene order in prokaryotes is conserved to a much lesser extent than protein sequences. Only some operons, primarily those that encode physically interacting proteins, are conserved in all or most of the bacterial and archaeal genomes. Nevertheless, even the limited conservation of operon organisation that is observed provides valuable evolutionary and functional clues through multiple genome comparisons. With the rapid growth in the number and diversity of sequenced prokaryotic genomes, functional inferences for uncharacterized genes located in the same conserved gene neighborhood with well-studied genes are becoming increasingly important. In this review, we discuss various computational approaches for identification of conserved gene strings and construction of local alignments of gene orders in prokaryotic genomes. 相似文献
14.
15.
Willem P. C. Stemmer 《Biotechnology and Bioprocess Engineering》2002,7(3):121-129
Existing methods for optimization of sequences by random mutagenesis generate libraries with a small number of mostly deleterious mutations, resulting in libraries containing a large fraction of non-functional clones that explore only a small part of squence space. Large numbers of clones need to be screened to find the rare mutants with improvements. Library display formats are useful to screen very large libraries but impose screening limitations that limit the value of this approach for most commercial applications. By contrast, in both classical breeding and in DNA shuffling, natural diversity is permutated by homologous recombination, generating libraries of very high quality, from which improved clones can be identified with a small number of complex screens. Given that this small number of screens can be performed under the conditions of actual use of the product, commercially relevant improvements can be reliably obtained. 相似文献
16.
Sugimori N Torizawa T Aceti DJ Thao S Markley JL Kainosho M 《Journal of biomolecular NMR》2004,30(3):357-358
17.
Kristensen DM Chen BY Fofanov VY Ward RM Lisewski AM Kimmel M Kavraki LE Lichtarge O 《Protein science : a publication of the Protein Society》2006,15(6):1530-1536
The annotation of protein function has not kept pace with the exponential growth of raw sequence and structure data. An emerging solution to this problem is to identify 3D motifs or templates in protein structures that are necessary and sufficient determinants of function. Here, we demonstrate the recurrent use of evolutionary trace information to construct such 3D templates for enzymes, search for them in other structures, and distinguish true from spurious matches. Serine protease templates built from evolutionarily important residues distinguish between proteases and other proteins nearly as well as the classic Ser-His-Asp catalytic triad. In 53 enzymes spanning 33 distinct functions, an automated pipeline identifies functionally related proteins with an average positive predictive power of 62%, including correct matches to proteins with the same function but with low sequence identity (the average identity for some templates is only 17%). Although these template building, searching, and match classification strategies are not yet optimized, their sequential implementation demonstrates a functional annotation pipeline which does not require experimental information, but only local molecular mimicry among a small number of evolutionarily important residues. 相似文献
18.
19.
Metal ions are crucial for protein function. They participate in enzyme catalysis, play regulatory roles, and help maintain protein structure. Current tools for predicting metal-protein interactions are based on proteins crystallized with their metal ions present (holo forms). However, a majority of resolved structures are free of metal ions (apo forms). Moreover, metal binding is a dynamic process, often involving conformational rearrangement of the binding pocket. Thus, effective predictions need to be based on the structure of the apo state. Here, we report an approach that identifies transition metal-binding sites in apo forms with a resulting selectivity >95%. Applying the approach to apo forms in the Protein Data Bank and structural genomics initiative identifies a large number of previously unknown, putative metal-binding sites, and their amino acid residues, in some cases providing a first clue to the function of the protein. 相似文献
20.
Kazunari Yoneda Hideaki Tsuge Haruhiko Sakuraba Nobuhiko Katunuma Seiki Kuramitsu Takeshi Kawabata Toshihisa Ohshima 《Acta Crystallographica. Section F, Structural Biology Communications》2005,61(7):636-639
The crystal structure of ST1625p, a protein encoded by a hypothetical open reading frame ST1625 in the genome of the hyperthermophilic archaeon Sulfolobus tokodaii, was determined at 2.2 Å resolution. The only sequence similarity exhibited by the amino‐acid sequence of ST1625p was a 33% identity with the sequence of SSO0983p from S. solfataricus. The 19 kDa monomeric protein was observed to consist of a right‐handed superhelix assembled from a tandem repeat of ten α‐helices. A structural homology search using the DALI and MATRAS algorithms indicates that this protein can be classified as a helical repeat protein. 相似文献