共查询到20条相似文献,搜索用时 15 毫秒
1.
Kim SH Shin DH Choi IG Schulze-Gahmen U Chen S Kim R 《Journal of structural and functional genomics》2003,4(2-3):129-135
The dramatically increasing number of new protein sequences arising from genomics 4 proteomics requires the need for methods to rapidly and reliably infer the molecular and cellular functions of these proteins. One such approach, structural genomics, aims to delineate the total repertoire of protein folds in nature, thereby providing three-dimensional folding patterns for all proteins and to infer molecular functions of the proteins based on the combined information of structures and sequences. The goal of obtaining protein structures on a genomic scale has motivated the development of high throughput technologies and protocols for macromolecular structure determination that have begun to produce structures at a greater rate than previously possible. These new structures have revealed many unexpected functional inferences and evolutionary relationships that were hidden at the sequence level. Here, we present samples of structures determined at Berkeley Structural Genomics Center and collaborators laboratories to illustrate how structural information provides and complements sequence information to deduce the functional inferences of proteins with unknown molecular functions.Two of the major premises of structural genomics are to discover a complete repertoire of protein folds in nature and to find molecular functions of the proteins whose functions are not predicted from sequence comparison alone. To achieve these objectives on a genomic scale, new methods, protocols, and technologies need to be developed by multi-institutional collaborations worldwide. As part of this effort, the Protein Structure Initiative has been launched in the United States (PSI; www.nigms.nih.gov/funding/psi.html). Although infrastructure building and technology development are still the main focus of structural genomics programs [1–6], a considerable number of protein structures have already been produced, some of them coming directly out of semi-automated structure determination pipelines [6–10]. The Berkeley Structural Genomics Center (BSGC) has focused on the proteins of Mycoplasma or their homologues from other organisms as its structural genomics targets because of the minimal genome size of the Mycoplasmas as well as their relevance to human and animal pathogenicity (http://www.strgen.org). Here we present several protein examples encompassing a spectrum of functional inferences obtainable from their three-dimensional structures in five situations, where the inferences are new and testable, and are not predictable from protein sequence information alone. 相似文献
2.
Kim SH Shin DH Liu J Oganesyan V Chen S Xu QS Kim JS Das D Schulze-Gahmen U Holbrook SR Holbrook EL Martinez BA Oganesyan N DeGiovanni A Lou Y Henriquez M Huang C Jancarik J Pufan R Choi IG Chandonia JM Hou J Gold B Yokota H Brenner SE Adams PD Kim R 《Journal of structural and functional genomics》2005,6(2-3):63-70
The initial aim of the Berkeley Structural Genomics Center is to obtain a near-complete structural complement of two minimal
organisms, closely related pathogens Mycoplasma genitalium and M. pneumoniae. The former has fewer than 500 genes and the latter fewer than 700 genes. To achieve this goal, the current protein targets
have been selected starting with those predicted to be most tractable and likely to yield new structural and functional information.
During the past 3 years, the semi-automated structural genomics pipeline has been set up from cloning, expression, purification,
and ultimately to structural determination. The results from the pipeline substantially increased the coverage of the protein
fold space of M. pneumoniae and M. genitalium. Furthermore, about 1/2 of the structures of ‘unique’ protein sequences revealed new and novel folds, and over 2/3 of the
structures of previously annotated ‘hypothetical proteins’ inferred their molecular functions. 相似文献
3.
Phillips GN Fox BG Markley JL Volkman BF Bae E Bitto E Bingman CA Frederick RO McCoy JG Lytle BL Pierce BS Song J Twigger SN 《Journal of structural and functional genomics》2007,8(2-3):73-84
The Center for Eukaryotic Structural Genomics (CESG) produces and solves the structures of proteins from eukaryotes. We have
developed and operate a pipeline to both solve structures and to test new methodologies. Both NMR and X-ray crystallography
methods are used for structure solution. CESG chooses targets based on sequence dissimilarity to known structures, medical
relevance, and nominations from members of the scientific community. Many times proteins qualify in more than one of these
categories. Here we review some of the structures that have connections to human health and disease. 相似文献
4.
Almo SC Bonanno JB Sauder JM Emtage S Dilorenzo TP Malashkevich V Wasserman SR Swaminathan S Eswaramoorthy S Agarwal R Kumaran D Madegowda M Ragumani S Patskovsky Y Alvarado J Ramagopal UA Faber-Barata J Chance MR Sali A Fiser A Zhang ZY Lawrence DS Burley SK 《Journal of structural and functional genomics》2007,8(2-3):121-140
5.
Bandyopadhyay D Huan J Liu J Prins J Snoeyink J Wang W Tropsha A 《Protein science : a publication of the Protein Society》2006,15(6):1537-1543
We describe a method to assign a protein structure to a functional family using family-specific fingerprints. Fingerprints represent amino acid packing patterns that occur in most members of a family but are rare in the background, a nonredundant subset of PDB; their information is additional to sequence alignments, sequence patterns, structural superposition, and active-site templates. Fingerprints were derived for 120 families in SCOP using Frequent Subgraph Mining. For a new structure, all occurrences of these family-specific fingerprints may be found by a fast algorithm for subgraph isomorphism; the structure can then be assigned to a family with a confidence value derived from the number of fingerprints found and their distribution in background proteins. In validation experiments, we infer the function of new members added to SCOP families and we discriminate between structurally similar, but functionally divergent TIM barrel families. We then apply our method to predict function for several structural genomics proteins, including orphan structures. Some predictions have been corroborated by other computational methods and some validated by subsequent functional characterization. 相似文献
6.
Abergel C Coutard B Byrne D Chenivesse S Claude JB Deregnaucourt C Fricaux T Gianesini-Boutreux C Jeudy S Lebrun R Maza C Notredame C Poirot O Suhre K Varagnol M Claverie JM 《Journal of structural and functional genomics》2003,4(2-3):141-157
With more than 100 antibacterial drugs at our disposal in the 1980s, the problem of bacterial infection was considered solved. Today, however, most hospital infections are insensitive to several classes of antibacterial drugs, and deadly strains of Staphylococcus aureus resistant to vancomycin – the last resort antibiotic – have recently begin to appear. Other life-threatening microbes, such as Enterococcus faecalis and Mycobacterium tuberculosis are already able to resist every available antibiotic. There is thus an urgent, and continuous need for new, preferably large-spectrum, antibacterial molecules, ideally targeting new biochemical pathways. Here we report on the progress of our structural genomics program aiming at the discovery of new antibacterial gene targets among evolutionary conserved genes of uncharacterized function. A series of bioinformatic and comparative genomics analyses were used to identify a set of 221 candidate genes common to Gram-positive and Gram-negative bacteria. These genes were split between two laboratories. They are now submitted to a systematic 3-D structure determination protocol including cloning, protein expression and purification, crystallization, X-ray diffraction, structure interpretation, and function prediction. We describe here our strategies for the 111 genes processed in our laboratory. Bioinformatics is used at most stages of the production process and out of 111 genes processed – and 17 months into the project – 108 have been successfully cloned, 103 have exhibited detectable expression, 84 have led to the production of soluble protein, 46 have been purified, 12 have led to usable crystals, and 7 structures have been determined. 相似文献
7.
Structural and nucleotide-binding properties of YajQ and YnaF,two Escherichia coli proteins of unknown function 下载免费PDF全文
Saveanu C Miron S Borza T Craescu CT Labesse G Gagyi C Popescu A Schaeffer F Namane A Laurent-Winter C Bârzu O Gilles AM 《Protein science : a publication of the Protein Society》2002,11(11):2551-2560
Structural genomics is a new approach in functional assignment of proteins identified via whole-genome sequencing programs. Its rationale is that nonhomologous proteins performing similar or related biological functions might have similar tertiary structure. We used dye pseudoaffinity chromatography, two-dimensional gel electrophoresis, and mass spectrometry to identify two novel Escherichia coli nucleotide-binding proteins, YnaF and YajQ. YnaF exhibited significant sequence identity with MJ0577, an ATP-binding protein from a hyperthermophile (Methanococcus jannaschii), and with UspA, a protein from Haemophilus influenzae that belongs to the Universal Stress Protein family. YnaF conserves the ATP-binding site and the dimeric structure observed in the crystal of MJ0577. The protein YajQ, present in many bacterial genomes, is missing in eukaryotes. In the absence of significant similarities of YajQ to any solved structure, we determined its structural and ligand-binding properties by NMR and isothermal titration calorimetry. We demonstrate that YajQ is composed of two domains, each centered on a beta-sheet, that are connected by two helical segments. NMR studies, corroborated with local sequence conservation among YajQ homologs in various bacteria, indicate that one of the beta-sheets is mostly involved in biological activity. 相似文献
8.
9.
冠状病毒(coronaviridae / Coronaviruse,CoV)是一类对人及家畜具有严重危害的病原微生物,其中的SARS(Severe Acute Respiratory Syndromes)- CoV更是于2003年引起全球性爆发,对人类健康和全球经济造成了严重威胁和重大损失。冠状病毒拥有目前已知最大的正链RNA基因组。侵染细胞后,它通过直接翻译与不连续转录-翻译得到一系列病毒蛋白,包括非结构蛋白、结构蛋白与附属蛋白。对冠状病毒转录/复制过程具有重要作用的非结构蛋白与附属蛋白,由于与目前研究较为深入的蛋白序列相似性很低,因此系统地开展与冠状病毒转录/复制密切相关的蛋白质的结构基因组研究,不仅能够从分子水平深入了解冠状病毒转录复制的分子机制,而且对于有针对性地设计抗冠状病毒的药物具有非常关键的作用。因此,美国,欧洲及我国的多个研究组实施了与冠状病毒及其相关病原体的结构基因组计划,使人类对冠状病毒蛋白质结构与功能的认识进入了一个新阶段。本文将对目前世界范围内冠状病毒结构基因组研究取得的进展进行综合性地展示与介绍。 相似文献
10.
Pusey ML Liu ZJ Tempel W Praissman J Lin D Wang BC Gavira JA Ng JD 《Progress in biophysics and molecular biology》2005,88(3):359-386
The common goal for structural genomic centers and consortiums is to decipher as quickly as possible the three-dimensional structures for a multitude of recombinant proteins derived from known genomic sequences. Since X-ray crystallography is the foremost method to acquire atomic resolution for macromolecules, the limiting step is obtaining protein crystals that can be useful of structure determination. High-throughput methods have been developed in recent years to clone, express, purify, crystallize and determine the three-dimensional structure of a protein gene product rapidly using automated devices, commercialized kits and consolidated protocols. However, the average number of protein structures obtained for most structural genomic groups has been very low compared to the total number of proteins purified. As more entire genomic sequences are obtained for different organisms from the three kingdoms of life, only the proteins that can be crystallized and whose structures can be obtained easily are studied. Consequently, an astonishing number of genomic proteins remain unexamined. In the era of high-throughput processes, traditional methods in molecular biology, protein chemistry and crystallization are eclipsed by automation and pipeline practices. The necessity for high-rate production of protein crystals and structures has prevented the usage of more intellectual strategies and creative approaches in experimental executions. Fundamental principles and personal experiences in protein chemistry and crystallization are minimally exploited only to obtain “low-hanging fruit” protein structures. We review the practical aspects of today's high-throughput manipulations and discuss the challenges in fast pace protein crystallization and tools for crystallography. Structural genomic pipelines can be improved with information gained from low-throughput tactics that may help us reach the higher-bearing fruits. Examples of recent developments in this area are reported from the efforts of the Southeast Collaboratory for Structural Genomics (SECSG). 相似文献
11.
12.
Xu QS Jancarik J Lou Y Kuznetsova K Yakunin AF Yokota H Adams P Kim R Kim SH 《Journal of structural and functional genomics》2005,6(4):269-279
Phosphotransacetylase (Pta) [EC 2.3.1.8] plays a major role in acetate metabolism by catalyzing the reversible transfer of the acetyl group between coenzyme A (CoA) and orthophosphate: CH(3)COSCoA+HPO(4)(2-)<-->CH(3)COOPO(3)(2-) +CoASH. In this study, we report the crystal structures of Pta from Bacillus subtilis at 2.75 A resolution and its complex with acetyl phosphate, one of its substrates, at 2.85 A resolution. In addition, the Pta activity of the enzyme has been assayed. The enzyme folds into an alpha/beta architecture with two domains separated by a prominent cleft, very similar to two other known Pta structures. The enzyme-acetyl phosphate complex structure reveals a few potential substrate binding sites. Two of them are located in the middle of the interdomain cleft: each one is surrounded by a region of strictly and highly conserved residues. High structural similarities are found with 4-hydroxythreonine-4-phosphate dehydrogenase (PdxA), and isocitrate and isopropylmalate dehydrogenases, all of which utilize NADP+ as their cofactor, which binds in the interdomain cleft. Their substrate binding sites are close to the acetyl phosphate binding sites of Pta in the cleft as well. These results suggest that the CoA is likely to bind to the interdomain cleft of Pta in a similar way as NADP+ binds to the other three enzymes. 相似文献
13.
Robert M. Stroud Senyon Choe James Holton H. Ronald Kaback Witek Kwiatkowski Daniel L. Minor Roland Riek Andrej Sali Henning Stahlberg William Harries 《Journal of structural and functional genomics》2009,10(2):193-208
A synopsis of the 2007 annual progress report for the Center for Structures of Membrane Proteins, a specialized center of
the Protein Structure Initiative. 相似文献
14.
We investigated structural reorganization of two different kinds of molecular sheets derived from the cellulose II crystal using molecular dynamics (MD) simulations, in order to identify the initial structure of the cellulose crystal in the course of its regeneration process from solution. After a one-nanosecond simulation, the molecular sheet formed by van der Waals forces along the () crystal plane did not change its structure in an aqueous environment, while the other one formed by hydrogen bonds along the (1 1 0) crystal plane changed into a van der Waals-associated molecular sheet, such as the former. The two structures that were calculated showed substantial similarities such as the high occupancy of intramolecular hydrogen bonds between O3H and O5 of over 0.75, few intermolecular hydrogen bonds, and the high occurrence of hydrogen bonding with water. The convergence of the two structures into one denotes that the van der Waals-associated molecular sheet can be the initial structure of the cellulose crystal formed in solution. The main chain conformations were almost the same as those in the cellulose II crystal except for a −16° shift of φ (dihedral angle of O5-C1-O1-C4) and the gauche-gauche conformation of the hydroxymethyl side group appears probably due to its hydrogen bonding with water. These results suggest that the van der Waals-associated molecular sheet becomes stable in an aqueous environment with its hydrophobic inside and hydrophilic periphery. Contrary to this, a benzene environment preferred a hydrogen-bonded molecular sheet, which is expected to be the initial structure formed in benzene. 相似文献
15.
Lipidic cubic phase-grown crystals yielded high resolution structures of a number of archaeal retinal proteins, the molecular mechanisms of which are being revealed as structures of photocycle intermediates become available. The structural basis for bacteriorhodopsin's mechanism of proton pumping is discussed, revealing a well-synchronized sequence of molecular events. Comparison with the high resolution structures of the halide pump halorhodopsin, as well as with the receptor sensory rhodopsin II, illustrates how small and localized structural changes result in functional divergence. Fundamental principles of energy transduction and sensory reception in the archaeal rhodopsins, which may have relevance to other systems, are discussed. 相似文献
16.
在蛋白质工程、绿色生物制造以及合成生物学等研究领域中,对重要催化反应的重塑和合成路径的优化搭建,都依赖于对相关蛋白质结构与功能的深入了解。合成生物技术近年来的飞速发展对关键菌种及生物催化过程中的蛋白质的性能提出了更高要求,相关研究的关键是获得大批量、高纯度目的蛋白,并进行快速、准确的构效关系研究。中国科学院天津工业生物技术研究所建所10年来,在工业蛋白质领域进行了多年的积累,成功搭建成了蛋白质结构生物学平台;并在植物天然产物合成相关萜类合成酶、白色污染降解的聚对苯二甲酸乙二酯(polyethylene terephthalate, PET)塑料降解酶以及生物质转化利用相关酶等方面获得了一些进展,通过对这些蛋白进行结构和功能的研究,为许多研究工作提供了理论依据。蛋白质结构功能研究相关技术的不断发展,将加速合成生物学的学术和工业应用研究,推动我国生物制造领域的科技创新升级。 相似文献
17.
Lundstrom K Wagner R Reinhart C Desmyter A Cherouati N Magnin T Zeder-Lutz G Courtot M Prual C André N Hassaine G Michel H Cambillau C Pattus F 《Journal of structural and functional genomics》2006,7(2):77-91
Production of recombinant receptors has been one of the major bottlenecks in structural biology on G protein-coupled receptors
(GPCRs). The MePNet (Membrane Protein Network) was established to overexpress a large number of GPCRs in three major expression
systems, based on Escherichia coli, Pichia pastoris and Semliki Forest virus (SFV) vectors. Evaluation by immunodetection demonstrated that 50% of a total of 103 GPCRs were
expressed in bacterial inclusion bodies, 94% in yeast cell membranes and 95% in SFV-infected mammalian cells. The expression
levels varied from low to high and the various GPCR families and subtypes were analyzed for their expressability in each expression
system. More than 60% of the GPCRs were expressed at milligram levels or higher in one or several systems, compatible to structural
biology applications. Functional activity was determined by binding assays in yeast and mammalian cells and the correlation
between immunodetection and binding activity was analyzed. 相似文献
18.
McGeehan JE Streeter SD Thresh SJ Taylor JE Shevtsov MB Kneale GG 《Journal of molecular biology》2011,409(2):177-188
Controller proteins play a key role in the temporal regulation of gene expression in bacterial restriction-modification (R-M) systems and are important mediators of horizontal gene transfer. They form the basis of a highly cooperative, concentration-dependent genetic switch involved in both activation and repression of R-M genes. Here we present biophysical, biochemical, and high-resolution structural analysis of a novel class of controller proteins, exemplified by C.Csp231I. In contrast to all previously solved C-protein structures, each protein subunit has two extra helices at the C-terminus, which play a large part in maintaining the dimer interface. The DNA binding site of the protein is also novel, having largely AAAA tracts between the palindromic recognition half-sites, suggesting tight bending of the DNA. The protein structure shows an unusual positively charged surface that could form the basis for wrapping the DNA completely around the C-protein dimer. 相似文献
19.
Enterovirus 71(EV71) is one of the main pathogens that causes hand-foot-and-mouth disease(HFMD). HFMD caused by EV71 infection is mostly self-limited; however, some infections can cause severe neurological diseases, such as aseptic meningitis, brain stem encephalitis, and even death. There are still no effective clinical drugs used for the prevention and treatment of HFMD. Studying EV71 protein function is essential for elucidating the EV71 replication process and developing anti-EV71 drugs and vaccines. In this review, we summarized the recent progress in the studies of EV71 noncoding regions(50 UTR and 30 UTR) and all structural and nonstructural proteins, especially the key motifs involving in viral infection, replication, and immune regulation. This review will promote our understanding of EV71 virus replication and pathogenesis, and will facilitate the development of novel drugs or vaccines to treat EV71. 相似文献
20.
In the field of evolutionary structural genomics, methods are needed to evaluate why genomes evolved to contain the fold distributions that are observed. In order to study the effects of population dynamics in the evolved genomes we need fast and accurate evolutionary models which can analyze the effects of selection, drift and fixation of a protein sequence in a population that are grounded by physical parameters governing the folding and binding properties of the sequence. In this study, various knowledge-based, force field, and statistical methods for protein folding have been evaluated with four different folds: SH2 domains, SH3 domains, Globin-like, and Flavodoxin-like, to evaluate the speed and accuracy of the energy functions. Similarly, knowledge-based and force field methods have been used to predict ligand binding specificity in SH2 domain. To demonstrate the applicability of these methods, the dynamics of evolution of new binding capabilities by an SH2 domain is demonstrated. 相似文献