首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
Structural genomics began as a global effort in the 1990s to determine the tertiary structures of all protein families as a response to large-scale genome sequencing projects. The immediate outcome was an influx of tens of thousands of protein structures, many of which had unknown functions. At the time, the value of structural genomics was controversial. However, the structures themselves were only the most obvious output. In addition, these newly solved structures motivated the emergence of huge data science and infrastructure efforts, which, together with advances in Deep Learning, have brought about a revolution in computational molecular biology. Here, we review some of the computational research carried out at the Protein Data Bank Japan (PDBj) during the Protein 3000 project under the leadership of Haruki Nakamura, much of which continues to flourish today.  相似文献   

2.
The dramatically increasing number of new protein sequences arising from genomics 4 proteomics requires the need for methods to rapidly and reliably infer the molecular and cellular functions of these proteins. One such approach, structural genomics, aims to delineate the total repertoire of protein folds in nature, thereby providing three-dimensional folding patterns for all proteins and to infer molecular functions of the proteins based on the combined information of structures and sequences. The goal of obtaining protein structures on a genomic scale has motivated the development of high throughput technologies and protocols for macromolecular structure determination that have begun to produce structures at a greater rate than previously possible. These new structures have revealed many unexpected functional inferences and evolutionary relationships that were hidden at the sequence level. Here, we present samples of structures determined at Berkeley Structural Genomics Center and collaborators laboratories to illustrate how structural information provides and complements sequence information to deduce the functional inferences of proteins with unknown molecular functions.Two of the major premises of structural genomics are to discover a complete repertoire of protein folds in nature and to find molecular functions of the proteins whose functions are not predicted from sequence comparison alone. To achieve these objectives on a genomic scale, new methods, protocols, and technologies need to be developed by multi-institutional collaborations worldwide. As part of this effort, the Protein Structure Initiative has been launched in the United States (PSI; www.nigms.nih.gov/funding/psi.html). Although infrastructure building and technology development are still the main focus of structural genomics programs [1–6], a considerable number of protein structures have already been produced, some of them coming directly out of semi-automated structure determination pipelines [6–10]. The Berkeley Structural Genomics Center (BSGC) has focused on the proteins of Mycoplasma or their homologues from other organisms as its structural genomics targets because of the minimal genome size of the Mycoplasmas as well as their relevance to human and animal pathogenicity (http://www.strgen.org). Here we present several protein examples encompassing a spectrum of functional inferences obtainable from their three-dimensional structures in five situations, where the inferences are new and testable, and are not predictable from protein sequence information alone.  相似文献   

3.
A major goal of structural genomics is the provision of a structural template for a large fraction of protein domains. The magnitude of this task depends on the number and nature of protein sequence families. With a large number of bacterial genomes now fully sequenced, it is possible to obtain improved estimates of the number and diversity of families in that kingdom. We have used an automated clustering procedure to group all sequences in a set of genomes into protein families. Bench-marking shows the clustering method is sensitive at detecting remote family members, and has a low level of false positives. This comprehensive protein family set has been used to address the following questions. (1) What is the structure coverage for currently known families? (2) How will the number of known apparent families grow as more genomes are sequenced? (3) What is a practical strategy for maximizing structure coverage in future? Our study indicates that approximately 20% of known families with three or more members currently have a representative structure. The study indicates also that the number of apparent protein families will be considerably larger than previously thought: We estimate that, by the criteria of this work, there will be about 250,000 protein families when 1000 microbial genomes have been sequenced. However, the vast majority of these families will be small, and it will be possible to obtain structural templates for 70-80% of protein domains with an achievable number of representative structures, by systematically sampling the larger families.  相似文献   

4.
The current pace of structural biology now means that protein three-dimensional structure can be known before protein function, making methods for assigning homology via structure comparison of growing importance. Previous research has suggested that sequence similarity after structure-based alignment is one of the best discriminators of homology and often functional similarity. Here, we exploit this observation, together with a merger of protein structure and sequence databases, to predict distant homologous relationships. We use the Structural Classification of Proteins (SCOP) database to link sequence alignments from the SMART and Pfam databases. We thus provide new alignments that could not be constructed easily in the absence of known three-dimensional structures. We then extend the method of Murzin (1993b) to assign statistical significance to sequence identities found after structural alignment and thus suggest the best link between diverse sequence families. We find that several distantly related protein sequence families can be linked with confidence, showing the approach to be a means for inferring homologous relationships and thus possible functions when proteins are of known structure but of unknown function. The analysis also finds several new potential superfamilies, where inspection of the associated alignments and superimpositions reveals conservation of unusual structural features or co-location of conserved amino acids and bound substrates. We discuss implications for Structural Genomics initiatives and for improvements to sequence comparison methods.  相似文献   

5.
在后基因组时代,随着大量物种全基因组序列的获得,结构生物学家面临着结构基因组学的新机遇和挑战。与传统的结构生物学不同的是,结构基因组学的研究主要集中在结构和功能未知并且与从前研究的蛋白质相似性很小的蛋白质。准确的来讲,结构基因组学通过高通量蛋白质表达、结构解析来完成所有蛋白质家族的结构表征,从而能够通过结构预测功能。加州结构基因组学联合实验室发展了高度自动化的蛋白质合成、结晶、结构解析生产线。然而由于一些蛋白质不能被结晶,要想覆盖所有蛋白质结构域还有很大困难。Wuthrich的研究小组通过一些高通量的目的蛋白质筛选和NMR结构解析的方法解决了这一难题。与X射线晶体学解析蛋白质结构相比,NMR技术由于能够解析更接近生理状态的溶液结构而具有互补性。通过获得溶液中的蛋白质稳定性、动力学特征和相互作用信息,正如在朊蛋白和SARS相关蛋白的研究中所表现的那样,NMR技术从扩大已知的蛋白质结构数据库、新的蛋白质功能到化学生物学研究中都扮演着激动人心的角色。  相似文献   

6.
Structural genomics can be defined as structural biology on a large number of target proteins in parallel. This approach plays an important role in modern structure-based drug design. Although a number of structural genomics initiatives have been initiated, relatively few are associated with integral membrane proteins. This indicates the difficulties in expression, purification, and crystallization of membrane proteins, which has also been confirmed by the existence of some 100 high-resolution structures of membrane proteins among the more than 30,000 entries in public databases. Paradoxically, membrane proteins represent 60–70% of current drug targets and structural knowledge could both improve and speed up the drug discovery process. In order to improve the sucess rate for structure resolution of membrane proteins structural genomics networks have been established.  相似文献   

7.
Structural genomics: computational methods for structure analysis   总被引:2,自引:0,他引:2       下载免费PDF全文
The success of structural genomics initiatives requires the development and application of tools for structure analysis, prediction, and annotation. In this paper we review recent developments in these areas; specifically structure alignment, the detection of remote homologs and analogs, homology modeling and the use of structures to predict function. We also discuss various rationales for structural genomics initiatives. These include the structure-based clustering of sequence space and genome-wide function assignment. It is also argued that structural genomics can be integrated into more traditional biological research if specific biological questions are included in target selection strategies.  相似文献   

8.
9.
10.
As the number of complete genomes that have been sequenced keeps growing, unknown areas of the protein space are revealed and new horizons open up. Most of this information will be fully appreciated only when the structural information about the encoded proteins becomes available. The goal of structural genomics is to direct large-scale efforts of protein structure determination, so as to increase the impact of these efforts. This review focuses on current approaches in structural genomics aimed at selecting representative proteins as targets for structure determination. We will discuss the concept of representative structures/folds, the current methodologies for identifying those proteins, and computational techniques for identifying proteins which are expected to adopt new structural folds.  相似文献   

11.
Tuberculosis (TB) is a devastating disease of worldwide importance. The availability of the genome sequence of Mycobacterium tuberculosis (Mtb), the causative agent, has stimulated a large variety of genome-scale initiatives. These include international structural genomics efforts which have the dual aim of characterising potential new drug targets and addressing key aspects of the biology of Mtb. This review highlights the various ways in which structural analysis has illuminated the biological activities of Mtb gene products, which were previously of unknown or uncertain function. Key information comes from the protein fold, from bound ligands, solvent molecules, ions etc. or from unexpectedly modified amino acid residues. Most importantly, the three dimensional structure of a protein permits the integration of data from many sources, both bioinformatic and experimental, to develop testable functional hypotheses. This has led to many new insights into TB biology.  相似文献   

12.
The Center for Eukaryotic Structural Genomics (CESG) produces and solves the structures of proteins from eukaryotes. We have developed and operate a pipeline to both solve structures and to test new methodologies. Both NMR and X-ray crystallography methods are used for structure solution. CESG chooses targets based on sequence dissimilarity to known structures, medical relevance, and nominations from members of the scientific community. Many times proteins qualify in more than one of these categories. Here we review some of the structures that have connections to human health and disease.  相似文献   

13.
We have developed and tested a simple and efficient protein purification method for biophysical screening of proteins and protein fragments by nuclear magnetic resonance (NMR) and optical methods, such as circular dichroism spectroscopy. The method constitutes an extension of previously described protocols for gene expression and protein solubility screening [M. Hammarstr?m et al., (2002), Protein Science 11, 313]. Using the present purification scheme it is possible to take several target proteins, produced as fusion proteins, from cell pellet to NMR spectrum and obtain a judgment on the suitability for further structural or biophysical studies in less than 1 day. The method is independent of individual protein properties as long as the target protein can be produced in soluble form with a fusion partner. Identical procedures for cell culturing, lysis, affinity chromatography, protease cleavage, and NMR sample preparation then initially require only optimization for different fusion partner and protease combinations. The purification method can be automated, scaled up or down, and extended to a traditional purification scheme. We have tested the method on several small human proteins produced in Escherichia coli and find that the method allows for detection of structured proteins and unfolded or molten globule-like proteins.  相似文献   

14.
简要介绍了结构基因组学研究中,用于测定蛋白质结构的X射线分析在解决衍射相位问题方面的最新进展。  相似文献   

15.
Overton IM  Barton GJ 《FEBS letters》2006,580(16):4005-4009
Target selection and ranking is fundamental to structural genomics. We present a Z-score scale, the "OB-Score", to rank potential targets by their predicted propensity to produce diffraction-quality crystals. The OB-Score is derived from a matrix of predicted isoelectric point and hydrophobicity values for nonredundant PDB entries solved to or=1 member with a high OB-Score, presenting favourable candidates for structural studies.  相似文献   

16.
This Perspective, arising from a workshop held in July 2008 in Buffalo NY, provides an overview of the role NMR has played in the United States Protein Structure Initiative (PSI), and a vision of how NMR will contribute to the forthcoming PSI-Biology program. NMR has contributed in key ways to structure production by the PSI, and new methods have been developed which are impacting the broader protein NMR community.  相似文献   

17.
Solution NMR structure determination of proteins revisited   总被引:2,自引:2,他引:0  
This 'Perspective' bears on the present state of protein structure determination by NMR in solution. The focus is on a comparison of the infrastructure available for NMR structure determination when compared to protein crystal structure determination by X-ray diffraction. The main conclusion emerges that the unique potential of NMR to generate high resolution data also on dynamics, interactions and conformational equilibria has contributed to a lack of standard procedures for structure determination which would be readily amenable to improved efficiency by automation. To spark renewed discussion on the topic of NMR structure determination of proteins, procedural steps with high potential for improvement are identified.  相似文献   

18.
Structure determination has already proven useful for lead optimization and direct drug design. The number of high-resolution structures available in public databases today exceeds 30,000 and will definitely aid in structure-based drug design. Structural genomics approaches covering whole genomes, topologically similar proteins or gene families are great assets for further progress in the development of new drugs. However, membrane proteins representing 70% of current drug targets are poorly characterized structurally. The problems have been related to difficulties in obtaining large amount of recombinant membrane proteins as well as their purification and structure determination. Structural genomics has proven successful in developing new methods in areas from expression to structure determination by studying a large number of target proteins in parallel.  相似文献   

19.
Knowledge of the three-dimensional structures of proteins is the key to unlocking the full potential of genomic information. There are two distinct directions along which cutting-edge research in structural biology is currently moving towards this goal. On the one hand, tightly focused long-term research in individual laboratories is leading to the determination of the structures of macromolecular assemblies of ever-increasing size and complexity. On the other hand, large consortia of structural biologists, inspired by the pace of genome sequencing, are developing strategies to determine new protein structures rapidly, so that it will soon be possible to predict reasonably accurate structures for most protein domains. We anticipate that a small number of complex systems, studied in depth, will provide insights across the field of biology with the aid of genome-based comparative structural analysis.  相似文献   

20.
Knowledge of the three-dimensional structures of proteins is the key to unlocking the full potential of genomic information. There are two distinct directions along which cutting-edge research in structural biology is currently moving towards this goal. On the one hand, tightly focused long-term research in individual laboratories is leading to the determination of the structures of macromolecular assemblies of ever-increasing size and complexity. On the other hand, large consortia of structural biologists, inspired by the pace of genome sequencing, are developing strategies to determine new protein structures rapidly, so that it will soon be possible to predict reasonably accurate structures for most protein domains. We anticipate that a small number of complex systems, studied in depth, will provide insights across the field of biology with the aid of genome-based comparative structural analysis.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号