首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Using structural similarity clustering of protein domains: protein domain universe graph (PDUG), and a hierarchical functional annotation: gene ontology (GO) as two evolutionary lenses, we find that each structural cluster (domain fold) exhibits a distribution of functions that is unique to it. These functional distributions are functional fingerprints that are specific to characteristic structural clusters and vary from cluster to cluster. Furthermore, as structural similarity threshold for domain clustering in the PDUG is relaxed we observe an influx of earlier-diverged domains into clusters. These domains join clusters without destroying the functional fingerprint. These results can be understood in light of a divergent evolution scenario that posits correlated divergence of structural and functional traits in protein domains from one or few progenitors.  相似文献   

2.
3.
α‐Helical membrane proteins exist in an anisotropic environment which strongly influences their folding, stability, and architecture, which is far more complex than a simple bundle of transmembrane helices, notably due to helix deformations, prosthetic groups and extramembrane structures. However, the role and the distribution of such heterogeneity in the supra molecular organization of membrane proteins remains poorly investigated. Using a nonredundant subset of α‐helical membrane proteins, we have annotated and analyze the statistics of several types of new elements such as incomplete helices, intramembrane loops, helical extensions of helical transmembrane domains, extracellular loops, and helices lying parallel to the membrane surface. The relevance of the annotation scheme was studied using residue composition, statistics, physical chemistry, and symmetry of their distribution in relation to the immediate membrane environment. Calculation of hydrophobicity using different scales show that different structural elements appear to have affinities coherent with their position in the membrane. Examination of the annotation scheme suggests that there is considerable information content in the amino acid compositions of the different elements suggesting that it might be useful for structural prediction. More importantly, the proposed annotation will help to decipher the complex hierarchy of interactions involved in membrane protein architecture. © 2009 Wiley Periodicals, Inc. Biopolymers 91: 815–829, 2009. This article was originally published online as an accepted preprint. The “Published Online” date corresponds to the preprint version. You can request a copy of the preprint by emailing the Biopolymers editorial office at biopolymers@wiley.com  相似文献   

4.
5.
Baoqiang Cao  Ron Elber 《Proteins》2010,78(4):985-1003
We investigate small sequence adjustments (of one or a few amino acids) that induce large conformational transitions between distinct and stable folds of proteins. Such transitions are intriguing from evolutionary and protein‐design perspectives. They make it possible to search for ancient protein structures or to design protein switches that flip between folds and functions. A network of sequence flow between protein folds is computed for representative structures of the Protein Data Bank. The computed network is dense, on an average each structure is connected to tens of other folds. Proteins that attract sequences from a higher than expected number of neighboring folds are more likely to be enzymes and alpha/beta fold. The large number of connections between folds may reflect the need of enzymes to adjust their structures for alternative substrates. The network of the Cro family is discussed, and we speculate that capacity is an important factor (but not the only one) that determines protein evolution. The experimentally observed flip from all alpha to alpha + beta fold is examined by the network tools. A kinetic model for the transition of sequences between the folds (with only protein stability in mind) is proposed. Proteins 2010. © 2009 Wiley‐Liss, Inc.  相似文献   

6.
The causal relationship between protein structural change and ligand binding was classified and annotated for 839 nonredundant pairs of crystal structures in the Protein Data Bank—one with and the other without a bound low-molecular-weight ligand molecule. Protein structural changes were first classified into either domain or local motions depending on the size of the moving protein segments. Whether the protein motion was coupled with ligand binding was then evaluated based on the location of the ligand binding site and by application of the linear response theory of protein structural change. Protein motions coupled with ligand binding were further classified into either closure or opening motions. This classification revealed the following: (i) domain motions coupled with ligand binding are dominated by closure motions, which can be described by the linear response theory; (ii) local motions frequently accompany order-disorder or α-helix-coil conformational transitions; and (iii) transferase activity (Enzyme Commission   number 2) is the predominant function among coupled domain closure motions. This could be explained by the closure motion acting to insulate the reaction site of these enzymes from environmental water.  相似文献   

7.
Wang X 《RNA (New York, N.Y.)》2008,14(6):1012-1017
MicroRNAs (miRNAs) are short noncoding RNAs that are involved in the regulation of thousands of gene targets. Recent studies indicate that miRNAs are likely to be master regulators of many important biological processes. Due to their functional importance, miRNAs are under intense study at present, and many studies have been published in recent years on miRNA functional characterization. The rapid accumulation of miRNA knowledge makes it challenging to properly organize and present miRNA function data. Although several miRNA functional databases have been developed recently, this remains a major bioinformatics challenge to miRNA research community. Here, we describe a new online database system, miRDB, on miRNA target prediction and functional annotation. Flexible web search interface was developed for the retrieval of target prediction results, which were generated with a new bioinformatics algorithm we developed recently. Unlike most other miRNA databases, miRNA functional annotations in miRDB are presented with a primary focus on mature miRNAs, which are the functional carriers of miRNA-mediated gene expression regulation. In addition, a wiki editing interface was established to allow anyone with Internet access to make contributions on miRNA functional annotation. This is a new attempt to develop an interactive community-annotated miRNA functional catalog. All data stored in miRDB are freely accessible at http://mirdb.org.  相似文献   

8.
A methodological framework is presented for the graph theoretical interpretation of NMR data of protein interactions. The proposed analysis generalizes the idea of network representations of protein structures by expanding it to protein interactions. This approach is based on regularization of residue‐resolved NMR relaxation times and chemical shift data and subsequent construction of an adjacency matrix that represents the underlying protein interaction as a graph or network. The network nodes represent protein residues. Two nodes are connected if two residues are functionally correlated during the protein interaction event. The analysis of the resulting network enables the quantification of the importance of each amino acid of a protein for its interactions. Furthermore, the determination of the pattern of correlations between residues yields insights into the functional architecture of an interaction. This is of special interest for intrinsically disordered proteins, since the structural (three‐dimensional) architecture of these proteins and their complexes is difficult to determine. The power of the proposed methodology is demonstrated at the example of the interaction between the intrinsically disordered protein osteopontin and its natural ligand heparin.  相似文献   

9.
Reversible protein phosphorylation by protein kinases and phosphatases is a ubiquitous signaling mechanism in all eukaryotic cells. A multilevel hidden Markov model library is presented which is able to classify protein kinases into one of 12 families, with a misclassification rate of zero on the characterized kinomes of H. sapiens, M. musculus, D. melanogaster, C. elegans, S. cerevisiae, D. discoideum, and P. falciparum. The Library is shown to outperform BLASTP and a general Pfam hidden Markov model of the kinase catalytic domain in the retrieval and family-level classification of protein kinases. The application of the Library to the 38 unclassified kinases of yeast enriches the yeast kinome in protein kinases of the families AGC (5), CAMK (17), CMGC (4), and STE (1), thereby raising the family-level classification of yeast conventional protein kinases from 66.96 to 90.43%. The application of the Library to 21 eukaryotic genomes shows seven families (AGC, CAMK, CK1, CMGC, STE, PIKK, and RIO) to be present in all genomes analyzed, and so is likely to be essential to eukaryotes. Putative tyrosine kinases (TKs) are found in the plants A. thaliana (2), O. sativa ssp. Indica (6), and O. sativa ssp. Japonica (7), and in the amoeba E. histolytica (7). To our knowledge, TKs have not been predicted in plants before. This also suggests that a primitive set of TKs might have predated the radiation of eukaryotes. Putative tyrosine kinase-like kinases (TKLs) are found in the fungi C. neoformans (2), P. chrysosporium (4), in the Apicomplexans C. hominis (4), P. yoelii (4), and P. falciparum (6), the amoeba E. histolytica (109), and the alga T. pseudonana (6). TKLs are found to be abundant in plants (776 in A. thaliana, 1010 in O. sativa ssp. Indica, and 969 in O. sativa ssp. Japonica). TKLs might have predated the radiation of eukaryotes too and have been lost secondarily from some fungi. The application of the Library facilitates the annotation of kinomes and has provided novel insights on the early evolution and subsequent adaptations of the various protein kinase families in eukaryotes.  相似文献   

10.
Sistla RK  K V B  Vishveshwara S 《Proteins》2005,59(3):616-626
We present a novel method for the identification of structural domains and domain interface residues in proteins by graph spectral method. This method converts the three-dimensional structure of the protein into a graph by using atomic coordinates from the PDB file. Domain definitions are obtained by constructing either a protein backbone graph or a protein side-chain graph. The graph is constructed based on the interactions between amino acid residues in the three-dimensional structure of the proteins. The spectral parameters of such a graph contain information regarding the domains and subdomains in the protein structure. This is based on the fact that the interactions among amino acids are higher within a domain than across domains. This is evident in the spectra of the protein backbone and the side-chain graphs, thus differentiating the structural domains from one another. Further, residues that occur at the interface of two domains can also be easily identified from the spectra. This method is simple, elegant, and robust. Moreover, a single numeric computation yields both the domain definitions and the interface residues.  相似文献   

11.
Bovine seminal ribonuclease (BS-RNase) is a unique member of the pancreatic-like ribonuclease superfamily. The native enzyme is a mixture of two dimeric forms with distinct structural features. The most abundant form is characterized by the swapping of N-terminal fragments. In this paper, the crystal structure of the complex between the swapping dimer and uridylyl(2',5')adenosine is reported at 2.06 A resolution. The refined model has a crystallographic R-factor of 0.184 and good stereochemistry. The quality of the electron density maps enables the structure of both the inhibitor and active site residues to be unambiguously determined. The overall architecture of the active site is similar to that of RNase A. The dinucleotide adopts an extended conformation with the pyrimidine and purine base interacting with Thr45 and Asn71, respectively. Several residues (Gln11, His12, Lys41, His119, and Phe120) bind the oxygens of the phosphate group. The structural similarity of the active sites of BS-RNase and RNase A includes some specific water molecules believed to be relevant to catalytic activity. Upon binding of the dinucleotide, small but significant modifications of the tertiary and quaternary structure of the protein are observed. The ensuing correlation of these modifications with the catalytic activity of the enzyme is discussed.  相似文献   

12.
With the rapid growth of sequence databases, there is an increasing need for reliable functional characterisation and annotation of newly predicted proteins. To cope with such large data volumes, faster and more effective means of protein sequence characterisation and annotation are required. One promising approach is automatic large-scale functional characterisation and annotation, which is generated with limited human interaction. However, such an approach is heavily dependent on reliable data sources. The SWISS-PROT protein sequence database plays an essential role here owing to its high level of functional information.  相似文献   

13.
Using genetic engineering technologies, the chitin-binding domain (ChBD) of the human macrophage chitotriosidase has been inserted into the host protein BlaP, a class A beta-lactamase produced by Bacillus licheniformis. The product of this construction behaved as a soluble chimeric protein that conserves both the capacity to bind chitin and to hydrolyze beta-lactam moiety. Here we describe the biochemical and biophysical properties of this protein (BlaPChBD). This work contributes to a better understanding of the reciprocal structural and functional effects of the insertion on the host protein scaffold and the heterologous structured protein fragments. The use of BlaP as a protein carrier represents an efficient approach to the functional study of heterologous protein fragments.  相似文献   

14.
A consensus approach for the assignment of structural domains in proteins is presented. The approach combines a number of previously published algorithms, and takes advantage of the elevated accuracy obtained when assignments from the individual algorithms are in agreement. The consensus approach is tested on a data set of 55 protein chains, for which domain assignments from four automated methods were known, and for which crystallographers assignments had been reported in the literature. Accuracy was found to increase in this test from 72% using individual algorithms to 100% when all four methods were in agreement. However a consensus prediction using all four methods was only possible for 52% of the dataset. The consensus approach [using three publicly available domain assignment algorithms (PUU, DETECTIVE, DOMAK)] was then used to make domain assignments for a data set of 787 protein chains from the Protein Data Bank. Analysis of the assignments showed 55.7% of assignments could be made automatically, and of these, 13.5% were multi-domain proteins. Of the remaining 44.3% that could not be assigned by the consensus procedure 90.4% had their domain boundaries assigned correctly by at least one of the algorithms. Once identified, these domains were analyzed for trends in their size and secondary structure class. In addition, the discontinuity of each domain along the protein chain was considered.  相似文献   

15.
The formation of alpha(2) dimer in Escherichia coli core RNA polymerase (RNAP) is thought to be the first step toward the assembly of the functional enzyme. A large number of evidences indicate that the alpha-subunit dimerizes through its N-terminal domain (NTD). The crystal structures of the alpha-subunit NTD and that of a homologous Thermus aquaticus core RNAP are known. To identify the stabilizing interactions in the dimer interface of the alpha-NTD of E. coli RNAP, we identified side-chain clusters by using the crystal structure coordinates of E. coli alpha-NTD. A graph spectral algorithm was used to identify side-chain clusters. This algorithm considers the global nonbonded side-chain interactions of the residues for the clustering procedure and is unique in identifying residues that make the largest number of interactions among the residues that form clusters in a very quantitative way. By using this algorithm, a nine-residue cluster consisting of polar and hydrophobic residues was identified in the subunit interface adjacent to the hydrophobic core. The residues forming the cluster are relatively rigid regions of the interface, as measured by the thermal factors of the residues. Most of the cluster residues in the E. coli enzyme were topologically and sequentially conserved in the T. aquaticus RNAP crystal structure. Residues 35F and 46I were predicted to be important in the stability of the alpha-dimer interface, with 35F forming the center of the cluster. The predictions were tested by isolating single-point mutants alpha-F35A and alpha-I46S on the dimer interface, which were found to disrupt dimerization. Thus, the identified cluster at the edge of the dimer interface seems to be a vital component in stabilizing the alpha-NTD.  相似文献   

16.
We have previously attempted to simulate domain creation in early protein evolution by recombining polypeptide segments from non-homologous proteins, and we have described the structure of one such de novo protein, 1b11, a segment-swapped tetramer with novel architecture. Here, we have analyzed the thermodynamic stability and folding kinetics of the 1b11 tetramer and its monomeric and dimeric intermediates, and of 1b11 mutants with changes at the domain interface. Denatured 1b11 polypeptides fold into transient, folded monomers with marginal stability (DeltaG<1kcalmol(-1)) which convert rapidly ( approximately 6x10(4)M(-1)s(-1)) into dimers (DeltaG=9.8kcal/mol) and then more slowly ( approximately 3M(-1)s(-1)) into tetramers (DeltaG=28kcalmol(-1)). Segment swapping takes place during dimerization, as suggested by mass spectroscopic analysis of covalently linked peptides derived from proteolysis of a disulfide-linked dimer. Our results confirm that segment swapping and associated oligomerization are both powerful ways of stabilizing proteins, and we suggest that this may have been a feature of early protein evolution.  相似文献   

17.
Mitochondria are organelles derived from α-proteobacteria over the course of one to two billion years. Mitochondria from the major eukaryotic lineages display some variation in functions and coding capacity but sequence analysis demonstrates them to be derived from a single common ancestral endosymbiont. The loss of assorted functions, the transfer of genes to the nucleus, and the acquisition of various ‘eukaryotic’ proteins have resulted in an organelle that contains approximately 1000 different proteins, with most of these proteins imported into the organelle across one or two membranes. A single translocase in the outer membrane and two translocases in the inner membrane mediate protein import. Comparative sequence analysis and functional complementation experiments suggest some components of the import pathways to be directly derived from the eubacterial endosymbiont's own proteins, and some to have arisen ‘de novo’ at the earliest stages of ‘mitochondrification’ of the endosymbiont. A third class of components appears lineage-specific, suggesting they were incorporated into the process of protein import long after mitochondria was established as an organelle and after the divergence of the various eukaryotic lineages. Protein sorting pathways inherited from the endosymbiont have been co-opted and play roles in intraorganelle protein sorting after import. The import apparatus of animals and fungi show significant similarity to one another, but vary considerably to the plant apparatus. Increasing complexity in the eukaryotic lineage, i.e., from single celled to multi-cellular life forms, has been accompanied by an expansion in genes encoding each component, resulting in small gene families encoding many components. The functional differences in these gene families remain to be elucidated, but point to a mosaic import apparatus that can be regulated by a variety of signals.  相似文献   

18.
Domains are basic evolutionary units of proteins and most proteins have more than one domain. Advances in domain modeling and collection are making it possible to annotate a large fraction of known protein sequences by a linear ordering of their domains, yielding their architecture. Protein domain architectures link evolutionarily related proteins and underscore their shared functions. Here, we attempt to better understand this association by identifying the evolutionary pathways by which extant architectures may have evolved. We propose a model of evolution in which architectures arise through rearrangements of inferred precursor architectures and acquisition of new domains. These pathways are ranked using a parsimony principle, whereby scenarios requiring the fewest number of independent recombination events, namely fission and fusion operations, are assumed to be more likely. Using a data set of domain architectures present in 159 proteomes that represent all three major branches of the tree of life allows us to estimate the history of over 85% of all architectures in the sequence database. We find that the distribution of rearrangement classes is robust with respect to alternative parsimony rules for inferring the presence of precursor architectures in ancestral species. Analyzing the most parsimonious pathways, we find 87% of architectures to gain complexity over time through simple changes, among which fusion events account for 5.6 times as many architectures as fission. Our results may be used to compute domain architecture similarities, for example, based on the number of historical recombination events separating them. Domain architecture "neighbors" identified in this way may lead to new insights about the evolution of protein function.  相似文献   

19.
20.
It is shown that complex adaptations are best modelled as discrete processes represented on directed weighted graphs. Such a representation captures the idea that problems of adaptation in evolutionary biology are problems in a discrete space, something that the conventional representations using continuous adaptive landscapes does not. Further, this representation allows the utilization of well-known algorithms for the computation of several biologically interesting results such as the accessibility of one allele from another by a specified number of point mutations, the accessibility of alleles at a local maximum of fitness, the accessibility of the allele with the globally maximum fitness, etc. A reduction of a model due to Kauffman and Levin to such a representation is explicitly carried out and it is shown how this reduction clarifies the biological questions that are of interest.Thanks are due to William Wimsatt, James F. Crow, and the referees for Biology and Philosophy for comments on an earlier version of this paper. Remarks by members of the audience, especially Abner Shimony, of a seminar at Boston University, February 19, 1988, were also very helpful. The diagrams were prepared with the assistance of Tracy Lubas.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号