首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
A classification scheme for membrane proteins is proposed that clusters families of proteins into structural classes based on hydropathy profile analysis. The averaged hydropathy profiles of protein families are taken as fingerprints of the 3D structure of the proteins and, therefore, are able to detect more distant evolutionary relationships than amino acid sequences. A procedure was developed in which hydropathy profile analysis is used initially as a filter in a BLAST search of the NCBI protein database. The strength of the procedure is demonstrated by the classification of 29 families of secondary transporters into a single structural class, termed ST[3]. An exhaustive search of the database revealed that the 29 families contain 568 unique sequences. The proteins are predominantly from prokaryotic origin and most of the characterized transporters in ST[3] transport organic and inorganic anions and a smaller number are Na(+)/H(+) antiporters. All modes of energy coupling (symport, antiport, uniport) are found in structural class ST[3]. The relevance of the classification for structure/function prediction of uncharacterised transporters in the class is discussed.  相似文献   

2.
    
The sodium solute symporters (SSS) and neurotransmitter sodium symporters (NSS) are two families of secondary transporters that are not related in amino acid sequence. Nonetheless, recent crystal structures showed that the Na+/galactose (SSS) and Na+/leucine (NSS) transporters have similar core structures. The structural relatedness highlights the need for classification methods for membrane protein structures based on other criteria than amino acid similarity. Here, we demonstrate that a method based on hydropathy profile alignments convincingly identifies structural similarity between the NSS and SSS families. Most importantly, the method shows that one of the largest transporter families for which a crystal structure is elusive (the amino acid/polyamine/organocation or APC superfamily), also shares the similar core structure observed for the Na+/galactose and Na+/leucine transporters. The APC superfamily contains the major amino acid transporter families that are found throughout life. Insight into their structure will significantly facilitate the studies of this important group of transporters.  相似文献   

3.
    
  相似文献   

4.
There are lots of works gone into proteins to understand the nature of proteins. Hydrophobic interaction is the dominant force that drives the proteins to carry out the biochemical reactions in all living system. Carbon is the only element that contributes towards this hydrophobic interaction. Studies find that globular proteins prefer to have 31.45% of carbon for its stability. Taking this as standard, a carbon analysis program has been developed to study the carbon distribution profile of protein sequences. This carbon analysis program has been made available online. This can be accessed at www.rajasekaran.net.in/tools/carbana.html. This new program is hoped to help in identification and development of active sites, study of protein stability, evolutionary understating of proteins, gene identification, ligand binding site identification, and to solve the long-standing problem of protein-protein and protein-DNA interactions.  相似文献   

5.
    
Kinch LN  Grishin NV 《Proteins》2002,48(1):75-84
Nitrogen regulatory (PII) proteins are signal transduction molecules involved in controlling nitrogen metabolism in prokaryots. PII proteins integrate the signals of intracellular nitrogen and carbon status into the control of enzymes involved in nitrogen assimilation. Using elaborate sequence similarity detection schemes, we show that five clusters of orthologs (COGs) and several small divergent protein groups belong to the PII superfamily and predict their structure to be a (betaalphabeta)(2) ferredoxin-like fold. Proteins from the newly emerged PII superfamily are present in all major phylogenetic lineages. The PII homologs are quite diverse, with below random (as low as 1%) pairwise sequence identities between some members of distant groups. Despite this sequence diversity, evidence suggests that the different subfamilies retain the PII trimeric structure important for ligand-binding site formation and maintain a conservation of conservations at residue positions important for PII function. Because most of the orthologous groups within the PII superfamily are composed entirely of hypothetical proteins, our remote homology-based structure prediction provides the only information about them. Analogous to structural genomics efforts, such prediction gives clues to the biological roles of these proteins and allows us to hypothesize about locations of functional sites on model structures or rationalize about available experimental information. For instance, conserved residues in one of the families map in close proximity to each other on PII structure, allowing for a possible metal-binding site in the proteins coded by the locus known to affect sensitivity to divalent metal ions. Presented analysis pushes the limits of sequence similarity searches and exemplifies one of the extreme cases of reliable sequence-based structure prediction. In conjunction with structural genomics efforts to shed light on protein function, our strategies make it possible to detect homology between highly diverse sequences and are aimed at understanding the most remote evolutionary connections in the protein world.  相似文献   

6.
氨基酸组成聚类、蛋白质结构型和结构型的预测   总被引:11,自引:0,他引:11       下载免费PDF全文
用信息聚类方法对蛋白质的氨基酸组成进行聚类,发现存在梯级成团(大集团分解成小集团)现象,645个蛋白质可分成15个小集团,每一个小集团与蛋白质二级结构含量决定的结构型有一定相关性,但与蛋白质五大结构型相关性不明显。指出了由氨基酸成分和二级结构含量预测结构型的方案中存在的问题。提出了由蛋白质二级结构序列预测蛋白质结构型的新方法,并给出了预测蛋白质结构型的简明预测规则  相似文献   

7.
    
In this study, we show that it is possible to increase the performance over PSI-BLAST by using evolutionary information for both query and target sequences. This information can be used in three different ways: by sequence linking, profile-profile alignments, and by combining sequence-profile and profile-sequence searches. If only PSI-BLAST is used, 16% of superfamily-related protein domains can be detected at 90% specificity, but if a sequence-profile and a profile-sequence search are combined, this is increased to 20%, profile-profile searches detects 19%, whereas a linking procedure identifies 22% of these proteins. All three methods show equal performance, but the best combination of speed and accuracy seems to be obtained by the combined searches, because this method shows a good performance even at high specificity and the lowest computational cost. In addition, we show that the E-values reported by all these methods, including PSI-BLAST, underestimate the true rate of false positives. This behavior is seen even if a very strict E-value cutoff and a limited number of iterations are used. However, the difference is more pronounced with a looser E-value cutoff and more iterations.  相似文献   

8.
The European Bioinformatics Institute (EBI) provides numerous free-of-charge, publicly available bioinformatics services that can be divided into the following categories: ftp downloads; data submissions processing and biological database production; access to query; analysis and retrieval systems and tools; user support; training and education and industry support through EBI's SME program. These services are all available at the website. It is imperative that EBI's data as well as the tools to analyse it efficiently are made available in a free and unambiguous way to the scientific community. An important part of the EBI's mission is to make this happen in a fast, reliable and efficient manner. This paper serves as a brief introduction to each of these services.  相似文献   

9.
    
S. Rackovsky 《Proteins》2013,81(10):1681-1685
Delineation of the relationship between sequence and structure in proteins has proven elusive. Most studies of this problem use alignment methods and other approaches based on the characteristics of individual residues. It is demonstrated herein that the sequence‐structure relationship is determined in significant part by global characteristics of sequence organization. Information encoded in complete sequences is required to distinguish proteins in different architectural groups. It is found that the statistically significant differences between sequences encoding different architectures are encoded in a surprisingly small set of low‐wave‐number sequence periodicities. It would therefore appear that unexpected simplicity in an appropriately defined Fourier space may be an inherent characteristic of the sequences of folded proteins. Proteins 2013; 81:1681–1685. © 2013 Wiley Periodicals, Inc.  相似文献   

10.
李楠  李春 《生物信息学》2012,10(4):238-240
基于氨基酸的16种分类模型,给出蛋白质序列的派生序列,进而结合加权拟熵和LZ复杂度构造出34维特征向量来表示蛋白质序列。借助于贝叶斯分类器对同源性不超过25%的640数据集进行蛋白质结构类预测,准确度达到71.28%。  相似文献   

11.
The structure of the hernoglobin -chain of Rose-ringed Parakeet was determined by sequence degradations of the intact subunit, the CNBr fragments, and peptides obtained by digestion with staphylococcal Glu-specific protease and trypsin. Using this analysis, the complete -chain structure of 21 avian species is known, permitting comparisons of the protein structure and of avian relationships. The structure exhibits differences from previously established avian -chains at a total of 61 positions, five of which have residues unique to those of the parakeet (Ser-12, Gly-65, Ser-67, Ala-121, and Leu-134). The analysis defines hemoglobin variation within an additional avian order (Psittaciformes), demonstrates distant patterns for evaluation of relationships within other avian orders, and lends support to taxonomic conclusions from molecular data.  相似文献   

12.
A new method to detect remote relationships between protein sequences and known three-dimensional structures based on direct energy calculations and without reliance on statistics has been developed. The likelihood of a residue to occupy a given position on the structural template was represented by an estimate of the stabilization free energy made after explicit prediction of the substituted side chain conformation. The profile matrix derived from these energy values and modified by increasing the residue self-exchange values successfully predicted compatibility of heatshock protein and globin sequences with the three-dimensional structures of actin and phycocyanin, respectively, from a full protein sequence databank search. The high sensitivity of the method makes it a unique tool for predicting the three-dimensional fold for the rapidly growing number of protein sequences. © 1994 Wiley-Liss, Inc.  相似文献   

13.
Elementary flux mode analysis is a promising approach for a pathway-oriented perspective of metabolic networks. However, in larger networks it is hampered by the combinatorial explosion of possible routes. In this work we give some estimations on the combinatorial complexity including theoretical upper bounds for the number of elementary flux modes in a network of a given size. In a case study, we computed the elementary modes in the central metabolism of Escherichia coli while utilizing four different substrates. Interestingly, although the number of modes occurring in this complex network can exceed half a million, it is still far below the upper bound. Hence, to a certain extent, pathway analysis of central catabolism is feasible to assess network properties such as flexibility and functionality.  相似文献   

14.
1. Ship‐induced waves disturb benthic invertebrate assemblages colonizing littoral zones of lakes and rivers. However, the impact of ship‐induced waves on invertebrates has rarely been quantified, and the influencing factors have not been addressed. 2. In an experimental wave tank, five benthic invertebrate species, Bithynia tentaculata, Calopteryx splendens, Dikerogammarus villosus, Gammarus roeseli and Laccophilus hyalinus, were exposed to waves of increasing shear stress (0.43–2.19 N m?2). Mean number of detached individuals was recorded for five littoral habitats [coarse woody debris (CWD), reeds, sand, stones and tree roots], representing different levels of structural complexity as quantified by their fractal dimensions (FD). 3. Results showed that detachment of invertebrates was significantly related to shear stress in all habitats except tree roots. Detachments averaged for the five species were significantly lower in habitats with a high degree of structural complexity, decreasing in the habitat sequence: sand, CWD, stones, reeds and tree roots. 4. Consistent with their different morphologies and methods of attachment to substrates, the five species displayed differences in their response to hydraulic stress that were dependent on habitat. 5. The increasing sheltering effect of structural habitat complexity was mirrored by increasing dissipation of the kinetic energy of waves; i.e. the FD of the habitat was positively correlated with shear stress reduction due to the flow resistance of the habitat. 6. Network habitats such as tree roots provided the best sheltering conditions against hydraulic disturbance, because they combined good refuge availability for all studied invertebrate species and maximal dissipation of kinetic wave energy. Consequently, persistent anthropogenic impacts, such as lakeshore modification or long‐term exposure to ship‐induced waves, which cause disappearance of complex littoral habitats such as tree roots or dense reed belts, will drastically increase the adverse effects of boating and ship traffic on littoral invertebrate assemblages.  相似文献   

15.
    
In the present study, trypsin from Plodia interpunctella (Hübner) is characterized to discover sequence, biochemical and structural features. This enzyme is purified by ion exchange chromatography using fast protein liquid chromatography on proteins from fifth‐instar larvae. The enzyme is optimally active at 50 °C and pH 11.0. The kinetic parameters (Km and Vmax) of the enzyme are 5.3 ± 0.6 µm and 31 ± 1.3 nmol min?1 mg?1, respectively (using Nα‐benzoyl‐l ‐arginine ρ‐nitroanilide hydrochloride as substrate). The enzyme is inhibited by the addition of Cu2+ and Mn2+, whereas it is activated by Li+ at high concentrations. Moreover, the enzyme is almost completely inhibited in the presence of Nα‐tosyl‐l ‐lysine chloromethyl ketone hydrochloride and phenylmethanesulphonyl fluoride. To understand some characteristics of P. interpunctella trypsin, including active site structure and alkaline pH profile, a reliable structural model of P. interpunctella trypsin is built based on the Fusarium oxisporum (Schlecht) trypsin cystal structure (Protein Data Bank code: 1GDU). The secondary structure content of the purified trypsin from near‐ultraviolet circular dichroism data shows considerable similarities with that of P. interpunctella trypsin predicted structure. Analysis of pKa values of active site residues, a type of amino acid residue in the active site cleft and the surface charges of the model and Tribolium castaneum (Herbst) trypsin structure as an insect species from different orders reveals some differences between them. These differences might effect on the microenvironment of the active site cleft and consequently shift its pH profile. The application of multiple theoretical and experimental techniques is well adapted to predict the enzyme structure with high accuracy and this could help in the design of a powerful inhibitor for trypsin with ideal properties.  相似文献   

16.
    
Solute carriers are eukaryotic membrane proteins that control the uptake and efflux of solutes, including essential cellular compounds, environmental toxins, and therapeutic drugs. Solute carriers can share similar structural features despite weak sequence similarities. Identification of sequence relationships among solute carriers is needed to enhance our ability to model individual carriers and to elucidate the molecular mechanisms of their substrate specificity and transport. Here, we describe a comprehensive comparison of solute carriers. We link the proteins using sensitive profile–profile alignments and two classification approaches, including similarity networks. The clusters are analyzed in view of substrate type, transport mode, organism conservation, and tissue specificity. Solute carrier families with similar substrates generally cluster together, despite exhibiting relatively weak sequence similarities. In contrast, some families cluster together with no apparent reason, revealing unexplored relationships. We demonstrate computationally and experimentally the functional overlap between representative members of these families. Finally, we identify four putative solute carriers in the human genome. The solute carriers include a biomedically important group of membrane proteins that is diverse in sequence and structure. The proposed classification of solute carriers, combined with experiment, reveals new relationships among the individual families and identifies new solute carriers. The classification scheme will inform future attempts directed at modeling the structures of the solute carriers, a prerequisite for describing the substrate specificities of the individual families.  相似文献   

17.
    
Periplasmic binding protein-dependent transport systems are composed of a periplasmic substrate-binding protein, a set of 2 (sometimes 1) very hydrophobic integral membrane proteins, and 1 (sometimes 2) hydrophilic peripheral membrane protein that binds and hydrolyzes ATP. These systems are members of the superfamily of ABC transporters. We performed a molecular phylogenetic analysis of the sequences of 70 hydrophobic membrane proteins of these transport systems in order to investigate their evolutionary history. Proteins were grouped into 8 clusters. Within each cluster, protein sequences displayed significant similarities, suggesting that they derive from a common ancestor. Most clusters contained proteins from systems transporting analogous substrates such as monosaccharides, oligopeptides, or hydrophobic amino acids, but this was not a general rule. Proteins from diverse bacteria are found within each cluster, suggesting that the ancestors of current clusters were present before the divergence of bacterial groups. The phylogenetic trees computed for hydrophobic membrane proteins of these permeases are similar to those described for the periplasmic substrate-binding proteins. This result suggests that the genetic regions encoding binding protein-dependent permeases evolved as whole units. Based on the results of the classification of the proteins and on the reconstructed phylogenetic trees, we propose an evolutionary scheme for periplasmic permeases. According to this model, it is probable that these transport systems derive from an ancestral system having only 1 hydrophobic membrane protein.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

18.
    
Strong sequence similarity has been reported among WrbA (the Trp repressor-binding protein of Escherichia coli); Ycp4, a protein of unknown function from the budding yeast Saccharomyces cerevisiae; P25, the pap1-dependent protein of the fission yeast Schizosaccharomyces pombe; and the translation product of a partial cDNA sequence from rice seedling root (Oryza sativa, locus Ricr02421a; here referred to as RicR). Further homology search with the profile method indicates that all the above sequences are related to the flavodoxin family and, in turn, allows detection of the recently proposed flavodoxin-like proteins from E. coli, MioC and the hypothetical protein YihB. We discuss sequence conservation with reference to the known 3-dimensional structures of flavodoxins. Conserved sequence and hydrophobicity patterns, as well as residue-pair interaction potentials, strongly support the hypothesis that these proteins share the alpha/beta twisted open-sheet fold typical of flavodoxins, with an additional alpha/beta unit in the WrbA family. On the basis of the proposed structural homology, we discuss the details of the putative FMN-binding sites. Our analysis also suggests that the helix-turn-helix motif we identified previously in the C-terminal region of the WrbA family is unlikely to reflect a DNA-binding function of this new protein family.  相似文献   

19.
    
Protein fluorescence is a powerful tool for studying protein structure and dynamics if we have a means to interpret the spectral data in terms of protein structural properties. Our previous research successfully provided this support through the development of individual software modules implementing the algorithms for fluorescence and structural analyses. Now we have integrated the developed software modules, introduced a new program for the assignment of tryptophan residues to spectral-structural classes, and created a web-based toolkit PFAST: protein fluorescence and structural toolkit: http://pfast.phys.uri.edu/. PFAST contains three modules: (1) FCAT is a fluorescence-correlation analysis tool, which decomposes protein fluorescence spectra to reveal the spectral components of individual tryptophan residues or groups of tryptophan residues located close to each other, and assigns spectral components to one of five previously established spectral-structural classes. (2) SCAT is a structural-correlation analysis tool for the calculation of the structural parameters of the environment of tryptophan residues from the atomic structures of the proteins from the Protein Data Bank (PDB), and for the assignment of tryptophan residues to one of five spectral-structural classes. (3) The last module is a PFAST database that contains protein fluorescence and structural data obtained from results of the FCAT and SCAT analyses.  相似文献   

20.
Aggregate Measures of Complex Economic Structure and Evolution   总被引:1,自引:0,他引:1  
It is perhaps in the nature of complex systems that they call for aggregate measures that enable analysts to grasp their structure and evolution without being overwhelmed by their very complexity. Complex interindustry theory and models are a typical case, where the underlying database—an input−output table—routinely contains thousands of data points for a single year. Within input−output analysis, quantitative measures have been developed that describe and characterize interindustry interactions and that have been used to compare economies, both in a static taxonomy and through their evolution over time. First, we review and critically discuss a number of concepts that have been proposed and applied to interindustry systems, such as interconnectedness, interrelatedness, linkages, and economic landscapes. Second, we apply these concepts to a case study of the Australian economy between 1975 and 1999 in terms of environmental headline indicators. Our results enable the reader to judge the usefulness and ability of the measures in capturing the key structural elements and evolutionary processes governing the interaction between the economy and the environment. For the Australian case study, the measures showed a diversifying economy occurring together with a specialization of environmental flows.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号