首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A classification scheme for membrane proteins is proposed that clusters families of proteins into structural classes based on hydropathy profile analysis. The averaged hydropathy profiles of protein families are taken as fingerprints of the 3D structure of the proteins and, therefore, are able to detect more distant evolutionary relationships than amino acid sequences. A procedure was developed in which hydropathy profile analysis is used initially as a filter in a BLAST search of the NCBI protein database. The strength of the procedure is demonstrated by the classification of 29 families of secondary transporters into a single structural class, termed ST[3]. An exhaustive search of the database revealed that the 29 families contain 568 unique sequences. The proteins are predominantly from prokaryotic origin and most of the characterized transporters in ST[3] transport organic and inorganic anions and a smaller number are Na(+)/H(+) antiporters. All modes of energy coupling (symport, antiport, uniport) are found in structural class ST[3]. The relevance of the classification for structure/function prediction of uncharacterised transporters in the class is discussed.  相似文献   

2.
A novel alignment-free method for computing functional similarity of membrane proteins based on features of hydropathy distribution is presented. The features of hydropathy distribution are used to represent protein families as hydropathy profiles. The profiles statistically summarize the hydropathy distribution of member proteins. The summation is made by using hydropathy features that numerically represent structurally/functionally significant portions of protein sequences. The hydropathy profiles are numerical vectors that are points in a high dimensional ‘hydropathy’ space. Their similarities are identified by projection of the space onto principal axes. Here, the approach is applied to the secondary transporters. The analysis using the presented approach is validated by the standard classification of the secondary transporters. The presented analysis allows for prediction of function attributes for proteins of uncharacterized families of secondary transporters. The results obtained using the presented analysis may help to characterize unknown function attributes of secondary transporters. They also show that analysis of hydropathy distribution can be used for function prediction of membrane proteins.  相似文献   

3.
A novel alignment-free method for computing functional similarity of membrane proteins based on features of hydropathy distribution is presented. The features of hydropathy distribution are used to represent protein families as hydropathy profiles. The profiles statistically summarize the hydropathy distribution of member proteins. The summation is made by using hydropathy features that numerically represent structurally/functionally significant portions of protein sequences. The hydropathy profiles are numerical vectors that are points in a high dimensional 'hydropathy' space. Their similarities are identified by projection of the space onto principal axes. Here, the approach is applied to the secondary transporters. The analysis using the presented approach is validated by the standard classification of the secondary transporters. The presented analysis allows for prediction of function attributes for proteins of uncharacterized families of secondary transporters. The results obtained using the presented analysis may help to characterize unknown function attributes of secondary transporters. They also show that analysis of hydropathy distribution can be used for function prediction of membrane proteins.  相似文献   

4.
Pánek J  Eidhammer I  Aasland R 《Proteins》2005,58(4):923-934
Structural similarity among proteins is reflected in the distribution of hydropathicity along the amino acids in the protein sequence. Similarities in the hydropathy distributions are obvious for homologous proteins within a protein family. They also were observed for proteins with related structures, even when sequence similarities were undetectable. Here we present a novel method that employs the hydropathy distribution in proteins for identification of (sub)families in a set of (homologous) proteins. We represent proteins as points in a generalized hydropathy space, represented by vectors of specifically defined features. The features are derived from hydropathy of the individual amino acids. Projection of this space onto principal axes reveals groups of proteins with related hydropathy distributions. The groups identified correspond well to families of structurally and functionally related proteins. We found that this method accurately identifies protein families in a set of proteins, or subfamilies in a set of homologous proteins. Our results show that protein families can be identified by the analysis of hydropathy distribution, without the need for sequence alignment.  相似文献   

5.
Nucleic acid sequences from genome sequencing projects are submitted as raw data, from which biologists attempt to elucidate the function of the predicted gene products. The protein sequences are stored in public databases, such as the UniProt Knowledgebase (UniProtKB), where curators try to add predicted and experimental functional information. Protein function prediction can be done using sequence similarity searches, but an alternative approach is to use protein signatures, which classify proteins into families and domains. The major protein signature databases are available through the integrated InterPro database, which provides a classification of UniProtKB sequences. As well as characterization of proteins through protein families, many researchers are interested in analyzing the complete set of proteins from a genome (i.e. the proteome), and there are databases and resources that provide non-redundant proteome sets and analyses of proteins from organisms with completely sequenced genomes. This article reviews the tools and resources available on the web for single and large-scale protein characterization and whole proteome analysis.  相似文献   

6.
Intensive growth in 3D structure data on DNA-protein complexes as reflected in the Protein Data Bank (PDB) demands new approaches to the annotation and characterization of these data and will lead to a new understanding of critical biological processes involving these data. These data and those from other protein structure classifications will become increasingly important for the modeling of complete proteomes. We propose a fully automated classification of DNA-binding protein domains based on existing 3D-structures from the PDB. The classification, by domain, relies on the Protein Domain Parser (PDP) and the Combinatorial Extension (CE) algorithm for structural alignment. The approach involves the analysis of 3D-interaction patterns in DNA-protein interfaces, assignment of structural domains interacting with DNA, clustering of domains based on structural similarity and DNA-interacting patterns. Comparison with existing resources on describing structural and functional classifications of DNA-binding proteins was used to validate and improve the approach proposed here. In the course of our study we defined a set of criteria and heuristics allowing us to automatically build a biologically meaningful classification and define classes of functionally related protein domains. It was shown that taking into consideration interactions between protein domains and DNA considerably improves the classification accuracy. Our approach provides a high-throughput and up-to-date annotation of DNA-binding protein families which can be found at http://spdc.sdsc.edu.  相似文献   

7.
Membrane transporters constitute one of the largest functional categories of proteins in all organisms. In the yeast Saccharomyces cerevisiae, this represents about 300 proteins (∼ 5% of the proteome). We here present the Yeast Transport Protein database (YTPdb), a user-friendly collaborative resource dedicated to the precise classification and annotation of yeast transporters. YTPdb exploits an evolution of the MediaWiki web engine used for popular collaborative databases like Wikipedia, allowing every registered user to edit the data in a user-friendly manner. Proteins in YTPdb are classified on the basis of functional criteria such as subcellular location or their substrate compounds. These classifications are hierarchical, allowing queries to be performed at various levels, from highly specific (e.g. ammonium as a substrate or the vacuole as a location) to broader (e.g. cation as a substrate or inner membranes as location). Other resources accessible for each transporter via YTPdb include post-translational modifications, Km values, a permanently updated bibliography, and a hierarchical classification into families. The YTPdb concept can be extrapolated to other organisms and could even be applied for other functional categories of proteins. YTPdb is accessible at http://homes.esat.kuleuven.be/ytpdb/.  相似文献   

8.
9.
MOTIVATION: Increase the discriminatory power of PROSITE profiles to facilitate function determination and provide biologically relevant information about domains detected by profiles for the annotation of proteins. SUMMARY: We have created a new database, ProRule, which contains additional information about PROSITE profiles. ProRule contains notably the position of structurally and/or functionally critical amino acids, as well as the condition they must fulfill to play their biological role. These supplementary data should help function determination and annotation of the UniProt Swiss-Prot knowledgebase. ProRule also contains information about the domain detected by the profile in the Swiss-Prot line format. Hence, ProRule can be used to make Swiss-Prot annotation more homogeneous and consistent. The format of ProRule can be extended to provide information about combination of domains. AVAILABILITY: ProRule can be accessed through ScanProsite at http://www.expasy.org/tools/scanprosite. A file containing the rules will be made available under the PROSITE copyright conditions on our ftp site (ftp://www.expasy.org/databases/prosite/) by the next PROSITE release.  相似文献   

10.
11.
A technique has been developed to search a proteome database for new members of a functional class of membrane protein. It takes advantage of the highly conserved secondary structure of functionally related membrane proteins. Such proteins typically have the same number of transmembrane domains located at similar relative positions in their polypeptide sequence. This gives rise to a characteristic pattern of peaks in their hydropathy profiles. To conduct a search, each member of a polypeptide database is converted to a hydropathy profile, peaks are automatically detected, and the pattern of peaks is compared with a template. A template was designed for the acetylcholine (ACh) and glycine receptors of the cys-loop receptor superfamily. The key feature was a closely spaced triplet of hydropathy peaks bracketed by deep valleys. When applied to the human proteome the search procedure retrieved 153 profiles with a receptor-like triplet of peaks. The approach was highly selective with 70% of the retrieved profiles annotated as known or putative receptors. These included ACh, glycine, gamma-amino butyric acid and serotonin receptors, which are all related by sequence. However, ionotropic glutamate receptors, which have almost no sequence homology with ACh receptors, were also retrieved. Thus, the strategy can find members of a functional class that cannot be identified by sequence alignment. To demonstrate that the strategy can easily be extended to other membrane protein families, a template was developed for the neurotransmitter/Na+ symporter family, and similar results were obtained. This approach should prove a useful adjunct to sequence-based retrieval tools when searching for novel membrane proteins.  相似文献   

12.
In post-genomic era, a plethora of protein structures have been solved but the functions of some of them are unknown. In this context, the role of hydropathy index of amino acids in predicting the function of a structurally known and functionally unknown protein was explored. Initially serine protease class was taken for analysis. Various methodologies like calculation of average hydropathy index for a five-residue window of a given sequence, hydropathy cluster analyses, etc., were done. Among these, the distribution of hydropathy clusters seems to suggest that the location of these clusters is conserved for a given class of proteins. Hence, this methodology was extended to different classes of proteins and to a protein with unknown function.  相似文献   

13.
The Swiss-Prot protein knowledgebase provides manually annotated entries for all species, but concentrates on the annotation of entries from model organisms to ensure the presence of high quality annotation of representative members of all protein families. A specific Plant Protein Annotation Program (PPAP) was started to cope with the increasing amount of data produced by the complete sequencing of plant genomes. Its main goal is the annotation of proteins from the model plant organism Arabidopsis thaliana. In addition to bibliographic references, experimental results, computed features and sometimes even contradictory conclusions, direct links to specialized databases connect amino acid sequences with the current knowledge in plant sciences. As protein families and groups of plant-specific proteins are regularly reviewed to keep up with current scientific findings, we hope that the wealth of information of Arabidopsis origin accumulated in our knowledgebase, and the numerous software tools provided on the Expert Protein Analysis System (ExPASy) web site might help to identify and reveal the function of proteins originating from other plants. Recently, a single, centralized, authoritative resource for protein sequences and functional information, UniProt, was created by joining the information contained in Swiss-Prot, Translation of the EMBL nucleotide sequence (TrEMBL), and the Protein Information Resource-Protein Sequence Database (PIR-PSD). A rising problem is that an increasing number of nucleotide sequences are not being submitted to the public databases, and thus the proteins inferred from such sequences will have difficulties finding their way to the Swiss-Prot or TrEMBL databases.  相似文献   

14.
Yona G  Linial N  Linial M 《Proteins》1999,37(3):360-378
We investigate the space of all protein sequences in search of clusters of related proteins. Our aim is to automatically detect these sets, and thus obtain a classification of all protein sequences. Our analysis, which uses standard measures of sequence similarity as applied to an all-vs.-all comparison of SWISSPROT, gives a very conservative initial classification based on the highest scoring pairs. The many classes in this classification correspond to protein subfamilies. Subsequently we merge the subclasses using the weaker pairs in a two-phase clustering algorithm. The algorithm makes use of transitivity to identify homologous proteins; however, transitivity is applied restrictively in an attempt to prevent unrelated proteins from clustering together. This process is repeated at varying levels of statistical significance. Consequently, a hierarchical organization of all proteins is obtained. The resulting classification splits the protein space into well-defined groups of proteins, which are closely correlated with natural biological families and superfamilies. Different indices of validity were applied to assess the quality of our classification and compare it with the protein families in the PROSITE and Pfam databases. Our classification agrees with these domain-based classifications for between 64.8% and 88.5% of the proteins. It also finds many new clusters of protein sequences which were not classified by these databases. The hierarchical organization suggested by our analysis reveals finer subfamilies in families of known proteins as well as many novel relations between protein families.  相似文献   

15.
Neurotransmitter:sodium symporters (NSSs) mediate reuptake of neurotransmitters from the synaptic cleft and are targets for several therapeutics and psychostimulants. The prokaryotic NSS homologue, LeuT, represents a principal structural model for Na+-coupled transport catalyzed by these proteins. Here, we used site-directed fluorescence quenching spectroscopy to identify in LeuT a substrate-induced conformational rearrangement at the inner gate conceivably leading to formation of a structural intermediate preceding transition to the inward-open conformation. The substrate-induced, Na+-dependent change required an intact primary substrate-binding site and involved increased water exposure of the cytoplasmic end of transmembrane segment 5. The findings were supported by simulations predicting disruption of an intracellular interaction network leading to a discrete rotation of transmembrane segment 5 and the adjacent intracellular loop 2. The magnitude of the spectroscopic response correlated inversely with the transport rate for different substrates, suggesting that stability of the intermediate represents an unrecognized rate-limiting barrier in the NSS transport mechanism.  相似文献   

16.
Structural classification of families of membrane proteins by bioinformatics techniques has become a critical aspect of membrane protein research. We have proposed hydropathy profile alignment to identify structural homology between families of membrane proteins. Here, we demonstrate experimentally that two families of secondary transporters, the ESS and 2HCT families, indeed share similar folds. Members of the two families show highly similar hydropathy profiles but cannot be shown to be homologous by sequence similarity. A structural model was predicted for the ESS family transporters based upon an existing model of the 2HCT family transporters. In the model, the transporters fold into two domains containing five transmembrane segments and a reentrant or pore-loop each. The two pore-loops enter the membrane embedded part of the proteins from opposite sides of the membrane. The model was verified by accessibility studies of cysteine residues in single-Cys mutants of the Na+-glutamate transporter GltS of Escherichia coli, a member of the ESS family. Cysteine residues positioned in predicted periplasmic loops were accessible from the periplasm by a bulky, membrane-impermeable thiol reagent, while cysteine residues in cytoplasmic loops were not. Furthermore, two cysteine residues in the predicted pore-loop entering the membrane from the cytoplasmic side were shown to be accessible for small, membrane-impermeable thiol reagents from the periplasm, as was demonstrated before for the Na+-citrate transporter CitS of Klebsiella pneumoniae, a member of the 2HCT family. The data strongly suggests that GltS of the ESS family and CitS of the 2HCT family share the same fold as was predicted by comparing the averaged hydropathy profiles of the two families.  相似文献   

17.
Conformational flexibility between structural ensembles is an essential component of enzyme function. Although the broad dynamical landscape of proteins is known to promote a number of functional events on multiple time scales, it is yet unknown whether structural and functional enzyme homologues rely on the same concerted residue motions to perform their catalytic function. It is hypothesized that networks of contiguous and flexible residue motions occurring on the biologically relevant millisecond time scale evolved to promote and/or preserve optimal enzyme catalysis. In this study, we use a combination of NMR relaxation dispersion, model-free analysis, and ligand titration experiments to successfully capture and compare the role of conformational flexibility between two structural homologues of the pancreatic ribonuclease family: RNase A and eosinophil cationic protein (or RNase 3). In addition to conserving the same catalytic residues and structural fold, both homologues show similar yet functionally distinct clusters of millisecond dynamics, suggesting that conformational flexibility can be conserved among analogous protein folds displaying low sequence identity. Our work shows that the reduced conformational flexibility of eosinophil cationic protein can be dynamically and functionally reproduced in the RNase A scaffold upon creation of a chimeric hybrid between the two proteins. These results support the hypothesis that conformational flexibility is partly required for catalytic function in homologous enzyme folds, further highlighting the importance of dynamic residue sectors in the structural organization of proteins.  相似文献   

18.
Functional classification of proteins from sequences alone has become a critical bottleneck in understanding the myriad of protein sequences that accumulate in our databases. The great diversity of homologous sequences hides, in many cases, a variety of functional activities that cannot be anticipated. Their identification appears critical for a fundamental understanding of the evolution of living organisms and for biotechnological applications. ProfileView is a sequence-based computational method, designed to functionally classify sets of homologous sequences. It relies on two main ideas: the use of multiple profile models whose construction explores evolutionary information in available databases, and a novel definition of a representation space in which to analyze sequences with multiple profile models combined together. ProfileView classifies protein families by enriching known functional groups with new sequences and discovering new groups and subgroups. We validate ProfileView on seven classes of widespread proteins involved in the interaction with nucleic acids, amino acids and small molecules, and in a large variety of functions and enzymatic reactions. ProfileView agrees with the large set of functional data collected for these proteins from the literature regarding the organization into functional subgroups and residues that characterize the functions. In addition, ProfileView resolves undefined functional classifications and extracts the molecular determinants underlying protein functional diversity, showing its potential to select sequences towards accurate experimental design and discovery of novel biological functions. On protein families with complex domain architecture, ProfileView functional classification reconciles domain combinations, unlike phylogenetic reconstruction. ProfileView proves to outperform the functional classification approach PANTHER, the two k-mer-based methods CUPP and eCAMI and a neural network approach based on Restricted Boltzmann Machines. It overcomes time complexity limitations of the latter.  相似文献   

19.
The sodium solute symporters (SSS) and neurotransmitter sodium symporters (NSS) are two families of secondary transporters that are not related in amino acid sequence. Nonetheless, recent crystal structures showed that the Na+/galactose (SSS) and Na+/leucine (NSS) transporters have similar core structures. The structural relatedness highlights the need for classification methods for membrane protein structures based on other criteria than amino acid similarity. Here, we demonstrate that a method based on hydropathy profile alignments convincingly identifies structural similarity between the NSS and SSS families. Most importantly, the method shows that one of the largest transporter families for which a crystal structure is elusive (the amino acid/polyamine/organocation or APC superfamily), also shares the similar core structure observed for the Na+/galactose and Na+/leucine transporters. The APC superfamily contains the major amino acid transporter families that are found throughout life. Insight into their structure will significantly facilitate the studies of this important group of transporters.  相似文献   

20.
In Corynebacterium glutamicum the LysE carrier protein exhibits the unique function of exporting L-lysine. We here analyze the membrane topology of LysE, a protein of 236 amino acyl residues, using PhoA- and LacZ-fusions. The amino-terminal end of LysE is located in the cytoplasm whereas the carboxy-terminal end is found in the periplasm. Although 6 hydrophobic domains were identified based on hydropathy analyses, only five transmembrane spanning helices appear to be present. The additional hydrophobic segment may dip into the membrane or be surface localized. We show that LysE is a member of a family of proteins found, for example, in Escherichia coil, Bacillus subtilis, Mycobacterium tuberculosis and Helicobacter pylori. This family, which we have designated the LysE family, is distantly related to two additional protein families which we have designated the YahN and CadD families. These three families, the members of which exhibit similar sizes, hydropathy profiles, and sequence motifs comprise the LysE superfamily. Functionally characterized members of the LysE superfamily export L-lysine, cadmium and possibly quarternary amines. We suggest that LysE superfamily members will prove to catalyze export of a variety of biologically important solutes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号