首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 316 毫秒
1.
Detection of similarity is particularly difficult for small proteins and thus connections between many of them remain unnoticed. Structure and sequence analysis of several metal-binding proteins reveals unexpected similarities in structural domains classified as different protein folds in SCOP and suggests unification of seven folds that belong to two protein classes. The common motif, termed treble clef finger in this study, forms the protein structural core and is 25-45 residues long. The treble clef motif is assembled around the central zinc ion and consists of a zinc knuckle, loop, beta-hairpin and an alpha-helix. The knuckle and the first turn of the helix each incorporate two zinc ligands. Treble clef domains constitute the core of many structures such as ribosomal proteins L24E and S14, RING fingers, protein kinase cysteine-rich domains, nuclear receptor-like fingers, LIM domains, phosphatidylinositol-3-phosphate-binding domains and His-Me finger endonucleases. The treble clef finger is a uniquely versatile motif adaptable for various functions. This small domain with a 25 residue structural core can accommodate eight different metal-binding sites and can have many types of functions from binding of nucleic acids, proteins and small molecules, to catalysis of phosphodiester bond hydrolysis. Treble clef motifs are frequently incorporated in larger structures or occur in doublets. Present analysis suggests that the treble clef motif defines a distinct structural fold found in proteins with diverse functional properties and forms one of the major zinc finger groups.  相似文献   

2.
The zinc finger HIT domain is a sequence motif found in many proteins, including thyroid hormone receptor interacting protein 3 (TRIP-3), which is possibly involved in maturity-onset diabetes of the young (MODY). Novel zinc finger motifs are suggested to play important roles in gene regulation and chromatin remodeling. Here, we determined the high-resolution solution structure of the zinc finger HIT domain in ZNHIT2 (protein FON) from Homo sapiens, by an NMR method based on 567 upper distance limits derived from NOE intensities measured in three-dimensional NOESY spectra. The structure yielded a backbone RMSD to the mean coordinates of 0.19 A for the structured residues 12-48. The fold consists of two consecutive antiparallel beta-sheets and two short C-terminal helices packed against the second beta-sheet, and binds two zinc ions. Both zinc ions are coordinated tetrahedrally via a CCCC-CCHC motif to the ligand residues of the zf-HIT domain in an interleaved manner. The tertiary structure of the zinc finger HIT domain closely resembles the folds of the B-box, RING finger, and PHD domains with a cross-brace zinc coordination mode, but is distinct from them. The unique three-dimensional structure of the zinc finger HIT domain revealed a novel zinc-binding fold, as a new member of the treble clef domain family. On the basis of the structural data, we discuss the possible functional roles of the zinc finger HIT domain.  相似文献   

3.
RNA structural motifs are recurrent three-dimensional (3D) components found in the RNA architecture. These RNA structural motifs play important structural or functional roles and usually exhibit highly conserved 3D geometries and base-interaction patterns. Analysis of the RNA 3D structures and elucidation of their molecular functions heavily rely on efficient and accurate identification of these motifs. However, efficient RNA structural motif search tools are lacking due to the high complexity of these motifs. In this work, we present RNAMotifScanX, a motif search tool based on a base-interaction graph alignment algorithm. This novel algorithm enables automatic identification of both partially and fully matched motif instances. RNAMotifScanX considers noncanonical base-pairing interactions, base-stacking interactions, and sequence conservation of the motifs, which leads to significantly improved sensitivity and specificity as compared with other state-of-the-art search tools. RNAMotifScanX also adopts a carefully designed branch-and-bound technique, which enables ultra-fast search of large kink-turn motifs against a 23S rRNA. The software package RNAMotifScanX is implemented using GNU C++, and is freely available from http://genome.ucf.edu/RNAMotifScanX.  相似文献   

4.
Recent studies point to a diverse assemblage of prokaryotic cognates of the eukaryotic ubiquitin (Ub) system. These systems span an entire spectrum, ranging from those catalyzing cofactor and amino acid biosynthesis, with only adenylating E1-like enzymes and ubiquitin-like proteins (Ubls), to those that are closer to eukaryotic systems by virtue of possessing E2 enzymes. Until recently E3 enzymes were unknown in such prokaryotic systems. Using contextual information from comparative genomics, we uncover a diverse group of RING finger E3s in prokaryotes that are likely to function with E1s, E2s, JAB domain peptidases and Ubls. These E1s, E2s and RING fingers suggest that features hitherto believed to be unique to eukaryotic versions of these proteins emerged progressively in such prokaryotic systems. These include the specific configuration of residues associated with oxyanion-hole formation in E2s and the C-terminal UFD in the E1 enzyme, which presents the E2 to its active site. Our study suggests for the first time that YukD-like Ubls might be conjugated by some of these systems in a manner similar to eukaryotic Ubls. We also show that prokaryotic RING fingers possess considerable functional diversity and that not all of them are involved in Ub-related functions. In eukaryotes, other than RING fingers, a number of distinct binuclear (chelating two Zn atoms) and mononuclear (chelating one zinc atom) treble clef domains are involved in Ub-related functions. Through detailed structural analysis we delineated the higher order relationships and interaction modes of binuclear treble clef domains. This indicated that the FYVE domain acquired the binuclear state independently of the other binuclear forms and that different treble clef domains have convergently acquired Ub-related functions independently of the RING finger. Among these, we uncover evidence for notable prokaryotic radiations of the ZF-UBP, B-box, AN1 and LIM clades of treble clef domains and present contextual evidence to support their role in functions unrelated to the Ub-system in prokaryotes. In particular, we show that bacterial ZF-UBP domains are part of a novel cyclic nucleotide-dependent redox signaling system, whereas prokaryotic B-box, AN1 and LIM domains have related functions as partners of diverse membrane-associated peptidases in processing proteins. This information, in conjunction with structural analysis, suggests that these treble clef domains might have been independently recruited to the eukaryotic Ub-system due to an ancient conserved mode of interaction with peptides.  相似文献   

5.
Brakoulias A  Jackson RM 《Proteins》2004,56(2):250-260
A method is described for the rapid comparison of protein binding sites using geometric matching to detect similar three-dimensional structure. The geometric matching detects common atomic features through identification of the maximum common sub-graph or clique. These features are not necessarily evident from sequence or from global structural similarity giving additional insight into molecular recognition not evident from current sequence or structural classification schemes. Here we use the method to produce an all-against-all comparison of phosphate binding sites in a number of different nucleotide phosphate-binding proteins. The similarity search is combined with clustering of similar sites to allow a preliminary structural classification. Clustering by site similarity produces a classification of binding sites for the 476 representative local environments producing ten main clusters representing half of the representative environments. The similarities make sense in terms of both structural and functional classification schemes. The ten main clusters represent a very limited number of unique structural binding motifs for phosphate. These are the structural P-loop, di-nucleotide binding motif [FAD/NAD(P)-binding and Rossman-like fold] and FAD-binding motif. Similar classification schemes for nucleotide binding proteins have also been arrived at independently by others using different methods.  相似文献   

6.
ClpX (423 amino acids), a member of the Clp/Hsp100 family of molecular chaperones and the protease, ClpP, comprise a multimeric complex supporting targeted protein degradation in Escherichia coli. The ClpX sequence consists of an NH2-terminal zinc binding domain (ZBD) and a COOH-terminal ATPase domain. Earlier, we have demonstrated that the zinc binding domain forms a constitutive dimer that is essential for the degradation of some ClpX substrates such as gammaO and MuA but is not required for the degradation of other substrates such as green fluorescent protein-SsrA. In this report, we present the NMR solution structure of the zinc binding domain dimer. The monomer fold reveals that ZBD is a member of the treble clef zinc finger family, a motif known to facilitate protein-ligand, protein-DNA, and protein-protein interactions. However, the dimeric ZBD structure is not related to any protein structure in the Protein Data Bank. A trimer-of-dimers model of ZBD is presented, which might reflect the closed state of the ClpX hexamer.  相似文献   

7.
Complex brains have evolved a highly efficient network architecture whose structural connectivity is capable of generating a large repertoire of functional states. We detect characteristic network building blocks (structural and functional motifs) in neuroanatomical data sets and identify a small set of structural motifs that occur in significantly increased numbers. Our analysis suggests the hypothesis that brain networks maximize both the number and the diversity of functional motifs, while the repertoire of structural motifs remains small. Using functional motif number as a cost function in an optimization algorithm, we obtain network topologies that resemble real brain networks across a broad spectrum of structural measures, including small-world attributes. These results are consistent with the hypothesis that highly evolved neural architectures are organized to maximize functional repertoires and to support highly efficient integration of information.  相似文献   

8.
An  J.  Wako  H.  Sarai  A. 《Molecular Biology》2001,35(6):905-910
An amino acid sequence pattern conserved among a family of proteins is called motif. It is usually related to the specific function of the family. On the other hand, functions of proteins are realized through their 3D structures. Specific local structures, called structural motifs, are considered as related to their functions. However, searching for common structural motifs in different proteins is much more difficult than for common sequence motifs. We are attempting in this study to convert the information about the structural motifs into a set of one-dimensional digital strings, i.e., a set of codes, to compare them more easily by computer and to investigate their relationship to functions more quantitatively. By applying the Delaunay tessellation to a 3D structure of a protein, we can assign each local structure to a unique code that is defined so as to reflect its structural feature. Since a structural motif is defined as a set of the local structures in this paper, the structural motif is represented by a set of the codes. In order to examine the ability of the set of the codes to distinguish differences among the sets of local structures with a given PROSITE pattern that contain both true and false positives, we clustered them by introducing a similarity measure among the set of the codes. The obtained clustering shows a good agreement with other results by direct structural comparison methods such as a superposition method. The structural motifs in homologous proteins are also properly clustered according to their sources. These results suggest that the structural motifs can be well characterized by these sets of the codes, and that the method can be utilized in comparing structural motifs and relating them with function.  相似文献   

9.
An amino acid sequence pattern conserved among a family of proteins is called motif. It is usually related to the specific function of the family. On the other hand, functions of proteins are achieved by their 3D structures. Specific local structures, called structural motifs, are considered related to their functions. However, searching for common structural motifs in different proteins is much more difficult than for common sequence motifs. We are attempting in this study to convert the information about the structural motifs into a set of one-dimensional digital strings, i.e., a set of codes, to compare them more easily by computer and to investigate their relationship to functions more quantitatively. By applying the Delaunay tessellation to a 3D structure of a protein, we can assign each local structure to a unique code that is defined so as to reflect its structural feature. Since a structural motif is defined as a set of the local structures in this paper, the structural motif is represented by a set of the codes. In order to examine the ability of the set of the codes to distinguish differences among the sets of local structures with a given PROSITE pattern that contain both true and false positives, we clustered them by introducing a similarity measure among the set of the codes. The obtained clustering shows a good agreement with other results by direct structural comparison methods such as a superposition method. The structural motifs in homologous proteins are also properly clustered according to their sources. These results suggest that the structural motifs can be well characterized by these sets of the codes, and that the method can be utilized in comparing structural motifs and relating them with function.  相似文献   

10.
RNA binding proteins recognize RNA targets in a sequence specific manner. Apart from the sequence, the secondary structure context of the binding site also affects the binding affinity. Binding sites are often located in single-stranded RNA regions and it was shown that the sequestration of a binding motif in a double-strand abolishes protein binding. Thus, it is desirable to include knowledge about RNA secondary structures when searching for the binding motif of a protein. We present the approach MEMERIS for searching sequence motifs in a set of RNA sequences and simultaneously integrating information about secondary structures. To abstract from specific structural elements, we precompute position-specific values measuring the single-strandedness of all substrings of an RNA sequence. These values are used as prior knowledge about the motif starts to guide the motif search. Extensive tests with artificial and biological data demonstrate that MEMERIS is able to identify motifs in single-stranded regions even if a stronger motif located in double-strand parts exists. The discovered motif occurrences in biological datasets mostly coincide with known protein-binding sites. This algorithm can be used for finding the binding motif of single-stranded RNA-binding proteins in SELEX or other biological sequence data.  相似文献   

11.
12.
An automatic procedure is proposed to identify, from the protein sequence database, conserved amino acid patterns (or sequence motifs) that are exclusive to a group of functionally related proteins. This procedure is applied to the PIR database and a dictionary of sequence motifs that relate to specific superfamilies constructed. The motifs have a practical relevance in identifying the membership of specific superfamilies without the need to perform sequence database searches in 20% of newly determined sequences. The sequence motifs identified represent functionally important sites on protein molecules. When multiple blocks exist in a single motif they are often close together in the 3-D structure. Furthermore, occasionally these motif blocks were found to be split by introns when the correlation with exon structures was examined.  相似文献   

13.
Computational methods such as sequence alignment and motif construction are useful in grouping related proteins into families, as well as helping to annotate new proteins of unknown function. These methods identify conserved amino acids in protein sequences, but cannot determine the specific functional or structural roles of conserved amino acids without additional study. In this work, we present 3MATRIX (http://3matrix.stanford.edu) and 3MOTIF (http://3motif.stanford.edu), a web-based sequence motif visualization system that displays sequence motif information in its appropriate three-dimensional (3D) context. This system is flexible in that users can enter sequences, keywords, structures or sequence motifs to generate visualizations. In 3MOTIF, users can search using discrete sequence motifs such as PROSITE patterns, eMOTIFs, or any other regular expression-like motif. Similarly, 3MATRIX accepts an eMATRIX position-specific scoring matrix, or will convert a multiple sequence alignment block into an eMATRIX for visualization. Each query motif is used to search the protein structure database for matches, in which the motif is then visually highlighted in three dimensions. Important properties of motifs such as sequence conservation and solvent accessible surface area are also displayed in the visualizations, using carefully chosen color shading schemes.  相似文献   

14.
Short motifs are known to play diverse roles in proteins, such as in mediating the interactions with other molecules, binding to membranes, or conducting a specific biological function. Standard approaches currently employed to detect short motifs in proteins search for enrichment of amino acid motifs considering mostly the sequence information. Here, we presented a new approach to search for common motifs (protein signatures) which share both physicochemical and structural properties, looking simultaneously at different features. Our method takes as an input an amino acid sequence and translates it to a new alphabet that reflects its intrinsic structural and chemical properties. Using the MEME search algorithm, we identified the proteins signatures within subsets of protein which encompass common sequence and structural information. We demonstrated that we can detect enriched structural motifs, such as the amphipathic helix, from large datasets of linear sequences, as well as predicting common structural properties (such as disorder, surface accessibility, or secondary structures) of known functional‐motifs. Finally, we applied the method to the yeast protein interactome and identified novel putative interacting motifs. We propose that our approach can be applied for de novo protein function prediction given either sequence or structural information. Proteins 2013; © 2012 Wiley Periodicals, Inc.  相似文献   

15.
16.
Lack of crystal structure data of folate binding proteins has left so many questions unanswered (for example, important residues in active site, binding domain, important amino acid residues involved in interactions between ligand and receptor). With sequence alignment and PROSITE motif identification, we attempted to answer evolutionarily significant residues that are of functional importance for ligand binding and that form catalytic sites. We have analyzed 46 different FRs and FBP sequences of various organisms obtained from Genbank. Multiple sequence alignment identified 44 highly conserved identical amino acid residues with 10 cysteine residues and 12 motifs including ECSPNLGPW (which might help in the structural stability of FR).  相似文献   

17.
Beta-barrel membrane proteins are found in the outer membrane of gram-negative bacteria, mitochondria, and chloroplasts. Although sequence motifs have been studied in alpha-helical membrane proteins and have been shown to play important roles in their assembly, it is not clear whether over-represented motifs and under-represented anti-motifs exist in beta-barrel membrane proteins. We have developed probabilistic models to identify sequence motifs of residue pairs on the same strand separated by an arbitrary number of residues. A rigorous statistical model is essential for this study because of the difficulty associated with the short length of the strands and the small amount of structural data. By comparing to the null model of exhaustive permutation of residues within the same beta-strand, propensity values of sequence patterns of two residues and p-values measuring statistical significance are calculated exactly by several analytical formulae we have developed or by enumeration. We find that there are characteristic sequence motifs and antimotifs in transmembrane (TM) beta-strands. The amino acid Tyr plays an important role in several such motifs. We find a general dichotomy consisting of favorable Aliphatic-Tyr sequence motifs and unfavorable Tyr-Aliphatic antimotifs. Tyr is also part of a terminal motif, YxF, which is likely to be important for chaperone binding. Our results also suggest several experiments that can help to elucidate the mechanisms of in vitro and in vivo folding of beta-barrel membrane proteins.  相似文献   

18.
19.
This review summarizes and analyzes data on structural and functional relationships between cell adhesion proteins and alpha-fetoprotein (AFP), which play an important role in embryo- and carcinogenesis and act in synergism with growth factors. These two groups of proteins are mosaic, multimodular, and polyfunctional, and each of their modules can function independently through binding with its specific membrane receptor. Most cell adhesion proteins contain modules similar to epidermal growth factor (EGF) and also their repeats, which determine the involvement of these proteins in regulation of cell proliferation, differentiation, and apoptosis. These EGF-like modules are found to include short motifs similar to the fragment LDSYQCT of human AFP. Both direct and inverted AFP-like motifs are linked through a consensus octapeptide motif CXXGY/FXGX. Such AFP-like motifs of cell adhesion proteins and the tripeptide RGD found in AFP may be structural prerequisites for common functions of these groups of nonhomologous and unrelated proteins.  相似文献   

20.
Polycomb group proteins are epigenetic regulators that maintain patterns of gene expression over multiple rounds of cell division. Many of these proteins, including polyhomeotic and the MBT repeat containing proteins SCM and dSfmbt, contain an atypical C2C2 zinc finger with a characteristic phenylalanine–cysteine–serine sequence motif. The reoccurrence of this so‐called FCS zinc finger in a variety of polycomb group proteins suggests that it has an important regulatory function. We have determined the solution structure of the FCS zinc finger of the human dSfmbt homologue L(3)mbt‐like 2 (L3MBTL2). The structure consists of a β‐hairpin followed by an α‐helix. The zinc ligands are situated in the β‐hairpin and at the N‐terminus of the α‐helix an arrangement typical of the treble clef class of zinc fingers. The structure is consistent with the proposal that FCS zinc fingers bind to regulatory RNAs.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号