首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Comparisons of the primary structures of yeast and horse liver alcohol dehydrogenases reveal that the enzymes are homologous but distantly related. The overall positional identity is 25% between common regions, and several deletions/insertions occur in either enzyme, the longest apparently corresponding to 21 residues, showing that the different subunit sizes are largely explained by internal differences. Variabilities in the structural similarities can be coupled with functional requirements but not directly with whole domains in the previously known tertiary structure of the horse protein. The two most similar regions of the enzymes affect active-site segments and the two most dissimilar regions seem to affect a loop structure without known function, and a segment participating in subunit interactions. The dissimilarities may probably be correlated with changes in zinc-binding properties and quaternary structures. The extra region corresponding to the large internal chain-length difference shows an apparent coincidence in sequence to a following segment of the horse enzyme, and additional elements of internal coincidences, or superficial similarities with other dehydrogenases, are noticed. These characteristics are not fully distinguishable from chance distributions but in view of the extensive species variations in alcohol dehydrogenases some evolutionary considerations may not be excluded, in which case a model relating all regions of these and associated enzymes to a common ancestor is shown to be compatible with all known observations.  相似文献   

2.
A mathematical method has been developed in order to search for latent periodicity in protein amino-acid and other symbolical sequences using dynamic programming and random matrices. The method allows the detection of the latent periodicity with insertions and deletions at positions that are unknown beforehand. The developed method has been applied to search for the periodicity in the amino-acid sequences of several proteins and in the euro/dollar exchange rate since 2001. The presence of a long period with insertions and deletions in amino-acid sequences is shown. The period length of seven amino acids is observed in the proteins that contain supercoiled regions (a coiled-coil structure) as well as of six, five, or more amino acids. The existence of the period length of 6 and 7 days, as well as 24 and 25 h in the analyzed financial time series is observed; note that this periodicity is detectable only for insertions and deletions. The causes that underlie the occurrence of the latent periodicity with insertions and deletions in amino-acid sequences and financial time series are discussed.  相似文献   

3.
A method for comparing amino acid compositions of proteins (Cornish-Bowden, 1977) has been extended to allow proteins of unequal lengths to be compared. The method has been tested by applying it to proteins of known sequence. It tends to exaggerate the amount of difference between unrelated proteins. It is therefore a reliable guide to possible sequence similarities, in that it does not suggest that sequences are similar when they are not, though it sometimes fails to detect genuine similarities. When applied to related proteins the method gives results in good agreement with those predicted. A phylogenetic tree for 37 snake venom toxins has been constructed from their compositions and is similar in most important respects to one constructed from the corresponding sequences.  相似文献   

4.
A sensitive procedure to compare amino acid sequences   总被引:17,自引:0,他引:17  
Methods are discussed that provide sensitive criteria for detection of weak sequence homologies. They are based on the Dayhoff relatedness odds amino acid exchange matrix and certain residue physical characteristics. The search procedure uses several residue probe lengths in comparing all possible segments of two protein sequences, and search plots are shown with peak values displayed over the entire search length. Alignments are automatically effected using the highest search matrix values and without the necessity of gap penalties. Tests for significance are derived from actual protein sequences rather than a random shuffling procedure.  相似文献   

5.
A new simple quantitative representation of three-dimensional structure of globular proteins is proposed which is useful for comparison of distantly related problems, computer sorting of large sets of conformations, and search of structurally similar domains in protein data base. The folding course of the polypeptide backbone is approximated by a set of successive vectors corresponding to the elements of regular secondary structure (e.g. alpha-helices, strands of beta-sheets) and non-regular segments. The parameters specifying the spatial organization of segments in this vector model are internal coordinates, namely, lengths of the vectors, planar and dihedral angles. Quantitative representation proposed allows to circumvent the problem of insertions/deletions and to avoid the stage of best superposition during protein comparison. An application was made to the comparison of three-dimensional structures of scorpion toxins Centruroides sculpturatus Ewing v-3, Buthus eupeus M9 and I5A, which have different chain lengths and low sequence similarity.  相似文献   

6.
During membrane fusion, the influenza A virus hemagglutinin (HA) adopts an extended helical structure that contains the viral transmembrane and fusion peptide domains at the same end of the molecule. The peptide segments that link the end of this rod-like structure to the membrane-associating domains are approximately 10 amino acids in each case, and their structure at the pH of fusion is currently unknown. Here, we examine mutant HAs and influenza viruses containing such HAs to determine whether these peptide linkers are subject to specific length requirements for the proper folding of native HA and for membrane fusion function. Using pairwise deletions and insertions, we show that the region flanking the fusion peptide appears to be important for the folding of the native HA structure but that mutant proteins with small insertions can be expressed on the cell surface and are functional for membrane fusion. HA mutants with deletions of up to 10 residues and insertions of as many as 12 amino acids were generated for the peptide linker to the viral transmembrane domain, and all folded properly and were expressed on the cell surface. For these mutants, it was possible to designate length restrictions for efficient membrane fusion, as functional activity was observed only for mutants containing linkers with insertions or deletions of eight residues or less. The linker peptide mutants are discussed with respect to requirements for the folding of native HAs and length restrictions for membrane fusion activity.  相似文献   

7.
Insertions and deletions of nucleotides in the genes encoding the variable domains of antibodies are natural components of the hypermutation process, which may expand the available repertoire of hypervariable loop lengths and conformations. Although insertion of amino acids has also been utilized in antibody engineering, little is known about the functional consequences of such modifications. To investigate this further, we have introduced single-codon insertions and deletions as well as more complex modifications in the complementarity-determining regions of human antibody fragments with different specificities. Our results demonstrate that single amino acid insertions and deletions are generally well tolerated and permit production of stably folded proteins, often with retained antigen recognition, despite the fact that the thus modified loops carry amino acids that are disallowed at key residue positions in canonical loops of the corresponding length or are of a length not associated with a known canonical structure. We have thus shown that single-codon insertions and deletions can efficiently be utilized to expand structure and sequence space of the antigen-binding site beyond what is encoded by the germline gene repertoire.  相似文献   

8.
We have recently developed a fast approach to comparisons of 3-dimensional structures. Our method is unique, treating protein structures as collections of unconnected points (atoms) in space. It is completely independent of the amino acid sequence order. It is unconstrained by insertions, deletions, and chain directionality. It matches single, isolated amino acids between 2 different structures strictly by their spatial positioning regardless of their relative sequential position in the amino acid chain. It automatically detects a recurring 3D motif in protein molecules. No predefinition of the motif is required. The motif can be either in the interior of the proteins or on their surfaces. In this work, we describe an enhancement over our previously developed technique, which considerably reduces the complexity of the algorithm. This results in an extremely fast technique. A typical pairwise comparison of 2 protein molecules requires less than 3 s on a workstation. We have scanned the structural database with dozens of probes, successfully detecting structures that are similar to the probe. To illustrate the power of this method, we compare the structure of a trypsin-like serine protease against the structural database. Besides detecting homologous trypsin-like proteases, we automatically obtain 3D, sequence order-independent, active-site similarities with subtilisin-like and sulfhydryl proteases. These similarities equivalence isolated residues, not conserving the linear order of the amino acids in the chains. The active-site similarities are well known and have been detected by manually inspecting the structures in a time-consuming, laborious procedure. This is the first time such equivalences are obtained automatically from the comparison of full structures. The far-reaching advantages and the implications of our novel algorithm to studies of protein folding, to evolution, and to searches for pharmacophoric patterns are discussed.  相似文献   

9.
Nucleotide sequence analysis has demonstrated that interspecific size variation in the YP2 yolk protein among HawaiianDrosophila is due to in-frame insertions and deletions in two repetitive segments of the coding region of the Yp2 gene. Sequence comparisons of the complex repetitive region close to the 5′ end of this gene across 34 endemic Hawaiian taxa revealed five length morphs, spanning a length difference of 21 nucleotides (nt). A phylogenetic character reconstruction of the length mutations on an independently derived molecular phylogeny showed clade-specific length variants arising from six ancient events: two identical insertions of 6 nt, and four deletions, one of 6 nt, one of 12 nt, and two identical but independent deletions of 15 nt. These mutations can be attributed to replication slippage with nontandem trinucleotide repeats playing a major role in the slipped-strand mispairing. Geographic analysis suggests that the 15 nt deletion which distinguishes theplanitibia subgroup from thecyrtoloma subgroup occurred on Oahu about 3 million years ago. The homoplasies observed caution against relying too heavily on nucleotide insertions/deletions for phylogenetic inference. In contrast to the extensive repeat polymorphisms within otherDrosophila and the human species, the more complex 5′Yp2 repetitive region analyzed here appears to lack polymorphism among HawaiianDrosophila, perhaps due to founder effects, low population sizes, and hitchhiking effects of selection on the immediately adjacent 5′ region. Correspondence to: M.P. Kambysellis  相似文献   

10.
The Escherichia coli btuB product is an outer membrane protein that mediates the TonB-coupled active transport of cobalamins and the uptake of the E colicins and bacteriophage BF23. The roles of various segments of the BtuB protein in its function or cellular localization were investigated by analysis of several genetic constructs. Hybrid proteins in which various lengths from the amino terminus of BtuB were linked to alkaline phosphatase (btuB::phoA genes) were all secreted across the cytoplasmic membrane. The BtuB-PhoA proteins that carried up to 327 amino acids of BtuB appeared to reside in the periplasmic space, whereas hybrid proteins containing at least 399 amino acids of BtuB were associated with the outer membrane. Eleven in-frame internal deletion mutations that spanned more than half of the mature sequence were prepared by combining appropriate restriction fragments from btuB variants with 6-bp linker insertions. None of the deleted proteins was able to complement any BtuB functions, and only three of them were detectable in the outer membrane, suggesting that most of the deletions affected sequences needed for stable association with the outer membrane. Duplications covering the same portions of BtuB were prepared in the same manner. All of these partial duplication variants complemented all BtuB functions, although some gave substantially reduced levels of activity. These proteins were found in the outer membrane, although some were subject to proteolytic cleavage within or near the duplicated segment. These results indicate that the insertion of BtuB into the outer membrane requires the presence of several regions of teh BtuB protein and that the presence of extra or redundant segments of the protein can be tolerated during its insertion and function.  相似文献   

11.
Abstract

A new simple quantitative representation of three-dimensional structure of globular proteins is proposed which is useful for comparison of distantly related problems, computer sorting of large sets of conformations, and search of structurally similar domains in protein data base. The folding course of the polypeptide backbone is approximated by a set of successive vectors corresponding to the elements of regular secondary structure (e.g. α-helices, strands of β- sheets) and non-regular segments. The parameters specifying the spatial organization of segments in this vector model are internal coordinates, namely, lengths of the vectors, planar and dihedral angles. Quantitative representation proposed allows to circumvent the problem of insertions/deletions and to avoid the stage of best superposition during protein comparison An application was made to the comparison of three-dimensional structures of scorpion toxins Centruroides sculpturatus Ewing v-3, Buthus eupeus M9 and I5A, which have different chain lengths and low sequence similarity.  相似文献   

12.
Sequence comparisons of the genomes of white spot syndrome virus (WSSV) strains have identified regions containing variable-length insertions/deletions (i.e. indels). Indel-I and Indel-II, positioned between open reading frames (ORFs) 14/15 and 23/24, respectively, are the largest and the most variable. Here we examined the nature of these 2 indel regions in 313 WSSV-infected Penaeus monodon shrimp collected between 2006 and 2009 from 76 aquaculture ponds in the Mekong Delta region of Vietnam. In the Indel-I region, 2 WSSV genotypes with deletions of either 5950 or 6031 bp in length compared with that of a reference strain from Thailand (WSSV-TH-96-II) were detected. In the Indel-II region, 4 WSSV genotypes with deletions of 8539, 10970, 11049 or 11866 bp in length compared with that of a reference strain from Taiwan (WSSV-TW) were detected, and the 8539 and 10970 bp genotypes predominated. Indel-II variants with longer deletions were found to correlate statistically with WSSV-diseased shrimp originating from more intensive farming systems. Like Indel-I lengths, Indel-II lengths also varied based on the Mekong Delta province from which farmed shrimp were collected.  相似文献   

13.
Coiled‐coils are essential components of many protein complexes. First discovered in structural proteins such as keratins, they have since been found to figure largely in the assembly and dynamics required for diverse functions, including membrane fusion, signal transduction and motors. Coiled‐coils have a characteristic repeating seven‐residue geometric and sequence motif, which is sometimes interrupted by the insertion of one or more residues. Such insertions are often highly conserved and critical to interdomain communication in signaling proteins such as bacterial histidine kinases. Here we develop the “accommodation index” as a parameter that allows automatic detection and classification of insertions based on the three dimensional structure of a protein. This method allows precise identification of the type of insertion and the “accommodation length” over which the insertion is structurally accommodated. A simple theory is presented that predicts the structural perturbations of 1, 3, 4 residue insertions as a function of the length over which the insertion is accommodated. Analysis of experimental structures is in good agreement with theory, and shows that short accommodation lengths give rise to greater perturbation of helix packing angles, changes in local helical phase, and increased structural asymmetry relative to long accommodation lengths. Cytoplasmic domains of histidine kinases in different signaling states display large changes in their accommodation lengths, which can now be seen to underlie diverse structural transitions including symmetry/asymmetry and local variations in helical phase that accompany signal transduction.  相似文献   

14.
15.
An automated algorithm is presented that delineates protein sequence fragments which display similarity. The method incorporates a selection of a number of local nonoverlapping sequence alignments with the highest similarity scores and a graphtheoretical approach to elucidate the consistent start and end points of the fragments comprising one or more ensembles of related subsequences. The procedure allows the simultaneous identification of different types of repeats within one sequence. A multiple alignment of the resulting fragments is performed and a consensus sequence derived from the ensemble(s). Finally, a profile is constructed form the multiple alignment to detect possible and more distant members within the sequence. The method tolerates mutations in the repeats as well as insertions and deletions. The sequence spans between the various repeats or repeat clusters may be of different lengths. The technique has been applied to a number of proteins where the repeating fragments have been derived from information additional to the protein sequences. © 1993 Wiley-Liss, Inc.  相似文献   

16.
Comparison of super-secondary structures in proteins   总被引:36,自引:0,他引:36  
A method of comparing the conformations of different, but structurally related proteins is described. Local variations, such as the systematic translation of a helix, or the position of deletions and insertions can be detected, and the correspondence of only marginally similar structures can be measured. The occurrence of larger continuous folds (“super-secondary structures”) has been detected in the comparison of lactate dehydrogenase with itself and with other protein structures.  相似文献   

17.
Qian B  Goldstein RA 《Proteins》2001,45(1):102-104
Protein sequence alignment has become a widely used method in the study of newly sequenced proteins. Most sequence alignment methods use an affine gap penalty to assign scores to insertions and deletions. Although affine gap penalties represent the relative ease of extending a gap compared with initializing a gap, it is still an obvious oversimplification of the real processes that occur during sequence evolution. To improve the efficiency of sequence alignment methods and to obtain a better understanding of the process of sequence evolution, we wanted to find a more accurate model of insertions and deletions in homologous proteins. In this work, we extract the probability of a gap occurrence and the resulting gap length distribution in distantly related proteins (sequence identity < 25%) using alignments based on their common structures. We observe a distribution of gaps that can be fitted with a multiexponential with four distinct components. The results suggest new approaches to modeling insertions and deletions in sequence alignments.  相似文献   

18.
The set of "expansion segments" of any eukaryotic 26S/28S ribosomal RNA (rRNA) gene is responsible for the bulk of the difference in length between the prokaryotic 23S rRNA gene and the eukaryotic 26S/28S rRNA gene. The expansion segments are also responsible for interspecific fluctuations in length during eukaryotic evolution. They show a consistent bias in base composition in any species; for example, they are AT rich in Drosophila melanogaster and GC rich in vertebrate species. Dot-matrix comparisons of sets of expansion segments reveal high similarities between members of a set within any 28S rRNA gene of a species, in contrast to the little or spurious similarity that exists between sets of expansion segments from distantly related species. Similarities among members of a set of expansion segments within any 28S rRNA gene cannot be accounted for by their base-compositional bias alone. In contrast, no significant similarity exists within a set of "core" segments (regions between expansion segments) of any 28S rRNA gene, although core segments are conserved between species. The set of expansion segments of a 26S/28S gene is coevolving as a unit in each species, at the same time as the family of 28S rRNA genes, as a whole, is undergoing continual homogenization, making all sets of expansion segments from all ribosomal DNA (rDNA) arrays in a species similar in sequence. Analysis of DNA simplicity of 26S/28S rRNA genes shows a direct correlation between significantly high relative simplicity factors (RSFs) and sequence similarity among a set of expansion segments. A similar correlation exists between RSF values, overall rDNA lengths, and the lengths of individual expansion segments. Such correlations suggest that most length fluctuations reflect the gain and loss of simple sequence motifs by slippage-like mechanisms. We discuss the molecular coevolution of expansion segments, which takes place against a background of slippage-like and unequal crossing-over mechanisms of turnover that are responsible for the accumulation of interspecific differences in rDNA sequences.   相似文献   

19.
Structural comparisons of sorbitol dehydrogenase with zinc-containing 'long' alcohol dehydrogenases reveal distant but clear relationships. An alignment suggests 93 positional identities with horse liver alcohol dehydrogenase (25% of 374 positions) and 73 identities with yeast alcohol dehydrogenase (20%). Sorbitol dehydrogenase forms a link between these distantly related alcohol dehydrogenases and is in some regions more similar to one of them that they are to each other. 43 residues (11%) are common to all three enzymes and include a heavy over-representation of glycine (half of all glycine residues in sorbitol dehydrogenase), showing the importance of space restrictions in protein structures. Four regions are well conserved, two in each domain of horse liver alcohol dehydrogenase. They are two segments close to the active-site zinc atom of the catalytic domain, and two in the central beta-pleated sheet strands of the coenzyme-binding domain. These similarities demonstrate the general importance of internal and central building units in proteins. Large variations affect a region adjacent to the third protein ligand to the active-site zinc atom in horse liver alcohol dehydrogenase. Such changes at active sites of related enzymes are unusual. Other large differences concern the segment around the non-catalytic zinc atom of horse liver alcohol dehydrogenase; three of its four cysteine ligands are absent from sorbitol dehydrogenase. Three segments with several exchanges correspond to a continuous region with superficial areas, inter-domain contacts and inter-subunit interactions in the catalytic domain of alcohol dehydrogenase. They may correlate with the altered quaternary structure of sorbitol dehydrogenase. Regions corresponding to top and bottom beta-strands in the coenzyme-binding domain of the alcohol dehydrogenase are also little conserved. Within sorbitol dehydrogenase, a large segment shows an internal similarity. The two distantly related alcohol dehydrogenases and sorbitol dehydrogenase form a triplet of enzymes illustrating basic protein relationships. They are ancestrally close enough to establish similarities, yet sufficiently divergent to illustrate changes in all but fundamental properties.  相似文献   

20.
Although oligonucleotide probes complementary to single nucleotide substitutions are commonly used in microarray-based screens for genetic variation, little is known about the hybridization properties of probes complementary to small insertions and deletions. It is necessary to define the hybridization properties of these latter probes in order to improve the specificity and sensitivity of oligonucleotide microarray-based mutational analysis of disease-related genes. Here, we compare and contrast the hybridization properties of oligonucleotide microarrays consisting of 25mer probes complementary to all possible single nucleotide substitutions and insertions, and one and two base deletions in the 9168 bp coding region of the ATM (ataxia telangiectasia mutated) gene. Over 68 different dye-labeled single-stranded nucleic acid targets representing all ATM coding exons were applied to these microarrays. We assess hybridization specificity by comparing the relative hybridization signals from probes perfectly matched to ATM sequences to those containing mismatches. Probes complementary to two base substitutions displayed the highest average specificity followed by those complementary to single base substitutions, single base deletions and single base insertions. In all the cases, hybridization specificity was strongly influenced by sequence context and possible intra- and intermolecular probe and/or target structure. Furthermore, single nucleotide substitution probes displayed the most consistent hybridization specificity data followed by single base deletions, two base deletions and single nucleotide insertions. Overall, these studies provide valuable empirical data that can be used to more accurately model the hybridization properties of insertion and deletion probes and improve the design and interpretation of oligonucleotide microarray-based resequencing and mutational analysis.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号