首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A data base was compiled containing the amino acid sequences of 12 aspartate aminotransferases and 11 other aminotransferases. A comparison of these sequences by a standard alignment method confirmed the previously reported homology of all aspartate aminotransferases and Escherichia coli tyrosine aminotransferase. However, no significant similarity between these proteins and any of the other aminotransferases was detected. A more rigorous analysis, focusing on short sequence segments rather than the total polypeptide chain, revealed that rat tyrosine aminotransferase and Saccharomyces cerevisiae and Escherichia coli histidinol-phosphate aminotransferase share several homologous sequence segments with aspartate aminotransferases. For comparison of the complete sequences, a multiple sequence editor was developed to display the whole set of amino acid sequences in parallel on a single work-sheet. The editor allows gaps in individual sequences or a set of sequences to be introduced and thus facilitates their parallel analysis and alignment. Several clusters of invariant residues at corresponding positions in the amino acid sequences became evident, clearly establishing that the cytosolic and the mitochondrial isoenzyme of vertebrate aspartate aminotransferase, E. coli aspartate aminotransferase, rat and E. coli tyrosine aminotransferase, and S. cerevisiae and E. coli histidinol-phosphate aminotransferase are homologous proteins. Only 12 amino acid residues out of a total of about 400 proved to be invariant in all sequences compared; they are either involved in the binding of pyridoxal 5'-phosphate and the substrate, or appear to be essential for the conformation of the enzymes.  相似文献   

2.
An extensive search for internal regularities in amino acid sequences has been made, using both the genetic code and the relative frequencies of amino acid alternatives in homologous proteins. The two methods give very similar results and strongly suggest the occurrence of significant linear and inverted repetitions (similar sequences of opposite polarity) in several proteins. A hypothesis is developed to explain the occurrence of such internal regularities in proteins. This hypothesis is based on a process of duplication of an ancestral loop in which a symmetrical arrangement of amino acid allows stabilization by interaction between the amino acid side chains.  相似文献   

3.
The complete amino acid sequences of ribosomal proteins S11 from the Gram-positive eubacterium Bacillus stearothermophilus and of S19 from the archaebacterium Halobacterium marismortui have been determined. A search for homologous sequences of these proteins revealed that they belong to the ribosomal protein S11 family. Homologous proteins have previously been sequenced from Escherichia coli as well as from chloroplast, yeast and mammalian ribosomes. A pairwise comparison of the amino acid sequences showed that Bacillus protein S11 shares 68% identical residues with S11 from Escherichia coli and a slightly lower homology (52%) with the homologous chloroplast protein. The halophilic protein S19 is more related to the eukaryotic (45–49%) than to the eubacterial counterparts (35%)  相似文献   

4.
The primary structure of the alpha subunit of elongation factor 1 (EF-1 alpha) from human MOLT 4 cells was determined by cDNA sequencing. The data show that the conservation of the amino acid sequence is more than 80% when compared with yeast and Artemia EF-1 alpha. An inventory of amino acid sequences around the guanine-nucleotide-binding site in elongation factor Tu from Escherichia coli and homologous amino acid sequences in G proteins, initiation and elongation factors and proteins from the RAS family shows two regions containing conserved sequence elements. Region I has the sequence apolar-Xaa-Xaa-Xaa-Gly-Xaa-Xaa-Yaa-Xaa-Gly-LYs-Thr(Ser)- -Xaa-Xaa-Xaa-Xaa-X-apolar. Except for RAS proteins, Yaa is always an acidic amino acid residue. Region II is characterized by the invariant sequence apolar-apolar-Xaa-Xaa-Asn-Lys-Xaa-Asp. In order to facilitate sequence comparison we have used a graphic display, which is based on the hydrophilicity values of individual amino acids in a sequence.  相似文献   

5.
Summary The observed gene overlays in the viruses X174 and SV40 show a surprising economy of information storage; two different amino acid sequences are read in different frames from the same stretch of DNA. This phenomenon appears contradictory in that the information in the two overlaid amino acid sequences is strongly interdependent, yet each of the two proteins has evolved to its own well-defined function. The contradiction can be resolved by assuming sufficiently large degeneracy of the information contents of amino acid sequences with respect to function. Such a degeneracy is familiar from homologous proteins where a given biological function is implemented by many different amino acid sequences. It is shown that the very existence of viral overlays allows to derive a lower limit for the magnitude of this degeneracy: The degeneracy is equal to, or greater than fourfold; on the average, at each position of the chain a choice of 1 out of 5 or less amino acids, and not a choice of 1 out of 20 is necessary for constructing a protein with a specified function. In addition, the strong dependence of overlay probabilities on chain length allows the definition of a maximal length of overlays; in bacterial viruses overlay regions should be shorter than about 150 residues.  相似文献   

6.
Ribosomal proteins were extracted from 50S ribosomal subunits of the archaebacterium Halobacterium marismortui by decreasing the concentration of Mg2+ and K+, and the proteins were separated and purified by ion-exchange column chromatography on DEAE-cellulose. Ten proteins were purified to homogeneity and three of these proteins were subjected to sequence analysis. The complete amino acid sequences of the ribosomal proteins L25, L29 and L31 were established by analyses of the peptides obtained by enzymatic digestion with trypsin, Staphylococcus aureus protease, chymotrypsin and lysylendopeptidase. Proteins L25, L29 and L31 consist of 84, 115 and 95 amino acid residues with the molecular masses of 9472 Da, 12293 Da and 10418 Da respectively. A comparison of their sequences with those of other large-ribosomal-subunit proteins from other organisms revealed that protein L25 from H. marismortui is homologous to protein L23 from Escherichia coli (34.6%), Bacillus stearothermophilus (41.8%), and tobacco chloroplasts (16.3%) as well as to protein L25 from yeast (38.0%). Proteins L29 and L31 do not appear to be homologous to any other ribosomal proteins whose structures are so far known.  相似文献   

7.
An algorithm is presented for localizing variable and constant regions in homologous protein sequences. A set of aligned protein sequences is divided into two groups consisting of m and n sequences. Each group contains sequences of most related species. Value of the position dissimilarity of proteins from different groups of m and n sequences is defined as a number of failures to coincide in comparison with all possible mXn pairs of amino acid residues in the position (each from different group) divided by mXn. The position dissimilarity value of m protein sequences within a group is defined as the number of failures to coincide in comparison with all possible mX X(m-1)/2 pairs of amino acid residues divided by mX(m-1)/2. Ten position average of dissimilarity values is plotted vs. the first position number. Area of the figure included between the profile of dissimilarity values and its mean value line characterizes the overall irregularity of amino acid substitutions along the protein sequences. If the area value is greater than the average area for 1000 random profile by more than two standard deviation units, the profile extrema containing the "surplus" of area are cut off. The cut off stretches are likely to be variable and constant regions. In case of "between groups" comparisons it is found that the overall irregularity of amino acid substitutions is very high for all considered families of proteins; phospholipases A2, aspartate aminotransferases, alpha-subunits of Na+,K(+)-ATPase, L- and M-subunits of photosynthetic bacteria photoreaction centre, human rhodopsins.  相似文献   

8.
The complete amino acid sequences of ribosomal proteins L16, L23 and L33 from the archaebacterium Halobacterium marismortui were determined. The sequences were established by manual sequencing of peptides produced with several proteases as well as by cleavage with dilute HCl. Proteins L16, L23 and L33 consist of 119, 154 and 69 amino acid residues, and their molecular masses are 13538, 16812 and 7620 Da, respectively. The comparison of their sequences with those of ribosomal proteins from other organisms revealed that L23 and L33 are related to eubacterial ribosomal proteins from Escherichia coli and Bacillus stearothermophilus, while protein L16 was found to be homologous to a eukaryotic ribosomal protein from yeast. These results provide information about the special phylogenetic position of archaebacteria.  相似文献   

9.
The complete amino acid sequences of ribosomal proteins L9, L20, L21/22, L24 and L32 from the archaebacterium Halobacterium marismortui were determined. The comparison of the sequences of these proteins with those from other organisms revealed that proteins L21/22 and L24 are homologous to ribosomal protein Yrp29 from yeast and L19 from rat, respectively, and that H. marismortui L20 is homologous to L30 from eubacteria. H. marismortui ribosomal protein L9 showed sequence homology to both L29 from yeast and L15 from eubacteria. No homologous protein was found for H. marismortui L32. These results are discussed with respect to the phylogenetic relationship between eubacteria, archaebacteria and eukaryotes.  相似文献   

10.
11.
An examination of the available amino acid sequences of GTP-binding proteins has revealed that each contains a polypeptide essentially homologous for all of them. These sequences for elongation factor-Tu (EF-Tu) and the human bladder protein p21 exhibit a singular degree of homology (50%). Chemical and structural evidence indicates that this sequence in EF-Tu constitutes part of the nucleotide-binding site. The homologous sequences may therefore contribute to the GTP-binding sites of the other proteins.  相似文献   

12.
A set of aligned homologous protein sequences is divided into two groups consisting of m and n sequences. Each group contains sequences from the most related organisms. Value of the position dissimilarity of proteins from different groups of m and n sequences is defined as a number of mismatches in comparison of all possible m X n pairs of amino acid residues in the position (each from different group) divided by m X n. Ten position average of dissimilarity values is plotted vs. the first position number. Area of the figure between the profile of dissimilarity values and its mean value line characterizes the overall irregularity of amino acid substitutions along the protein sequences. If the area is greater than the average area for 1000 random profiles by more than two standard deviation units, the profile extrema containing the "surplus" of area are cut off. The cut-off stretches are likely to be variable and constant regions. If necessary, each of stretches may be separately tested and statistically estimated using a standard size sample of artificial protein families. Intergroup comparison of protein sequences reveals high overall irregularity of amino acid substitutions and identifies variable and conservative regions for all considered families of proteins: phospholipases A2, aspartate aminotransferases, alpha-subunits of Na+, K(+)-ATPase, L- and M-subunits of photosynthetic bacteria photoreaction centre, human rhodopsins.  相似文献   

13.
Using a maximum-likelihood formalism, we have developed a method with which to reconstruct the sequences of ancestral proteins. Our approach allows the calculation of not only the most probable ancestral sequence but also of the probability of any amino acid at any given node in the evolutionary tree. Because we consider evolution on the amino acid level, we are better able to include effects of evolutionary pressure and take advantage of structural information about the protein through the use of mutation matrices that depend on secondary structure and surface accessibility. The computational complexity of this method scales linearly with the number of homologous proteins used to reconstruct the ancestral sequence.  相似文献   

14.
We have conducted a human cDNA project to predict protein-coding sequences (CDSs) in large cDNAs (> 4 kb) since 1994, and the number of newly identified genes, known as KIAA genes, already exceeds 2000. The ultimate goal of this project is to clarify the physiological functions of the proteins encoded by KIAA genes. To this end, the project has recently been expanded to include isolation and characterization of mouse KIAA-counterpart genes. We herein present the entire sequences and the chromosome loci of 500 mKIAA cDNA clones and 13 novel cDNA clones that were incidentally identified during this project. The average size of the 513 cDNA sequences reached 4.3 kb and that of the deduced amino acid sequences from these cDNAs was 816 amino acid residues. By comparison of the predicted CDSs between mouse and human KIAAs, 12 mKIAA cDNA clones were assumed to be differently spliced isoforms of the human cDNA clones. The comparison of mouse and human sequences also revealed that four pairs of human KIAA cDNAs are derived from single genes. Notably, a homology search against the public database indicated that 4 out of 13 novel cDNA clones were homologous to the disease-related genes.  相似文献   

15.
Identification of functional open reading frames in chloroplast genomes   总被引:7,自引:0,他引:7  
K H Wolfe  P M Sharp 《Gene》1988,66(2):215-222
We have used a rapid computer dot-matrix comparison method to identify all DNA regions which have been evolutionarily conserved between the completely sequenced chloroplast genomes of tobacco and a liverwort. Analysis of these regions reveals 74 homologous open reading frames (ORFs) which have been conserved as to length and amino acid sequence; these ORFs also have an excess of nucleotide substitutions at silent sites of codons. Since the nonfunctional parts of these genomes have become saturated with mutations and show no sequence similarity whatsoever, the homologous ORFs are almost certainly functional. A further four pairs of ORFs show homology limited to only a short part of their putative gene products. Amino acid sequence identities range between 50 and 99%; some chloroplast proteins are seen to be among the most slowly evolving of all known proteins. A search of the nucleotide and amino acid sequence databanks has revealed several previously unidentified genes in chloroplast sequences from other species, but no new homologies to prokaryotic genes.  相似文献   

16.
The complete primary structures of proteins L17, L27 and S9 extracted from the Bacillus stearothermophilus ribosomes with 1 M NaCl and purified to homogeneity by column chromatography have been determined. The amino acid sequences of these proteins are compared to those of the homologous ribosomal proteins from Escherichia coli. The number of identical amino acid residues between the homologous proteins lies between 33-55%.  相似文献   

17.
We developed novel programs for displaying and analyzing the transmembrane alpha-helical segments (TMSs) in the aligned sequences of homologous integral membrane proteins. TMS_ALIGN predicts the positions of putative TMSs in multiply aligned protein sequences and graphically shows the TMSs in the alignment. TMS_SPLIT (1). predicts the positions of TMSs for each sequence; (2). allows a user to select proteins with a specified number of TMSs, and (3). splits the sequences into groups of TMSs of equal numbers. TMS_CUT works like TMS_SPLIT, but it can cut sequences with any combination of TMSs. The BASS program similarly allows comparison of protein repeat elements, equivalent to TMS_SPLIT plus IC, but it provides the comparison data expressed in BLAST E values. These programs, together with the IntraCompare program, facilitate the identification of repeat sequences in integral membrane proteins. They also facilitate the estimation of protein topology and the determination of evolutionary pathways.  相似文献   

18.
Summary Completion of the sequence determination of all 52 Escherichia coli ribosomal proteins enabled a final comparison of their sequences. Similarities in amino acid compositions were compared to the relatedness of the sequences, which was analyzed statistically with the aid of the computer programs RELATE and ALIGN.Among the examined 52×52 possible protein pairs at least 40 pairs were found that can be regarded as distantly related (showing segment comparison score values slightly above 3.0 S.D. units). These protein pairs were further examined with the programs ALIGN and SEEK to locate homologous sequence stretches. In no case were two complete homologous sequences found (with the exception of the known identical pairs L7/L12 and S20/L26). However, short homologous sequence regions were observed. Beside those protein pairs that show significant although distant relatedness, other pairs were slightly below the threshold value of 3.0 S.D. units.Those pairs observed to be distantly related consisted either of two proteins from the same subunit or of one protein from each of the different subunits. A further analysis of these pairs revealed a correlation between their relatedness and their time of incorporation into the ribosome during assembly.  相似文献   

19.
Mammalian hibernation is a unique physiological adaptation that allows the sustainment of life under extremely low body temperatures. In the chipmunk, we found four proteins related specifically to hibernation. These proteins started to diminish in concentration in the blood before and disappeared during hibernation. These proteins reappeared in the blood as hibernation ceased and remained during nonhibernation. The complete or partial amino acid sequences of the four proteins showed that three (27-, 25-, and 20-kDa) were previously unknown, whereas another (55-kDa) is highly homologous with alpha 1-antitrypsin. The three novel proteins are homologous, indicating that they are a family. In the NH2-terminal regions of these proteins, a collagen-like amino acid sequence is present, whereas in their COOH-terminal regions, two sequences, Ser-Ala-Phe-Ala-Val-Lys and Val-Trp-Leu-Glu, are conserved. Analysis by sodium dodecyl sulfate-polyacrylamide gel electrophoresis under nonreducing conditions and gel permeation chromatography under denaturating conditions revealed that the four proteins form a 140-kDa complex in the plasma fraction. The novel proteins were detected in blood of another hibernator, the ground squirrel, but not in rodent nonhibernators, namely tree squirrels and rats. The present finding is the first identification of a hibernation-specific protein. The presence of specific proteins in hibernators suggests the involvement of genetic factors in the control of hibernation. These proteins provide valuable tools for understanding molecular mechanisms of mammalian hibernation.  相似文献   

20.
Internal homologies in an amino acid sequence of a protein and in amino acid sequences of two different proteins are examined, using correlation coefficients calculated from the sequences when residues are replaced by various quantitative properties of the amino acids such as hydrophobicity. To improve the signal-noise ratio the average correlation coefficient is used to detect homology because the correlation depends on the property considered. In this way, any sequence repetition in a protein and the extent of the similarity and difference among proteins can be estimated quantitatively. The procedure was applied first to the sequences of proteins which have been assumed on other grounds to contain some internal sequence repetitions, α-tropomyosin from rabbit skeletal muscle, calmodulin from bovine brain, troponin C from skeletal and cardiac muscle, and then to the sequences of calcium binding proteins, calmodulin, troponin C, and L2 light chain of myosin. The results show that α-tropomyosin has a markedly periodic sequence at intervals of multiples of seven residues throughout the whole sequence, and calmodulin and skeletal troponin C contain two homologous sequences, the homology of troponin C being weaker than that of calmodulin. Candidates for the calcium binding regions of both troponin C, calmodulin, and L2 light chain are the homologous parts having a high average correlation coefficient (about 0·5) with respect to the sequences of the CD and EF hand regions of carp parvalbumin. The procedure may be a useful method for searching for homologous segments in amino acid sequences.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号