首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The concept of consensus in multiple sequence alignments (MSAs) has been used to design and engineer proteins previously with some success. However, consensus design implicitly assumes that all amino acid positions function independently, whereas in reality, the amino acids in a protein interact with each other and work cooperatively to produce the optimum structure required for its function. Correlation analysis is a tool that can capture the effect of such interactions. In a previously published study, we made consensus variants of the triosephosphate isomerase (TIM) protein using MSAs that included sequences form both prokaryotic and eukaryotic organisms. These variants were not completely native-like and were also surprisingly different from each other in terms of oligomeric state, structural dynamics, and activity. Extensive correlation analysis of the TIM database has revealed some clues about factors leading to the unusual behavior of the previously constructed consensus proteins. Among other things, we have found that the more ill-behaved consensus mutant had more broken correlations than the better-behaved consensus variant. Moreover, we report three correlation and phylogeny-based consensus variants of TIM. These variants were more native-like than the previous consensus mutants and considerably more stable than a wild-type TIM from a mesophilic organism. This study highlights the importance of choosing the appropriate diversity of MSA for consensus analysis and provides information that can be used to engineer stable enzymes.  相似文献   

2.
Base sequence studies of 300 nucleotide renatured repeated human DNA clones   总被引:117,自引:0,他引:117  
A band of 300 nucleotide long duplex DNA is released by treating renatured repeated human DNA with the single strand-specific endonuclease S1. Since many of the interspersed repeated sequences in human DNA are 300 nucleotides long, this band should be enriched in such repeats. We have determined the nucleotide sequences of 15 clones constructed from these 300 nucleotide S1-resistant repeats. Ten of these cloned sequences are members of the Alu family of interspersed repeats. These ten sequences share a recognizable consensus sequence from which individual clones have an average divergence of 12.8%. The 300 nucleotide Alu family consensus sequence has a dimeric structure and was evidently formed from a head to tail duplication of an ancestral monomeric sequence. Three of the remaining clones are variations on a simple pentanucleotide sequence previously reported for human satellite III DNA. Two of the 15 clones have distinct and complex sequences and may represent other families of interspersed repeated sequences.  相似文献   

3.
Recent investigations into the translation termination sites of various organisms have revealed that not only stop codons but also sequences around stop codons have an effect on translation termination. To investigate the relationship between these sequence patterns and translation as well as its termination efficiency, we analysed the correlation between strength of consensus and translation efficiency, as predicted according to Codon Adaptation Index (CAI) value. We used RIKEN full-length mouse cDNA sequences and ten other eukaryotic UniGene datasets from NCBI for the analyses. First, we conducted sequence profile analyses following translation termination sites. We found base G and A at position +1 as a strong consensus for mouse cDNA. A similar consensus was found for other mammals, such as Homo sapiens, Rattus norvegicus and Bos taurus. However, some plants had different consensus sequences. We then analysed the correlation between the strength of consensus at each position and the codon biases of whole coding regions, using information content and CAI value. The results showed that in mouse cDNA, CAI value had a positive correlation with information content at positions +1. We also found that, for positions with strong consensus, the strength of the consensus is likely to have a positive correlation with CAI value in some other eukaryotes. Along with these observations, biological insights into the relationship between gene expression level, codon biases and consensus sequence around stop codons will be discussed.  相似文献   

4.
Species-specific patterns of DNA bending and sequence.   总被引:16,自引:6,他引:10       下载免费PDF全文
Nucleotide sequences in the GenEMBL database were analyzed using strategies designed to reveal species-specific patterns of DNA bending and DNA sequence. The results uncovered striking species-dependent patterns of bending with more variations among individual organisms than between prokaryotes and eukaryotes. The frequency of bent sites in sequences from different bacteria was related to genomic A + T content and this relationship was confirmed by electrophoretic analysis of genomic DNA. However, base composition was not an accurate predictor for DNA bending in eukaryotes. Sequences from C. elegans exhibited the highest frequency of bent sites in the database and the RNA polymerase II locus from the nematode was the most bent gene in GenEMBL. Bent DNA extended throughout most introns and gene flanking segments from C.elegans while exon regions lacked A-tract bending characteristics. Independent evidence for the strong bending character of this genome was provided by electrophoretic studies which revealed that a large number of the fragments from C.elegans DNA exhibited anomalous gel mobilities when compared to genomic fragments from over 20 other organisms. The prevalence of bent sites in this genome enabled us to detect selectively C.elegans sequences in a computer search of the database using as probes C.elegans introns, bending elements, and a 20 nucleotide consensus sequence for bent DNA. This approach was also used to provide additional examples of species-specific sequence patterns in eukaryotes where it was shown that (A) greater than or equal to 10 and (A.T) greater than or equal to 5 tracts are prevalent throughout the untranslated DNA of D.discodium and P.falciparum, respectively. These results provide new insight into the organization of eukaryotic DNA because they show that species-specific patterns of simple sequences are found in introns and in other untranslated regions of the genome.  相似文献   

5.
H G Griffin  S R Swindell  M J Gasson 《Gene》1992,122(1):193-197
Lactate dehydrogenase (LDH; EC1.1.1.27) is a key enzyme in the fermentation of milk by lactic acid bacteria used in the dairy industry. An 800-bp DNA fragment containing part of the gene (ldh) encoding LDH was amplified from Lactococcus lactis in a polymerase chain reaction using primers designed from the partial amino acid sequence of a lactococcal LDH. This fragment was radioactively labelled and used to probe a phage lambda library of Lc. lactis genomic DNA. Fragments containing ldh were subcloned from lambda to pUC13 and pUC18 and a 1.2-kb region was sequenced. The deduced aa sequence reveals that the lactococcal LDH is highly homologous to the LDHs of other organisms. The active site and several other domains of unknown function are highly conserved between all LDH enzymes (prokaryotic and eukaryotic). An evolutionary study of LDH sequences clearly divides the prokaryotic from the eukaryotic enzymes except for the Bifidobacterium longum LDH which anomalously groups with the eukaryotic enzymes. The LDHs from Gram-positive bacteria form a separate group from the enzymes from the Gram-negative organisms. The lactococcal LDH is phylogenetically closest to the streptococcal LDH.  相似文献   

6.
7.
Summary A database search has revealed significant and extensive sequence similarities among prokaryotic and eukaryotic pyridoxal phosphate (PLP)-dependent decarboxylases, includingDrosophila glutamic acid decarboxylase (GAD) and bacterial histidine decarboxylase (HDC). Based on these findings, the sequences of seven PLP-dependent decarboxylases from five different organisms have been aligned to derive a consensus sequence for this family of enzymes. In addition, quantitative methods have been employed to calculate the relative evolutionary distances between pairs of the decarboxylases comprising this family. The multiple sequence analysis together with the quantitative results strongly suggest an ancient and common origin for all PLP-dependent decarboxylases. This analysis also indicates that prokaryotic and eukaryotic HDC activities evolved independently. Finally, a sensitive search algorithm (PROFILE) was unable to detect additional members of this decarboxylase family in protein sequence databases.  相似文献   

8.
Cytochromes P-450 are extremely important in the oxidative metabolism of a variety of endogenous and exogenous compounds in pro- and eukaryotic organisms. Progress in understanding the structure and mechanism of action of this superfamily of enzymes has been hampered by the properties of the eukaryotic enzymes and the availability of only one well-characterized prokaryotic enzyme as a model. We report here the isolation of a Pseudomonas species which will utilize a monoterpene natural product, alpha-terpineol, as its sole source of carbon and energy. Approximately 1% of the soluble protein in the cell-free extract is a novel cytochrome P-450 (P-450terp). This enzyme and its associated iron sulfur protein electron carrier (terpredoxin) have been purified to homogeneity and their NH2-terminal amino acid sequences determined. The amino acid sequences of six tryptic peptide fragments of cytochrome P-450terp have also been determined. This sequence information was used to clone the gene encoding cytochrome P-450terp. Three clones representing approximately 8 kilobase pairs of unique sequences were selected and sequenced. Five non-overlapping open reading frames (ORFs) were found in the sequences, and the translated sequences were used to search the Protein Identification Resource for comparable proteins. The ORFs were identified as: 1) an alcohol dehydrogenase, 2) an aldehyde dehydrogenase, 3) cytochrome P-450terp, 4) terpredoxin reductase, and 5) terpredoxin. The identification of both the cytochrome P-450terp and terpredoxin DNA sequence was confirmed by the presence of each of the corresponding amino acid sequences found in the purified proteins. The five ORFs were bounded on both the 5' and 3' ends by consensus factor-independent terminator sequences. A consensus promoter sequence was found immediately 5' to the first ORF. These results indicate that we have sequenced the complete terp operon. Comparison of the amino acid sequence of cytochrome P-450terp to that of all other cytochromes P-450 has shown that it is the first member of the gene family CYP108. Preliminary characterization of the chemical and physical properties and the preparation of crystals of this new cytochrome P-450, suitable for x-ray diffraction analysis, indicate that it will be useful in comparison studies with other members of this class of proteins.  相似文献   

9.
A complete, high-quality reference sequence of a dog genome was recently produced by a team of researchers led by the Broad Institute, achieving another major milestone in deciphering the genomic landscape of mammalian organisms. The genome sequence provides an indispensable resource for comparative analysis and novel insights into dog and human evolution and history. Together with the survey sequence of a poodle previously published in 2003, the two dog genome sequences allowed identification of more than 2.5 million single nucleotide polymorphisms within and between dog breeds, which can be used in evolutionary analysis, behavioral studies and disease gene mapping.(1)  相似文献   

10.
Organization and sequence of the human alpha-lactalbumin gene.   总被引:10,自引:1,他引:9       下载免费PDF全文
  相似文献   

11.
The CCA-adding enzyme (ATP:tRNA adenylyltransferase or CTP:tRNA cytidylyltransferase (EC )) generates the conserved CCA sequence responsible for the attachment of amino acid at the 3' terminus of tRNA molecules. It was shown that enzymes from various organisms strictly recognize the elbow region of tRNA formed by the conserved D- and T-loops. However, most of the mammalian mitochondrial (mt) tRNAs lack consensus sequences in both D- and T-loops. To characterize the mammalian mt CCA-adding enzymes, we have partially purified the enzyme from bovine liver mitochondria and determined cDNA sequences from human and mouse dbESTs by mass spectrometric analysis. The identified sequences contained typical amino-terminal peptides for mitochondrial protein import and had characteristics of the class II nucleotidyltransferase superfamily that includes eukaryotic and eubacterial CCA-adding enzymes. The human recombinant enzyme was overexpressed in Escherichia coli, and its CCA-adding activity was characterized using several mt tRNAs as substrates. The results clearly show that the human mt CCA-adding enzyme can efficiently repair mt tRNAs that are poor substrates for the E. coli enzyme although both enzymes work equally well on cytoplasmic tRNAs. This suggests that the mammalian mt enzymes have evolved so as to recognize mt tRNAs with unusual structures.  相似文献   

12.
13.
The nucleotide sequence of the human fibroblast (beta 1) interferon chromosomal gene and its flanking regions was determined. These results confirm the absence of intervening sequences in the gene. The presence of some sequences in the upstream flanking region homologous to similar features for other eukaryotic genes was revealed: these include not only the TATAAAT sequence and the consensus sequence (reported by Benoist et al., 1980) but also two additional motifs, one of which is so far present only in inducible genes. Furthermore, a striking similarity between the upstream flanking regions of the human beta 1 and alpha 1 interferon genes is observed.  相似文献   

14.
Summary The DNA sequence of the small-subunit ribosomal RNA coding region for the cycadZamia pumila L. was determined. TheZamia smallsubunit rRNA was found to be 1813 nucleotides in length and approximately 92% identical to published angiosperm small-subunit rRNA sequences. Conserved regions interspersed with variable regions are observed corresponding to those found in other eukaryotic small-subunit sequences. Using representatives from protist, fungal, plant, and animal groups, a distance matrix was constructed of average nucleotide substitution rates for pairs of organisms. Phylogenetic trees were inferred from similarities between sequences. The sequence ofZamia represents the earliest divergence from the higher plant lineage reported to date for small-subunit rRNA data. Inferred phylogenies also support a monophyletic origin for the angiosperms consistent with studies citing phenotypic characters.  相似文献   

15.
L F Wu  A Reizer  J Reizer  B Cai  J M Tomich    M H Saier  Jr 《Journal of bacteriology》1991,173(10):3117-3127
The fruK gene encoding fructose-1-phosphate kinase (FruK), located within the fructose (fru)-catabolic operon of Rhodobacter capsulatus, was sequenced. FruK of R. capsulatus (316 amino acids; molecular weight = 31,232) is the same size as and is homologous to FruK of Escherichia coli, phosphofructokinase B (PfkB) of E. coli, phosphotagatokinase of Staphylococcus aureus, and ribokinase of E. coli. These proteins therefore make up a family of homologous proteins, termed the PfkB family. A phylogenetic tree for this new family was constructed. Sequence comparisons plus chemical inactivation studies suggested the lack of involvement of specific residues in catalysis. Although the Rhodobacter FruK differed markedly from the other enzymes within the PfkB family with respect to amino acid composition, these enzymes exhibited similar predicted secondary structural features. A large internal segment of the Rhodobacter FruK was found to be similar in sequence to the domain bearing the sugar bisphosphate-binding region of the large subunit of ribulose 1,5-bisphosphate carboxylase/oxygenase of plants and bacteria. Proteins of the PfkB family did not exhibit statistically significant sequence identity with PfkA of E. coli. PfkA, however, is homologous to other prokaryotic and eukaryotic ATP- and PPi-dependent Pfks (the PfkA family). These eukaryotic, ATP-dependent enzymes each consist of a homotetramer (mammalian) or a heterooctamer (yeasts), with each subunit containing an internal duplication of the size of the entire PfkA protein of E. coli. In some of these enzymes, additional domains are present. A phylogenetic tree was constructed for the PfkA family and revealed that the bacterial enzymes closely resemble the N-terminal domains of the eukaryotic enzyme subunits whereas the C-terminal domains have diverged more extensively. The PPi-dependent Pfk of potato is only distantly related to the ATP-dependent enzymes. On the basis of their similar functions, sizes, predicted secondary structures, and sequences, we suggest that the PfkA and PfkB families share a common evolutionary origin.  相似文献   

16.
A 1.1-kb human DNA fragment (ARSH1) capable of functioning as a putative origin of replication in yeast cells has been characterized both by in situ hybridization to human metaphase chromosomes and by DNA sequencing. Our hybridization studies show a preferential localization of ARSH1 in chromosome regions 1p34-36 and 2q34-37. DNA sequence analysis indicates that in addition to the consensus sequence required for ARS function in yeast cells, nuclear matrix-associated DNA motifs are also present in the 1.1-kb fragment. These results suggest that ARSH1 sequences may serve as points of anchorage to the nuclear matrix for chromosomes 1 and 2.  相似文献   

17.
S Fabry  R Hensel 《Gene》1988,64(2):189-197
The gene for the glycolytic enzyme glyceraldehyde-3-phosphate dehydrogenase (GAPDH) from the thermophilic methanogenic archaebacterium Methanothermus fervidus (growth optimum at 84 degrees C) was cloned in Escherichia coli and the nucleotide sequence was determined. A striking preference for adenine and thymidine bases was found in the gene, which is in agreement with the low G + C content of the M. fervidus DNA. The deduced amino acid sequence indicates an Mr of 37,500 for the protein subunit. Alignment with the amino acid sequences of GAPDHs from other organisms shows that the archaebacterial GAPDH is homologous to the respective eubacterial and eukaryotic enzymes, but the similarity between the archaebacterial enzyme and the eubacterial or eukaryotic GAPDHs is much less than that between the latter two.  相似文献   

18.
Structural analysis of a phage lambda Charon 4A clone carrying one of the human nuclear mitochondrial(mut)-DNA-like sequences revealed that a KpnI-family member (KpnI 5.5-kb DNA) is inserted within this sequence. The inserted KpnI 5.5-kb DNA contains several possible polyadenylation signal sequences followed by an A-rich sequence at its 3' end and is flanked by perfect 13-bp direct repeats of the duplicated mtDNA-like sequences. These structures strongly suggest that the KpnI 5.5-kb DNA is a mobile element. Comparison of the 5' terminal sequences of the KpnI 5.5-kb DNA and four other long KpnI-family DNAs so far examined, using the predicted general promoter sequence for eukaryotic tRNAs, indicates that they contain the consensus sequences for the split internal RNA polymerase III control region.  相似文献   

19.
The previously presented consensus sequence for eukaryotic translation initiation sites by Kozak was derived substantially from vertebrate mRNA sequences. Drosophila nuclear genes exhibit a significantly different translation start consensus sequence. These differences probably do not represent mechanistic differences in translation initiation inasmuch as both taxa exhibit identical preferences and restrictions at the crucial -3 position. Using more conservative criteria for the assignment of consensus the following consensus sequences were derived: vertebrate--CANCAUG and Drosophila--CAAAACAUG.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号