首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 593 毫秒
1.
Recent efforts to design de novo or redesign the sequence and structure of proteins using computational techniques have met with significant success. Most, if not all, of these computational methodologies attempt to model atomic-level interactions, and hence high-resolution structural characterization of the designed proteins is critical for evaluating the atomic-level accuracy of the underlying design force-fields. We previously used our computational protein design protocol RosettaDesign to completely redesign the sequence of the activation domain of human procarboxypeptidase A2. With 68% of the wild-type sequence changed, the designed protein, AYEdesign, is over 10 kcal/mol more stable than the wild-type protein. Here, we describe the high-resolution crystal structure and solution NMR structure of AYEdesign, which show that the experimentally determined backbone and side-chains conformations are effectively superimposable with the computational model at atomic resolution. To isolate the origins of the remarkable stabilization, we have designed and characterized a new series of procarboxypeptidase mutants that gain significant thermodynamic stability with a minimal number of mutations; one mutant gains more than 5 kcal/mol of stability over the wild-type protein with only four amino acid changes. We explore the relationship between force-field smoothing and conformational sampling by comparing the experimentally determined free energies of the overall design and these focused subsets of mutations to those predicted using modified force-fields, and both fixed and flexible backbone sampling protocols.  相似文献   

2.
The bacterial protein streptokinase (SK) contains three independently folded domains (α, β and γ), interconnected by two flexible linkers with noticeable sequence homology. To investigate their primary structure requirements, the linkers were swapped amongst themselves i.e. linker 1 (between α and β domains) was swapped with linker 2 (between β and γ domains) and vice versa. The resultant construct exhibited very low activity essentially due to an enhanced proteolytic susceptibility. However, a SK mutant with two linker 1 sequences, which was proteolytically as stable as WT-rSK retained about 10% of the plasminogen activator activity of rSK When the native sequence of each linker was substituted with 9 consecutive glycine sequences, in case of the linker 1 substitution mutant substantial activity was seen to survive, whereas the linker 2 mutant lost nearly all its activity. The optimal length of linkers was then studied through deletion mutagenesis experiments, which showed that deletion beyond three residues in either of the linkers resulted in virtually complete loss of activator activity. The effect of length of the linkers was then also examined by insertion of extraneous pentapeptide sequences having a propensity for adopting either an extended conformation or a relatively rigid conformation. The insertion of poly-Pro sequences into native linker 2 sequence caused up to 10-fold reduction in activity, whereas its effect in linker 1 was relatively minor. Interestingly, most of the linker mutants could form stable 1:1 complexes with human plasminogen. Taken together, these observations suggest that (i) the functioning of the inter-domain linkers of SK requires a critical minimal length, (ii) linker 1 is relatively more tolerant to insertions and sequence alterations, and appears to function primarily as a covalent connector between the α and β domains, and (iii) the native linker 2 sequence is virtually indispensable for the activity of SK probably because of structural and/or flexibility requirements in SK action during catalysis.  相似文献   

3.
We have developed a phylogeny-based design method that has been used to produce mutated proteins with enhanced thermal stabilities. We previously validated the predictive worth of the method by producing and characterizing mutants in which one original residue or a small number of the original residues had been replaced with the one or the ones found in the phylogenetically predicted “ancestral” sequence. For the current study, this method was used to design a sequence for the deepest nodal position of a phylogenic tree composed of 16 gyrase B-subunit sequences, which was then synthesized and characterized. The sequence was inferred from the sequences of 16 extant DNA gyrases and 3 extant type VI DNA topoisomerases. Genes encoding the inferred sequence and its N-terminal ATPase domain were PCR constructed and expressed in Escherichia coli. The full-length designed protein is slightly less thermally stable than is subunit B from the extant thermophilic Thermus thermophilus DNA gyrase, whereas the thermal stability of the designed ATPase domain is more similar to that of the T. thermophilus ATPase domain. Moreover, the designed ATPase domain has significant catalytic activity. Therefore, even a small set of homologous amino acid sequences contains sufficient information to design a thermally stable and functional protein. Because the isolated designed ATPase domain is more thermally stable and catalytically active than is the sequence containing the most frequently occurring amino acids among the 16 gyrases, the phylogenetic approach was superior (in this case, at least) to the consensus approach when the same data set was used to predict the two sequences.  相似文献   

4.
One of the classical DNA-binding proteins, bacteriophage lambda Cro, forms a homodimer with a unique fold of alpha-helices and beta-sheets. We have computationally designed an artificial sequence of 60 amino acid residues to stabilize the backbone tertiary structure of the lambda Cro dimer by simulated annealing using knowledge-based structure-sequence compatibility functions. The designed amino acid sequence has 25% identity with that of natural lambda Cro and preserves Phe58, which is important for formation of the stably folded structure of lambda Cro. The designed dimer protein and its monomeric variant, which was redesigned by the insertion of a beta-hairpin sequence at the C-terminal region to prevent dimerization, were synthesized and biochemically characterized to be well folded. The designed protein was monomeric under a wide range of protein concentrations and its solution structure was determined by NMR spectroscopy. The solved structure is similar to that of a monomeric variant of natural lambda Cro with a root-mean-square deviation of the polypeptide backbones at 2.1A and has a well-packed protein core. Thus, our knowledge-based functions provide approximate but essential relationships between amino acid sequences and protein structures, and are useful for finding novel sequences that are foldable into a given target structure.  相似文献   

5.
Using oligonucleotide primers designed to the known gene sequence of an (E)-beta-farnesene (EbetaF) synthase, two cDNA sequences (MxpSS1 and MxpSS2) were cloned from a black peppermint (Menthaxpiperita) plant. MxpSS1 encoded a protein with 96% overall amino acid sequence identity with the EbetaF synthase. Recombinant MxpSS1 produced in Escherichia coli, after removal of an N-terminal thioredoxin fusion, had a K(m) for FPP of 1.91+/-0.1 microM and k(cat) of 0.18 s(-1), and converted farnesyl diphosphate (FPP) into four products, the major two being cis-muurola-3,5-diene (45%) and cis-muurola-4(14),5-diene (43%). This is the first cis-muuroladiene synthase, to be characterised. MxpSS2 encoded a protein with only two amino acids differing from EbetaF synthase. Recombinant MxpSS2 protein showed no activity towards FPP. One of the two mutations, at position 531 (leucine in MxpSS2 and serine in EbetaF synthase) was shown, by structural modelling to occur in the J-K loop, an element of the structure of sesquiterpene synthases known to be important in the reaction mechanism. Reintroduction of the serine at position 531 into MxpSS2 by site-directed mutagenesis restored EbetaF synthase activity (K(m) for FPP 0.98+/-0.12 microM, k(cat) 0.1 s(-1)), demonstrating the crucial role of this residue in the enzyme activity. Analysis, by GC-MS, of the sesquiterpene profile of the plant used for the cloning, revealed that EbetaF was not present, confirming that this particular mint chemotype had lost EbetaF synthase activity due to the observed mutations.  相似文献   

6.
Xu L  Zhang Z  Zhou X  Yin X  Yang L  Zhao D 《Gene》2011,485(2):102-105
The resistance or susceptibility of sheep to scrapie is associated with polymorphisms of the prion protein gene (PRNP), particularly, single nucleotide polymorphisms (SNPs) in amino acid positions 136, 154 and 171. The prion protein (PrP) gene sequence and the deduced amino acid alignment of prion protein in Tan sheep, a local Chinese sheep breed traditionally raised in Ningxia, northwestern China, were determined and variability of the PrP amino acids sequence was analyzed in this study. The PrP nucleic acids and amino acids sequences of 112 Tan sheep were highly homogenous, although polymorphism of the PrP gene was detected at several sites, particularly codons 106, 154, and 171. The analysis of both sequences revealed that the most predominant allele at codons 136, 154 and 171 in Tan sheep was ARQ, which was known to be associated with high susceptibility to scrapie in sheep. The result suggests that Tan sheep is potentially susceptible to scrapie. Our findings provide valuable information for future breeding projects to scrapie resistance in Tan sheep.  相似文献   

7.
Rubisco is a very large, complex and one of the most abundant proteins in the world and comprises up to 50% of all soluble protein in plants. The activity of Rubisco, the enzyme that catalyzes CO2 assimilation in photosynthesis, is regulated by Rubisco activase (Rca). In the present study, we searched for hypothetical protein of Vitis vinifera which has putative Rubisco activase function. The Arabidopsis and tobacco Rubisco activase protein sequences were used as seed sequences to search against Vitis vinifera in UniprotKB database. The selected hypothetical proteins of Vitis vinifera were subjected to sequence, structural and functional annotation. Subcellular localization predictions suggested it to be cytoplasmic protein. Homology modelling was used to define the three-dimensional (3D) structure of selected hypothetical proteins of Vitis vinifera. Template search revealed that all the hypothetical proteins share more than 80% sequence identity with structure of green-type Rubisco activase from tobacco, indicating proteins are evolutionary conserved. The homology modelling was generated using SWISS-MODEL. Several quality assessment and validation parameters computed indicated that homology models are reliable. Further, functional annotation through PFAM, CATH, SUPERFAMILY, CDART suggested that selected hypothetical proteins of Vitis vinifera contain ATPase family associated with various cellular activities (AAA) and belong to the AAA+ super family of ring-shaped P-loop containing nucleoside triphosphate hydrolases. This study will lead to research in the optimization of the functionality of Rubisco which has large implication in the improvement of plant productivity and resource use efficiency.  相似文献   

8.
Bovine viral diarrhea virus (BVDV) nonstructural protein 5B is an RNA-dependent RNA polymerase, essential for viral replication. Initial attempts to crystallize a soluble form of the 695-residue BVDV polymerase did not produce any crystals. Limited proteolysis, homology modeling, and mutagenesis data were used to aid the design of polymerase constructs that might crystallize more readily. Limited proteolysis of the polymerase with trypsin identified a domain boundary within the protein. Homology modeling of the polymerase, based on the structure of hepatitis C virus polymerase, indicated that the two polymerases share a 23% identical "core," although overall sequence identity is low. Eighty-four expression clones of the BVDV polymerase were designed by fine-sampling of chain termini at the boundaries of domain and of active truncated forms of the polymerase. The resulting constructs were expressed in Escherichia coli and purified using high-throughput methods. Soluble truncated proteins were subjected to crystallization trials in a 96-well format, and two of these proteins were successfully crystallized.  相似文献   

9.
Kuzuwa S  Yokoi KJ  Kondo M  Kimoto H  Yamakawa A  Taketo A  Kodaira K 《Gene》2012,495(2):154-162
Though some genetic features of lactobacillar fructan hydrolases were elucidated, information about their enzymology or mutational analyses were scarce. Lactobacillus casei IAM1045 exhibits extracellular activity degrading inulin. After partial purification of the inulin-degrading protein from the spent culture medium, several fragments were obtained by protease digestion. Based on their partial amino-acid sequences, oligonucleotide primers were designed, and its structural gene (levH1) was determined using the gene library constructed in the E. coli system. The levH1 gene encoded a protein (designated as LevH1), of which calculated molecular mass and pI were 138.8-kDa and 4.66, respectively. LevH1 (1296 amino-acids long) was predicted to have a four-domain structure, containing (i) an N-terminal secretion signal of 40 amino-acids, (ii) variable domain of about 140 residues whose function is unclear, (iii) a catalytic domain of about 630 residues with glycoside-hydrolase activity consisting of two modules, a five-blade β-propeller module linked to a β-sandwich module, (iv) a C-terminal domain of about 490 residues comprising five nearly perfect repeat sequences of 80 residues homologous to equivalents of other hypothetical cell surface proteins, followed by 37-residues rich in Ser/Thr/Pro/Gly, a pentad LPQAG (the LPXTG homologue). When overproduced in E. coli, the putative variable-catalytic domain region of about 770 residues exhibited exo-inulinase activity. Deletion analyses demonstrated that the variable-catalytic domain region containing two modules is important for enzymatic activity. Presence of eight conserved motifs (I-VIII) was suggested in the catalytic domain by comparative analysis, among which motif VIII was newly identified in the β-sandwich module in this study. Site-directed mutagenesis of conserved amino-acids in these motifs revealed that D198, R388, D389 and E440, were crucial for inulinase activity. Moreover, mutations of D502A and D683A in motif VI and VIII respectively caused significant decrease in the activity. These results suggested that the variable domain and β-sandwich module, besides the β-propeller module, are important for inulin-degrading activity of LevH1.  相似文献   

10.
Functional genes of HIV-1 like the tat express proteins essential for viral survival and propagation. There are variations reported in levels of Tat transactivation among the different subtypes of HIV-1. This study looked at the amino acid differences in the different regions of Tat protein (exon 1) of subtype B and C strains of HIV-1 and tried to observe a molecular basis for protein function. HIV-1 sequences of subtype B (n=30) and C (n=60) strains were downloaded from HIV-1 Los Alamos data base. Among the 60 subtype C strain sequences, 30 each were from India and Africa. A HIV-1 Tat protein (exon 1) sequence, the consensus B and C sequence was obtained from the 'sequence search interface' in the Los Alamos HIV-1 sequence data. The sequences were visualized using Weblogo and the RNA binding regions of the three consensus sequences were also determined using BindN software program. Compared to subtype B, there was a high level of divergence in the auxiliary domain of tat exon 1 (amino acid positions 58- 69). The net charge of the subtype C (Indian) Tat protein (exon 1) auxiliary domain was -1.9 at pH 7 and it had an isoelectric point of 4.1. The net charge of the subtype C (African) auxiliary domain was -2.9 at pH 7 and it had an isoelectric point of 3.7 while the net charge of same region in subtype B was -0.9 at pH 7 with an isoelectric point of 4.9. The ratio of the hydrophilic residues to the total number of residues was 60% in the in both the Indian and African subtype C in the auxiliary domain while this was 50% in subtype B. The consensus subtype B sequence was found to have 36 RNA binding sites while subtype C (India) had 33 and subtype C (Africa) had 32 RNA binding sites. The HIV-1 Tat-TAR interaction is a potential target for inhibitors and being considered for its potential use in HIV-1 vaccines. Development of such inhibitor/vaccines would have to take into consideration the variation in amino acid sequence analyzed in this study as this could determine epitope presentation on MHC class I antigen for afferent immune response.  相似文献   

11.
Humulus lupulus is commonly known as hops, a member of the family moraceae. Currently many projects are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. The genetically characterized domains in these databases are limited due to non-availability of reliable molecular markers. The large data of EST sequences are available in hops. The simple sequence repeat markers extracted from EST data are used as molecular markers for genetic characterization, in the present study. 25,495 EST sequences were examined and assembled to get full-length sequences. Maximum frequency distribution was shown by mononucleotide SSR motifs i.e. 60.44% in contig and 62.16% in singleton where as minimum frequency are observed for hexanucleotide SSR in contig (0.09%) and pentanucleotide SSR in singletons (0.12%). Maximum trinucleotide motifs code for Glutamic acid (GAA) while AT/TA were the most frequent repeat of dinucleotide SSRs. Flanking primer pairs were designed in-silico for the SSR containing sequences. Functional categorization of SSRs containing sequences was done through gene ontology terms like biological process, cellular component and molecular function.  相似文献   

12.
Previously, tubulin has been purified from Leishmania amazonensis and used to identify novel molecules with selective antimitotic activity. However, L. amazonensis is pathogenic and requires a relatively expensive medium for large-scale cultivation. Herein, the purification and characterization of tubulin from the non-pathogenic Leishmania tarentolae is reported, together with the sequence of alpha- and beta-tubulin from this organism. This protein was purified by sonication, diethylaminoethyl-Sepharose chromatography, and one assembly disassembly cycle in 1% overall recovery based on total cellular protein. Leishmania tarentolae tubulin was indistinguishable from the corresponding L. amazonensis protein in terms of binding affinity for dinitroaniline sulfanilamides and sensitivity to assembly inhibition by these compounds. The amino acid sequences derived from the L. tarentolae alpha- and beta-tubulin genes were 99.6 and 99.4% identical to the corresponding amino acid sequences from the Leishmania major Friedlin strain. These results indicate that tubulin from L. tarentolae is suitable for use in drug screening.  相似文献   

13.
The genetic heterogeneity of two M. arenaria race 2 populations (designated Pelion and Govan) was examined using RFLP analysis of 12 clonal lines established from single egg masses (six distinct clonal lines from each population). These populations are essentially identical by traditional biochemical and race identification schemes; however, the Govan population is more aggressive than the Pelion population, producing larger galls and exhibiting greater reproductive capabilities on many soybean cultivars and experimental accessions. Variation at the genomic DNA level was examined using probes representative of expressed DNA sequences present in the eukaryotic genome. Ribosomal DNA, interspersed repeated sequences, and cDNA probes were tested for detection of polymorphism within and between single egg mass lines of each population. Cloned cDNAs and ribosomal intergenic spacer sequences detect polymorphism both within and between populations, demonstrating the usefulness of these sequence classes for molecular genetic analysis of population structure and genome evolution.  相似文献   

14.
15.
αB-crystallin, a small heat-shock protein, has been shown to prevent the aggregation of other proteins under various stress conditions. Here we have cloned the cDNA and the genomic sequence of CRYAB gene from the Giant Panda (Ailuropoda melanoleuca) using RT-PCR technology and Touchdown-PCR, respectively. The length of cDNA fragment cloned contains an open reading frame of 528bp encoding 175 amino acids and the length of the genomic sequence is 3189bp, containing three exons and two introns. Alignment analysis indicated that the nucleotide sequence and the deduced amino acid sequence are highly conserved to other four species studied, including Homo sapiens, Mus musculus, Rattus norvegicus and Bos taurus. The homologies for nucleotide sequences of Giant Panda CRYAB to that of these species are 93.9%, 91.5%, 91.5% and 95.3%, respectively, and the homologies for amino acid sequences are 98.3%, 97.1%,97.7% and 99.4%, respectively. Topology prediction shows that there are only four Casein kinase II phosphorylation sites in the CRYAB protein of the Giant Panda. The cDNA of CRYAB was transfected into E. coli, and the CRYAB fused with the N-terminally His-tagged protein gave rise to the accumulation of an expected 24KDa polypeptide, which accorded with the predicted protein. The expression product obtained could be used for purification and study of its function further.  相似文献   

16.
Protein design aims at designing new protein molecules of desired structure and functionality. One of the major obstacles to large-scale protein design are the extensive time and manpower requirements for experimental validation of designed sequences. Recent advances in protein structure prediction have provided potentials for an automated assessment of the designed sequences via folding simulations. We present a new protocol for protein design and validation. The sequence space is initially searched by Monte Carlo sampling guided by a public atomic potential, with candidate sequences selected by the clustering of sequence decoys. The designed sequences are then assessed by I-TASSER folding simulations, which generate full-length atomic structural models by the iterative assembly of threading fragments. The protocol is tested on 52 nonhomologous single-domain proteins, with an average sequence identity of 24% between the designed sequences and the native sequences. Despite this low sequence identity, three-dimensional models predicted for the first designed sequence have an RMSD of < 2 Å to the target structure in 62% of cases. This percentage increases to 77% if we consider the three-dimensional models from the top 10 designed sequences. Such a striking consistency between the target structure and the structural prediction from nonhomologous sequences, despite the fact that the design and folding algorithms adopt completely different force fields, indicates that the design algorithm captures the features essential to the global fold of the target. On average, the designed sequences have a free energy that is 0.39 kcal/(mol residue) lower than in the native sequences, potentially affording a greater stability to synthesized target folds.  相似文献   

17.
18.
Triticale cv. Beagle 82, cotton cv. McNair 235, and soybean cv. Twiggs were arranged in three cropping sequences to determine the effects of fenamiphos and cropping sequence on nematode population densities and crop yields under conservation tillage for 4 years. The cropping sequences were triticale (T)-cotton (C)-T-C, T-soybean (S)-T-S, and T-C-T-S. Numbers of Meloidogyne incognita second-stage juveniles declined on trificale but increased on cotton and soybean each year. Root-gall indices of cotton and soybean ranged from 1.00 to 1.08 (1 to 5 scale: 1 = 0%, 2 = 1% to 25%, 3 = 26% to 50%, 4 = 51% to 75%, and 5 = 76% to 100% of roots galled) each year and were not affected by fenamiphos treatment or cropping sequence. Numbers of Pratylenchus brachyurus were maintained on trificale and generally increased more on soybean than on cotton. Population densities of Helicotylenchus dihystera were near or below detection levels in all plots during the first year and increased thereafter in untreated plots in the T-C-T-C and T-S-T-S sequences. Generally, yields of triticale in all cropping sequences declined over the years. Yields of cotton and soybean were not affected by fenamiphos at 6.7 kg a.i./ha. Cotton and soybean were grown successfully with little or no suppression in yields caused by nematodes in conservation tillage following triticale harvested for grain.  相似文献   

19.
20.

Background and Aims

Molecular phylogenetic studies of palms (Arecaceae) have not yet provided a fully resolved phylogeny of the family. There is a need to increase the current set of markers to resolve difficult groups such as the Neotropical subtribe Bactridinae (Arecoideae: Cocoseae). We propose the use of two single-copy nuclear genes as valuable tools for palm phylogenetics.

Methods

New primers were developed for the amplification of the AGAMOUS 1 (AG1) and PHYTOCHROME B (PHYB) genes. For the AGAMOUS gene, the paralogue 1 of Elaeis guineensis (EgAG1) was targeted. The region amplified contained coding sequences between the MIKC K and C MADS-box domains. For the PHYB gene, exon 1 (partial sequence) was first amplified in palm species using published degenerate primers for Poaceae, and then specific palm primers were designed. The two gene portions were sequenced in 22 species of palms representing all genera of Bactridinae, with emphasis on Astrocaryum and Hexopetion, the status of the latter genus still being debated.

Key Results

The new primers designed allow consistent amplification and high-quality sequencing within the palm family. The two loci studied produced more variability than chloroplast loci and equally or less variability than PRK, RPBII and ITS nuclear markers. The phylogenetic structure obtained with AG1 and PHYB genes provides new insights into intergeneric relationships within the Bactridinae and the intrageneric structure of Astrocaryum. The Hexopetion clade was recovered as monophyletic with both markers and was weakly supported as sister to Astrocaryum sensu stricto in the combined analysis. The rare Astrocaryum minus formed a species complex with Astrocaryum gynacanthum. Moreover, both AG1 and PHYB contain a microsatellite that could have further uses in species delimitation and population genetics.

Conclusions

AG1 and PHYB provide additional phylogenetic information within the palm family, and should prove useful in combination with other genes to improve the resolution of palm phylogenies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号