共查询到20条相似文献,搜索用时 0 毫秒
1.
蛋白质工程:从定向进化到计算设计 总被引:1,自引:0,他引:1
定向进化通过建立突变体文库与高通量筛选方法,快速提升蛋白的特定性质,是目前蛋白质工程最为常用的蛋白质设计改造策略。近十年随着计算机运算能力大幅提升以及先进算法不断涌现,计算机辅助蛋白质设计改造得到了极大的重视和发展,成为蛋白质工程新开辟的重要方向。以结构模拟与能量计算为基础的蛋白质计算设计不但能改造酶的底物特异性与热稳定性,还可从头设计具有特定功能的人工酶。近年来机器学习等人工智能技术也被应用于计算机辅助蛋白质设计改造,并取得瞩目的成绩。文中介绍了蛋白质工程的发展历程,重点评述当前计算机辅助蛋白质设计改造方面的进展与应用,并展望其未来发展方向。 相似文献
2.
Directed evolution experiments rely on the cyclical application of mutagenesis, screening and amplification in a test tube. They have led to the creation of novel proteins for a wide range of applications. However, directed evolution currently requires an uncertain, typically large, number of labor intensive and expensive experimental cycles before proteins with improved function are identified. This paper introduces predictive models for quantifying the outcome of the experiments aiding in the setup of directed evolution for maximizing the chances of obtaining DNA sequences encoding enzymes with improved activities. Two methods of DNA manipulation are analysed: error-prone PCR and DNA recombination. Error-prone PCR is a DNA replication process that intentionally introduces copying errors by imposing mutagenic reaction conditions. The proposed model calculates the probability of producing a specific nucleotide sequence after a number of PCR cycles. DNA recombination methods rely on the mixing and concatenation of genetic material from a number of parent sequences. This paper focuses on modeling a specific DNA recombination protocol, DNA shuffling. Three aspects of the DNA shuffling procedure are modeled: the fragment size distribution after random fragmentation by DNase I, the assembly of DNA fragments, and the probability of assembling specific sequences or combinations of mutations. Results obtained with the proposed models compare favorably with experimental data. 相似文献
3.
SCUMBLE: a method for systematic and accurate detection of codon usage bias by maximum likelihood estimation 下载免费PDF全文
The genetic code is degenerate—most amino acids can be encoded by from two to as many as six different codons. The synonymous codons are not used with equal frequency: not only are some codons favored over others, but also their usage can vary significantly from species to species and between different genes in the same organism. Known causes of codon bias include differences in mutation rates as well as selection pressure related to the expression level of a gene, but the standard analysis methods can account for only a fraction of the observed codon usage variation. We here introduce an explicit model of codon usage bias, inspired by statistical physics. Combining this model with a maximum likelihood approach, we are able to clearly identify different sources of bias in various genomes. We have applied the algorithm to Saccharomyces cerevisiae as well as 325 prokaryote genomes, and in most cases our model explains essentially all observed variance. 相似文献
4.
ACUA: a software tool for automated codon usage analysis 总被引:1,自引:0,他引:1
Currently available codon usage analysis tools lack intuitive graphical user interface and are limited to inbuilt calculations. ACUA (Automated Codon Usage Tool) has been developed to perform high throughput sequence analysis aiding statistical profiling of codon usage. The results of ACUA are presented in a spreadsheet with all perquisite codon usage data required for statistical analysis, displayed in a graphical interface. The package is also capable of on-click sequence retrieval from the results interface, and this feature is unique to ACUA. AVAILABILITY: The package is available for non-commercial purposes and can be downloaded from: http://www.bioinsilico.com/acua. 相似文献
5.
Yang Huang Eugene V. Koonin David J. Lipman Teresa M. Przytycka 《Nucleic acids research》2009,37(20):6799-6810
In a wide range of genomes, it was observed that the usage of synonymous codons is biased toward specific codons and codon patterns. Factors that are implicated in the selection for codon usage include facilitation of fast and accurate translation. There are two types of translational errors: missense errors and processivity errors. There is considerable evidence in support of the hypothesis that codon usage is optimized to minimize missense errors. In contrast, little is known about the relationship between codon usage and frameshifting errors, an important form of processivity errors, which appear to occur at frequencies comparable to the frequencies of missense errors. Based on the recently proposed pause-and-slip model of frameshifting, we developed Frameshifting Robustness Score (FRS). We used this measure to test if the pattern of codon usage indicates optimization against frameshifting errors. We found that the FRS values of protein-coding sequences from four analyzed genomes (the bacteria Bacillus subtilis and Escherichia coli, and the yeasts Saccharomyces cerevisiae and Schizosaccharomyce pombe) were typically higher than expected by chance. Other properties of FRS patterns observed in B. subtilis, S. cerevisiae and S. pombe, such as the tendency of FRS to increase from the 5′- to 3′-end of protein-coding sequences, were also consistent with the hypothesis of optimization against frameshifting errors in translation. For E. coli, the results of different tests were less consistent, suggestive of a much weaker optimization, if any. Collectively, the results fit the concept of selection against mistranslation-induced protein misfolding being one of the factors shaping the evolution of both coding and non-coding sequences. 相似文献
6.
7.
Two gene classes characterized by high and low GC content have been found in rice and other cereals, but not dicot genomes. We used paralogs with high and low GC contents in rice and found: (a) a greater increase in GC content at exonic fourfold-redundant sites than at flanking introns; (b) with reference to their orthologs in Arabidopsis, most substitution sites between the two kinds of paralogs are found at 2- and 4-degenerate sites with a T-->C mode, while A-->C and A-->G play major roles at 0-degenerate sites; and (c) high-GC genes have greater bias and codon usage is skewed toward codons that are preferred in highly expressed genes. We believe this is strong evidence for selectively driven codon usage in rice. Another cereal, maize, also showed the same trend as in rice. This represents a potential evolutionary process for the origin of genes with a high GC content in rice and other cereals. 相似文献
8.
Despite the degeneracy of the genetic code, whereby different codons encode the same amino acid, alternative codons and amino acids are utilized nonrandomly within and between genomes. Such biases in codon and amino acid usage have been demonstrated extensively in prokaryote genomes and likely reflect a balance between the action of mutation, selection, and genetic drift. Here, we quantify the effects of selection and mutation drift as causes of codon and amino acid-usage bias in a large collection of nematode partial genomes from 37 species spanning approximately 700 Myr of evolution, as inferred from expressed sequence tag (EST) measures of gene expression and from base composition variation. Average G + C content at silent sites among these taxa ranges from 10% to 63%, and EST counts range more than 100-fold, underlying marked differences between the identities of major codons and optimal codons for a given species as well as influencing patterns of amino acid abundance among taxa. Few species in our sample demonstrate a dominant role of selection in shaping intragenomic codon-usage biases, and these are principally free living rather than parasitic nematodes. This suggests that deviations in effective population size among species, with small effective sizes among parasites, are partly responsible for species differences in the extent to which selection shapes patterns of codon usage. Nevertheless, a consensus set of optimal codons emerges that is common to most taxa, indicating that, with some notable exceptions, selection for translational efficiency and accuracy favors similar sets of codons regardless of the major codon-usage trends defined by base compositional properties of individual nematode genomes. 相似文献
9.
Subramanian S 《Genetics》2008,178(4):2429-2432
Here I show that the mean codon usage bias of a genome, and of the lowly expressed genes in a genome, is largely similar across eukaryotes ranging from unicellular protists to vertebrates. Conversely, this bias in housekeeping genes and in highly expressed genes has a remarkable inverse relationship with species generation time that varies by more than four orders of magnitude. The relevance of these results to the nearly neutral theory of molecular evolution is discussed. 相似文献
10.
Laccases are blue multicopper oxidases that couple the four-electron reduction of oxygen with the oxidation of a broad range of aromatic substrates. These fungal enzymes can be used for many applications such as bleaching, organic synthesis, bioremediation, and in laundry detergents. Laccases from Pleurotus ostreatus have been successfully heterologously expressed in yeasts. The availability of established recombinant expression systems has allowed the construction of mutated, "better performing" enzymes through molecular evolution techniques. In the present work, random mutagenesis experiments on poxc and poxa1b cDNAs, using error prone PCR (EP-PCR) have been performed. By screening a library of 1100 clones the mutant 1M9B was selected, it shows a single mutation (L112F) leading to an enzyme more active but less stable with respect to the wild-type enzyme (POXA1b) in all the analyzed conditions. This mutant has been used as a template for a second round of EP-PCR. From this second generation library of 1200 clones, three mutants have been selected. Properties of the four mutants, 1M9B screened from the first library, and 1L2B, 1M10B, and 3M7C from the second library, were analyzed. The better performing mutant 3M7C presents, besides L112F, only one substitution (P494T) responsible both for the significantly increased stability and for the exhibited higher activity of this mutant. Molecular dynamics simulations have been performed on three-dimensional models of POXA1b, 1M9B, and 3M7C, and hypotheses on the structure-function relationships of these proteins have been formulated. 相似文献
11.
Several primer prediction programs have been developed for a variety of applications. However none of these tools allows the prediction of a large set of primers for whole gene site-directed mutagenesis experiments using the megaprimer method. We report a novel primer prediction tool (insilico.mutagenesis), accessible at www.insilico.uni-duesseldorf.de, developed for the application to high-throughput mutagenesis used in directed evolution or structure-function dependency projects, which involve the subsequent mutagenesis of a large number of amino acid positions (e.g., in whole gene saturation or gene scanning mutagenesis experiments). Furthermore, the program is suitable for all site-directed (saturation) mutagenesis approaches, such as saturation mutagenesis of promoter sequences and other types of untranslated intergenic regions. In anticipation of downstream cloning steps, the primer design tool also includes a restriction site control feature alerting the user if unwanted restriction sites have been introduced within the mutagenesis primer. The use of our tool promises to speed up the process of site-directed mutagenesis, as it instantly allows predicting a large set of primers. 相似文献
12.
13.
Preferential codon usage in genes 总被引:1,自引:0,他引:1
We present a method which permits comparison of the preferential use of degenerate codons within any gene. The method makes use of the triplet frequencies in the noncoding frames to assess whether a preference is specific to the reading frame. Preference is given a statistical meaning by use of the analysis of variance coupled to Duncan's multiple range test.Preferential use of degenerate codons is gene-specific and independent of gene size. The data suggest that any correlation between codon frequency distribution and tRNA levels is unreliable. In those animal genes examined, codons ending in C or G are preferred; in animal viruses tested, codons ending in U or A are preferred. Similarly, the bacterial genes and the genes of single-stranded DNA phages that we analyzed differed from each other as well as from eukaryotic genes in the third base of the codon. 相似文献
14.
Summer B. Thyme Sandrine J. S. Boissel S. Arshiya Quadri Tony Nolan Dean A. Baker Rachel U. Park Lara Kusak Justin Ashworth David Baker 《Nucleic acids research》2014,42(4):2564-2576
Homing endonucleases (HEs) can be used to induce targeted genome modification to reduce the fitness of pathogen vectors such as the malaria-transmitting Anopheles gambiae and to correct deleterious mutations in genetic diseases. We describe the creation of an extensive set of HE variants with novel DNA cleavage specificities using an integrated experimental and computational approach. Using computational modeling and an improved selection strategy, which optimizes specificity in addition to activity, we engineered an endonuclease to cleave in a gene associated with Anopheles sterility and another to cleave near a mutation that causes pyruvate kinase deficiency. In the course of this work we observed unanticipated context-dependence between bases which will need to be mechanistically understood for reprogramming of specificity to succeed more generally. 相似文献
15.
Gabriel A. Schachtel Philipp Bucher Edward S. Mocarski B. Edwin Blaisdell Samuel Karlin 《Journal of molecular evolution》1991,33(6):483-494
Summary The genomes of human viruses herpes simplex 1 (HSV1) and varicella zoster (VZV), although similar in biology, largely concordant in gene order, and identical in many amino acid segments, differ widely in their genomic G+C (abbreviated S) content, which is high in HSV1 (68%) and low in VZV (46%). This paper analyzes several striking codon usage contrasts. The S difference in coding regions is dramatically large in codon site 3, S3, about 42%. The large difference in S3 is maintained at the same level in a subset of closely similar genes and even in corresponding identical amino acid blocks. A similar difference in S levels in silent site 1 (S1) is found in leucine and arginine. The difference in S3 levels occurs in every gene and in every multicodon amino acid form. The S difference also exists in amino acid usage, with HSV1 using significantly more codon types SSN, while VZV uses more codon types WWN (where W stands for A or T). The nonoverlapping and narrow histograms of S3 gene frequencies in both viruses suggest that the difference has arisen and been maintained by a process of selective rather than nonselective effects. This is in sharp contrast to the relatively large variance seen for highly similar genes in the human versus yeast analysis. Interpretations and hypotheses to explain the HSV1 vs VZV condon usage disparity relate to virus-host interactions, to the role of viral genes in DNA metabolism, to availability of molecular resources (molecular Gause exclusion principle), and to differences in genomic structure. 相似文献
16.
17.
Codon usage in Clonorchis sinensis was analyzed using 12,515 codons from 38 coding sequences. Total GC content was 49.83%, and GC1, GC2 and GC3 contents were 56.32%, 43.15% and 50.00%, respectively. The effective number of codons converged at 51-53 codons. When plotted against total GC content or GC3, codon usage was distributed in relation to GC3 biases. Relative synonymous codon usage for each codon revealed a single major trend, which was highly correlated with GC content at the third position when codons began with A or U at the first two positions. In codons beginning with G or C base at the first two positions, the G or C base rarely occurred at the third position. These results suggest that codon usage is shaped by a bias towards G or C at the third base, and that this is affected by the first and second bases. 相似文献
18.
19.
In many organisms, selection acts on synonymous codons to improve translation. However, the precise basis of this selection remains unclear in the majority of species. Selection could be acting to maximize the speed of elongation, to minimize the costs of proofreading, or to maximize the accuracy of translation. Using several data sets, we find evidence that codon use in Escherichia coli is biased to reduce the costs of both missense and nonsense translational errors. Highly conserved sites and genes have higher codon bias than less conserved ones, and codon bias is positively correlated to gene length and production costs, both indicating selection against missense errors. Additionally, codon bias increases along the length of genes, indicating selection against nonsense errors. Doublet mutations or replacement substitutions do not explain our observations. The correlations remain when we control for expression level and for conflicting selection pressures at the start and end of genes. Considering each amino acid by itself confirms our results. We conclude that selection on synonymous codon use in E. coli is largely due to selection for translational accuracy, to reduce the costs of both missense and nonsense errors. 相似文献