首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 468 毫秒
1.
Tang L  Gao H  Zhu X  Wang X  Zhou M  Jiang R 《BioTechniques》2012,52(3):149-158
Site-saturation mutagenesis is a powerful tool for protein optimization due to its efficiency and simplicity. A degenerate codon NNN or NNS (K) is often used to encode the 20 standard amino acids, but this will produce redundant codons and cause uneven distribution of amino acids in the constructed library. Here we present a novel "small-intelligent" strategy to construct mutagenesis libraries that have a minimal gene library size without inherent amino acid biases, stop codons, or rare codons of Escherichia coli by coupling well-designed combinatorial degenerate primers with suitable PCR-based mutagenesis methods. The designed primer mixture contains exactly one codon per amino acid and thus allows the construction of small-intelligent mutagenesis libraries with one gene per protein. In addition, the software tool DC-Analyzer was developed to assist in primer design according to the user-defined randomization scheme for library construction. This small-intelligent strategy was successfully applied to the randomization of halohydrin dehalogenases with one or two randomized sites. With the help of DC-Analyzer, the strategy was proven to be as simple as NNS randomization and could serve as a general tool to efficiently randomize target genes at positions of interest.  相似文献   

2.
Saturation mutagenesis is a widely used directed evolution technique, in which a large number of protein variants, each having random amino acids in certain predetermined positions, are screened in order to discover high-fitness variants among them. Several metrics for determining the library size (the number of variants screened) have been suggested in the literature, but none of them incorporates the actual fitness of the variants discovered in the experiment. We present the results of an extensive simulation study, which is based on probabilistic models for protein fitness landscape, and which investigates how the result of a saturation mutagenesis experiment – the fitness of the best variant discovered – varies as a function of the library size. In particular, we study the loss of fitness in the experiment: the difference between the fitness of the best variant discovered, and the fitness of the best variant in variant space. Our results are that the existing criteria for determining the library size are conservative, so smaller libraries are often satisfactory. Reducing the library size can save labor, time, and expenses in the laboratory.  相似文献   

3.
We determined the essentiality of all amino acid replacements within an 11-codon sequence in the putative nucleoside-binding site of thymidine kinase encoded by herpes simplex virus type 1. This involved partial randomization of 11 codons in the gene to create a degenerate library, followed by genetic complementation using a tk- Escherichia coli strain and selection of unnatural active enzymes. We produced and tested 53,000 variants; of which 190 were found to be biologically active. Sequence analyses of functional variants revealed a high degree of flexibility in accommodating different types of amino acid substitutions in this region. However, no replacement was tolerated at proline-173, whereas tyrosine-172 could be replaced by only phenylalanine. To further define permissible substitutions at specified positions, we constructed a library with randomization at only four test codons. We produced and tested 600,000 variants; of which only 5 were active. Again proline-173 was conserved, and only tyrosine and phenylalanine were found at position 172. The identification of these conserved amino acids should provide important insights into the understanding of the structural basis of catalysis by this enzyme.  相似文献   

4.
Amino acid substitution plays a vital role in both the molecular engineering of proteins and analysis of structure-activity relationships. High-throughput substitution is achieved by codon randomisation, which generates a library of mutants (a randomised gene library) in a single experiment. For full randomisation, key codons are typically replaced with NNN (64 sequences) or NN(G)(CorT) (32 sequences). This obligates cloning of redundant codons alongside those required to encode the 20 amino acids. As the number of randomised codons increases, there is therefore a progressive loss of randomisation efficiency; the number of genes required per protein rises exponentially. The redundant codons cause amino acids to be represented unevenly; for example, methionine is encoded just once within NNN, whilst arginine is encoded six times. Finally, the organisation of the genetic code makes it impossible to encode functional subsets of amino acids (e.g. polar residues only) in a single experiment. Here, we present a novel solution to randomisation where genetic redundancy is eliminated; the number of different genes equals the number of encoded proteins, regardless of codon number. There is no inherent amino acid bias and any required subset of amino acids may be encoded in one experiment. This generic approach should be widely applicable in studies involving randomisation of proteins.  相似文献   

5.
The universal genetic code links the 20 naturally occurring amino acids to the 61 sense codons. Previously, the UAG amber stop codon (a nonsense codon) has been used as a blank in the code to insert natural and unnatural amino acids via nonsense suppression. We have developed a selection methodology to investigate whether the unnatural amino acid biocytin could be incorporated into an mRNA display library at sense codons. In these experiments we probed a single randomized NNN codon with a library of 16 orthogonal, biocytin-acylated tRNAs. In vitro selection for efficient incorporation of the unnatural amino acid resulted in templates containing the GUA codon at the randomized position. This sense suppression occurs via Watson-Crick pairing with similar efficiency to UAG-mediated nonsense suppression. These experiments suggest that sense codon suppression is a viable means to expand the chemical and functional diversity of the genetic code.  相似文献   

6.
The efficiency of translation termination at NNN NNN UGA A stop codon contexts has been determined in Escherichia coli. No general effects are found which can be attributed directly to the mRNA sequences itself. Instead, termination is influenced primarily by the amino acids at the C-terminal end of the nascent peptide, which are specified by the two codons at the 5' side of UGA. For the penultimate amino acid (-2 location), charge and hydrophobicity are important. For the last amino acid (-1 location), alpha-helical, beta-strand and reverse turn propensities are determining factors. The van der Waals volume of the last amino acid can affect the relative efficiency of stop codon readthrough by the wild-type and suppressor forms of tRNA(Trp) (CAA). The influence of the -1 and -2 amino acids is cooperative. Accumulation of an mRNA degradation intermediate indicates mRNA protection by pausing ribosomes at contexts which give inefficient UGA termination. Highly expressed E.coli genes with the UGA A termination signal encode C-terminal amino acids which favour efficient termination. This restriction is not found for poorly expressed genes.  相似文献   

7.
Xia X 《Gene》2005,345(1):13-20
The H-strand of vertebrate mitochondrial DNA is left single-stranded for hours during the slow DNA replication. This facilitates C-->U mutations on the H-strand (and consequently G-->A mutations on the L-strand) via spontaneous deamination which occurs much more frequently on single-stranded than on double-stranded DNA. For the 12 coding sequences (CDS) collinear with the L-strand, NNY synonymous codon families (where N stands for any of the four nucleotides and Y stands for either C or U) end mostly with C, and NNR and NNN codon families (where R stands for either A or G) end mostly with A. For the lone ND6 gene on the other strand, the codon bias is the opposite, with NNY codon families ending mostly with U and NNR and NNN codon families ending mostly with G. These patterns are consistent with the strand-specific mutation bias. The codon usage biased towards C-ending and A-ending in the 12 CDS sequences affects the codon-anticodon adaptation. The wobble site of the anticodon is always G for NNY codon families dominated by C-ending codons and U for NNR and NNN codon families dominated by A-ending codons. The only, but consistent, exception is the anticodon of tRNA-Met which consistently has a 5'-CAU-3' anticodon base-pairing with the AUG codon (the translation initiation codon) instead of the more frequent AUA. The observed CAU anticodon (matching AUG) would increase the rate of translation initiation but would reduce the rate of peptide elongation because most methionine codons are AUA, whereas the unobserved UAU anticodon (matching AUA) would increase the elongation rate at the cost of translation initiation rate. The consistent CAU anticodon in tRNA-Met suggests the importance of maximizing the rate of translation initiation.  相似文献   

8.
Many native proteins are multi-specific and interact with numerous partners, which can confound analysis of their functions. Protein design provides a potential route to generating synthetic variants of native proteins with more selective binding profiles. Redesigned proteins could be used as research tools, diagnostics or therapeutics. In this work, we used a library screening approach to reengineer the multi-specific anti-apoptotic protein Bcl-xL to remove its interactions with many of its binding partners, making it a high-affinity and selective binder of the BH3 region of pro-apoptotic protein Bad. To overcome the enormity of the potential Bcl-xL sequence space, we developed and applied a computational/experimental framework that used protein structure information to generate focused combinatorial libraries. Sequence features were identified using structure-based modeling, and an optimization algorithm based on integer programming was used to select degenerate codons that maximally covered these features. A constraint on library size was used to ensure thorough sampling. Using yeast surface display to screen a designed library of Bcl-xL variants, we successfully identified a protein with ~ 1000-fold improvement in binding specificity for the BH3 region of Bad over the BH3 region of Bim. Although negative design was targeted only against the BH3 region of Bim, the best redesigned protein was globally specific against binding to 10 other peptides corresponding to native BH3 motifs. Our design framework demonstrates an efficient route to highly specific protein binders and may readily be adapted for application to other design problems.  相似文献   

9.
In systematics, parsimony methods construct phylogenies, or evolutionary trees, in which characters evolve with the least evolutionary change. The chromosome inversion, or polymorphism, parsimony criterion is used when each character of a population may exhibit homozygous or heterozygous states, but when the heterozygous state must evolve uniquely. Variations of the criterion concern whether or not the ancestral states of characters are specified. We establish that problems of inferring phylogenies by these criteria are NP-complete and thus are so difficult computationally that efficient optimal algorithms for them are unlikely to exist.  相似文献   

10.
With the three-letter alphabet [R,Y,N] (R = purine, Y = pyrimidine, N = R or Y), there are 26 codons (NNN being excluded): RNN,...,NNY (six codons at two unspecified bases N), RRN,...,NYY (12 codons at one unspecified base N), RRR,...,YYY (eight specified codons). A statistical methodology that uses the codon frequency and a reduced centered variable leads to similar results for a codon occurrence study, regardless of gene function and regardless of a particular protein coding gene taxonomic population. Therefore, this variable can be considered a new codon usage index, whose use removes certain nonsignificant results found with the frequency statistic. This methodology identifies the common and rare codons (i.e., the codons having the highest and lowest occurrence) and leads to a model of codon evolution at three successive states: RNN, then RNY, and finally RYY. Some biological relations between this model and the YRY(N)6YRY preferential occurrence are also presented.  相似文献   

11.
Exhaustive mutagenesis studies of the reactive centre loop (RCL), a key structural component of proteins belonging to the serpin superfamily of protease inhibitors, are complicated by the size of the RCL, serpin conformational complexity, and, for most serpins, the lack of a serpin-dependent phenotype of expressing cells. Here, we describe a thrombin capture assay that distinguished thrombin-inhibitory recombinant human alpha-1 proteinase inhibitor (API M358R) from non-inhibitory API variants in Escherichia coli lysates prepared from either single clones or pools. Binding of API proteins in the lysates to thrombin immobilized on microtiter plate wells was quantified via colour generated by a peroxidase-coupled anti-API antibody. Bacterial expression plasmids encoding inhibitory API M358R were mixed 1:99 with plasmids encoding non-inhibitory API T345R/M358R and the resulting library screened in pools of 10. All above-background signals arising from pools or subsequently re-probed single clones were linked to the presence of plasmids encoding API M358R. Screening of a portion of another expression library encoding hypervariable API with all possibilities at codons 352–358 also yielded only novel, thrombin-inhibitory variants. Probing a smaller library expressing all possible codons at Ala347 yielded the wild type, 6 different functional variants, one partially active variant, and two variants with no thrombin-inhibitory activity. API antigen levels varied considerably less among Ala347 variants than activity levels, and comparison of rate constants of inhibition of purified API variants to their corresponding thrombin capture assay lysate values was used to establish the sensitivity and specificity of the assay. The results indicate that the approach is sufficiently robust to correctly identify functional versus non-functional candidates in API expression libraries, and could be of value in systematically probing structure/function relationships not only in the API RCL, but also in that of other serpins.  相似文献   

12.
There is an increasing demand for efficient and effective methods to engineer protein variants for industrial applications, structural biology and drug development. We describe a PCR-based strategy that produces multi-site-saturation mutagenic expression library using a circular plasmid carrying the wild-type gene. This restriction digestion- and ligation-independent method involves three steps: 1) synthesis of the degenerate oligonucleotide primers, 2) incorporation of the mutations through PCR, 3) transformation into the expression host. Our strategy is demonstrated through successful construction of an E. coli K12 malic enzyme expression library that contains members with simultaneous mutations on amino acid residues G311, D345 and G397. This method is in principle compatible with any circular vector that can be propagated with a dam+ E. coli host to generate protein variant library with multiple changes, including mutation, short sequence deletion and insertion, or any mix of them.  相似文献   

13.
The randomization scheme of hypervariable region takes crucial role in construction of a synthetic antibody library. The codon bias and inevitable 'stop' codon of conventional 'NNK' and 'NNS' codons limit their applications. Here we report a split-mix-split DNA synthesis method that can control over the amino acid composition and distribution of randomized sequences effectually. A fully synthetic human antibody library with a diversity of 1.56 x 10(9) was successfully generated with complementarity determining region 3 (CDR3) randomized by this strategy. Sequencing analysis indicated that >60% of colonies had completely correct scFv genes and the amino acid composition and distribution were designed well in accordance. The utility was demonstrated by screening of scFv clones against BHL (anti-CD3 x anti-ovarian carcinoma bispecific antibody). These results proved the feasibility of the split-mix-split DNA randomization strategy in library construction and site-directed mutagenesis.  相似文献   

14.
There is an increasing demand for efficient and effective methods to engineer protein variants for industrial applications, structural biology and drug development. We describe a PCR-based strategy that produces multi-site-saturation mutagenic expression library using a circular plasmid carrying the wild-type gene. This restriction digestion- and ligation-independent method involves three steps: 1) synthesis of the degenerate oligonucleotide primers, 2) incorporation of the mutations through PCR, 3) transformation into the expression host. Our strategy is demonstrated through successful construction of an E. coli K12 malic enzyme expression library that contains members with simultaneous mutations on amino acid residues G311, D345 and G397. This method is in principle compatible with any circular vector that can be propagated with a dam(+)E. coli host to generate protein variant library with multiple changes, including mutation, short sequence deletion and insertion, or any mix of them.  相似文献   

15.
Site-directed mutagenesis has greatly helped researchers both to understand the precise role of specific residues in coding sequences and to generate variants of proteins that have acquired new characteristics. Today's demands for more complete functional cartographies of proteins and advances in selection and screening technologies require that site-directed mutagenesis be adapted for high-throughput applications. We describe here the first generation of a library of single and multiple site-directed mutants using a mixture of oligonucleotides synthesized on DNA chips. We have used the human interleukin 15 (IL15) gene as a model, of which 37 codons were simultaneously targeted for substitution by any of eight possible codons. Ninety-six clones were sequenced, exhibiting a broad spectrum of targeted substitutions over the whole gene length with no unwanted mutations. Libraries produced using such pools of oligonucleotides open new perspectives to direct the evolution of proteins in vitro, by enabling the simple, rapid, and cost-effective generation of large tailor-made genetic diversities from any gene.  相似文献   

16.
The codon-reading properties of wobble-position variants of the unmodified form of Escherichia coli tRNASer1 (the UGA anticodon) were measured in a cell-free translation system. Two variants, with the AGA and CGA anticodons, each exclusively read a single codon, UCU and UCG, respectively. The only case of efficient wobbling occurred with the variant with the GGA anticodon, which reads the UCU codon in addition to the UCC codon. Surprisingly, this wobble reading is more efficient than the Watson-Crick reading by the variant with the AGA anticodon. Furthermore, we prepared tRNA variants with AA, UC, and CU, instead of GA, in the second and third positions and measured their relative efficiencies in the reading of codons starting with UU, GA, and AG, respectively. The specificity concerning the wobble position is essentially the same as that in the case of the codons starting with UC.  相似文献   

17.
The search for the association of rare genetic variants with common diseases is of high interest, yet challenging because of cost considerations. We present an efficient two-stage design that uses diseased cases to first screen for rare variants at stage-1. If too few cases are found to carry any variants, the study stops. Otherwise, the selected variants are screened at stage-2 in a larger set of cases and controls, and the frequency of variants is compared between cases and controls by an exact test that corrects for the stage-1 ascertainment. Simulations show that our new method provides conservative Type-I error rates, similar to the conservative aspect of Fisher’s exact test. We show that the probability of stopping at stage-1 increases with a smaller number of cases screened at stage-1, a larger stage-1 continuation threshold, or a smaller carrier probability. Our simulations also show how these factors impact the power at stage-2. To balance stopping early when there are few variant carriers versus continuation to stage-2 when the variants have a reasonable effect size on the phenotype, we provide guidance on designing an optimal study that minimizes the expected sample size when the null hypothesis is true, yet achieves the desired power.  相似文献   

18.

Background  

cDNA libraries are widely used to identify genes and splice variants, and as a physical resource for full-length clones. Conventionally-generated cDNA libraries contain a high percentage of 5'-truncated clones. Current library construction methods that enrich for full-length mRNA are laborious, and involve several enzymatic steps performed on mRNA, which renders them sensitive to RNA degradation. The SMART technique for full-length enrichment is robust but results in limited cDNA insert size of the library.  相似文献   

19.
A computer program (PINCERS) is described for use in the design of synthetic genes and mixed-probe DNA sequences. A protein sequence is reverse translated with generation of synonymous codons at each position producing a degenerate sequence. In order to locate potential restriction enzyme sites, the degenerate sequence is searched with a library of restriction enzymes for sites that utilize any combination of synonymous codons. These sites are indicated in a map so that they may be incorporated into the synthetic gene sequence. The program allows the user to select the appropriate codon usage table for the organism of interest and then to set a threshold usage frequency below which codons are not generated. PINCERS may also be used to assist in planning the synthesis of mixed-probe DNA sequences for cross-hybridization experiments. It can identify regions of specified length with the protein sequence that have the least overall degeneracy, thereby minimizing the number of probes to be synthesized and, therefore, maximizing the concentration of a given probe sequence.  相似文献   

20.
The Red List Categories and the accompanying five criteria developed by the International Union for Conservation of Nature (IUCN) provide an authoritative and comprehensive methodology to assess the conservation status of organisms. Red List criterion B, which principally uses distribution data, is the most widely used to assess conservation status, particularly of plant species. No software package has previously been available to perform large‐scale multispecies calculations of the three main criterion B parameters [extent of occurrence (EOO), area of occupancy (AOO) and an estimate of the number of locations] and provide preliminary conservation assessments using an automated batch process. We developed ConR, a dedicated R package, as a rapid and efficient tool to conduct large numbers of preliminary assessments, thereby facilitating complete Red List assessment. ConR (1) calculates key geographic range parameters (AOO and EOO) and estimates the number of locations sensu IUCN needed for an assessment under criterion B; (2) uses this information in a batch process to generate preliminary assessments of multiple species; (3) summarize the parameters and preliminary assessments in a spreadsheet; and (4) provides a visualization of the results by generating maps suitable for the submission of full assessments to the IUCN Red List. ConR can be used for any living organism for which reliable georeferenced distribution data are available. As distributional data for taxa become increasingly available via large open access datasets, ConR provides a novel, timely tool to guide and accelerate the work of the conservation and taxonomic communities by enabling practitioners to conduct preliminary assessments simultaneously for hundreds or even thousands of species in an efficient and time‐saving way.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号