首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The extent to which natural selection shapes diversity within populations is a key question for population genetics. Thus, there is considerable interest in quantifying the strength of selection. A full likelihood approach for inference about selection at a single site within an otherwise neutral fully linked sequence of sites is described here. A coalescent model of evolution is used to model the ancestry of a sample of DNA sequences which have the selected site segregating. The mutation model, for the selected and neutral sites, is the infinitely many-sites model where there is no back or parallel mutation at sites. A unique perfect phylogeny, a gene tree, can be constructed from the configuration of mutations on the sample sequences under this model of mutation. The approach is general and can be used for any bi-allelic selection scheme. Selection is incorporated through modelling the frequency of the selected and neutral allelic classes stochastically back in time, then using a subdivided population model considering the population frequencies through time as variable population sizes. An importance sampling algorithm is then used to explore over coalescent tree space consistent with the data. The method is applied to a simulated data set and the gene tree presented in Verrelli et al. (2002).  相似文献   

2.
K Zeng 《Heredity》2013,110(4):363-371
There is increasing evidence that background selection, the effects of the elimination of recurring deleterious mutations by natural selection on variability at linked sites, may be a major factor shaping genome-wide patterns of genetic diversity. To accurately quantify the importance of background selection, it is vital to have computationally efficient models that include essential biological features. To this end, a structured coalescent procedure is used to construct a model of background selection that takes into account the effects of recombination, recent changes in population size and variation in selection coefficients against deleterious mutations across sites. Furthermore, this model allows a flexible organization of selected and neutral sites in the region concerned, and has the ability to generate sequence variability at both selected and neutral sites, allowing the correlation between these two types of sites to be studied. The accuracy of the model is verified by checking against the results of forward simulations. These simulations also reveal several patterns of diversity that are in qualitative agreement with observations reported in recent studies of DNA sequence polymorphisms. These results suggest that the model should be useful for data analysis.  相似文献   

3.
Simulating Evolution by Gene Duplication   总被引:14,自引:5,他引:14       下载免费PDF全文
Tomoko Ohta 《Genetics》1987,115(1):207-213
By considering the recent finding that unequal crossing over and other molecular interactions are contributing to the evolution of multigene families, a model of the origin of repetitive genes was studied by Monte Carlo simulations. Starting from a single gene copy, how genetic systems evolve was examined under unequal crossing over, random drift and natural selection. Both beneficial and deteriorating mutations were incorporated, and the latter were assumed to occur ten times more frequently than the former. Positive natural selection favors those chromosomes with more beneficial mutations in redundant copies than others in the population, but accumulation of deteriorating mutations (pseudogenes) have no effect on fitness so long as there remains a functional gene. The results imply the following: Positive natural selection is needed in order to acquire gene families with new functions. Without it, too many pseudogenes accumulate before attaining a functional gene family. There is a large fluctuation in the outcome even if parameters are the same. When unequal crossing over occurs more frequently, the system evolves more rapidly. It was also shown, under realistic values of parameters, that the genetic load for acquiring a new gene is not as large as J.B.S. Haldane suggested, but not so small as in a model in which a system for selection started from already redundant genes.  相似文献   

4.
Summary The role of mutations in finite populations on response to artificial selection was investigated by a computer simulation model designed to mimic the biological model of pupal weight of Tribolium. Given the model, the results showed that with selection about 25–55 generations were needed for genetic variances to reach a maximum value depending on population size, selection intensity, and gene number. When effective population size was larger than 40 or the intensity of selection was high (less than 50% selected), selection had a dramatic effect in reducing the time to approach the maximum point of genetic variance. Furthermore, the genetic variance after that point often declined as a function of selection instead of remaining at a steady state in the subsequent generations.  相似文献   

5.
PoPMuSiC is an efficient tool for rational computer-aided design of single-site mutations in proteins and peptides. Two types of queries can be submitted. The first option allows to estimate the changes in folding free energy for specific point mutations given by the user. In the second option, all possible point mutations in a given protein or protein region are performed and the most stabilizing or destabilizing mutations, or the neutral mutations with respect to thermodynamic stability, are selected. For each sequence position or secondary structure the deviation from the most stable sequence is moreover evaluated, which helps to identify the most suitable sites for the introduction of mutations.  相似文献   

6.
Immune escape is considered to be the driving force behind structural variability of major antigens on the surface of bacterial pathogens, such as fimbriae. In the Dr family of Escherichia coli adhesins, structural and adhesive functions are carried out by the same subunit. Dr adhesins have been shown to bind decay-accelerating factor (DAF), collagen IV, and carcinoembryonic antigen-related cell adhesion molecules (CEACAMs). We show that genes encoding Dr adhesins from 100 E. coli strains form eight structural groups with a high level of amino acid sequence diversity between them. However, genes comprising each group differ from each other by only a small number of point mutations. Out of 66 polymorphisms identified within the groups, only three were synonymous mutations, indicating strong positive selection for amino acid replacements. Functional analysis of intragroup variants comprising the Dr haemagglutinin (DraE) group revealed that the point mutations result in distinctly different binding phenotypes, with a tendency of increased affinity to DAF, decreased sensitivity of DAF binding to inhibition by chloramphenicol, and loss of binding capability to collagen, CEACAM3 and CEACAM6. Thus, variability by point mutation of major antigenic proteins on the bacterial surface can be a signature of selection for functional modification.  相似文献   

7.
CRISPR-based homing gene drives can be designed to disrupt essential genes whilst biasing their own inheritance, leading to suppression of mosquito populations in the laboratory. This class of gene drives relies on CRISPR-Cas9 cleavage of a target sequence and copying (‘homing’) therein of the gene drive element from the homologous chromosome. However, target site mutations that are resistant to cleavage yet maintain the function of the essential gene are expected to be strongly selected for. Targeting functionally constrained regions where mutations are not easily tolerated should lower the probability of resistance. Evolutionary conservation at the sequence level is often a reliable indicator of functional constraint, though the actual level of underlying constraint between one conserved sequence and another can vary widely. Here we generated a novel adult lethal gene drive (ALGD) in the malaria vector Anopheles gambiae, targeting an ultra-conserved target site in a haplosufficient essential gene (AGAP029113) required during mosquito development, which fulfils many of the criteria for the target of a population suppression gene drive. We then designed a selection regime to experimentally assess the likelihood of generation and subsequent selection of gene drive resistant mutations at its target site. We simulated, in a caged population, a scenario where the gene drive was approaching fixation, where selection for resistance is expected to be strongest. Continuous sampling of the target locus revealed that a single, restorative, in-frame nucleotide substitution was selected. Our findings show that ultra-conservation alone need not be predictive of a site that is refractory to target site resistance. Our strategy to evaluate resistance in vivo could help to validate candidate gene drive targets for their resilience to resistance and help to improve predictions of the invasion dynamics of gene drives in field populations.  相似文献   

8.
It has become clear that a large proportion of functional DNA in the human genome does not code for protein. Identification of this non-coding functional sequence using comparative approaches is proving difficult and has previously been thought to require deep sequencing of multiple vertebrates. Here we introduce a new model and comparative method that, instead of nucleotide substitutions, uses the evolutionary imprint of insertions and deletions (indels) to infer the past consequences of selection. The model predicts the distribution of indels under neutrality, and shows an excellent fit to human–mouse ancestral repeat data. Across the genome, many unusually long ungapped regions are detected that are unaccounted for by the neutral model, and which we predict to be highly enriched in functional DNA that has been subject to purifying selection with respect to indels. We use the model to determine the proportion under indel-purifying selection to be between 2.56% and 3.25% of human euchromatin. Since annotated protein-coding genes comprise only 1.2% of euchromatin, these results lend further weight to the proposition that more than half the functional complement of the human genome is non-protein-coding. The method is surprisingly powerful at identifying selected sequence using only two or three mammalian genomes. Applying the method to the human, mouse, and dog genomes, we identify 90 Mb of human sequence under indel-purifying selection, at a predicted 10% false-discovery rate and 75% sensitivity. As expected, most of the identified sequence represents unannotated material, while the recovered proportions of known protein-coding and microRNA genes closely match the predicted sensitivity of the method. The method's high sensitivity to functional sequence such as microRNAs suggest that as yet unannotated microRNA genes are enriched among the sequences identified. Futhermore, its independence of substitutions allowed us to identify sequence that has been subject to heterogeneous selection, that is, sequence subject to both positive selection with respect to substitutions and purifying selection with respect to indels. The ability to identify elements under heterogeneous selection enables, for the first time, the genome-wide investigation of positive selection on functional elements other than protein-coding genes.  相似文献   

9.
Summary The population dynamics of nearly neutral mutations are studied using a single-site and a multisite model. In the latter model, the nucleotides in a sequence are completely linked and the selection schemes employed are additive, multiplicative, and additive with a threshold. Although the third selection scheme is very different from the first two, the three schemes produce identical results for a wide range of parameter values. Thus the present study provides a general theory for the population dynamics of nearly neutral mutations because the results can also be used to draw inferences about other selection schemes such as stabilizing selection and synergistic selection. It is shown that the number of slightly deleterious mutations accumulated in a sequence can be considerably larger under the multisite model than under the single-site model, particularly if the sequence is long or if the mutation rate per site is high. The results show that even a very slight selective difference between synonymous codons can produce a strong bias in codon usage. Three alternative explanations for the strong bias in codon usage in bacterial and yeast genes are considered. The implications of the present results for molecular evolution are discussed.  相似文献   

10.
A population genetic model with a single locus at which balancing selection acts and many linked loci at which neutral mutations can occur is analysed using the coalescent approach. The model incorporates geographic subdivision with migration, as well as mutation, recombination, and genetic drift of neutral variation. It is found that geographic subdivision can affect genetic variation even with high rates of migration, providing that selection is strong enough to maintain different allele frequencies at the selected locus. Published sequence data from the alcohol dehydrogenase locus of Drosophila melanogaster are found to fit the proposed model slightly better than a similar model without subdivision.  相似文献   

11.
A computer program was developed to automate the selection of DNA fragments for detecting mutations within a long DNA sequence by denaturing gel electrophoresis methods. The program, MELTSCAN, scans through a user specified DNA sequence calculating the melting behavior of overlapping DNA fragments covering the sequence. Melting characteristics of the fragments are analyzed to determine the best fragment for detecting mutations at each base pair position in the sequence. The calculation also determines the optimal fragment for detecting mutations within a user specified mutational hot spot region. The program is built around the statistical mechanical model of the DNA melting transition. The optimal fragment for a given position is selected using the criteria that its melting curve has at least two steps, the base pair position is in the fragment's lowest melting domain, and the melting domain has the smallest number of base pairs among fragments that meet the first two criteria. The program predicted fragments for detecting mutations in the cDNA and genomic DNA of the human p53 gene.  相似文献   

12.
We examine the behavior of sexual and asexual populations in modular multipeaked fitness landscapes and show that sexuals can systematically reach different, higher fitness adaptive peaks than asexuals. Whereas asexuals must move against selection to escape local optima, sexuals reach higher fitness peaks reliably because they create specific genetic variants that "skip over" fitness valleys, moving from peak to peak in the fitness landscape. This occurs because recombination can supply combinations of mutations in functional composites or "modules," that may include individually deleterious mutations. Thus when a beneficial module is substituted for another less-fit module by sexual recombination it provides a genetic variant that would require either several specific simultaneous mutations in an asexual population or a sequence of individual mutations some of which would be selected against. This effect requires modular genomes, such that subsets of strongly epistatic mutations are tightly physically linked. We argue that such a structure is provided simply by virtue of the fact that genomes contain many genes each containing many strongly epistatic nucleotides. We briefly discuss the connections with "building blocks" in the evolutionary computation literature. We conclude that there are conditions in which sexuals can systematically evolve high-fitness genotypes that are essentially unevolvable for asexuals.  相似文献   

13.
14.
15.
Antiviral CD8(+) T cells are a key component of the adaptive immune system against hepatitis C virus (HCV). For the development of immune therapies, it is essential to understand how CD8(+) T cells contribute to clearance of infection and why they fail so often. A mechanism for secondary failure is mutational escape of the virus. However, some substitutions in viral epitopes are associated with fitness costs and often require compensatory mutations. We hypothesized that compensatory mutations may point toward epitopes under particularly strong selection pressure that may be beneficial for vaccine design because of a higher genetic barrier to escape. We previously identified two HLA-B*15-restricted CD8(+) epitopes in NS5B (LLRHHNMVY(2450-2458) and SQRQKKVTF(2466-2474)), based on sequence analysis of a large HCV genotype 1b outbreak. Both epitopes are targeted in about 70% of HLA-B*15-positive individuals exposed to HCV. Reproducible selection of escape mutations was confirmed in an independent multicenter cohort in the present study. Interestingly, mutations were also selected in the epitope flanking region, suggesting that compensatory evolution may play a role. Covariation analysis of sequences from the database confirmed a significant association between escape mutations inside one of the epitopes (H2454R and M2456L) and substitutions in the epitope flanking region (S2439T and K2440Q). Functional analysis with the subgenomic replicon Con1 confirmed that the primary escape mutations impaired viral replication, while fitness was restored by the additional substitutions in the epitope flanking region. We concluded that selection of escape mutations inside an HLA-B*15 epitope requires secondary substitutions in the epitope flanking region that compensate for fitness costs.  相似文献   

16.
In natural systems, selection acts on both protein sequence and expression level, but it is unclear how selection integrates over these two dimensions. We recently developed the EMPIRIC approach to systematically determine the fitness effects of all possible point mutants for important regions of essential genes in yeast. Here, we systematically investigated the fitness effects of point mutations in a putative substrate binding loop of yeast Hsp90 (Hsp82) over a broad range of expression strengths. Negative epistasis between reduced expression strength and amino acid substitutions was common, and the endogenous expression strength frequently obscured mutant defects. By analyzing fitness effects at varied expression strengths, we were able to uncover all mutant effects on function. The majority of mutants caused partial functional defects, consistent with this region of Hsp90 contributing to a mutation sensitive and critical process. These results demonstrate that important functional regions of proteins can tolerate mutational defects without experimentally observable impacts on fitness.  相似文献   

17.
M. S. Ciampi  J. R. Roth 《Genetics》1988,118(2):193-202
A single site in the middle of the coding sequence of the hisG gene of Salmonella is required for most of the polar effect of mutations in this gene. Nonsense and insertion mutations mapping upstream of this point in the hisG gene all have strong polar effects on expression of downstream genes in the operon; mutations mapping promotor distal to this site have little or no polar effect. Two previously known hisG mutations, mapping in the region of the polarity site, abolish the polarity effect of insertion mutations mapping upstream of this region. New polarity site mutations have been selected which have lost the polar effect of upstream nonsense mutations. All mutations abolishing the function of the site are small deletions; three are identical, 28-bp deletions which have arisen independently. A fourth mutation is a deletion of 16 base pairs internal to the larger deletion. Several point mutations within this 16-bp region have no effect on the function of the polarity site. We believe that a small number of polarity sites of this type are responsible for polarity in all genes. The site in the hisG gene is more easily detected than most because it appears to be the only such site in the hisG gene and because it maps in the center of the coding sequence.  相似文献   

18.
Recombination, the swapping of large portions of genetic information between and among parental genotypes, can be applied to in vitro evolution experiments on functional nucleic acids. Both homologous and heterologous recombination can be achieved using standard laboratory techniques. In many cases, recombination can allow for the discovery of a ribozyme or DNAzyme phenotype that would not likely be encountered by reliance on point mutations alone. In addition, recombination can often aid in the discovery of global optima in sequence space and/or lessen the number of generations it would take to reach optima. Recombination is most efficiently used in combination with point mutations and applied after the first couple of rounds of selection but before high-fitness genotypes dominate the selection. The “recombination zone” describes that region of sequence space—defined by the residues that will ultimately participate in the function of the winning nucleic acid(s)—where recombination is expected to be the most beneficial in the search for high-fitness genotypes.[Reviewing Editor: Martin Kreitman]Author order determined by a single Bernoulli trial as implemented by RPS.  相似文献   

19.
Exploring the sequence space of a DNA aptamer using microarrays   总被引:2,自引:1,他引:1  
The relationship between sequence and binding properties of an aptamer for immunoglobulin E (IgE) was investigated using custom DNA microarrays. Single, double and some triple mutations of the aptamer sequence were created to evaluate the importance of specific base composition on aptamer binding. The majority of the positions in the aptamer sequence were found to be immutable, with changes at these positions resulting in more than a 100-fold decrease in binding affinity. Improvements in binding were observed by altering the stem region of the aptamer, suggesting that it plays a significant role in binding. Results obtained for the various mutations were used to estimate the information content and the probability of finding a functional aptamer sequence by selection from a random library. For the IgE-binding aptamer, this probability is on the order of 1010 to 109. Results obtained for the double and triple mutations also show that there are no compensatory mutations within the space defined by those mutations. Apparently, at least for this particular aptamer, the functional sequence space can be represented as a rugged landscape with sharp peaks defined by highly constrained base compositions. This makes the rational optimization of aptamer sequences using step-wise mutagenesis approaches very challenging.  相似文献   

20.
Recombineering allows DNA cloned in Escherichia coli to be modified via lambda (lambda) Red-mediated homologous recombination, obviating the need for restriction enzymes and DNA ligases to modify DNA. Here, we describe the construction of three new recombineering strains (SW102, SW105 and SW106) that allow bacterial artificial chromosomes (BACs) to be modified using galK positive/negative selection. This two-step selection procedure allows DNA to be modified without introducing an unwanted selectable marker at the modification site. All three strains contain an otherwise complete galactose operon, except for a precise deletion of the galK gene, and a defective temperature-sensitive lambda prophage that makes recombineering possible. SW105 and SW106 cells in addition carry l-arabinose-inducible Cre or Flp genes, respectively. The galK function can be selected both for and against. This feature greatly reduces the background seen in other negative-selection schemes, and galK selection is considerably more efficient than other related selection methods published. We also show how galK selection can be used to rapidly introduce point mutations, deletions and loxP sites into BAC DNA and thus facilitate functional studies of SNP and/or disease-causing point mutations, the identification of long-range regulatory elements and the construction of conditional targeting vectors.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号