首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The evolution of homologous sequences affected by recombination or gene conversion cannot be adequately explained by a single phylogenetic tree. Many tree-based methods for sequence analysis, for example, those used for detecting sites evolving nonneutrally, have been shown to fail if such phylogenetic incongruity is ignored. However, it may be possible to propose several phylogenies that can correctly model the evolution of nonrecombinant fragments. We propose a model-based framework that uses a genetic algorithm to search a multiple-sequence alignment for putative recombination break points, quantifies the level of support for their locations, and identifies sequences or clades involved in putative recombination events. The software implementation can be run quickly and efficiently in a distributed computing environment, and various components of the methods can be chosen for computational expediency or statistical rigor. We evaluate the performance of the new method on simulated alignments and on an array of published benchmark data sets. Finally, we demonstrate that prescreening alignments with our method allows one to analyze recombinant sequences for positive selection.  相似文献   

2.
Wilson DJ  McVean G 《Genetics》2006,172(3):1411-1425
Models of molecular evolution that incorporate the ratio of nonsynonymous to synonymous polymorphism (dN/dS ratio) as a parameter can be used to identify sites that are under diversifying selection or functional constraint in a sample of gene sequences. However, when there has been recombination in the evolutionary history of the sequences, reconstructing a single phylogenetic tree is not appropriate, and inference based on a single tree can give misleading results. In the presence of high levels of recombination, the identification of sites experiencing diversifying selection can suffer from a false-positive rate as high as 90%. We present a model that uses a population genetics approximation to the coalescent with recombination and use reversible-jump MCMC to perform Bayesian inference on both the dN/dS ratio and the recombination rate, allowing each to vary along the sequence. We demonstrate that the method has the power to detect variation in the dN/dS ratio and the recombination rate and does not suffer from a high false-positive rate. We use the method to analyze the porB gene of Neisseria meningitidis and verify the inferences using prior sensitivity analysis and model criticism techniques.  相似文献   

3.
Throughout the living world, genetic recombination and nucleotide substitution are the primary processes that create the genetic variation upon which natural selection acts. Just as analyses of substitution patterns can reveal a great deal about evolution, so too can analyses of recombination. Evidence of genetic recombination within the genomes of apparently asexual species can equate with evidence of cryptic sexuality. In sexually reproducing species, nonrandom patterns of sequence exchange can provide direct evidence of population subdivisions that prevent certain individuals from mating. Although an interesting topic in its own right, an important reason for analysing recombination is to account for its potentially disruptive influences on various phylogenetic-based molecular evolution analyses. Specifically, the evolutionary histories of recombinant sequences cannot be accurately described by standard bifurcating phylogenetic trees. Taking recombination into account can therefore be pivotal to the success of selection, molecular clock and various other analyses that require adequate modelling of shared ancestry and draw increased power from accurately inferred phylogenetic trees. Here, we review various computational approaches to studying recombination and provide guidelines both on how to gain insights into this important evolutionary process and on how it can be properly accounted for during molecular evolution studies.  相似文献   

4.
MOTIVATION: Recombination plays an important role in the evolution of many pathogens, such as HIV or malaria. Despite substantial prior work, there is still a pressing need for efficient and effective methods of detecting recombination and analyzing recombinant sequences. RESULTS: We introduce Recco, a novel fast method that, given a multiple sequence alignment, scores the cost of obtaining one of the sequences from the others by mutation and recombination. The algorithm comes with an illustrative visualization tool for locating recombination breakpoints. We analyze the sequence alignment with respect to all choices of the parameter alpha weighting recombination cost against mutation cost. The analysis of the resulting cost curve yields additional information as to which sequence might be recombinant. On random genealogies Recco is comparable in its power of detecting recombination with the algorithm Geneconv (Sawyer, 1989). For specific relevant recombination scenarios Recco significantly outperforms Geneconv.  相似文献   

5.
Hao W 《Gene》2011,481(2):57-64
The evolution of influenza viruses is remarkably dynamic. Influenza viruses evolve rapidly in sequence and undergo frequent reassortment of different gene segments. Homologous recombination, although commonly seen as an important component of dynamic genome evolution in many other organisms, is believed to be rare in influenza. In this study, 256 gene segments from 32 influenza A genomes were examined for homologous recombination, three recombinant H1N1 strains were detected and they most likely resulted from one recombination event between two closely rated parental sequences. These findings suggest that homologous recombination in influenza viruses tends to take place between strains sharing high sequence similarity. The three recombinant strains were isolated at different time periods and they form a clade, indicating that recombinant strains could circulate. In addition, the simulation results showed that many recombinant sequences might not be detectable by currently existing recombinant detection programs when the parental sequences are of high sequence similarity. Finally, possible ways were discussed to improve the accuracy of the detection for recombinant sequences in influenza.  相似文献   

6.
Anisimova M  Nielsen R  Yang Z 《Genetics》2003,164(3):1229-1236
Maximum-likelihood methods based on models of codon substitution accounting for heterogeneous selective pressures across sites have proved to be powerful in detecting positive selection in protein-coding DNA sequences. Those methods are phylogeny based and do not account for the effects of recombination. When recombination occurs, such as in population data, no unique tree topology can describe the evolutionary history of the whole sequence. This violation of assumptions raises serious concerns about the likelihood method for detecting positive selection. Here we use computer simulation to evaluate the reliability of the likelihood-ratio test (LRT) for positive selection in the presence of recombination. We examine three tests based on different models of variable selective pressures among sites. Sequences are simulated using a coalescent model with recombination and analyzed using codon-based likelihood models ignoring recombination. We find that the LRT is robust to low levels of recombination (with fewer than three recombination events in the history of a sample of 10 sequences). However, at higher levels of recombination, the type I error rate can be as high as 90%, especially when the null model in the LRT is unrealistic, and the test often mistakes recombination as evidence for positive selection. The test that compares the more realistic models M7 (beta) against M8 (beta and omega) is more robust to recombination, where the null model M7 allows the positive selection pressure to vary between 0 and 1 (and so does not account for positive selection), and the alternative model M8 allows an additional discrete class with omega = d(N)/d(S) that could be estimated to be >1 (and thus accounts for positive selection). Identification of sites under positive selection by the empirical Bayes method appears to be less affected than the LRT by recombination.  相似文献   

7.
J S Lopes  M Arenas  D Posada  M A Beaumont 《Heredity》2014,112(3):255-264
The estimation of parameters in molecular evolution may be biased when some processes are not considered. For example, the estimation of selection at the molecular level using codon-substitution models can have an upward bias when recombination is ignored. Here we address the joint estimation of recombination, molecular adaptation and substitution rates from coding sequences using approximate Bayesian computation (ABC). We describe the implementation of a regression-based strategy for choosing subsets of summary statistics for coding data, and show that this approach can accurately infer recombination allowing for intracodon recombination breakpoints, molecular adaptation and codon substitution rates. We demonstrate that our ABC approach can outperform other analytical methods under a variety of evolutionary scenarios. We also show that although the choice of the codon-substitution model is important, our inferences are robust to a moderate degree of model misspecification. In addition, we demonstrate that our approach can accurately choose the evolutionary model that best fits the data, providing an alternative for when the use of full-likelihood methods is impracticable. Finally, we applied our ABC method to co-estimate recombination, substitution and molecular adaptation rates from 24 published human immunodeficiency virus 1 coding data sets.  相似文献   

8.
Allelic variation in the major histocompatibility class (MHC) IIB gene of salmonids is analyzed for patterns indicative of natural selection acting at the molecular level. Sequence data for the second exon of this MHC gene were generated for 11 species in three salmonid genera: Oncorhynchus, Salmo, and Salvelinus. Phylogenetic analysis of nucleotide sequences revealed: (1) monophyletic grouping of alleles from each genus, (2) transspecies evolution of alleles within Salmo and Salvelinus, and (3) differential patterns of transspecies evolution within the genus Oncorhynchus. Within Oncorhynchus, five of seven species had alleles that were species-specific or nearly so, while the remaining two, O. mykiss and O. clarkii, retained ancestral polymorphisms. The different patterns in Oncorhynchus and the other two genera could be due to historical demographic effects or functional differences in MHC molecules in the three genera, but the two hypotheses could not be distinguished with the current dataset. An analysis of recombination/gene conversion identified numerous recombinant alleles, which is consistent with what has been found in other vertebrate taxa. However, these gene conversion events could not account for the species-specific allelic lineages observed in five of the Oncorhynchus species. Analyses of the relative rates of nonsynonymous and synonymous substitutions revealed the signature of selection on the class IIB gene in all 11 of the salmonid species for both the ABS and the non-ABS codons. Codon-based analyses of selection identified seven codons that have experienced selection in the majority of the species. More than half of these sites were mammalian ABS codons, but several were not, suggesting subtle functional differences in the mammalian and teleost fish MHC molecules.  相似文献   

9.
Hasselmann M  Beye M 《Genetics》2006,174(3):1469-1480
Recombination decreases the association of linked nucleotide sites and can influence levels of polymorphism in natural populations. When coupled with selection, recombination may relax potential conflict among linked genes, a concept that has played a central role in research on the evolution of recombination. The sex determination locus (SDL) of the honeybee is an informative example for exploring the combined forces of recombination, selection, and linkage on sequence evolution. Balancing selection at SDL is very strong and homozygous individuals at SDL are eliminated by worker bees. The recombination rate is increased up to four times that of the genomewide average in the region surrounding SDL. Analysis of nucleotide diversity (pi) reveals a sevenfold increase of polymorphism within the sex determination gene complementary sex determiner (csd) that rapidly declines within 45 kb to levels of genomewide estimates. Although no recombination was observed within SDL, which contains csd, analyses of heterogeneity, shared polymorphic sites, and linkage disequilibrium (LD) show that recombination has contributed to the evolution of the 5' part of some csd sequences. Gene conversion, however, has not obviously contributed to the evolution of csd sequences. The local control of recombination appears to be related to SDL function and mode of selection. The homogenizing force of recombination is reduced within SDL, which preserves allelic differences and specificity, while the increase of recombination activity around SDL relaxes conflict between SDL and linked genes.  相似文献   

10.
Isolating the core functional elements of an RNA is normally performed during the characterization of a new RNA in order to simplify further biochemical analysis. The removal of extraneous sequence is challenging and can lead to biases that result from the incomplete sampling of deletion variants. An impartial solution to this problem is to construct a library containing a large number of deletion constructs and to select functional RNA isolates that are at least as efficient as their full-length progenitors. Here, we use nonhomologous recombination and selection to isolate the catalytic core of a pyrimidine nucleotide synthase ribozyme. A variable-length pool of approximately 10(8) recombinant molecules that included deletions, inversions, and translocations of a 271-nucleotide-long ribozyme isolate was constructed by digesting and randomly religating its DNA genome. In vitro selection for functional ribozymes was then performed in a size-dependent and a size-independent manner. The final pools had nearly equivalent catalytic rates even though their length distributions were completely different, indicating that a diverse range of deletion constructs were functionally active. Four short sequence islands, requiring as little as 81 nt of sequence, were found within all of the truncated ribozymes and could be folded into a secondary structure consisting of three helix-loops. Our findings suggest that nonhomologous recombination is a highly efficient way to isolate a ribozyme's core motif and could prove to be a useful method for evolving new ribozyme functions from pre-existing sequences in a manner that may have played an important role early in evolution.  相似文献   

11.
A Graded Autocatalysis Replication Domain (GARD) model is proposed, which provides a rigorous kinetic analysis of simple chemical sets that manifest mutual catalysis. It is shown that catalytic closure can sustain self-replication up to a critical dilution rate, c, related to the graded extent of mutual catalysis. We explore the behavior of vesicles containing GARD species whose mutual catalysis is governed by a previously published statistical distribution. In the population thus generated, some GARD vesicles display a significantly higher replication efficiency than most others. GARD thus represents a simple model for primordial chemical selection of mutually catalytic sets.  相似文献   

12.
The incidence of syphilis has risen worldwide in the last decade in spite of being an easily treated infection. The causative agent of this sexually transmitted disease is the bacterium Treponema pallidum subspecies pallidum (TPA), very closely related to subsp. pertenue (TPE) and endemicum (TEN), responsible for the human treponematoses yaws and bejel, respectively. Although much focus has been placed on the question of the spatial and temporary origins of TPA, the processes driving the evolution and epidemiological spread of TPA since its divergence from TPE and TEN are not well understood. Here, we investigate the effects of recombination and selection as forces of genetic diversity and differentiation acting during the evolution of T. pallidum subspecies. Using a custom-tailored procedure, named phylogenetic incongruence method, with 75 complete genome sequences, we found strong evidence for recombination among the T. pallidum subspecies, involving 12 genes and 21 events. In most cases, only one recombination event per gene was detected and all but one event corresponded to intersubspecies transfers, from TPE/TEN to TPA. We found a clear signal of natural selection acting on the recombinant genes, which is more intense in their recombinant regions. The phylogenetic location of the recombination events detected and the functional role of the genes with signals of positive selection suggest that these evolutionary processes had a key role in the evolution and recent expansion of the syphilis bacteria and significant implications for the selection of vaccine candidates and the design of a broadly protective syphilis vaccine.  相似文献   

13.
MOTIVATION: Accurate detection of positive Darwinian selection can provide important insights to researchers investigating the evolution of pathogens. However, many pathogens (particularly viruses) undergo frequent recombination and the phylogenetic methods commonly applied to detect positive selection have been shown to give misleading results when applied to recombining sequences. We propose a method that makes maximum likelihood inference of positive selection robust to the presence of recombination. This is achieved by allowing tree topologies and branch lengths to change across detected recombination breakpoints. Further improvements are obtained by allowing synonymous substitution rates to vary across sites. RESULTS: Using simulation we show that, even for extreme cases where recombination causes standard methods to reach false positive rates >90%, the proposed method decreases the false positive rate to acceptable levels while retaining high power. We applied the method to two HIV-1 datasets for which we have previously found that inference of positive selection is invalid owing to high rates of recombination. In one of these (env gene) we still detected positive selection using the proposed method, while in the other (gag gene) we found no significant evidence of positive selection. AVAILABILITY: A HyPhy batch language implementation of the proposed methods and the HIV-1 datasets analysed are available at http://www.cbio.uct.ac.za/pub_support/bioinf06. The HyPhy package is available at http://www.hyphy.org, and it is planned that the proposed methods will be included in the next distribution. RDP2 is available at http://darwin.uvigo.es/rdp/rdp.html  相似文献   

14.
Since recombination leads to the generation of mosaic genomes that violate the assumption of traditional phylogenetic methods that sequence evolution can be accurately described by a single tree, results and conclusions based on phylogenetic analysis of data sets including recombinant sequences can be severely misleading. Many methods are able to adequately detect recombination between diverse sequences, for example between different HIV-1 subtypes. More problematic is the identification of recombinants among closely related sequences such as a viral population within a host. We describe a simple algorithmic procedure that enables detection of intra-host recombinants based on split-decomposition networks and a robust statistical test for recombination. By applying this algorithm to several published HIV-1 data sets we conclude that intra-host recombination was significantly underestimated in previous studies and that up to one-third of the env sequences longitudinally sampled from a given subject can be of recombinant origin. The results show that our procedure can be a valuable exploratory tool for detection of recombinant sequences before phylogenetic analysis, and also suggest that HIV-1 recombination in vivo is far more frequent and significant than previously thought.  相似文献   

15.
The nature and rate of recombination can be studied by comparing the sequences of multiple genes across a set of strains. When this approach is applied to Borrelia burgdorferi, four results emerge: (1) chromosomal genes are clonal; (2) there is little or no plasmid exchange; (3) the major mode of horizontal transfer of genetic material inserts a small fragment of DNA, typically <1 kb, during recombination; and (4) the level of horizontal transfer in Borrelia is so low that there is evidence for horizontal transfer only in genes where there is positive selection for diversity, that is, positive selection for the recombinant. Thus, Borrelia can serve as a model of a low recombination taxon. The implications of these results lead us to postulate that an unknown agent that is part of the Borrelia genome mediates the horizontal transfer of small fragments of DNA; the rare transfer of small fragments of DNA excludes both DNA parasites and virulence factors from the genome.  相似文献   

16.
Many viruses, including retroviruses, undergo frequent recombination, a process which can increase their rate of adaptive evolution. In the case of HIV, recombination has been responsible for the generation of numerous intersubtype recombinant variants with epidemiological importance in the AIDS pandemic. Although it is known that fragments of genetic material do not combine randomly during the generation of recombinant viruses, the mechanisms that lead to preferential recombination at specific sites are not fully understood. Here we reanalyze recent independent data defining (i) the structure of a complete HIV-1 RNA genome and (ii) favorable sites for recombination. We show that in the absence of selection acting on recombinant genomes, regions harboring RNA structures in the NL4-3 model strain are strongly predictive of recombination breakpoints in the HIV-1 env genes of primary isolates. In addition, we found that breakpoints within recombinant HIV-1 genomes sampled from human populations, which have been acted upon extensively by natural selection, also colocalize with RNA structures. Critically, junctions between genes are enriched in structured RNA elements and are also preferred sites for generating functional recombinant forms. These data suggest that RNA structure-mediated recombination allows the virus to exchange intact genes rather than arbitrary subgene fragments, which is likely to increase the overall viability and replication success of the recombinant HIV progeny.  相似文献   

17.
Homologous recombination in GB virus C/hepatitis G virus   总被引:3,自引:0,他引:3  
Analysis of 33 GB virus C/hepatitis G virus (GBV-C/HGV) full or nearly full genome sequences revealed several putative inter- and intrasubtype recombinants. The breakpoints of the recombinant regions were mapped using a maximum-likelihood method, and the statistical significance for each region was tested using Monte Carlo simulation. The results were highly significant and provided evidence for the existence of complex mosaic genomes showing as many as nine recombination events, with breakpoints in the 5' UTR and in all of the coding regions except the short NS4b gene. Recombination was confirmed by separate phylogenetic analysis of the various recombinant regions and by Sawyer's runs test. Taken together, these findings demonstrate for the first time that recombination is common in natural populations of GBV-C and that it takes place both within and between subtypes. The wide-ranging implications of such nonclonal history for reconstructing the spread and timescale of GBV-C evolution are discussed.  相似文献   

18.
A simple nearly neutral mutation model of protein evolution was studied using computer simulation assuming a constant population size. In this model, a gene consists of a finite number of codons and there is no recombination within a gene. Each codon has two replacement and one silent sites. The fitness of a gene was determined multiplicatively by amino acids specified by codons (the independent multicodon model). Nucleotide diversity at replacement sites decreases as selection becomes stronger. A reduction of nucleotide diversity at silent sites also occurs as selection intensifies but the magnitude of the reduction is not a monotone function of the intensity of selection. The dispersion index is close to one. The average value of Tajima's and Fu and Li's statistics are negative and their absolute values increases as selection intensifies. However, their powers of detecting selection under the present model were not high unless the number of sites is large or mutation rate is high. The MK test was shown to detect intermediate selection fairly well. For comparison, the house-of-cards model was also investigated and its behavior was shown to be more sensitive to changes of population size than that of the independent multicodon model. The relevance of the present model for explaining protein evolution was discussed comparing its prediction and recent DNA data. Received: 24 May 1999 / Accepted: 17 August 1999  相似文献   

19.
The major histocompatibility complex (MHC) plays a central role in the adaptive immune system and provides a good model with which to understand the evolutionary processes underlying functional genes. Trans-species polymorphism and orthology are both commonly found in MHC genes; however, mammalian MHC class I genes tend to cluster by species. Concerted evolution has the potential to homogenize different loci, whereas birth-and-death evolution can lead to the loss of orthologs; both processes result in monophyletic groups within species. Studies investigating the evolution of MHC class I genes have been biased toward a few particular taxa and model species. We present the first study of MHC class I genes in a species from the superfamily Musteloidea. The European badger (Meles meles) exhibits moderate variation in MHC class I sequences when compared to other carnivores. We identified seven putatively functional sequences and nine pseudogenes from genomic (gDNA) and complementary (cDNA) DNA, signifying at least two functional class I loci. We found evidence for separate evolutionary histories of the α1 and α2/α3 domains. In the α1 domain, several sequences from different species were more closely related to each other than to sequences from the same species, resembling orthology or trans-species polymorphism. Balancing selection and probable recombination maintain genetic diversity in the α1 domain, evidenced by the detection of positive selection and a recombination event. By comparison, two recombination breakpoints indicate that the α2/α3 domains have most likely undergone concerted evolution, where recombination has homogenized the α2/α3 domains between genes, leading to species-specific clusters of sequences. Our findings highlight the importance of analyzing MHC domains separately.  相似文献   

20.
MOTIVATION: Recombination can be a prevailing drive in shaping genome evolution. RAT (Recombination Analysis Tool) is a Java-based tool for investigating recombination events in any number of aligned sequences (protein or DNA) of any length (short viral sequences to full genomes). It is an uncomplicated and intuitive application and allows the user to view only the regions of sequence alignments they are interested in. RESULTS: RAT was applied to viral sequences. Its utility was demonstrated through the detection of a known recombinant of HIV and a detailed analysis of Noroviruses, the most common cause of viral gastroenteritis in humans. AVAILABILITY: RAT, along with a user's guide, is freely available from http://jic-bioinfo.bbsrc.ac.uk/bioinformatics-research/staff/graham_etherington/RAT.htm.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号