首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
MOTIVATION: We present a statistical method for detecting recombination, whose objective is to accurately locate the recombinant breakpoints in DNA sequence alignments of small numbers of taxa (4 or 5). Our approach explicitly models the sequence of phylogenetic tree topologies along a multiple sequence alignment. Inference under this model is done in a Bayesian way, using Markov chain Monte Carlo (MCMC). The algorithm returns the site-dependent posterior probability of each tree topology, which is used for detecting recombinant regions and locating their breakpoints. RESULTS: The method was tested on a synthetic and three real DNA sequence alignments, where it was found to outperform the established detection methods PLATO, RECPARS, and TOPAL.  相似文献   

2.
Stepwise detection of recombination breakpoints in sequence alignments   总被引:1,自引:0,他引:1  
MOTIVATION: We propose a stepwise approach to identify recombination breakpoints in a sequence alignment. The approach can be applied to any recombination detection method that uses a permutation test and provides estimates of breakpoints. RESULTS: We illustrate the approach by analyses of a simulated dataset and alignments of real data from HIV-1 and human chromosome 7. The presented simulation results compare the statistical properties of one-step and two-step procedures. More breakpoints are found with a two-step procedure than with a single application of a given method, particularly for higher recombination rates. At higher recombination rates, the additional breakpoints were located at the cost of only a slight increase in the number of falsely declared breakpoints. However, a large proportion of breakpoints still go undetected. AVAILABILITY: A makefile and C source code for phylogenetic profiling and the maximum chi2 method, tested with the gcc compiler on Linux and WindowsXP, are available at http://stat-db.stat.sfu.ca/stepwise/ CONTACT: jgraham@stat.sfu.ca.  相似文献   

3.
The mammalian tick-borne flavivirus group (MTBFG) contains viruses associated with important human and animal diseases such as encephalitis and hemorrhagic fever. In contrast to mosquito-borne flaviviruses where recombination events are frequent, the evolutionary dynamic within the MTBFG was believed to be essentially clonal. This assumption was challenged with the recent report of several homologous recombinations within the Tick-borne encephalitis virus (TBEV). We performed a thorough analysis of publicly available genomes in this group and found no compelling evidence for the previously identified recombinations. However, our results show for the first time that demonstrable recombination (i.e., with large statistical support and strong phylogenetic evidences) has occurred in the MTBFG, more specifically within the Louping ill virus lineage. Putative parents, recombinant strains and breakpoints were further tested for statistical significance using phylogenetic methods. We investigated the time of divergence between the recombinant and parental strains in a Bayesian framework. The recombination was estimated to have occurred during a window of 282 to 76 years before the present. By unravelling the temporal setting of the event, we adduce hypotheses about the ecological conditions that could account for the observed recombination.  相似文献   

4.
An analysis of the genome structure of soybean cultivars was conducted to determine if cultivars are composed of large regions of chromosomes inherited intact from one parent (indicative of minimal recombination) or if the chromosomes are a mixture of one parent's DNA interspersed with the DNA from the other parent (indicative of maximal recombination). Twenty-one single-cross-derived and 5 single-backcross-derived soybean cultivars and their immediate parents (47 genotypes) were analyzed at 89 RFLP loci to determine the minimal number and distribution of recombination events detected. Cultivars derived from single-cross and single-backcross breeding programs showed an average of 5.2 and 8.0 recombination events per cultivar, respectively. A homogeneity Chi-square test based upon a Poisson distribution of recombination events across 13 linkage groups indicated that the number of recombinations observed among linkage groups was random for the single-cross cultivars, but not for the single-backcross-derived cultivars. A twotailed t-test demonstrated that for some linkage groups, the number of recombinations per map unit exceeded the confidence interval developed from a t-distribution of recombinations standardized for map unit distance. Paired t-tests of the number of recombinations observed between linkage-group ends and the mid-portion of the linkage groups indicated that during the development of the cultivars analyzed in this study more recombinations were associated with the ends of linkage groups than with the middle region. Detailed analysis of each linkage group revealed that large portions of linkage groups D, F, and G were inherited intact from one parent in several cultivars. A portion of linkage group G, in contrast, showed more recombination events than expected, based on genetic distance. These analyses suggest that breeders may have selected against recombination events where agronomically favorable combinations of alleles are present in one parent, and for recombination in areas where agronomically favorable combinations of alleles are not present in either parent.Names are necessary to report factually on the available data; however, the USDA neither guarantees nor warrants the standard of the product, and the use of the name by the USDA implies no approval of the product to the exclusion of others that may also be available. Contribution of the Midwest Area, USDA-ARS, Project No. 3236 of the Iowa Agriculture and Home Economics Experiment Station, Ames, IA 50011. Journal Paper No. J-16533  相似文献   

5.
In Lao PDR, where more than 8% of the population are chronic carriers of HBsAg, multiple genotypes and subgenotypes co-circulate and are prone to generate recombinant viruses. Phylogenetic analyses of multiple clones per donor revealed mixed infections of subgenotypes B1, B2, B4, C1, C5, I1 and I2 in almost 6% of HBsAg positive rejected blood donors. Recombination analyses and distance calculations furthermore showed that about 65% (17/26) of the mixed infected donors showed recombinations in the S-gene alone, involving the predominant genotypes B and C. These results suggest that, at least in Laos, hepatitis B virus (HBV) mixed infections lead to frequent recombinations. In many donors with recombinant strains, the recombinant fragment and a non-recombinant strain of the same genotype co-existed (127/185 analysed recombinant fragments). For a large proportion of these (60/127), the most closely related known virus was found, although not always exclusively, in the same donor. Recombinant virus strains are largely distinct. This is reflected in an unexpected diversity in recombination breakpoints and the relatively rare recombinations with identical recombination patterns of the same genotypes in different donors. Recent recombination events would explain the limited spread of each of the recombinants. Using a published mutation rate of 4.2 × 10(-5) mutations per site and year, the observed minimum genetic distances of 0-0.60% between parent strain and recombinant fragment would correspond to 0-71 years of evolution from a most recent common ancestor (MRCA). Thus several lines of evidence are suggestive of recent independent recombination events, a proportion of these even occurring within the same donors. In conclusion, our analyses revealed a high variability of mixed infections as a very probable breeding ground of multiple variable recombination events in Laos that so far have not led to new dominant strains.  相似文献   

6.
Recombination is an important evolutionary mechanism responsible for creating the patterns of haplotype variation observable in human populations. Recently, there has been extensive research on understanding the fine-scale variation in recombination across the human genome using DNA polymorphism data. Historical recombination events leave signature patterns in haplotype data. A nonparametric approach for estimating the number of historical recombination events is to compute the minimum number of recombination events in the history of a set of haplotypes. In this paper, we provide new and improved methods for computing lower bounds on the minimum number of recombination events. These methods are shown to detect a higher number of recombination events for a haplotype dataset from a region in the lipoprotein lipase gene than previous lower bounds. We apply our methods to two datasets for which recombination hotspots have been experimentally determined and demonstrate a high density of detectable recombination events in the regions annotated as recombination hotspots. The programs implementing the methods in this paper are available at www.cs.ucsd.edu/users/vibansal/RecBounds/.  相似文献   

7.
Lefebvre JF  Labuda D 《Genetics》2008,178(4):2069-2079
In this article we present a new heuristic approach (informative recombinations, InfRec) to analyze recombination density at the sequence level. InfRec is intuitive and easy and combines previously developed methods that (i) resolve genotypes into haplotypes, (ii) estimate the minimum number of recombinations, and (iii) evaluate the fraction of informative recombinations. We tested this approach in its sliding-window version on 117 genes from the SeattleSNPs program, resequenced in 24 African-Americans (AAs) and 23 European-Americans (EAs). We obtained population recombination rate estimates (rho(obs)) of 0.85 and 0.37 kb(-1) in AAs and EAs, respectively. Coalescence simulations indicated that these values account for both the recombinations and the gene conversions in the history of the sample. The intensity of rho(obs) varied considerably along the sequence, revealing the presence of recombination hotspots. Overall, we observed approximately 80% of recombinations in one-third and approximately 50% in only 10% of the sequence. InfRec performance, tested on published simulated and additional experimental data sets, was similar to that of other hotspot detection methods. Fast, intuitive, and visual, InfRec is not constrained by sample size limitations. It facilitates understanding data and provides a simple and flexible tool to analyze recombination intensity along the sequence.  相似文献   

8.
Most phylogenetic tree estimation methods assume that there is a single set of hierarchical relationships among sequences in a data set for all sites along an alignment. Mosaic sequences produced by past recombination events will violate this assumption and may lead to misleading results from a phylogenetic analysis due to the imposition of a single tree along the entire alignment. Therefore, the detection of past recombination is an important first step in an analysis. A Bayesian model for the changes in topology caused by recombination events is described here. This model relaxes the assumption of one topology for all sites in an alignment and uses the theory of Hidden Markov models to facilitate calculations, the hidden states being the underlying topologies at each site in the data set. Changes in topology along the multiple sequence alignment are estimated by means of the maximum a posteriori (MAP) estimate. The performance of the MAP estimate is assessed by application of the model to data sets of four sequences, both simulated and real.  相似文献   

9.
Advances in high-throughput DNA sequencing technologies have determined an explosion in the number of sequenced bacterial genomes. Comparative sequence analysis frequently reveals evidences of homologous recombination occurring with different mechanisms and rates in different species, but the large-scale use of computational methods to identify recombination events is hampered by their high computational costs. Here, we propose a new method to identify recombination events in large datasets of whole genome sequences. Using a filtering procedure of the gene conservation profiles of a test genome against a panel of strains, this algorithm identifies sets of contiguous genes acquired by homologous recombination. The locations of the recombination breakpoints are determined using a statistical test that is able to account for the differences in the natural rate of evolution between different genes. The algorithm was tested on a dataset of 75 genomes of Staphylococcus aureus and 50 genomes comprising different streptococcal species, and was able to detect intra-species recombination events in S. aureus and in Streptococcus pneumoniae. Furthermore, we found evidences of an inter-species exchange of genetic material between S. pneumoniae and Streptococcus mitis, a closely related commensal species that colonizes the same ecological niche. The method has been implemented in an R package, Reco, which is freely available from supplementary material, and provides a rapid screening tool to investigate recombination on a genome-wide scale from sequence data.  相似文献   

10.
11.
Genetic recombination is considered to be a very frequent phenomenon among enteroviruses (Family Picornaviridae, Genus Enterovirus). However, the recombination patterns may differ between enterovirus species and between types within species. Enterovirus C (EV-C) species contains 21 types. In the capsid coding P1 region, the types of EV-C species cluster further into three sub-groups (designated here as A–C). In this study, the recombination pattern of EV-C species sub-group B that contains types CVA-21, CVA-24, EV-C95, EV-C96 and EV-C99 was determined using partial 5′UTR and VP1 sequences of enterovirus strains isolated during poliovirus surveillance and previously published complete genome sequences. Several inter-typic recombination events were detected. Furthermore, the analyses suggested that inter-typic recombination events have occurred mainly within the distinct sub-groups of EV-C species. Only sporadic recombination events between EV-C species sub-group B and other EV-C sub-groups were detected. In addition, strict recombination barriers were inferred for CVA-21 genotype C and CVA-24 variant strains. These results suggest that the frequency of inter-typic recombinations, even within species, may depend on the phylogenetic position of the given viruses.  相似文献   

12.
Phylogenetic studies based on DNA sequences typically ignore the potential occurrence of recombination, which may produce different alignment regions with different evolutionary histories. Traditional phylogenetic methods assume that a single history underlies the data. If recombination is present, can we expect the inferred phylogeny to represent any of the underlying evolutionary histories? We examined this question by applying traditional phylogenetic reconstruction methods to simulated recombinant sequence alignments. The effect of recombination on phylogeny estimation depended on the relatedness of the sequences involved in the recombinational event and on the extent of the different regions with different phylogenetic histories. Given the topologies examined here, when the recombinational event was ancient, or when recombination occurred between closely related taxa, one of the two phylogenies underlying the data was generally inferred. In this scenario, the evolutionary history corresponding to the majority of the positions in the alignment was generally recovered. Very different results were obtained when recombination occurred recently among divergent taxa. In this case, when the recombinational breakpoint divided the alignment in two regions of similar length, a phylogeny that was different from any of the true phylogenies underlying the data was inferred.  相似文献   

13.
Volvocales forms a species-rich clade with wide morphological variety and is regarded as an ideal model for tracing the evolutionary transitions in multicellularity. The phylogenetic relationships among the colonial volvocine algae and its relatives are important for investigating the origin of multicellularity in the clade Reinhardtinia. Therefore, a robust phylogenetic framework of the unicellular and colonial volvocine algae with broad taxon and gene sampling is essential for illuminating the evolution of multicellularity. Recent chloroplast phylogenomic studies have uncovered five major orders in the Chlorophyceae, but the family-level relationships within Sphaeropleales and Volvocales remain elusive due to the uncertain positions of some incertae sedis taxa. In this study, we contributed six newly sequenced chloroplast genomes in the Volvocales and analyzed a dataset with 91 chlorophycean taxa and 58 protein-coding genes. Conflicting phylogenetic signals were detected among chloroplast genes that resulted in discordant tree topologies among different analyses. We compared the phylogenetic trees inferred from original nucleotide, RY-coding, codon-degenerate, and amino acid datasets, and improved the robustness of phylogenetic inference in the Chlorophyceae by reducing base compositional bias. Our analyses indicate that the unicellular Chlamydomonas and Vitreochlamys are close to or nested within the colonial taxa, and all the incertae sedis taxa are nested within the monophyletic Sphaeropleales s.l. We propose that the colonial taxa in the Reinhardtinia are paraphyletic and multicellularity evolved once in the volvocine green algae and might be lost in Chlamydomonas and Vitreochlamys.  相似文献   

14.
对河南省2008~2010年河南省人肠道病毒71型进行基因特征及重组特点研究。对河南省2008~2010年分离的5株肠道病毒EV71型构建VP1序列系统进化树并分析其全基因组序列的重组特点。VP1序列系统进化分析显示2008~2010年河南株均属于C4基因型的C4a亚群,Bootscan分析和5’NCR、P1、P2、P3区的进化树证实C4基因型在2A~2B处存在EV71的B基因型和C基因型的型内重组及在3B~3C处存在EV71的B基因型和CA16/G-10间的型间重组。2008~2010年河南EV71分离株为C4基因型的C4a亚群,与2004年以来的中国大陆优势株流行趋势完全一致,EV71C4基因型存在基因型内和型间双重组现象。  相似文献   

15.
Sequencing of five late genes from 18 isolates of P2-like bacteriophages showed that these are at least 96% identical to the genes of phage P2. A maximum-parsimony phylogenetic analysis of these genes showed excess homoplasy of a magnitude three to six times higher than that expected. Examination of the distribution of the number of homoplasies at parsimoniously informative sites and incompatibility matrices of such sites revealed a pattern typical for extensive recombination. It has been shown that phage P2 probably incorporated some functionally complete genes or gene modules by recombination with other phages or with different hosts, but homologous recombination within genes has previously not been shown. In this paper we demonstrate that homologous recombination between P2-like bacteriophages occurs randomly at multiple breakpoints in five late genes. The rate of recombination is high but, since some phages were sampled decades apart and in different parts of the world, this has to be viewed on an evolutionary time scale. The applicability of different methods used for detection of recombination breakpoints and estimation of rates of recombination in bacteriophages is discussed.  相似文献   

16.
RAPD problems in phylogenetics   总被引:1,自引:0,他引:1  
This paper is intended to clarify some of the questions related with the application of RAPD for phylogenetic reconstruction purposes. Using different specimens of mammals selected across various taxonomic levels, we assessed the validity of RAPD to recover a known phylogeny, using four distance coefficients (simple matching, Russell & Rao, Jaccard, and Dice). We assessed the minimum number of primers required in the computations to obtain stable results in terms of distance estimates and/or topologies of the derived trees. These results based on distance methods were compared with those obtained with parsimony analyses of RAPD markers. Both approaches have shown to be equally problematic for comparing taxa above the family level. On the basis of these comparisons among various indices and methods, we recommend the use of Jaccard or Dice coefficients, with no less than twelve primers. We also suggest validation of any phylogeny based on RAPD data with a resampling procedure (i.e. the bootstrap or the jackknife) before any sound conclusion can be drawn.  相似文献   

17.
In resolving the vertebrate tree of life, two fundamental questions remain: 1) what is the phylogenetic position of turtles within amniotes, and 2) what are the relationships between the three major lissamphibian (extant amphibian) groups? These relationships have historically been difficult to resolve, with five different hypotheses proposed for turtle placement, and four proposed branching patterns within Lissamphibia. We compiled a large cDNA/EST dataset for vertebrates (75 genes for 129 taxa) to address these outstanding questions. Gene-specific phylogenetic analyses revealed a great deal of variation in preferred topology, resulting in topologically ambiguous conclusions from the combined dataset. Due to consistent preferences for the same divergent topologies across genes, we suspected systematic phylogenetic error as a cause of some variation. Accordingly, we developed and tested a novel statistical method that identifies sites that have a high probability of containing biased signal for a specific phylogenetic relationship. After removing putatively biased sites, support emerged for a sister relationship between turtles and either crocodilians or archosaurs, as well as for a caecilian-salamander sister relationship within Lissamphibia, with Lissamphibia potentially paraphyletic.  相似文献   

18.
A graphical method for detecting recombination in phylogenetic data sets   总被引:9,自引:3,他引:6  
Current phylogenetic tree reconstruction methods assume that there is a single underlying tree topology for all sites along the sequence. The presence of mosaic sequences due to recombination violates this assumption and will cause phylogenetic methods to give misleading results due to the imposition of a single tree topology on all sites. The detection of mosaic sequences caused by recombination is therefore an important first step in phylogenetic analysis. A graphical method for the detection of recombination, based on the least squares method of phylogenetic estimation, is presented here. This method locates putative recombination breakpoints by moving a window along the sequence. The performance of the method is assessed by simulation and by its application to a real data set.   相似文献   

19.

Background

Tomato-infecting begomoviruses are widely distributed across the world and cause diseases of high economic impact on wide range of agriculturally important crops. Though recombination plays a pivotal role in diversification and evolution of these viruses, it is currently unknown whether there are differences in the number and quality of recombination events amongst different tomato-infecting begomovirus species. To examine this we sought to characterize the recombination events, estimate the frequency of recombination, and map recombination hotspots in tomato-infecting begomoviruses of South and Southeast Asia.

Results

Different methods used for recombination breakpoint analysis provided strong evidence for presence of recombination events in majority of the sequences analyzed. However, there was a clear evidence for absence or low Recombination events in viruses reported from North India. In addition, we provide evidence for non-random distribution of recombination events with the highest frequency of recombination being mapped in the portion of the N-terminal portion of Rep.

Conclusion

The variable recombination observed in these viruses signified that all begomoviruses are not equally prone to recombination. Distribution of recombination hotspots was found to be reliant on the relatedness of the genomic region involved in the exchange. Overall the frequency of phylogenetic violations and number of recombination events decreased with increasing parental sequence diversity. These findings provide valuable new information for understanding the diversity and evolution of tomato-infecting begomoviruses in Asia.  相似文献   

20.
Parkin mutations are responsible for the pathogenesis of autosomal-recessive juvenile parkinsonism (AR-JP). On initial screening of Japanese patients with AR-JP, we had found that approximately half of the parkin mutations are deletions occurring between exons 2 and 5, forming a deletion hot spot. In this study, we investigated the deletion breakpoints of the parkin mutations in 22 families with AR-JP and examined the possible association between these deletion events and meiotic recombinations. We identified 18 deletion breakpoints at the DNA nucleotide sequence level. Almost all these deletions were different, indicating that the deletion hot spot was generated by recurrent but independent events. We found no association between the deletions and specific DNA elements. Recent copy number variation (CNV) data from various ethnic groups showed that the deletion hot spot is overlapped by a highly polymorphic CNV region, indicating that the recurrent deletion mutation or CNV is observable worldwide. By comparing Marshfield and deCODE linkage maps, we found that the parkin deletion hot spot may be associated with a meiotic recombination hot spot, although such association was not found on comparison with recent high-resolution genetic maps generated from the International HapMap project. Here, we discuss the possible mechanisms for deletion hot spot formation and its effects on human genomes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号