首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Besides the complete genome, different partial genomic sequences of Hepatitis E virus (HEV) have been used in genotyping studies, making it difficult to compare the results based on them. No commonly agreed partial region for HEV genotyping has been determined. In this study, we used a statistical method to evaluate the phylogenetic performance of each partial genomic sequence from a genome wide, by comparisons of evolutionary distances between genomic regions and the full-length genomes of 101 HEV isolates to identify short genomic regions that can reproduce HEV genotype assignments based on full-length genomes. Several genomic regions, especially one genomic region at the 3′-terminal of the papain-like cysteine protease domain, were detected to have relatively high phylogenetic correlations with the full-length genome. Phylogenetic analyses confirmed the identical performances between these regions and the full-length genome in genotyping, in which the HEV isolates involved could be divided into reasonable genotypes. This analysis may be of value in developing a partial sequence-based consensus classification of HEV species.  相似文献   

2.
李喆  杨尧  魏静  冯毅  邢辉  何翔  邵一鸣 《病毒学报》2012,28(4):366-371
本研究旨在探索不同基因区的使用对HIV-1 B’亚型毒株系统进化分析结果的影响。首先利用既往研究中已发表的共计47条来自泰国,缅甸和中国多个地区不同传播途径的B’毒株近全长基因组序列,将其按基因区分为不同的数据集 (gag, pol, vif, vpr, vpu, env, nef),并分别进行系统进化分析研究。比较不同基因区系统进化分析的结果发现,B’亚型毒株 pol基因在分析的基因区中,具有最低的复杂度和进化速率,可以较好的区分B’TH和B’YN毒株,重复近全长基因组序列的分析结果;尽管env基因则具有最高的复杂度和进化速率,但无法获得类似结果。本研究比较了不同基因区对HIV-1 B’亚型毒株系统进化分析结果的影响,对进一步开展HIV分子流行病学调查,分析我国B’毒株在我国的传播奠定了基础。  相似文献   

3.
Pan XL  Liu H  Wang HY  Fu SH  Liu HZ  Zhang HL  Li MH  Gao XY  Wang JL  Sun XH  Lu XJ  Zhai YG  Meng WS  He Y  Wang HQ  Han N  Wei B  Wu YG  Feng Y  Yang DJ  Wang LH  Tang Q  Xia G  Kurane I  Rayner S  Liang GD 《Journal of virology》2011,85(19):9847-9853
Japanese encephalitis virus (JEV), a mosquito-borne zoonotic pathogen, is one of the major causes of viral encephalitis worldwide. Previous phylogenetic studies based on the envelope protein indicated that there are four genotypes, and surveillance data suggest that genotype I is gradually replacing genotype III as the dominant strain. Here we report an evolutionary analysis based on 98 full-length genome sequences of JEV, including 67 new samples isolated from humans, pigs, mosquitoes, midges. and bats in affected areas. To investigate the relationships between the genotypes and the significance of genotype I in recent epidemics, we estimated evolutionary rates, ages of common ancestors, and population demographics. Our results indicate that the genotypes diverged in the order IV, III, II, and I and that the genetic diversity of genotype III has decreased rapidly while that of genotype I has increased gradually, consistent with its emergence as the dominant genotype.  相似文献   

4.
We determined the complete mitochondrial DNA (mtDNA) sequence of a fluke, Paramphistomum cervi (Digenea: Paramphistomidae). This genome (14,014 bp) is slightly larger than that of Clonorchis sinensis (13,875 bp), but smaller than those of other digenean species. The mt genome of P. cervi contains 12 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes and 2 non-coding regions (NCRs), a complement consistent with those of other digeneans. The arrangement of protein-coding and ribosomal RNA genes in the P. cervi mitochondrial genome is identical to that of other digeneans except for a group of Schistosoma species that exhibit a derived arrangement. The positions of some transfer RNA genes differ. Bayesian phylogenetic analyses, based on concatenated nucleotide sequences and amino-acid sequences of the 12 protein-coding genes, placed P. cervi within the Order Plagiorchiida, but relationships depicted within that order were not quite as expected from previous studies. The complete mtDNA sequence of P. cervi provides important genetic markers for diagnostics, ecological and evolutionary studies of digeneans.  相似文献   

5.
The ability to infer relationships between groups of sequences, either by searching for their evolutionary history or by comparing their sequence similarity, can be a crucial step in hypothesis testing. Interpreting relationships of human immunodeficiency virus type 1 (HIV-1) sequences can be challenging because of their rapidly evolving genomes, but it may also lead to a better understanding of the underlying biology. Several studies have focused on the evolution of HIV-1, but there is little information to link sequence similarities and evolutionary histories of HIV-1 to the epidemiological information of the infected individual. Our goal was to correlate patterns of HIV-1 genetic diversity with epidemiological information, including risk and demographic factors. These correlations were then used to predict epidemiological information through analyzing short stretches of HIV-1 sequence. Using standard phylogenetic and phenetic techniques on 100 HIV-1 subtype B sequences, we were able to show some correlation between the viral sequences and the geographic area of infection and the risk of men who engage in sex with men. To help identify more subtle relationships between the viral sequences, the method of multidimensional scaling (MDS) was performed. That method identified statistically significant correlations between the viral sequences and the risk factors of men who engage in sex with men and individuals who engage in sex with injection drug users or use injection drugs themselves. Using tree construction, MDS, and newly developed likelihood assignment methods on the original 100 samples we sequenced, and also on a set of blinded samples, we were able to predict demographic/risk group membership at a rate statistically better than by chance alone. Such methods may make it possible to identify viral variants belonging to specific demographic groups by examining only a small portion of the HIV-1 genome. Such predictions of demographic epidemiology based on sequence information may become valuable in assigning different treatment regimens to infected individuals.  相似文献   

6.
Isolates of cauliflower mosaic virus (CaMV) differ in host range and symptomatology. Knowledge of their sequence relationships should assist in identifying nucleotide sequences responsible for isolate-specific characters. Complete nucleotide sequences of the DNAs of eight isolates of CaMV were aligned and the aligned sequences were used to analyze phylogenetic relationships by maximum likelihood, bootstrapped parsimony, and distance methods. Isolates found in North America clustered separately from those isolated from other parts of the world. Additional isolates, for which partial sequences were available, were incorporated into phylogenetic analysis of the sequences of genome segments corresponding to individual protein coding regions or the large intergenic region of CaMV DNA. The analysis revealed several instances where the position of an isolate on a tree for one coding region did not agree with the position of the isolate on the tree for the complete genome or with its position on trees for other coding regions. Examination of the distribution of shared residue types of phylogenetically informative positions in anomalous regions suggested that most of the anomalies were due to recombination events during the evolution of the isolates. Application of an algorithm that searches for segments of significant length that are identical between pairs of isolates or contain a significantly high concentration of polymorphisms suggested two additional recombination events between progenitors of the isolates studied and an event between the XinJing isolate and a CaMV not represented in the data set. An earlier phylogenetic origin for CaMV than for carnation etched ring virus, the caulimovirus used as outgroup in these analyses, was deduced from the position of the outgroup with North American isolates in some trees, but with non-North American isolates in other trees. Correspondence to: U. Melcher  相似文献   

7.
Mitochondrial DNA sequences can be used to estimate phylogenetic relationships among animal taxa and for molecular phylogenetic evolution analysis. With the development of sequencing technology, more and more mitochondrial sequences have been made available in public databases, including whole mitochondrial DNA sequences. These data have been used for phylogenetic analysis of animal species, and for studies of evolutionary processes. We made phylogenetic analyses of 19 species of Cervidae, with Bos taurus as the outgroup. We used neighbor joining, maximum likelihood, maximum parsimony, and Bayesian inference methods on whole mitochondrial genome sequences. The consensus phylogenetic trees supported monophyly of the family Cervidae; it was divided into two subfamilies, Plesiometacarpalia and Telemetacarpalia, and four tribes, Cervinae, Muntiacinae, Hydropotinae, and Odocoileinae. The divergence times in these families were estimated by phylogenetic analysis using the Bayesian method with a relaxed molecular clock method; the results were consistent with those of previous studies. We concluded that the evolutionary structure of the family Cervidae can be reconstructed by phylogenetic analysis based on whole mitochondrial genomes; this method could be used broadly in phylogenetic evolutionary analysis of animal taxa.  相似文献   

8.
The arenavirus Lassa virus causes Lassa fever, a viral hemorrhagic fever that is endemic in the countries of Nigeria, Sierra Leone, Liberia, and Guinea and perhaps elsewhere in West Africa. To determine the degree of genetic diversity among Lassa virus strains, partial nucleoprotein (NP) gene sequences were obtained from 54 strains and analyzed. Phylogenetic analyses showed that Lassa viruses comprise four lineages, three of which are found in Nigeria and the fourth in Guinea, Liberia, and Sierra Leone. Overall strain variation in the partial NP gene sequence was found to be as high as 27% at the nucleotide level and 15% at the amino acid level. Genetic distance among Lassa strains was found to correlate with geographic distance rather than time, and no evidence of a "molecular clock" was found. A method for amplifying and cloning full-length arenavirus S RNAs was developed and used to obtain the complete NP and glycoprotein gene (GP1 and GP2) sequences for two representative Nigerian strains of Lassa virus. Comparison of full-length gene sequences for four Lassa virus strains representing the four lineages showed that the NP gene (up to 23.8% nucleotide difference and 12.0% amino acid difference) is more variable than the glycoprotein genes. Although the evolutionary order of descent within Lassa virus strains was not completely resolved, the phylogenetic analyses of full-length NP, GP1, and GP2 gene sequences suggested that Nigerian strains of Lassa virus were ancestral to strains from Guinea, Liberia, and Sierra Leone. Compared to the New World arenaviruses, Lassa and the other Old World arenaviruses have either undergone a shorter period of diverisification or are evolving at a slower rate. This study represents the first large-scale examination of Lassa virus genetic variation.  相似文献   

9.
Partial E1 envelope glycoprotein gene sequences and complete structural polyprotein sequences were used to compare divergence and construct phylogenetic trees for the genus Alphavirus. Tree topologies indicated that the mosquito-borne alphaviruses could have arisen in either the Old or the New World, with at least two transoceanic introductions to account for their current distribution. The time frame for alphavirus diversification could not be estimated because maximum-likelihood analyses indicated that the nucleotide substitution rate varies considerably across sites within the genome. While most trees showed evolutionary relationships consistent with current antigenic complexes and species, several changes to the current classification are proposed. The recently identified fish alphaviruses salmon pancreas disease virus and sleeping disease virus appear to be variants or subtypes of a new alphavirus species. Southern elephant seal virus is also a new alphavirus distantly related to all of the others analyzed. Tonate virus and Venezuelan equine encephalitis virus strain 78V3531 also appear to be distinct alphavirus species based on genetic, antigenic, and ecological criteria. Trocara virus, isolated from mosquitoes in Brazil and Peru, also represents a new species and probably a new alphavirus complex.  相似文献   

10.
A hepatitis B virus (HBV) genome was cloned from human liver. Numerous mutations in all viral genes define this HBV DNA as a mutant, divergent from all known HBV DNA sequences. Functional analyses of this mutant demonstrated a defect blocking viral DNA synthesis. The genetic basis of this defect was identified as a single missense mutation in the 5' region of the viral polymerase gene, resulting in the inability to package pregenomic RNA into core particles. The replication defect could be trans-complemented by a full-length wild-type, but not by a full-length mutant or 3'-truncated wild-type, polymerase gene construct. Our findings indicate a critical role of the 5' polymerase gene region in the life cycle of the virus and suggest that introducing missense mutations in this region can be a strategy to terminate viral replication in vivo.  相似文献   

11.
With the development of genome sequencing more whole genomes of microorganisms were completed, many methods wereintroduced to reconstruct the phylogenetic tree of those microorganismswith the information extracted from the whole genomes through variousways of transforming or mapping the whole genome sequences into otherforms which can describe the evolutionary distance in a new way. We thinkit might be possible that there exists information buried in the wholegenome transferred along lineage, which remains stable and is moreessential than sequence conservation of individual genes or the arrangementof some genes of a selected set. We need to find one measurement that caninvolve as many phylogenetic features as possible that are beyond thegenome sequence itself. We converted each genome sequence of themicroorganisms into another linear sequence to represent the functionalstructure of the sequence, and we used a new information function tocalculate the discrepancy of sequences and to get one distance matrix of thegenomes, and built one phylogenetic tree with a neighbor joining method.The resulting tree shows that the major lineages are consistent with theresult based on their 16srRNA sequences. Our method discovered onephylogenetic feature derived from the genome sequences and the encodedgenes that can rebuild the phylogenetic tree correctly. The mapping of onegenome sequence to its new form representing the relative positions of thefunctional genes provides a new way to measure the phylogeneticrelationships, and with the more specific classification of gene functions theresult could be more sensitive.  相似文献   

12.
Bifurcating phylogenies are frequently used to describe the evolutionary history of groups of related species. However, simple bifurcating models may poorly represent the evolutionary history of species that have been exchanging genes. Here, we show that the history of three well-known closely related species, Drosophila pseudoobscura, D. persimilis and D. p. bogotana, is not well represented by a bifurcating phylogenetic tree. The phylogenetic relationships among these species vary widely between different genomic regions. Much of this phylogenetic variation can be explained by the potential of different genomic regions to introgress between species, as measured in laboratory studies. We argue that the utility of multiple markers in species-level phylogenetic studies can be greatly enhanced by knowledge of genomic location and, in the case of hybridizing species, by knowledge of the functional or linkage relationships among the markers and regions of the genome that reduce hybrid fitness.  相似文献   

13.
Zhang YJ  Ma PF  Li DZ 《PloS one》2011,6(5):e20596

Background

Bambusoideae is the only subfamily that contains woody members in the grass family, Poaceae. In phylogenetic analyses, Bambusoideae, Pooideae and Ehrhartoideae formed the BEP clade, yet the internal relationships of this clade are controversial. The distinctive life history (infrequent flowering and predominance of asexual reproduction) of woody bamboos makes them an interesting but taxonomically difficult group. Phylogenetic analyses based on large DNA fragments could only provide a moderate resolution of woody bamboo relationships, although a robust phylogenetic tree is needed to elucidate their evolutionary history. Phylogenomics is an alternative choice for resolving difficult phylogenies.

Methodology/Principal Findings

Here we present the complete nucleotide sequences of six woody bamboo chloroplast (cp) genomes using Illumina sequencing. These genomes are similar to those of other grasses and rather conservative in evolution. We constructed a phylogeny of Poaceae from 24 complete cp genomes including 21 grass species. Within the BEP clade, we found strong support for a sister relationship between Bambusoideae and Pooideae. In a substantial improvement over prior studies, all six nodes within Bambusoideae were supported with ≥0.95 posterior probability from Bayesian inference and 5/6 nodes resolved with 100% bootstrap support in maximum parsimony and maximum likelihood analyses. We found that repeats in the cp genome could provide phylogenetic information, while caution is needed when using indels in phylogenetic analyses based on few selected genes. We also identified relatively rapidly evolving cp genome regions that have the potential to be used for further phylogenetic study in Bambusoideae.

Conclusions/Significance

The cp genome of Bambusoideae evolved slowly, and phylogenomics based on whole cp genome could be used to resolve major relationships within the subfamily. The difficulty in resolving the diversification among three clades of temperate woody bamboos, even with complete cp genome sequences, suggests that these lineages may have diverged very rapidly.  相似文献   

14.
An increasing number of complete sequences of mitochondrial (mt) genomes provides the opportunity to optimise the choice of molecular markers for phylogenetic and ecological studies. This is particularly the case where mt genomes from closely related taxa have been sequenced; e.g., within Schistosoma. These blood flukes include species that are the causative agents of schistosomiasis, where there has been a need to optimise markers for species and strain recognition. For many phylogenetic and population genetic studies, the choice of nucleotide sequences depends primarily on suitable PCR primers. Complete mt genomes allow individual gene or other mt markers to be assessed relative to one another for potential information content, prior to broad-scale sampling. We assess the phylogenetic utility of individual genes and identify regions that contain the greatest interspecific variation for molecular ecological and diagnostic markers. We show that variable characters are not randomly distributed along the genome and there is a positive correlation between polymorphism and divergence. The mt genomes of African and Asian schistosomes were compared with the available intraspecific dataset of Schistosoma mansoni through sliding window analyses, in order to assess whether the observed polymorphism was at a level predicted from interspecific comparisons. We found a positive correlation except for the two genes (cox1 and nad1) adjoining the putative control region in S. mansoni. The genes nad1, nad4, nad5, cox1 and cox3 resolved phylogenies that were consistent with a benchmark phylogeny and in general, longer genes performed better in phylogenetic reconstruction. Considering the information content of entire mt genome sequences, partial cox1 would not be the ideal marker for either species identification (barcoding) or population studies with Schistosoma species. Instead, we suggest the use of cox3 and nad5 for both phylogenetic and population studies. Five primer pairs designed against Schistosoma mekongi and Schistosoma malayensis were tested successfully against Schistosoma japonicum. In combination, these fragments encompass 20-27% of the variation amongst the genomes (average total length approximately 14,000bp), thus providing an efficient means of encapsulating the greatest amount of variation within the shortest sequence. Comparative mitogenomics provides the basis of a rational approach to molecular marker selection and optimisation.  相似文献   

15.
Ribosomal DNA: molecular evolution and phylogenetic inference.   总被引:79,自引:0,他引:79  
Ribosomal DNA (rDNA) sequences have been aligned and compared in a number of living organisms, and this approach has provided a wealth of information about phylogenetic relationships. Studies of rDNA sequences have been used to infer phylogenetic history across a very broad spectrum, from studies among the basal lineages of life to relationships among closely related species and populations. The reasons for the systematic versatility of rDNA include the numerous rates of evolution among different regions of rDNA (both among and within genes), the presence of many copies of most rDNA sequences per genome, and the pattern of concerted evolution that occurs among repeated copies. These features facilitate the analysis of rDNA by direct RNA sequencing, DNA sequencing (either by cloning or amplification), and restriction enzyme methodologies. Constraints imposed by secondary structure of rRNA and concerted evolution need to be considered in phylogenetic analyses, but these constraints do not appear to impede seriously the usefulness of rDNA. An analysis of aligned sequences of the four nuclear and two mitochondrial rRNA genes identified regions of these genes that are likely to be useful to address phylogenetic problems over a wide range of levels of divergence. In general, the small subunit nuclear sequences appear to be best for elucidating Precambrian divergences, the large subunit nuclear sequences for Paleozoic and Mesozoic divergences, and the organellar sequences of both subunits for Cenozoic divergences. Primer sequences were designed for use in amplifying the entire nuclear rDNA array in 15 sections by use of the polymerase chain reaction; these "universal" primers complement previously described primers for the mitochondrial rRNA genes. Pairs of primers can be selected in conjunction with the analysis of divergence of the rRNA genes to address systematic problems throughout the hierarchy of life.  相似文献   

16.
In addition to immunodeficiency, human immunodeficiency virus type 1 (HIV-1) can cause cognitive impairment and dementia through direct infection of the brain. To investigate the adaptive process and timing of HIV-1 entry into the central nervous system, we carried out an extensive genetic characterization of variants amplified from different regions of the brain and determined their relatedness to those in lymphoid tissue. HIV-1 genomes infecting different regions of the brain of one study subject with HIV encephalitis (HIVE) had a mosaic structure, being assembled from different combinations of evolutionarily distinct lineages in p17(gag), pol, individual hypervariable regions of gp120 (V1/V2, V3, V4, and V5), and gp41/nef. Similar discordant phylogenetic relationships were observed between p17(gag) and V3 sequences of brain and lymphoid tissue from three other individuals with HIVE. The observation that different parts of the genome of HIV infecting a particular tissue can have different evolutionary histories necessarily limits the conclusions that can be drawn from previous studies of the compartmentalization of distinct HIV populations in different tissues, as these have been generally restricted to sequence comparisons of single subgenomic regions. The complexity of viral populations in the brain produced by recombination could provide a powerful adaptive mechanism for the spread of virus with new phenotypes, such as antiviral resistance or escape from cytotoxic T-cell recognition into existing tissue-adapted virus populations.  相似文献   

17.
Echinostoma hortense (Digenea: Echinostomatidae) is one of the intestinal flukes with medical importance in humans. However, the mitochondrial (mt) genome of this fluke has not been known yet. The present study has determined the complete mt genome sequences of E. hortense and assessed the phylogenetic relationships with other digenean species for which the complete mt genome sequences are available in GenBank using concatenated amino acid sequences inferred from 12 protein-coding genes. The mt genome of E. hortense contained 12 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes, and 1 non-coding region. The length of the mt genome of E. hortense was 14,994 bp, which was somewhat smaller than those of other trematode species. Phylogenetic analyses based on concatenated nucleotide sequence datasets for all 12 protein-coding genes using maximum parsimony (MP) method showed that E. hortense and Hypoderaeum conoideum gathered together, and they were closer to each other than to Fasciolidae and other echinostomatid trematodes. The availability of the complete mt genome sequences of E. hortense provides important genetic markers for diagnostics, population genetics, and evolutionary studies of digeneans.  相似文献   

18.
Pyrosequencing of PCR-amplified fragments that target variable regions within the 16S rRNA gene has quickly become a powerful method for analyzing the membership and structure of microbial communities. This approach has revealed and introduced questions that were not fully appreciated by those carrying out traditional Sanger sequencing-based methods. These include the effects of alignment quality, the best method of calculating pairwise genetic distances for 16S rRNA genes, whether it is appropriate to filter variable regions, and how the choice of variable region relates to the genetic diversity observed in full-length sequences. I used a diverse collection of 13,501 high-quality full-length sequences to assess each of these questions. First, alignment quality had a significant impact on distance values and downstream analyses. Specifically, the greengenes alignment, which does a poor job of aligning variable regions, predicted higher genetic diversity, richness, and phylogenetic diversity than the SILVA and RDP-based alignments. Second, the effect of different gap treatments in determining pairwise genetic distances was strongly affected by the variation in sequence length for a region; however, the effect of different calculation methods was subtle when determining the sample''s richness or phylogenetic diversity for a region. Third, applying a sequence mask to remove variable positions had a profound impact on genetic distances by muting the observed richness and phylogenetic diversity. Finally, the genetic distances calculated for each of the variable regions did a poor job of correlating with the full-length gene. Thus, while it is tempting to apply traditional cutoff levels derived for full-length sequences to these shorter sequences, it is not advisable. Analysis of β-diversity metrics showed that each of these factors can have a significant impact on the comparison of community membership and structure. Taken together, these results urge caution in the design and interpretation of analyses using pyrosequencing data.  相似文献   

19.
Traditional phylogenetic analysis is based on multiple sequence alignment. With the development of worldwide genome sequencing project, more and more completely sequenced genomes become available. However, traditional sequence alignment tools are impossible to deal with large-scale genome sequence. So, the development of new algorithms to infer phylogenetic relationship without alignment from whole genome information represents a new direction of phylogenetic study in the post-genome era. In the present study, a novel algorithm based on BBC (base-base correlation) is proposed to analyze the phylogenetic relationships of HEV (Hepatitis E virus). When 48 HEV genome sequences are analyzed, the phylogenetic tree that is constructed based on BBC algorithm is well consistent with that of previous study. When compared with methods of sequence alignment, the merit of BBC algorithm appears to be more rapid in calculating evolutionary distances of whole genome sequence and not requires any human intervention, such as gene identification, parameter selection. BBC algorithm can serve as an alternative to rapidly construct phylogenetic trees and infer evolutionary relationships.  相似文献   

20.
Phytophthora species are devastating plant pathogens in both agricultural and natural environments. Due to their significant economic and environmental impact, there has been increasing interest in Phytophthora genetics and genomics, culminating in the recent release of three complete genome sequences (P. ramorum, P. sojae, and P. infestans). In this study, genome and other large sequence databases were used to identify over 225 potential genetic markers for phylogenetic analyses. Here, we present a genus-wide phylogeny for 82 Phytophthora species using seven of the most informative loci (approximately 8700 nucleotide sites). Our results support the division of the genus into 10 well-supported clades. The relationships among these clades were rigorously evaluated using a number of phylogenetic methods. This is the most comprehensive study of Phytophthora relationships to date, and many newly discovered species have been included. A more resolved phylogeny of Phytophthora species will allow for better interpretations of the overall evolutionary history of the genus.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号