首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Phylogenetic methods have been widely used to detect the evolution of influenza viruses.However,previous phylogenetic studies of influenza viruses do not make full use of the genetic information at the protein level and therefore cannot distinguish the subtle differences among viral genes.Proteotyping is a new approach to study influenza virus evolution.It aimed at mining the potential genetic information of the viral gene at the protein level by visualizing unique amino acid signatures(proteotypes).Neuraminidase gene fragments of some H5N1 avian influenza viruses were used as an example to illustrate how the proteotyping method worked.Bayesian analysis confirmed that the NA gene tree was mainly divided into three lineages.The NA proteotype analysis further suggested there might be multiple proteotypes within these three lineages and even within single genotypes.At the same time,some proteotypes might even involve more than one genotype.In particular,it also discovered some amino acids of viruses of some genotypes might co-reassort.All these results proved this approach could provide additional information in contrast to results from standard phylogenetic tree analysis.  相似文献   

2.
Dong G  Luo J  Zhang H  Wang C  Duan M  Deliberto TJ  Nolte DL  Ji G  He H 《PloS one》2011,6(2):e17212
H9N2 influenza A viruses have become established worldwide in terrestrial poultry and wild birds, and are occasionally transmitted to mammals including humans and pigs. To comprehensively elucidate the genetic and evolutionary characteristics of H9N2 influenza viruses, we performed a large-scale sequence analysis of 571 viral genomes from the NCBI Influenza Virus Resource Database, representing the spectrum of H9N2 influenza viruses isolated from 1966 to 2009. Our study provides a panoramic framework for better understanding the genesis and evolution of H9N2 influenza viruses, and for describing the history of H9N2 viruses circulating in diverse hosts. Panorama phylogenetic analysis of the eight viral gene segments revealed the complexity and diversity of H9N2 influenza viruses. The 571 H9N2 viral genomes were classified into 74 separate lineages, which had marked host and geographical differences in phylogeny. Panorama genotypical analysis also revealed that H9N2 viruses include at least 98 genotypes, which were further divided according to their HA lineages into seven series (A-G). Phylogenetic analysis of the internal genes showed that H9N2 viruses are closely related to H3, H4, H5, H7, H10, and H14 subtype influenza viruses. Our results indicate that H9N2 viruses have undergone extensive reassortments to generate multiple reassortants and genotypes, suggesting that the continued circulation of multiple genotypical H9N2 viruses throughout the world in diverse hosts has the potential to cause future influenza outbreaks in poultry and epidemics in humans. We propose a nomenclature system for identifying and unifying all lineages and genotypes of H9N2 influenza viruses in order to facilitate international communication on the evolution, ecology and epidemiology of H9N2 influenza viruses.  相似文献   

3.

Background

Influenza neuraminidase (NA) is an important surface glycoprotein and plays a vital role in viral replication and drug development. The NA is found in influenza A and B viruses, with nine subtypes classified in influenza A. The complete knowledge of influenza NA evolutionary history and phylodynamics, although critical for the prevention and control of influenza epidemics and pandemics, remains lacking.

Methodology/Principal findings

Evolutionary and phylogenetic analyses of influenza NA sequences using Maximum Likelihood and Bayesian MCMC methods demonstrated that the divergence of influenza viruses into types A and B occurred earlier than the divergence of influenza A NA subtypes. Twenty-three lineages were identified within influenza A, two lineages were classified within influenza B, and most lineages were specific to host, subtype or geographical location. Interestingly, evolutionary rates vary not only among lineages but also among branches within lineages. The estimated tMRCAs of influenza lineages suggest that the viruses of different lineages emerge several months or even years before their initial detection. The d N /d S ratios ranged from 0.062 to 0.313 for influenza A lineages, and 0.257 to 0.259 for influenza B lineages. Structural analyses revealed that all positively selected sites are at the surface of the NA protein, with a number of sites found to be important for host antibody and drug binding.

Conclusions/Significance

The divergence into influenza type A and B from a putative ancestral NA was followed by the divergence of type A into nine NA subtypes, of which 23 lineages subsequently diverged. This study provides a better understanding of influenza NA lineages and their evolutionary dynamics, which may facilitate early detection of newly emerging influenza viruses and thus improve influenza surveillance.  相似文献   

4.
The evolution and population dynamics of human influenza in Taiwan is a microcosm of the viruses circulating worldwide, which has not yet been studied in detail. We collected 343 representative full genome sequences of human influenza A viruses isolated in Taiwan between 1979 and 2009. Phylogenetic and antigenic data analysis revealed that H1N1 and H3N2 viruses consistently co-circulated in Taiwan, although they were characterized by different temporal dynamics and degrees of genetic diversity. Moreover, influenza A viruses of both subtypes underwent internal gene reassortment involving all eight segments of the viral genome, some of which also occurred during non-epidemic periods. The patterns of gene reassortment were different in the two subtypes. The internal genes of H1N1 viruses moved as a unit, separately from the co-evolving HA and NA genes. On the other hand, the HA and NA genes of H3N2 viruses tended to segregate consistently with different sets of internal gene segments. In particular, as reassortment occurred, H3HA always segregated as a group with the PB1, PA and M genes, while N2NA consistently segregated with PB2 and NP. Finally, the analysis showed that new phylogenetic lineages and antigenic variants emerging in summer were likely to be the progenitors of the epidemic strains in the following season. The synchronized seasonal patterns and high genetic diversity of influenza A viruses observed in Taiwan make possible to capture the evolutionary dynamic and epidemiological rules governing antigenic drift and reassortment and may serve as a “warning” system that recapitulates the global epidemic.  相似文献   

5.
Phylogenetic profiles of the genes coding for the hemagglutinin (HA) protein, nucleoprotein (NP), matrix (M) protein, and nonstructural (NS) proteins of influenza B viruses isolated from 1940 to 1998 were analyzed in a parallel manner in order to understand the evolutionary mechanisms of these viruses. Unlike human influenza A (H3N2) viruses, the evolutionary pathways of all four genes of recent influenza B viruses revealed similar patterns of genetic divergence into two major lineages. Although evolutionary rates of the HA, NP, M, and NS genes of influenza B viruses were estimated to be generally lower than those of human influenza A viruses, genes of influenza B viruses demonstrated complex phylogenetic patterns, indicating alternative mechanisms for generation of virus variability. Topologies of the evolutionary trees of each gene were determined to be quite distinct from one another, showing that these genes were evolving in an independent manner. Furthermore, variable topologies were apparently the result of frequent genetic exchange among cocirculating epidemic viruses. Evolutionary analysis done in the present study provided further evidence for cocirculation of multiple lineages as well as sequestering and reemergence of phylogenetic lineages of the internal genes. In addition, comparison of deduced amino acid sequences revealed a novel amino acid deletion in the HA1 domain of the HA protein of recent isolates from 1998 belonging to the B/Yamagata/16/88-like lineage. It thus became apparent that, despite lower evolutionary rates, influenza B viruses were able to generate genetic diversity among circulating viruses through a combination of evolutionary mechanisms involving cocirculating lineages and genetic reassortment by which new variants with distinct gene constellations emerged.  相似文献   

6.
Shi W  Lei F  Zhu C  Sievers F  Higgins DG 《PloS one》2010,5(12):e14454

Background

More and more nucleotide sequences of type A influenza virus are available in public databases. Although these sequences have been the focus of many molecular epidemiological and phylogenetic analyses, most studies only deal with a few representative sequences. In this paper, we present a complete analysis of all Haemagglutinin (HA) and Neuraminidase (NA) gene sequences available to allow large scale analyses of the evolution and epidemiology of type A influenza.

Methodology/Principal Findings

This paper describes an analysis and complete classification of all HA and NA gene sequences available in public databases using multivariate and phylogenetic methods.

Conclusions/Significance

We analyzed 18975 HA sequences and divided them into 280 subgroups according to multivariate and phylogenetic analyses. Similarly, we divided 11362 NA sequences into 202 subgroups. Compared to previous analyses, this work is more detailed and comprehensive, especially for the bigger datasets. Therefore, it can be used to show the full and complex phylogenetic diversity and provides a framework for studying the molecular evolution and epidemiology of type A influenza virus. For more than 85% of type A influenza HA and NA sequences into GenBank, they are categorized in one unambiguous and unique group. Therefore, our results are a kind of genetic and phylogenetic annotation for influenza HA and NA sequences. In addition, sequences of swine influenza viruses come from 56 HA and 45 NA subgroups. Most of these subgroups also include viruses from other hosts indicating cross species transmission of the viruses between pigs and other hosts. Furthermore, the phylogenetic diversity of swine influenza viruses from Eurasia is greater than that of North American strains and both of them are becoming more diverse. Apart from viruses from human, pigs, birds and horses, viruses from other species show very low phylogenetic diversity. This might indicate that viruses have not become established in these species. Based on current evidence, there is no simple pattern of inter-hemisphere transmission of avian influenza viruses and it appears to happen sporadically. However, for H6 subtype avian influenza viruses, such transmissions might have happened very frequently and multiple and bidirectional transmission events might exist.  相似文献   

7.
Domestic ducks in southern China act as an important reservoir for influenza viruses and have also facilitated the establishment of multiple H6 influenza virus lineages. To understand the continuing evolution of these established lineages, 297 H6 viruses isolated from domestic ducks during 2006 and 2007 were genetically and antigenically analyzed. Phylogenetic analyses showed that group II duck H6 viruses had replaced the previously predominant group I lineage and extended their geographic distribution from coastal to inland regions. Group II H6 virus showed that the genesis and development of multiple types of deletions in the neuraminidase (NA) stalk region could occur in the influenza viruses from domestic ducks. A gradual replacement of the N2 NA subtype with N6 was observed. Significant antigenic changes occurred within group II H6 viruses so that they became antigenically distinguishable from group I and gene pool viruses. Gene exchange between group II H6 viruses and the established H5N1, H9N2, or H6N1 virus lineages in poultry in the region was very limited. These findings suggest that domestic ducks can facilitate significant genetic and antigenic changes in viruses established in this host and highlight gaps in our knowledge of influenza virus ecology and even the evolutionary behavior of this virus family in its aquatic avian reservoirs.  相似文献   

8.
Multiple genotypes of influenza B virus circulated between 1979 and 2003   总被引:4,自引:0,他引:4  
The segmented genome of influenza B virus allows exchange of gene segments between cocirculating strains. Through this process of reassortment, diversity is generated by the mixing of genes between viruses that differ in one or more gene segments. Phylogenetic and evolutionary analyses of all 11 genes of 31 influenza B viruses isolated from 1979 to 2003 were used to study the evolution of whole genomes. All 11 genes diverged into two new lineages prior to 1987. All genes except the NS1 gene were undergoing linear evolution, although the rate of evolution and the degree to which nucleotide changes translated into amino acid changes varied between lineages and by gene. Frequent reassortment generated 14 different genotypes distinct from the gene constellation of viruses circulating prior to 1979. Multiple genotypes cocirculated in some locations, and a sequence of reassortment events over time could not be established. The surprising diversity of the viruses, unrestricted mixing of lineages, and lack of evidence for coevolution of gene segments do not support the hypothesis that the reassortment process is driven by selection for functional differences.  相似文献   

9.
Influenza B virus remains a major contributor to the seasonal influenza outbreak and its prevalence has increased worldwide. We investigated the epidemiology and analyzed the full genome sequences of influenza B virus strains in Thailand between 2010 and 2014. Samples from the upper respiratory tract were collected from patients diagnosed with influenza like-illness. All samples were screened for influenza A/B viruses by one-step multiplex real-time RT-PCR. The whole genome of 53 influenza B isolates were amplified, sequenced, and analyzed. From 14,418 respiratory samples collected during 2010 to 2014, a total of 3,050 tested positive for influenza virus. Approximately 3.27% (471/14,418) were influenza B virus samples. Fifty three isolates of influenza B virus were randomly chosen for detailed whole genome analysis. Phylogenetic analysis of the HA gene showed clusters in Victoria clades 1A, 1B, 3, 5 and Yamagata clades 2 and 3. Both B/Victoria and B/Yamagata lineages were found to co-circulate during this time. The NA sequences of all isolates belonged to lineage II and consisted of viruses from both HA Victoria and Yamagata lineages, reflecting possible reassortment of the HA and NA genes. No significant changes were seen in the NA protein. The phylogenetic trees generated through the analysis of the PB1 and PB2 genes closely resembled that of the HA gene, while trees generated from the analysis of the PA, NP, and M genes showed similar topology. The NS gene exhibited the pattern of genetic reassortment distinct from those of the PA, NP or M genes. Thus, antigenic drift and genetic reassortment among the influenza B virus strains were observed in the isolates examined. Our findings indicate that the co-circulation of two distinct lineages of influenza B viruses and the limitation of cross-protection of the current vaccine formulation provide support for quadrivalent influenza vaccine in this region.  相似文献   

10.
Sun S  Wang Q  Zhao F  Chen W  Li Z 《PloS one》2012,7(2):e32119
Protein glycosylation alteration is typically employed by various viruses for escaping immune pressures from their hosts. Our previous work had shown that not only the increase of glycosylation sites (glycosites) numbers, but also glycosite migration might be involved in the evolution of human seasonal influenza H1N1 viruses. More importantly, glycosite migration was likely a more effectively alteration way for the host adaption of human influenza H1N1 viruses. In this study, we provided more bioinformatics and statistic evidences for further predicting the significant biological functions of glycosite migration in the host adaptation of human influenza H1N1 viruses, by employing homology modeling and in silico protein glycosylation of representative HA and NA proteins as well as amino acid variability analysis at antigenic sites of HA and NA. The results showed that glycosite migrations in human influenza viruses have at least five possible functions: to more effectively mask the antigenic sites, to more effectively protect the enzymatic cleavage sites of neuraminidase (NA), to stabilize the polymeric structures, to regulate the receptor binding and catalytic activities and to balance the binding activity of hemagglutinin (HA) with the release activity of NA. The information here can provide some constructive suggestions for the function research related to protein glycosylation of influenza viruses, although these predictions still need to be supported by experimental data.  相似文献   

11.
Networks of evolving genotypes can be constructed from the worldwide time-resolved genotyping of pathogens like influenza viruses. Such genotype networks are graphs where neighbouring vertices (viral strains) differ in a single nucleotide or amino acid. A rich trove of network analysis methods can help understand the evolutionary dynamics reflected in the structure of these networks. Here, I analyse a genotype network comprising hundreds of influenza A (H3N2) haemagglutinin genes. The network is rife with cycles that reflect non-random parallel or convergent (homoplastic) evolution. These cycles also show patterns of sequence change characteristic for strong and local evolutionary constraints, positive selection and mutation-limited evolution. Such cycles would not be visible on a phylogenetic tree, illustrating that genotype network analysis can complement phylogenetic analyses. The network also shows a distinct modular or community structure that reflects temporal more than spatial proximity of viral strains, where lowly connected bridge strains connect different modules. These and other organizational patterns illustrate that genotype networks can help us study evolution in action at an unprecedented level of resolution.  相似文献   

12.
Despite their close phylogenetic relationship, type A and B influenza viruses exhibit major epidemiological differences in humans, with the latter both less common and less often associated with severe disease. However, it is unclear what processes determine the evolutionary dynamics of influenza B virus, and how influenza viruses A and B interact at the evolutionary scale. To address these questions we inferred the phylogenetic history of human influenza B virus using complete genome sequences for which the date (day) of isolation was available. By comparing the phylogenetic patterns of all eight viral segments we determined the occurrence of segment reassortment over a 30-year sampling period. An analysis of rates of nucleotide substitution and selection pressures revealed sporadic occurrences of adaptive evolution, most notably in the viral hemagglutinin and compatible with the action of antigenic drift, yet lower rates of overall and nonsynonymous nucleotide substitution compared to influenza A virus. Overall, these results led us to propose a model in which evolutionary changes within and between the antigenically distinct 'Yam88' and 'Vic87' lineages of influenza B virus are the result of changes in herd immunity, with reassortment continuously generating novel genetic variation. Additionally, we suggest that the interaction with influenza A virus may be central in shaping the evolutionary dynamics of influenza B virus, facilitating the shift of dominance between the Vic87 and the Yam88 lineages.  相似文献   

13.
The novel swine-origin influenza A/H1N1 virus (S-OIV) first detected in April 2009 has been identified to transmit from human to human directly and is the cause of currently emerged pandemic. In this study, nucleotide and deduced amino acid sequences of hemagglutinin (HA) and neuraminidase (NA) of the S-OIV and other influenza A viruses were analyzed through bioinformatic tools for phylogenetic analysis, genetic recombination and point mutation to investigate the emergence and adaptation of the S-OIV in human. The phylogenetic analysis showed that the HA comes from triple reassortant influenza A/H1N2 and the NA from Eurasian swine influenza A/H1N1 indicating HA and NA to descend from different lineages during the genesis of the S-OIV. Recombination analysis nullified the possibility of occurrence of recombination in HA and NA denoting the role of reassortment in the outbreak. Several conservative mutations are observed in the amino acid sequences of the HA and NA and this mutated residues are identical in the S-OIV. The results reported herein suggested the notion that the recent pandemic is the result of reassortment of different genes from different lineages of two envelope proteins, HA and NA which are responsible for antigenic activity of virus. This study further suggests that the adaptive capability of the S-OIV in human is acquired by the unique mutations generated during emergence.  相似文献   

14.
The epidemiological and evolutionary dynamics of the two cocirculating lineages of influenza B virus, Victoria and Yamagata, are poorly understood, especially in tropical or subtropical areas of Southeast Asia. We performed a phylogenetic analysis of the hemagglutinin (HA) and neuraminidase (NA) sequences of influenza B viruses isolated in Guangzhou, a southern Chinese city, during 2009 to 2010 and compared the demographic and clinical features of infected patients. We identified multiple viral introductions of Victoria strains from both Chinese and international sources, which formed two phylogenetically and antigenically distinct clades (Victoria 1 and 2), some of which persisted between seasons. We identified one dominant Yamagata introduction from outside China during 2009. Our phylogenetic analysis reveals the occurrence of reassortment events among the Victoria and Yamagata lineages and also within the Victoria lineage. We found no significant difference in clinical severity by influenza B lineage, with the exceptions that (i) the Yamagata lineage infected older people than either Victoria lineage and (ii) fewer upper respiratory tract infections were caused by the Victoria 2 than the Victoria 1 clade. Overall, our study reveals the complex epidemiological dynamics of different influenza B lineages within a single geographic locality and has implications for vaccination policy in southern China.  相似文献   

15.
A quantitative genotype algorithm reflecting H5N1 Avian influenza niches   总被引:1,自引:0,他引:1  
MOTIVATION: Computational genotyping analyses are critical for characterizing molecular evolutionary footprints, thus providing important information for designing the strategies of influenza prevention and control. Most of the current methods that are available are based on multiple sequence alignment and phylogenetic tree construction, which are time consuming and limited by the number of taxa. Arbitrarily defining genotypes further complicates the interpretation of genotyping results. METHODS: In this study, we describe a quantitative influenza genotyping algorithm based on the theory of quasispecies. First, the complete composition vector (CCV) was utilized to calculate the pairwise evolutionary distance between genotypes. Next, Hierarchical Bayesian Modeling using the Gibbs Sampling algorithm was applied to identify the segment genotype threshold, which is used to identify influenza segment genotype through a modularity calculation. The viral genotype was defined by combining eight segment genotypes based on the genetic reassortment feature of influenza A viruses. RESULTS: We applied this method for H5N1 avian influenza viruses and identified 107 niches among 283 viruses with a complete genome set. The diversity of viral genotypes, and their correlation with geographic locations suggests that these viruses form local niches after being introduced to a new ecological environment through poultry trade or bird migration. This novel method allows us to define genotypes in a robust, quantitative as well as hierarchical manner. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.  相似文献   

16.
Understanding the evolutionary dynamics of influenza A virus is central to its surveillance and control. While immune-driven antigenic drift is a key determinant of viral evolution across epidemic seasons, the evolutionary processes shaping influenza virus diversity within seasons are less clear. Here we show with a phylogenetic analysis of 413 complete genomes of human H3N2 influenza A viruses collected between 1997 and 2005 from New York State, United States, that genetic diversity is both abundant and largely generated through the seasonal importation of multiple divergent clades of the same subtype. These clades cocirculated within New York State, allowing frequent reassortment and generating genome-wide diversity. However, relatively low levels of positive selection and genetic diversity were observed at amino acid sites considered important in antigenic drift. These results indicate that adaptive evolution occurs only sporadically in influenza A virus; rather, the stochastic processes of viral migration and clade reassortment play a vital role in shaping short-term evolutionary dynamics. Thus, predicting future patterns of influenza virus evolution for vaccine strain selection is inherently complex and requires intensive surveillance, whole-genome sequencing, and phenotypic analysis.  相似文献   

17.
肠道病毒是我国病毒性脑炎(Viral encephalitis,VE)的主要病原体。本文研究对4株引起VE的天津柯萨奇病毒B组5型(Coxsackievirus B5,CV-B5)分离株进行Illumina MiniSeq高通量测序,并对其全基因组特征、进化及重组特点进行分析。结果提示,4株CV-B5天津分离株的全基因组核苷酸和氨基酸序列同源性分别为84.5%~100.0%和98.1%~100.0%,与国内流行株的全基因组核苷酸序列同源性为83.2%~96.5%,氨基酸序列同源性为96.4%~99.4%。基于全基因组的系统进化分析将CV-B5流行株分为A-D四个基因型,其中天津与国内流行株均属于C基因型。C基因型进一步分为3个进化分支,而天津分离株处在两个不同的分支上。基于基因组各区段序列的系统进化与SimPlot重组分析结果显示,天津分离株15-39N、15-41N与埃可病毒30型(Echovirus 30,E-30)原型株在P3区3B、3C、3D区域均检测到重组信号。本研究有助于了解CV-B5的全基因组特点和重组规律,为相关疾病的防控提供依据。  相似文献   

18.
Phylogenetic analysis of 42 membrane protein (M) genes of influenza A viruses from a variety of hosts and geographic locations showed that these genes have evolved into at least four major host-related lineages: (i) A/Equine/prague/56, which has the most divergent M gene; (ii) a lineage containing only H13 gull viruses; (iii) a lineage containing both human and classical swine viruses; and (iv) an avian lineage subdivided into North American avian viruses (including recent equine viruses) and Old World avian viruses (including avianlike swine strains). The M gene evolutionary tree differs from those published for other influenza virus genes (e.g., PB1, PB2, PA, and NP) but shows the most similarity to the NP gene phylogeny. Separate analyses of the M1 and M2 genes and their products revealed very different patterns of evolution. Compared with other influenza virus genes (e.g., PB2 and NP), the M1 and M2 genes are evolving relatively slowly, especially the M1 gene. The M1 and M2 gene products, which are encoded in different but partially overlapping reading frames, revealed that the M1 protein is evolving very slowly in all lineages, whereas the M2 protein shows significant evolution in human and swine lineages but virtually none in avian lineages. The evolutionary rates of the M1 proteins were much lower than those of M2 proteins and other internal proteins of influenza viruses (e.g., PB2 and NP), while M2 proteins showed less rapid evolution compared with other surface proteins (e.g., H3HA). Our results also indicate that for influenza A viruses, the evolution of one protein of a bicistronic gene can affect the evolution of the other protein.(ABSTRACT TRUNCATED AT 400 WORDS)  相似文献   

19.
Wild aquatic birds are the primary reservoir of influenza A viruses, but little is known about the viruses' gene pool in wild birds. Therefore, we investigated the ecology and emergence of influenza viruses by conducting phylogenetic analysis of 70 matrix (M) genes of influenza viruses isolated from shorebirds and gulls in the Delaware Bay region and from ducks in Alberta, Canada, during >18 years of surveillance. In our analysis, we included 61 published M genes of isolates from various hosts. We showed that M genes of Canadian duck viruses and those of shorebird and gull viruses in the Delaware Bay shared ancestors with the M genes of North American poultry viruses. We found that North American and Eurasian avian-like lineages are divided into sublineages, indicating that multiple branches of virus evolution may be maintained in wild aquatic birds. The presence of non-H13 gull viruses in the gull-like lineage and of H13 gull viruses in other avian lineages suggested that gulls' M genes do not preferentially associate with the H13 subtype or segregate into a distinct lineage. Some North American avian influenza viruses contained M genes closely related to those of Eurasian avian viruses. Therefore, there may be interregional mixing of the two clades. Reassortment of shorebird M and HA genes was evident, but there was no correlation among the HA or NA subtype, M gene sequence, and isolation time. Overall, these results support the hypothesis that influenza viruses in wild waterfowl contain distinguishable lineages of M genes.  相似文献   

20.
The evolutionary classification of influenza genes into lineages is a first step in understanding their molecular epidemiology and can inform the subsequent implementation of control measures. We introduce a novel approach called Lineage Assignment By Extended Learning (LABEL) to rapidly determine cladistic information for any number of genes without the need for time-consuming sequence alignment, phylogenetic tree construction, or manual annotation. Instead, LABEL relies on hidden Markov model profiles and support vector machine training to hierarchically classify gene sequences by their similarity to pre-defined lineages. We assessed LABEL by analyzing the annotated hemagglutinin genes of highly pathogenic (H5N1) and low pathogenicity (H9N2) avian influenza A viruses. Using the WHO/FAO/OIE H5N1 evolution working group nomenclature, the LABEL pipeline quickly and accurately identified the H5 lineages of uncharacterized sequences. Moreover, we developed an updated clade nomenclature for the H9 hemagglutinin gene and show a similarly fast and reliable phylogenetic assessment with LABEL. While this study was focused on hemagglutinin sequences, LABEL could be applied to the analysis of any gene and shows great potential to guide molecular epidemiology activities, accelerate database annotation, and provide a data sorting tool for other large-scale bioinformatic studies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号