首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
3.
4.

Background

Multiple genome alignment remains a challenging problem. Effects of recombination including rearrangement, segmental duplication, gain, and loss can create a mosaic pattern of homology even among closely related organisms.

Methodology/Principal Findings

We describe a new method to align two or more genomes that have undergone rearrangements due to recombination and substantial amounts of segmental gain and loss (flux). We demonstrate that the new method can accurately align regions conserved in some, but not all, of the genomes, an important case not handled by our previous work. The method uses a novel alignment objective score called a sum-of-pairs breakpoint score, which facilitates accurate detection of rearrangement breakpoints when genomes have unequal gene content. We also apply a probabilistic alignment filtering method to remove erroneous alignments of unrelated sequences, which are commonly observed in other genome alignment methods. We describe new metrics for quantifying genome alignment accuracy which measure the quality of rearrangement breakpoint predictions and indel predictions. The new genome alignment algorithm demonstrates high accuracy in situations where genomes have undergone biologically feasible amounts of genome rearrangement, segmental gain and loss. We apply the new algorithm to a set of 23 genomes from the genera Escherichia, Shigella, and Salmonella. Analysis of whole-genome multiple alignments allows us to extend the previously defined concepts of core- and pan-genomes to include not only annotated genes, but also non-coding regions with potential regulatory roles. The 23 enterobacteria have an estimated core-genome of 2.46Mbp conserved among all taxa and a pan-genome of 15.2Mbp. We document substantial population-level variability among these organisms driven by segmental gain and loss. Interestingly, much variability lies in intergenic regions, suggesting that the Enterobacteriacae may exhibit regulatory divergence.

Conclusions

The multiple genome alignments generated by our software provide a platform for comparative genomic and population genomic studies. Free, open-source software implementing the described genome alignment approach is available from http://gel.ahabs.wisc.edu/mauve.  相似文献   

5.
6.
A chemical arms race at sea mediates algal host-virus interactions   总被引:1,自引:0,他引:1  
Despite the critical importance of viruses in shaping marine microbial ecosystems and lubricating upper ocean biogeochemical cycles, relatively little is known about the molecular mechanisms mediating phytoplankton host-virus interactions. Recent work in algal host-virus systems has begun to shed novel insight into the elegant strategies of viral infection and subcellular regulation of cell fate, which not only reveal tantalizing aspects of viral replication and host resistance strategies but also provide new diagnostic tools toward elucidating the impact of virus-mediated processes in the ocean. Widespread lateral gene transfer between viruses and their hosts plays a prominent role in host-virus diversification and in the regulation of host-virus infection mechanisms by allowing viruses to manipulate and 'rewire' host metabolic pathways to facilitate infection.  相似文献   

7.
8.
HIV-1 entry into host cells is mediated by interactions between the V3-loop of viral glycoprotein gp120 and chemokine receptor CCR5 or CXCR4, collectively known as HIV-1 coreceptors. Accurate genotypic prediction of coreceptor usage is of significant clinical interest and determination of the factors driving tropism has been the focus of extensive study. We have developed a method based on nonlinear support vector machines to elucidate the interacting residue pairs driving coreceptor usage and provide highly accurate coreceptor usage predictions. Our models utilize centroid-centroid interaction energies from computationally derived structures of the V3-loop:coreceptor complexes as primary features, while additional features based on established rules regarding V3-loop sequences are also investigated. We tested our method on 2455 V3-loop sequences of various lengths and subtypes, and produce a median area under the receiver operator curve of 0.977 based on 500 runs of 10-fold cross validation. Our study is the first to elucidate a small set of specific interacting residue pairs between the V3-loop and coreceptors capable of predicting coreceptor usage with high accuracy across major HIV-1 subtypes. The developed method has been implemented as a web tool named CRUSH, CoReceptor USage prediction for HIV-1, which is available at http://ares.tamu.edu/CRUSH/.  相似文献   

9.
We have analyzed host cell genes linked to HIV replication that were identified in nine genome-wide studies, including three independent siRNA screens. Overlaps among the siRNA screens were very modest (<7% for any pairwise combination), and similarly, only modest overlaps were seen in pairwise comparisons with other types of genome-wide studies. Combining all genes from the genome-wide studies together with genes reported in the literature to affect HIV yields 2,410 protein-coding genes, or fully 9.5% of all human genes (though of course some of these are false positive calls). Here we report an “encyclopedia” of all overlaps between studies (available at http://www.hostpathogen.org), which yielded a more extensively corroborated set of host factors assisting HIV replication. We used these genes to calculate refined networks that specify cellular subsystems recruited by HIV to assist in replication, and present additional analysis specifying host cell genes that are attractive as potential therapeutic targets.  相似文献   

10.
Human immunodeficiency virus type 1 (HIV-1) continues to be a major cause of disease and premature death. As with all viruses, HIV-1 exploits a host cell to replicate. Improving our understanding of the molecular interactions between virus and human host proteins is crucial for a mechanistic understanding of virus biology, infection and host antiviral activities. This knowledge will potentially permit the identification of host molecules for targeting by drugs with antiviral properties. Here, we propose a data-driven approach for the analysis and prediction of the HIV-1 interacting proteins (VIPs) with a focus on the directionality of the interaction: host-dependency versus antiviral factors. Using support vector machine learning models and features encompassing genetic, proteomic and network properties, our results reveal some significant differences between the VIPs and non-HIV-1 interacting human proteins (non-VIPs). As assessed by comparison with the HIV-1 infection pathway data in the Reactome database (sensitivity > 90%, threshold = 0.5), we demonstrate these models have good generalization properties. We find that the ‘direction’ of the HIV-1-host molecular interactions is also predictable due to different characteristics of ‘forward’/pro-viral versus ‘backward’/pro-host proteins. Additionally, we infer the previously unknown direction of the interactions between HIV-1 and 1351 human host proteins. A web server for performing predictions is available at http://hivpre.cvr.gla.ac.uk/.  相似文献   

11.
Viruses are extremely abundant in seawater and are believed to be significant pathogens to photosynthetic protists (microalgae). Recently, several novel RNA viruses were found to infect marine photosynthetic protists; one of them is HcRNAV, which infects Heterocapsa circularisquama (Dinophyceae). There are two distinct ecotypes of HcRNAV with complementary intraspecies host ranges. Nucleotide sequence comparison between them revealed remarkable differences in the coat protein coding gene resulting in a high frequency of amino acid substitutions. However, the detailed mechanism supporting this intraspecies host specificity is still unknown. In this study, virus inoculation experiments were conducted with compatible and incompatible host-virus combinations to investigate the mechanism determining intraspecies host specificity. Cells were infected by adding a virus suspension directly to a host culture or by transfecting viral RNA into host cells by particle bombardment. Virus propagation was monitored by Northern blot analysis with a negative-strand-specific RNA probe, transmission electron microscopy, and a cell lysis assay. With compatible host-virus combinations, propagation of infectious progeny occurred regardless of the inoculation method used. When incompatible combinations were used, direct addition of a virus suspension did not even result in viral RNA replication, while in host cells transfected with viral RNA, infective progeny virus particles with a host range encoded by the imported viral RNA were propagated. This indicates that the intraspecies host specificity of HcRNAV is determined by the upstream events of virus infection. This is the first report describing the reproductive steps of an RNA virus infecting a photosynthetic protist at the molecular level.  相似文献   

12.
13.
14.
A Bayesian network approach to operon prediction   总被引:5,自引:0,他引:5  
  相似文献   

15.

Background

The host response to influenza A infections is strongly influenced by host genetic factors. Animal models of genetically diverse mouse strains are well suited to identify host genes involved in severe pathology, viral replication and immune responses. Here, we have utilized a dual RNAseq approach that allowed us to investigate both viral and host gene expression in the same individual mouse after H1N1 infection.

Results

We performed a detailed expression analysis to identify (i) correlations between changes in expression of host and virus genes, (ii) host genes involved in viral replication, and (iii) genes showing differential expression between two mouse strains that strongly differ in resistance to influenza infections. These genes may be key players involved in regulating the differences in pathogenesis and host defense mechanisms after influenza A infections. Expression levels of influenza segments correlated well with the viral load and may thus be used as surrogates for conventional viral load measurements. Furthermore, we investigated the functional role of two genes, Reg3g and Irf7, in knock-out mice and found that deletion of the Irf7 gene renders the host highly susceptible to H1N1 infection.

Conclusions

Using RNAseq analysis we identified novel genes important for viral replication or the host defense. This study adds further important knowledge to host-pathogen-interactions and suggests additional candidates that are crucial for host susceptibility or survival during influenza A infections.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1867-8) contains supplementary material, which is available to authorized users.  相似文献   

16.
17.
18.
19.
Li  Fangfang  Xu  Xiongbiao  Li  Zhenghe  Wang  Yaqin  Zhou  Xueping 《中国病毒学》2020,35(1):120-123
正Dear Editor,The geminiviruses are small single-stranded plant DNA viruses belonging to the family Geminiviridae, which cause serious diseases in many economically important  相似文献   

20.
Translating a set of disease regions into insight about pathogenic mechanisms requires not only the ability to identify the key disease genes within them, but also the biological relationships among those key genes. Here we describe a statistical method, Gene Relationships Among Implicated Loci (GRAIL), that takes a list of disease regions and automatically assesses the degree of relatedness of implicated genes using 250,000 PubMed abstracts. We first evaluated GRAIL by assessing its ability to identify subsets of highly related genes in common pathways from validated lipid and height SNP associations from recent genome-wide studies. We then tested GRAIL, by assessing its ability to separate true disease regions from many false positive disease regions in two separate practical applications in human genetics. First, we took 74 nominally associated Crohn''s disease SNPs and applied GRAIL to identify a subset of 13 SNPs with highly related genes. Of these, ten convincingly validated in follow-up genotyping; genotyping results for the remaining three were inconclusive. Next, we applied GRAIL to 165 rare deletion events seen in schizophrenia cases (less than one-third of which are contributing to disease risk). We demonstrate that GRAIL is able to identify a subset of 16 deletions containing highly related genes; many of these genes are expressed in the central nervous system and play a role in neuronal synapses. GRAIL offers a statistically robust approach to identifying functionally related genes from across multiple disease regions—that likely represent key disease pathways. An online version of this method is available for public use (http://www.broad.mit.edu/mpg/grail/).  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号