首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The Network Makeup Artist (NORMA) is a web tool for interactive network annotation visualization and topological analysis, able to handle multiple networks and annotations simultaneously. Precalculated annotations (e.g., Gene Ontology, Pathway enrichment, community detection, or clustering results) can be uploaded and visualized in a network, either as colored pie-chart nodes or as color-filled areas in a 2D/3D Venn-diagram-like style. In the case where no annotation exists, algorithms for automated community detection are offered. Users can adjust the network views using standard layout algorithms or allow NORMA to slightly modify them for visually better group separation. Once a network view is set, users can interactively select and highlight any group of interest in order to generate publication-ready figures. Briefly, with NORMA, users can encode three types of information simultaneously. These are 1) the network, 2) the communities or annotations of interest, and 3) node categories or expression values. Finally, NORMA offers basic topological analysis and direct topological comparison across any of the selected networks. NORMA service is available at http://norma.pavlopouloslab.info, whereas the code is available at https://github.com/PavlopoulosLab/NORMA.  相似文献   

2.
It is computationally challenging to detect variation by aligning single-molecule sequencing (SMS) reads, or contigs from SMS assemblies. One approach to efficiently align SMS reads is sparse dynamic programming (SDP), where optimal chains of exact matches are found between the sequence and the genome. While straightforward implementations of SDP penalize gaps with a cost that is a linear function of gap length, biological variation is more accurately represented when gap cost is a concave function of gap length. We have developed a method, lra, that uses SDP with a concave-cost gap penalty, and used lra to align long-read sequences from PacBio and Oxford Nanopore (ONT) instruments as well as de novo assembly contigs. This alignment approach increases sensitivity and specificity for SV discovery, particularly for variants above 1kb and when discovering variation from ONT reads, while having runtime that are comparable (1.05-3.76×) to current methods. When applied to calling variation from de novo assembly contigs, there is a 3.2% increase in Truvari F1 score compared to minimap2+htsbox. lra is available in bioconda (https://anaconda.org/bioconda/lra) and github (https://github.com/ChaissonLab/LRA).  相似文献   

3.
Existing methods for identifying structural variants (SVs) from short read datasets are inaccurate. This complicates disease-gene identification and efforts to understand the consequences of genetic variation. In response, we have created Wham (Whole-genome Alignment Metrics) to provide a single, integrated framework for both structural variant calling and association testing, thereby bypassing many of the difficulties that currently frustrate attempts to employ SVs in association testing. Here we describe Wham, benchmark it against three other widely used SV identification tools–Lumpy, Delly and SoftSearch–and demonstrate Wham’s ability to identify and associate SVs with phenotypes using data from humans, domestic pigeons, and vaccinia virus. Wham and all associated software are covered under the MIT License and can be freely downloaded from github (https://github.com/zeeev/wham), with documentation on a wiki (http://zeeev.github.io/wham/). For community support please post questions to https://www.biostars.org/.
This is PLOS Computational Biology software paper.
  相似文献   

4.
5.
Rapidly improving high-throughput sequencing technologies provide unprecedented opportunities for carrying out population-genomic studies with various organisms. To take full advantage of these methods, it is essential to correctly estimate allele and genotype frequencies, and here we present a maximum-likelihood method that accomplishes these tasks. The proposed method fully accounts for uncertainties resulting from sequencing errors and biparental chromosome sampling and yields essentially unbiased estimates with minimal sampling variances with moderately high depths of coverage regardless of a mating system and structure of the population. Moreover, we have developed statistical tests for examining the significance of polymorphisms and their genotypic deviations from Hardy–Weinberg equilibrium. We examine the performance of the proposed method by computer simulations and apply it to low-coverage human data generated by high-throughput sequencing. The results show that the proposed method improves our ability to carry out population-genomic analyses in important ways. The software package of the proposed method is freely available from https://github.com/Takahiro-Maruki/Package-GFE.  相似文献   

6.
Lipids play a pivotal role in embryogenesis as structural components of cellular membranes, as a source of energy, and as signaling molecules. On the basis of a collection of temperature-sensitive embryonic lethal mutants, a systematic database search, and a subsequent microscopic analysis of >300 interference RNA (RNAi)–treated/mutant worms, we identified a couple of evolutionary conserved genes associated with lipid storage in Caenorhabditis elegans embryos. The genes include cpl-1 (cathepsin L–like cysteine protease), ccz-1 (guanine nucleotide exchange factor subunit), and asm-3 (acid sphingomyelinase), which is closely related to the human Niemann-Pick disease–causing gene SMPD1. The respective mutant embryos accumulate enlarged droplets of neutral lipids (cpl-1) and yolk-containing lipid droplets (ccz-1) or have larger genuine lipid droplets (asm-3). The asm-3 mutant embryos additionally showed an enhanced resistance against C band ultraviolet (UV-C) light. Herein we propose that cpl-1, ccz-1, and asm-3 are genes required for the processing of lipid-containing droplets in C. elegans embryos. Owing to the high levels of conservation, the identified genes are also useful in studies of embryonic lipid storage in other organisms.  相似文献   

7.
PHA-1 encodes a cytoplasmic protein that is required for embryonic morphogenesis and attachment of the foregut (pharynx) to the mouth (buccal capsule). Previous reports have in some cases suggested that PHA-1 is essential for the differentiation of most or all pharyngeal cell types. By performing mosaic analysis with a recently acquired pha-1 null mutation (tm3671), we found that PHA-1 is not required within most or all pharyngeal cells for their proper specification, differentiation, or function. Rather, our evidence suggests that PHA-1 acts in the arcade or anterior epithelial cells of the pharynx to promote attachment of the pharynx to the future buccal capsule. In addition, PHA-1 appears to be required in the epidermis for embryonic morphogenesis, in the excretory system for osmoregulation, and in the somatic gonad for normal ovulation and fertility. PHA-1 activity is also required within at least a subset of intestinal cells for viability. To better understand the role of PHA-1 in the epidermis, we analyzed several apical junction markers in pha-1(tm3671) homozygous embryos. PHA-1 regulates the expression of several components of two apical junction complexes including AJM-1DLG-1/discs large complex and the classical cadherin–catenin complex, which may account for the role of PHA-1 in embryonic morphogenesis.  相似文献   

8.
ChIP-seq is a powerful method for obtaining genome-wide maps of protein-DNA interactions and epigenetic modifications. CHANCE (CHip-seq ANalytics and Confidence Estimation) is a standalone package for ChIP-seq quality control and protocol optimization. Our user-friendly graphical software quickly estimates the strength and quality of immunoprecipitations, identifies biases, compares the user''s data with ENCODE''s large collection of published datasets, performs multi-sample normalization, checks against quantitative PCR-validated control regions, and produces informative graphical reports. CHANCE is available at https://github.com/songlab/chance.  相似文献   

9.
Many layouts exist for visualizing phylogenetic trees, allowing to display the same information (evolutionary relationships) in different ways. For large phylogenies, the choice of the layout is a key element, because the printable area is limited, and because interactive on-screen visualizers can lead to unreadable phylogenetic relationships at high zoom levels. A visual inspection of available layouts for rooted trees reveals large empty areas that one may want to fill in order to use less drawing space and eventually gain readability. This can be achieved by using the nonlayered tidy tree layout algorithm that was proposed earlier but was never used in a phylogenetic context so far. Here, we present its implementation, and we demonstrate its advantages on simulated and biological data (the measles virus phylogeny). Our results call for the integration of this new layout in phylogenetic software. We implemented the nonlayered tidy tree layout in R language as a stand-alone function (available at https://github.com/damiendevienne/non-layered-tidy-trees), as an option in the tree plotting function of the R package ape, and in the recent tool for visualizing reconciled phylogenetic trees thirdkind (https://github.com/simonpenel/thirdkind/wiki).  相似文献   

10.
Rhizobium leguminosarum bv. trifolii SRDI943 (strain syn. V2-2) is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from a root nodule of Trifolium michelianum Savi cv. Paradana that had been grown in soil collected from a mixed pasture in Victoria, Australia. This isolate was found to have a broad clover host range but was sub-optimal for nitrogen fixation with T. subterraneum (fixing 20-54% of reference inoculant strain WSM1325) and was found to be totally ineffective with the clover species T. polymorphum and T. pratense. Here we describe the features of R. leguminosarum bv. trifolii strain SRDI943, together with genome sequence information and annotation. The 7,412,387 bp high-quality-draft genome is arranged into 5 scaffolds of 5 contigs, contains 7,317 protein-coding genes and 89 RNA-only encoding genes, and is one of 100 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.  相似文献   

11.
We describe MetAMOS, an open source and modular metagenomic assembly and analysis pipeline. MetAMOS represents an important step towards fully automated metagenomic analysis, starting with next-generation sequencing reads and producing genomic scaffolds, open-reading frames and taxonomic or functional annotations. MetAMOS can aid in reducing assembly errors, commonly encountered when assembling metagenomic samples, and improves taxonomic assignment accuracy while also reducing computational cost. MetAMOS can be downloaded from: https://github.com/treangen/MetAMOS.  相似文献   

12.
13.
The Saccharomyces cerevisiae type 2C protein phosphatase Ptc1 is required for a wide variety of cellular functions, although only a few cellular targets have been identified. A genetic screen in search of mutations in protein kinase–encoding genes able to suppress multiple phenotypic traits caused by the ptc1 deletion yielded a single gene, MKK1, coding for a MAPK kinase (MAPKK) known to activate the cell-wall integrity (CWI) Slt2 MAPK. In contrast, mutation of the MKK1 paralog, MKK2, had a less significant effect. Deletion of MKK1 abolished the increased phosphorylation of Slt2 induced by the absence of Ptc1 both under basal and CWI pathway stimulatory conditions. We demonstrate that Ptc1 acts at the level of the MAPKKs of the CWI pathway, but only the Mkk1 kinase activity is essential for ptc1 mutants to display high Slt2 activation. We also show that Ptc1 is able to dephosphorylate Mkk1 in vitro. Our results reveal the preeminent role of Mkk1 in signaling through the CWI pathway and strongly suggest that hyperactivation of Slt2 caused by upregulation of Mkk1 is at the basis of most of the phenotypic defects associated with lack of Ptc1 function.  相似文献   

14.
Lactobacillus rhamnosus is a facultative, lactic acid bacterium in the phylum Firmicutes. Lactobacillus spp. are generally considered beneficial, and specific strains of L. rhamnosus are validated probiotics. We describe the draft genomes of three L. rhamnosus strains (L31, L34, and L35) isolated from the feces of Thai breastfed infants, which exhibit anti-inflammatory properties in vitro. The three genomes range between 2.8 – 2.9 Mb, and contain approximately 2,700 protein coding genes.  相似文献   

15.
Strain HIMB11 is a planktonic marine bacterium isolated from coastal seawater in Kaneohe Bay, Oahu, Hawaii belonging to the ubiquitous and versatile Roseobacter clade of the alphaproteobacterial family Rhodobacteraceae. Here we describe the preliminary characteristics of strain HIMB11, including annotation of the draft genome sequence and comparative genomic analysis with other members of the Roseobacter lineage. The 3,098,747 bp draft genome is arranged in 34 contigs and contains 3,183 protein-coding genes and 54 RNA genes. Phylogenomic and 16S rRNA gene analyses indicate that HIMB11 represents a unique sublineage within the Roseobacter clade. Comparison with other publicly available genome sequences from members of the Roseobacter lineage reveals that strain HIMB11 has the genomic potential to utilize a wide variety of energy sources (e.g. organic matter, reduced inorganic sulfur, light, carbon monoxide), while possessing a reduced number of substrate transporters.  相似文献   

16.
17.
Microvirga lotononidis is a recently described species of root-nodule bacteria that is an effective nitrogen- (N2) fixing microsymbiont of the symbiotically specific African legume Listia angolensis (Welw. ex Bak.) B.-E. van Wyk & Boatwr. M. lotononidis possesses several properties that are unusual in root-nodule bacteria, including pigmentation and the ability to grow at temperatures of up to 45°C. Strain WSM3557T is an aerobic, motile, Gram-negative, non-spore-forming rod isolated from a L. angolensis root nodule collected in Chipata, Zambia in 1963. This is the first report of a complete genome sequence for the genus Microvirga. Here we describe the features of Microvirga lotononidis strain WSM3557T, together with genome sequence information and annotation. The 7,082,538 high-quality-draft genome is arranged in 18 scaffolds of 104 contigs, contains 6,956 protein-coding genes and 84 RNA-only encoding genes, and is one of 20 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Community Sequencing Program.  相似文献   

18.
Genomic stability, stress response, and nutrient signaling all play critical, evolutionarily conserved roles in lifespan determination. However, the molecular mechanisms coordinating these processes with longevity remain unresolved. Here we investigate the involvement of the yeast anaphase promoting complex (APC) in longevity. The APC governs passage through M and G1 via ubiquitin-dependent targeting of substrate proteins and is associated with cancer and premature aging when defective. Our two-hybrid screen utilizing Apc5 as bait recovered the lifespan determinant Fob1 as prey. Fob1 is unstable specifically in G1, cycles throughout the cell cycle in a manner similar to Clb2 (an APC target), and is stabilized in APC (apc5CA) and proteasome (rpn10) mutants. Deletion of FOB1 increased replicative lifespan (RLS) in wild type (WT), apc5CA, and apc10 cells, and suppressed apc5CA cell cycle progression and rDNA recombination defects. Alternatively, increased FOB1 expression decreased RLS in WT cells, but did not reduce the already short apc5CA RLS, suggesting an epistatic interaction between apc5CA and fob1. Mutation to a putative L-Box (Fob1E420V), a Destruction Box-like motif, abolished Fob1 modifications, stabilized the protein, and increased rDNA recombination. Our work provides a mechanistic role played by the APC to promote replicative longevity and genomic stability in yeast.  相似文献   

19.
Recurrent neural networks with memory and attention mechanisms are widely used in natural language processing because they can capture short and long term sequential information for diverse tasks. We propose an integrated deep learning model for microbial DNA sequence data, which exploits convolutional neural networks, recurrent neural networks, and attention mechanisms to predict taxonomic classifications and sample-associated attributes, such as the relationship between the microbiome and host phenotype, on the read/sequence level. In this paper, we develop this novel deep learning approach and evaluate its application to amplicon sequences. We apply our approach to short DNA reads and full sequences of 16S ribosomal RNA (rRNA) marker genes, which identify the heterogeneity of a microbial community sample. We demonstrate that our implementation of a novel attention-based deep network architecture, Read2Pheno, achieves read-level phenotypic prediction. Training Read2Pheno models will encode sequences (reads) into dense, meaningful representations: learned embedded vectors output from the intermediate layer of the network model, which can provide biological insight when visualized. The attention layer of Read2Pheno models can also automatically identify nucleotide regions in reads/sequences which are particularly informative for classification. As such, this novel approach can avoid pre/post-processing and manual interpretation required with conventional approaches to microbiome sequence classification. We further show, as proof-of-concept, that aggregating read-level information can robustly predict microbial community properties, host phenotype, and taxonomic classification, with performance at least comparable to conventional approaches. An implementation of the attention-based deep learning network is available at https://github.com/EESI/sequence_attention (a python package) and https://github.com/EESI/seq2att (a command line tool).  相似文献   

20.
Histone acetylation is a key regulatory feature for chromatin that is established by opposing enzymatic activities of lysine acetyltransferases (KATs/HATs) and deacetylases (KDACs/HDACs). Esa1, like its human homolog Tip60, is an essential MYST family enzyme that acetylates histones H4 and H2A and other nonhistone substrates. Here we report that the essential requirement for ESA1 in Saccharomyces cerevisiae can be bypassed upon loss of Sds3, a noncatalytic subunit of the Rpd3L deacetylase complex. By studying the esa1sds3 strain, we conclude that the essential function of Esa1 is in promoting the cellular balance of acetylation. We demonstrate this by fine-tuning acetylation through modulation of HDACs and the histone tails themselves. Functional interactions between Esa1 and HDACs of class I, class II, and the Sirtuin family define specific roles of these opposing activities in cellular viability, fitness, and response to stress. The fact that both increased and decreased expression of the ESA1 homolog TIP60 has cancer associations in humans underscores just how important the balance of its activity is likely to be for human well-being.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号