期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

ALLMAPS: robust scaffold ordering based on multiple maps

Haibao Tang Xingtan Zhang Chenyong Miao Jisen Zhang Ray Ming James C Schnable Patrick S Schnable Eric Lyons Jianguo Lu 《Genome biology》2015,16(1)

The ordering and orientation of genomic scaffolds to reconstruct chromosomes is an essential step during de novo genome assembly. Because this process utilizes various mapping techniques that each provides an independent line of evidence, a combination of multiple maps can improve the accuracy of the resulting chromosomal assemblies. We present ALLMAPS, a method capable of computing a scaffold ordering that maximizes colinearity across a collection of maps. ALLMAPS is robust against common mapping errors, and generates sequences that are maximally concordant with the input maps. ALLMAPS is a useful tool in building high-quality genome assemblies. ALLMAPS is available at: https://github.com/tanghaibao/jcvi/wiki/ALLMAPS. 相似文献

2.

Fast Association Tests for Genes with FAST

Pritam Chanda Hailiang Huang Dan E. Arking Joel S. Bader 《PloS one》2013,8(7)

Gene-based tests of association can increase the power of a genome-wide association study by aggregating multiple independent effects across a gene or locus into a single stronger signal. Recent gene-based tests have distinct approaches to selecting which variants to aggregate within a locus, modeling the effects of linkage disequilibrium, representing fractional allele counts from imputation, and managing permutation tests for p-values. Implementing these tests in a single, efficient framework has great practical value. Fast ASsociation Tests (Fast) addresses this need by implementing leading gene-based association tests together with conventional SNP-based univariate tests and providing a consolidated, easily interpreted report. Fast scales readily to genome-wide SNP data with millions of SNPs and tens of thousands of individuals, provides implementations that are orders of magnitude faster than original literature reports, and provides a unified framework for performing several gene based association tests concurrently and efficiently on the same data. Availability: https://bitbucket.org/baderlab/fast/downloads/FAST.tar.gz, with documentation at https://bitbucket.org/baderlab/fast/wiki/Home 相似文献

3.

Irf2bp2a regulates terminal granulopoiesis through proteasomal degradation of Gfi1aa in zebrafish

Shuo Gao Zixuan Wang Luxiang Wang Haihong Wang Hao Yuan Xiaohui Liu Saijuan Chen Zhu Chen Hugues de Th Wenqing Zhang Yiyue Zhang Jun Zhu Jun Zhou 《PLoS genetics》2021,17(8)

相似文献

4.

Chemical graph generators

Mehmet Aziz Yirik Christoph Steinbeck 《PLoS computational biology》2021,17(1)

Chemical graph generators are software packages to generate computer representations of chemical structures adhering to certain boundary conditions. Their development is a research topic of cheminformatics. Chemical graph generators are used in areas such as virtual library generation in drug design, in molecular design with specified properties, called inverse QSAR/QSPR, as well as in organic synthesis design, retrosynthesis or in systems for computer-assisted structure elucidation (CASE). CASE systems again have regained interest for the structure elucidation of unknowns in computational metabolomics, a current area of computational biology. 相似文献

5.

Discovering differential genome sequence activity with interpretable and efficient deep learning

Jennifer Hammelman David K. Gifford 《PLoS computational biology》2021,17(8)

相似文献

6.

dbVOR: a database system for importing pedigree,phenotype and genotype data and exporting selected subsets

Robert V Baron Yvette P Conley Michael B Gorin Daniel E Weeks 《BMC bioinformatics》2015,16(1)

Background

When studying the genetics of a human trait, we typically have to manage both genome-wide and targeted genotype data. There can be overlap of both people and markers from different genotyping experiments; the overlap can introduce several kinds of problems. Most times the overlapping genotypes are the same, but sometimes they are different. Occasionally, the lab will return genotypes using a different allele labeling scheme (for example 1/2 vs A/C). Sometimes, the genotype for a person/marker index is unreliable or missing. Further, over time some markers are merged and bad samples are re-run under a different sample name. We need a consistent picture of the subset of data we have chosen to work with even though there might possibly be conflicting measurements from multiple data sources.

Results

We have developed the dbVOR database, which is designed to hold data efficiently for both genome-wide and targeted experiments. The data are indexed for fast retrieval by person and marker. In addition, we store pedigree and phenotype data for our subjects. The dbVOR database allows us to select subsets of the data by several different criteria and to merge their results into a coherent and consistent whole. Data may be filtered by: family, person, trait value, markers, chromosomes, and chromosome ranges. The results can be presented in columnar, Mega2, or PLINK format.

Conclusions

dbVOR serves our needs well. It is freely available from https://watson.hgen.pitt.edu/register. Documentation for dbVOR can be found at https://watson.hgen.pitt.edu/register/docs/dbvor.html.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0505-4) contains supplementary material, which is available to authorized users. 相似文献

7.

Wham: Identifying Structural Variants of Biological Consequence

Zev N. Kronenberg Edward J. Osborne Kelsey R. Cone Brett J. Kennedy Eric T. Domyan Michael D. Shapiro Nels C. Elde Mark Yandell 《PLoS computational biology》2015,11(12)

Existing methods for identifying structural variants (SVs) from short read datasets are inaccurate. This complicates disease-gene identification and efforts to understand the consequences of genetic variation. In response, we have created Wham (Whole-genome Alignment Metrics) to provide a single, integrated framework for both structural variant calling and association testing, thereby bypassing many of the difficulties that currently frustrate attempts to employ SVs in association testing. Here we describe Wham, benchmark it against three other widely used SV identification tools–Lumpy, Delly and SoftSearch–and demonstrate Wham’s ability to identify and associate SVs with phenotypes using data from humans, domestic pigeons, and vaccinia virus. Wham and all associated software are covered under the MIT License and can be freely downloaded from github (https://github.com/zeeev/wham), with documentation on a wiki (http://zeeev.github.io/wham/). For community support please post questions to https://www.biostars.org/.

This is PLOS Computational Biology software paper.

相似文献

8.

A Daily-Updated Database and Tools for Comprehensive SARS-CoV-2 Mutation-Annotated Trees

Jakob McBroome Bryan Thornlow Angie S Hinrichs Alexander Kramer Nicola De Maio Nick Goldman David Haussler Russell Corbett-Detig Yatish Turakhia 《Molecular biology and evolution》2021,38(12):5819

The vast scale of SARS-CoV-2 sequencing data has made it increasingly challenging to comprehensively analyze all available data using existing tools and file formats. To address this, we present a database of SARS-CoV-2 phylogenetic trees inferred with unrestricted public sequences, which we update daily to incorporate new sequences. Our database uses the recently proposed mutation-annotated tree (MAT) format to efficiently encode the tree with branches labeled with parsimony-inferred mutations, as well as Nextstrain clade and Pango lineage labels at clade roots. As of June 9, 2021, our SARS-CoV-2 MAT consists of 834,521 sequences and provides a comprehensive view of the virus’ evolutionary history using public data. We also present matUtils—a command-line utility for rapidly querying, interpreting, and manipulating the MATs. Our daily-updated SARS-CoV-2 MAT database and matUtils software are available at http://hgdownload.soe.ucsc.edu/goldenPath/wuhCor1/UShER_SARS-CoV-2/ and https://github.com/yatisht/usher, respectively. 相似文献

9.

Identification and classification of reverse transcriptases in bacterial genomes and metagenomes

Fatemeh Sharifi Yuzhen Ye 《Nucleic acids research》2022,50(5):e29

相似文献

10.

Image Alignment for Tomography Reconstruction from Synchrotron X-Ray Microscopic Images

Chang-Chieh Cheng Chia-Chi Chien Hsiang-Hsin Chen Yeukuang Hwu Yu-Tai Ching 《PloS one》2014,9(1)

A synchrotron X-ray microscope is a powerful imaging apparatus for taking high-resolution and high-contrast X-ray images of nanoscale objects. A sufficient number of X-ray projection images from different angles is required for constructing 3D volume images of an object. Because a synchrotron light source is immobile, a rotational object holder is required for tomography. At a resolution of 10 nm per pixel, the vibration of the holder caused by rotating the object cannot be disregarded if tomographic images are to be reconstructed accurately. This paper presents a computer method to compensate for the vibration of the rotational holder by aligning neighboring X-ray images. This alignment process involves two steps. The first step is to match the “projected feature points” in the sequence of images. The matched projected feature points in the - plane should form a set of sine-shaped loci. The second step is to fit the loci to a set of sine waves to compute the parameters required for alignment. The experimental results show that the proposed method outperforms two previously proposed methods, Xradia and SPIDER. The developed software system can be downloaded from the URL, http://www.cs.nctu.edu.tw/~chengchc/SCTA or http://goo.gl/s4AMx. 相似文献

11.

Tn-Seq Explorer: A Tool for Analysis of High-Throughput Sequencing Data of Transposon Mutant Libraries

Sina Solaimanpour Felipe Sarmiento Jan Mrázek 《PloS one》2015,10(5)

Tn-seq is a high throughput technique for analysis of transposon mutant libraries. Tn-seq Explorer was developed as a convenient and easy-to-use package of tools for exploration of the Tn-seq data. In a typical application, the user will have obtained a collection of sequence reads adjacent to transposon insertions in a reference genome. The reads are first aligned to the reference genome using one of the tools available for this task. Tn-seq Explorer reads the alignment and the gene annotation, and provides the user with a set of tools to investigate the data and identify possibly essential or advantageous genes as those that contain significantly low counts of transposon insertions. Emphasis is placed on providing flexibility in selecting parameters and methodology most appropriate for each particular dataset. Tn-seq Explorer is written in Java as a menu-driven, stand-alone application. It was tested on Windows, Mac OS, and Linux operating systems. The source code is distributed under the terms of GNU General Public License. The program and the source code are available for download at http://www.cmbl.uga.edu/downloads/programs/Tn_seq_Explorer/ and https://github.com/sina-cb/Tn-seqExplorer. 相似文献

12.

FIGfams: yet another set of protein families

Folker Meyer Ross Overbeek Alex Rodriguez 《Nucleic acids research》2009,37(20):6643-6654

We present FIGfams, a new collection of over 100 000 protein families that are the product of manual curation and close strain comparison. Using the Subsystem approach the manual curation is carried out, ensuring a previously unattained degree of throughput and consistency. FIGfams are based on over 950 000 manually annotated proteins and across many hundred Bacteria and Archaea. Associated with each FIGfam is a two-tiered, rapid, accurate decision procedure to determine family membership for new proteins. FIGfams are freely available under an open source license. These can be downloaded at ftp://ftp.theseed.org/FIGfams/. The web site for FIGfams is http://www.theseed.org/wiki/FIGfams/ 相似文献

13.

Boosting the prediction and understanding of DNA-binding domains from sequence

Robert E. Langlois Hui Lu 《Nucleic acids research》2010,38(10):3149-3158

相似文献

14.

UFold: fast and accurate RNA secondary structure prediction with deep learning

Laiyi Fu Yingxin Cao Jie Wu Qinke Peng Qing Nie Xiaohui Xie 《Nucleic acids research》2022,50(3):e14

For many RNA molecules, the secondary structure is essential for the correct function of the RNA. Predicting RNA secondary structure from nucleotide sequences is a long-standing problem in genomics, but the prediction performance has reached a plateau over time. Traditional RNA secondary structure prediction algorithms are primarily based on thermodynamic models through free energy minimization, which imposes strong prior assumptions and is slow to run. Here, we propose a deep learning-based method, called UFold, for RNA secondary structure prediction, trained directly on annotated data and base-pairing rules. UFold proposes a novel image-like representation of RNA sequences, which can be efficiently processed by Fully Convolutional Networks (FCNs). We benchmark the performance of UFold on both within- and cross-family RNA datasets. It significantly outperforms previous methods on within-family datasets, while achieving a similar performance as the traditional methods when trained and tested on distinct RNA families. UFold is also able to predict pseudoknots accurately. Its prediction is fast with an inference time of about 160 ms per sequence up to 1500 bp in length. An online web server running UFold is available at https://ufold.ics.uci.edu. Code is available at https://github.com/uci-cbcl/UFold. 相似文献

15.

RNA editing regulates lncRNA splicing in human early embryo development

Jiajun Qiu Xiao Ma Fanyi Zeng Jingbin Yan 《PLoS computational biology》2021,17(12)

相似文献

16.

Point-of-Care Autofluorescence Imaging for Real-Time Sampling and Treatment Guidance of Bioburden in Chronic Wounds: First-in-Human Results

Ralph S. DaCosta Iris Kulbatski Liis Lindvere-Teene Danielle Starr Kristina Blackmore Jason I. Silver Julie Opoku Yichao Charlie Wu Philip J. Medeiros Wei Xu Lizhen Xu Brian C. Wilson Cheryl Rosen Ron Linden 《PloS one》2015,10(3)

Background

Traditionally, chronic wound infection is diagnosed by visual inspection under white light and microbiological sampling, which are subjective and suboptimal, respectively, thereby delaying diagnosis and treatment. To address this, we developed a novel handheld, fluorescence imaging device (PRODIGI) that enables non-contact, real-time, high-resolution visualization and differentiation of key pathogenic bacteria through their endogenous autofluorescence, as well as connective tissues in wounds.

Methods and Findings

This was a two-part Phase I, single center, non-randomized trial of chronic wound patients (male and female, ≥18 years; UHN REB #09-0015-A for part 1; UHN REB #12-5003 for part 2; clinicaltrials.gov Identifier: NCT01378728 for part 1 and NCT01651845 for part 2). Part 1 (28 patients; 54% diabetic foot ulcers, 46% non-diabetic wounds) established the feasibility of autofluorescence imaging to accurately guide wound sampling, validated against blinded, gold standard swab-based microbiology. Part 2 (12 patients; 83.3% diabetic foot ulcers, 16.7% non-diabetic wounds) established the feasibility of autofluorescence imaging to guide wound treatment and quantitatively assess treatment response. We showed that PRODIGI can be used to guide and improve microbiological sampling and debridement of wounds in situ, enabling diagnosis, treatment guidance and response assessment in patients with chronic wounds. PRODIGI is safe, easy to use and integrates into the clinical workflow. Clinically significant bacterial burden can be detected in seconds, quantitatively tracked over days-to-months and their biodistribution mapped within the wound bed, periphery, and other remote areas.

Conclusions

PRODIGI represents a technological advancement in wound sampling and treatment guidance for clinical wound care at the point-of-care.

Trial Registration

ClinicalTrials.gov NCT01651845; ClinicalTrials.gov NCT01378728 相似文献

17.

Using Synthetic Mouse Spike-In Transcripts to Evaluate RNA-Seq Analysis Tools

Dena Leshkowitz Ester Feldmesser Gilgi Friedlander Ghil Jona Elena Ainbinder Yisrael Parmet Shirley Horn-Saban 《PloS one》2016,11(4)

相似文献

18.

Comparative GO: A Web Application for Comparative Gene Ontology and Gene Ontology-Based Gene Selection in Bacteria

Mario Fruzangohar Esmaeil Ebrahimie Abiodun D. Ogunniyi Layla K. Mahdi James C. Paton David L. Adelson 《PloS one》2013,8(3)

Availabilityhttp://turing.ersa.edu.au/BacteriaGO. 相似文献

19.

An Online Database for Informing Ecological Network Models: http://kelpforest.ucsc.edu

Rodrigo Beas-Luna Mark Novak Mark H. Carr Martin T. Tinker August Black Jennifer E. Caselle Michael Hoban Dan Malone Alison Iles 《PloS one》2014,9(10)

Ecological network models and analyses are recognized as valuable tools for understanding the dynamics and resiliency of ecosystems, and for informing ecosystem-based approaches to management. However, few databases exist that can provide the life history, demographic and species interaction information necessary to parameterize ecological network models. Faced with the difficulty of synthesizing the information required to construct models for kelp forest ecosystems along the West Coast of North America, we developed an online database (http://kelpforest.ucsc.edu/) to facilitate the collation and dissemination of such information. Many of the database''s attributes are novel yet the structure is applicable and adaptable to other ecosystem modeling efforts. Information for each taxonomic unit includes stage-specific life history, demography, and body-size allometries. Species interactions include trophic, competitive, facilitative, and parasitic forms. Each data entry is temporally and spatially explicit. The online data entry interface allows researchers anywhere to contribute and access information. Quality control is facilitated by attributing each entry to unique contributor identities and source citations. The database has proven useful as an archive of species and ecosystem-specific information in the development of several ecological network models, for informing management actions, and for education purposes (e.g., undergraduate and graduate training). To facilitate adaptation of the database by other researches for other ecosystems, the code and technical details on how to customize this database and apply it to other ecosystems are freely available and located at the following link (https://github.com/kelpforest-cameo/databaseui). 相似文献

20.

Significant sparse polygenic risk scores across 813 traits in UK Biobank

Yosuke Tanigawa Junyang Qian Guhan Venkataraman Johanne Marie Justesen Ruilin Li Robert Tibshirani Trevor Hastie Manuel A. Rivas 《PLoS genetics》2022,18(3)

We present a systematic assessment of polygenic risk score (PRS) prediction across more than 1,500 traits using genetic and phenotype data in the UK Biobank. We report 813 sparse PRS models with significant (p < 2.5 x 10⁻⁵) incremental predictive performance when compared against the covariate-only model that considers age, sex, types of genotyping arrays, and the principal component loadings of genotypes. We report a significant correlation between the number of genetic variants selected in the sparse PRS model and the incremental predictive performance (Spearman’s ⍴ = 0.61, p = 2.2 x 10⁻⁵⁹ for quantitative traits, ⍴ = 0.21, p = 9.6 x 10⁻⁴ for binary traits). The sparse PRS model trained on European individuals showed limited transferability when evaluated on non-European individuals in the UK Biobank. We provide the PRS model weights on the Global Biobank Engine (https://biobankengine.stanford.edu/prs). 相似文献