期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

PyPedia: using the wiki paradigm as crowd sourcing environment for bioinformatics protocols

Alexandros?Kanterakis Email author Jo?l?Kuiper George?Potamias Morris?A.?Swertz 《Source code for biology and medicine》2015,10(1):14

Background

Today researchers can choose from many bioinformatics protocols for all types of life sciences research, computational environments and coding languages. Although the majority of these are open source, few of them possess all virtues to maximize reuse and promote reproducible science. Wikipedia has proven a great tool to disseminate information and enhance collaboration between users with varying expertise and background to author qualitative content via crowdsourcing. However, it remains an open question whether the wiki paradigm can be applied to bioinformatics protocols.

Results

We piloted PyPedia, a wiki where each article is both implementation and documentation of a bioinformatics computational protocol in the python language. Hyperlinks within the wiki can be used to compose complex workflows and induce reuse. A RESTful API enables code execution outside the wiki. Initial content of PyPedia contains articles for population statistics, bioinformatics format conversions and genotype imputation. Use of the easy to learn wiki syntax effectively lowers the barriers to bring expert programmers and less computer savvy researchers on the same page.

Conclusions

PyPedia demonstrates how wiki can provide a collaborative development, sharing and even execution environment for biologists and bioinformaticians that complement existing resources, useful for local and multi-center research teams.

Availability

PyPedia is available online at: http://www.pypedia.com. The source code and installation instructions are available at: https://github.com/kantale/PyPedia_server. The PyPedia python library is available at: https://github.com/kantale/pypedia. PyPedia is open-source, available under the BSD 2-Clause License.

相似文献

2.

GEMBASSY: an EMBOSS associated software package for comprehensive genome analyses

Hidetoshi?Itaya Kazuki?Oshita Kazuharu?Arakawa Email author Masaru?Tomita 《Source code for biology and medicine》2013,8(1):17

相似文献

3.

TROM: A Testing-Based Method for Finding Transcriptomic Similarity of Biological Samples

Wei Vivian Li Yiling Chen Jingyi Jessica Li 《Statistics in biosciences》2017,9(1):105-136

相似文献

4.

PureCLIP: capturing target-specific protein–RNA interaction footprints from single-nucleotide CLIP-seq data

Sabrina Krakau Hugues Richard Annalisa Marsico 《Genome biology》2017,18(1):240

The iCLIP and eCLIP techniques facilitate the detection of protein–RNA interaction sites at high resolution, based on diagnostic events at crosslink sites. However, previous methods do not explicitly model the specifics of iCLIP and eCLIP truncation patterns and possible biases. We developed PureCLIP (https://github.com/skrakau/PureCLIP), a hidden Markov model based approach, which simultaneously performs peak-calling and individual crosslink site detection. It explicitly incorporates a non-specific background signal and, for the first time, non-specific sequence biases. On both simulated and real data, PureCLIP is more accurate in calling crosslink sites than other state-of-the-art methods and has a higher agreement across replicates. 相似文献

5.

NormalizeMets: assessing,selecting and implementing statistical methods for normalizing metabolomics data

Alysha M. De Livera Gavriel Olshansky Julie A. Simpson Darren J. Creek 《Metabolomics : Official journal of the Metabolomic Society》2018,14(5):54

Introduction

In metabolomics studies, unwanted variation inevitably arises from various sources. Normalization, that is the removal of unwanted variation, is an essential step in the statistical analysis of metabolomics data. However, metabolomics normalization is often considered an imprecise science due to the diverse sources of variation and the availability of a number of alternative strategies that may be implemented.

Objectives

We highlight the need for comparative evaluation of different normalization methods and present software strategies to help ease this task for both data-oriented and biological researchers.

Methods

We present NormalizeMets—a joint graphical user interface within the familiar Microsoft Excel and freely-available R software for comparative evaluation of different normalization methods. The NormalizeMets R package along with the vignette describing the workflow can be downloaded from https://cran.r-project.org/web/packages/NormalizeMets/. The Excel Interface and the Excel user guide are available on https://metabolomicstats.github.io/ExNormalizeMets.

Results

NormalizeMets allows for comparative evaluation of normalization methods using criteria that depend on the given dataset and the ultimate research question. Hence it guides researchers to assess, select and implement a suitable normalization method using either the familiar Microsoft Excel and/or freely-available R software. In addition, the package can be used for visualisation of metabolomics data using interactive graphical displays and to obtain end statistical results for clustering, classification, biomarker identification adjusting for confounding variables, and correlation analysis.

Conclusion

NormalizeMets is designed for comparative evaluation of normalization methods, and can also be used to obtain end statistical results. The use of freely-available R software offers an attractive proposition for programming-oriented researchers, and the Excel interface offers a familiar alternative to most biological researchers. The package handles the data locally in the user’s own computer allowing for reproducible code to be stored locally.

相似文献

6.

Larval host records of butterflies in Japan

Masayuki U. Saito Utsugi Jinbo Masaya Yago Osamu Kurashima Motomi Ito 《Ecological Research》2016,31(4):491-491

Using Japanese literature, we created a consolidated list of host records of butterflies in Japan. The list used the host records described in eight major illustrated reference books, two checklists, and 14 other pieces of literature. The presence of larvae on plants, the observation of larvae eating plants or insects in the field were considered as host records. We collected all species recorded in Japan. Scientific, family, and Japanese names of butterflies were consolidated using the BINRAN database (http://binran.lepimages.jp/). Scientific and Japanese names of host plants were based on the YList database (http://ylist.info/). If scientific names of host plants were not found in YList, we used scientific names based on The Plant List (http://www.theplantlist.org/). Family names of host plants were based on the Catalogue of Life database (http://www.catalogueoflife.org/). Scientific, family, and Japanese names of host insects were based on the MOKUROKU database (http://konchudb.agr.agr.kyushu-u.ac.jp/mokuroku/) for Hymenoptera and the catalogue of the Paraneoptera of Japan published by the Entomological Society of Japan for Hemiptera. We also provided the references of each host record and the original names described in the referred literature. Two datasets, HostDB and ReferenceDB, were created to include 3600 records of butterfly larval hosts in Japan, along with scientific and Japanese names of each species and a literature list. These datasets will be useful for basic and applied biological studies of butterflies. Data files are stored in the Ecological Research Data Archives (http://db.cger.nies.go.jp/JaLTER/ER_DataPapers/) and available from http://hostbj.lepumus.net/. These datasets are published under the Creative Commons License Attribution-ShareAlike 4.0 (CC BY-SA, https://creativecommons.org/licenses/by-sa/4.0/). 相似文献

7.

Nasal gene expression differentiates COPD from controls and overlaps bronchial gene expression

Ilse M. Boudewijn Alen Faiz Katrina Steiling Erica van der Wiel Eef D. Telenga Susan J. M. Hoonhorst Nick H. T. ten Hacken Corry-Anke Brandsma Huib A. M. Kerstjens Wim Timens Irene H. Heijink Marnix R. Jonker Harold G. de Bruin J. Sebastiaan Vroegop Henk R. Pasma Wim G. Boersma Pascal Wielders Frank van den Elshout Khaled Mansour Avrum Spira Marc E. Lenburg Victor Guryev Dirkje S. Postma Maarten van den Berge 《Respiratory research》2017,18(1):213

Background

Nasal gene expression profiling is a promising method to characterize COPD non-invasively. We aimed to identify a nasal gene expression profile to distinguish COPD patients from healthy controls. We investigated whether this COPD-associated gene expression profile in nasal epithelium is comparable with the profile observed in bronchial epithelium.

Methods

Genome wide gene expression analysis was performed on nasal epithelial brushes of 31 severe COPD patients and 22 controls, all current smokers, using Affymetrix Human Gene 1.0 ST Arrays. We repeated the gene expression analysis on bronchial epithelial brushes in 2 independent cohorts of mild-to-moderate COPD patients and controls.

Results

In nasal epithelium, 135 genes were significantly differentially expressed between severe COPD patients and controls, 21 being up- and 114 downregulated in COPD (false discovery rate?<?0.01). Gene Set Enrichment Analysis (GSEA) showed significant concordant enrichment of COPD-associated nasal and bronchial gene expression in both independent cohorts (FDR_GSEA <?0.001).

Conclusion

We identified a nasal gene expression profile that differentiates severe COPD patients from controls. Of interest, part of the nasal gene expression changes in COPD mimics differentially expressed genes in the bronchus. These findings indicate that nasal gene expression profiling is potentially useful as a non-invasive biomarker in COPD.

Trial registration

ClinicalTrials.gov registration number NCT01351792 (registration date May 10, 2011), ClinicalTrials.gov registration number NCT00848406 (registration date February 19, 2009), ClinicalTrials.gov registration number NCT00807469 (registration date December 11, 2008).

相似文献

8.

miRTrace reveals the organismal origins of microRNA sequencing data

Wenjing Kang Yrin Eldfjell Bastian Fromm Xavier Estivill Inna Biryukova Marc R. Friedländer 《Genome biology》2018,19(1):213

We present here miRTrace, the first algorithm to trace microRNA sequencing data back to their taxonomic origins. This is a challenge with profound implications for forensics, parasitology, food control, and research settings where cross-contamination can compromise results. miRTrace accurately (>?99%) assigns real and simulated data to 14 important animal and plant groups, sensitively detects parasitic infection in mammals, and discovers the primate origin of single cells. Applying our algorithm to over 700 public datasets, we find evidence that over 7% are cross-contaminated and present a novel solution to clean these computationally, even after sequencing has occurred. miRTrace is freely available at https://github.com/friedlanderlab/mirtrace. 相似文献

9.

UBCG: Up-to-date bacterial core gene set and pipeline for phylogenomic tree reconstruction

Seong-In Na Yeong Ouk Kim Seok-Hwan Yoon Sung-min Ha Inwoo Baek Jongsik Chun

《Journal of microbiology (Seoul, Korea)》

Genome-based phylogeny plays a central role in the future taxonomy and phylogenetics of Bacteria and Archaea by replacing 16S rRNA gene phylogeny. The concatenated core gene alignments are frequently used for such a purpose. The bacterial core genes are defined as single-copy, homologous genes that are present in most of the known bacterial species. There have been several studies describing such a gene set, but the number of species considered was rather small. Here we present the up-to-date bacterial core gene set, named UBCG, and software suites to accommodate necessary steps to generate and evaluate phylogenetic trees. The method was successfully used to infer phylogenomic relationship of Escherichia and related taxa and can be used for the set of genomes at any taxonomic ranks of Bacteria. The UBCG pipeline and file viewer are freely available at https://www.ezbiocloud.net/tools/ubcg and https://www.ezbiocloud.net/tools/ubcg_viewer, respectively. 相似文献

10.

The Year of the Wisent

Johannes?A.?Lenstra Email author Jianquan?Liu 《BMC biology》2016,14(1):100

Delving into European prehistory, two recent studies analyze ancient DNA from bison species depicted by our ancestors on the walls of their caves. The DNA tells a story of migrations driven by climate change but leaves some mystery clouding the genetic descent and climate preference of the still-extant wisent, otherwise known as the European bison.See research articles: https://bmcbiol.biomedcentral.com/articles/10.1186/s12915-016-0317-7 http://www.nature.com/articles/ncomms13158 相似文献

11.

The FOCUS,AFFINITY and EFFECTS trials studying the effect(s) of fluoxetine in patients with a recent stroke: statistical and health economic analysis plan for the trials and for the individual patient data meta-analysis

Catriona Graham Steff Lewis John Forbes Gillian Mead Maree L. Hackett Graeme J. Hankey John Gommans Huy Thang Nguyen Erik Lundström Eva Isaksson Per Näsman Ann-Sofie Rudberg Martin Dennis 《Trials》2017,18(1):627

Background

Small trials have suggested that fluoxetine may improve neurological recovery from stroke. FOCUS, AFFINITY and EFFECTS are a family of investigator-led, multicentre, parallel group, randomised, placebo-controlled trials which aim to determine whether the routine administration of fluoxetine (20 mg daily) for six months after an acute stroke improves patients’ functional outcome.

Methods/Design

The core protocol for the three trials has been published (Mead et al., Trials 20:369, 2015). The trials include patients aged 18 years and older with a clinical diagnosis of stroke and persisting focal neurological deficits at randomisation 2–15 days after stroke onset. Patients are randomised centrally via each trials’ web-based randomisation system using a common minimisation algorithm. Patients are allocated fluoxetine 20 mg once daily or matching placebo capsules for six months. The primary outcome measure is the modified Rankin scale (mRS) at six months. Secondary outcomes include: living circumstances; the Stroke Impact Scale; EuroQol (EQ5D-5 L); the vitality subscale of the 36-Item Short Form Health Survey (SF36); diagnosis of depression; adherence to medication; serious adverse events including death and recurrent stroke; and resource use at six and 12 months and the mRS at 12 months.

Discussion

Minor variations have been tailored to the national setting in the UK (FOCUS), Australia, New Zealand and Vietnam (AFFINITY) and Sweden (EFFECTS). Each trial is run and funded independently and will report its own results. A prospectively planned individual patient data meta-analysis of all three trials will provide the most precise estimate of the overall effect and establish whether any effects differ between trials or subgroups. This statistical analysis plan describes the core analyses for all three trials and that for the individual patient data meta-analysis. Recruitment and follow-up in the FOCUS trial is expected to be completed by the end of 2018. AFFINITY and EFFECTS are likely to complete follow-up in 2020.

Trial registration

FOCUS: ISRCTN, ISRCTN83290762. Registered on 23 May 2012. EudraCT, 2011-005616-29. Registered on 3 February 2012.AFFINITY: Australian New Zealand Clinical Trials Registry, ACTRN12611000774921. Registered on 22 July 2011.EFFECTS: ISRCTN, ISRCTN13020412. Registered on 19 December 2014. Clinicaltrials.gov, NCT02683213. Registered on 2 February 2016. EudraCT, 2011-006130-16. Registered on 8 August 2014.

相似文献

12.

Speed and accuracy improvement of higher-order epistasis detection on CUDA-enabled GPUs

Daniel Jünger Christian Hundt Jorge González Domínguez Bertil Schmidt 《Cluster computing》2017,20(3):1899-1908

The discovery of higher-order epistatic interactions is an important task in the field of genome wide association studies which allows for the identification of complex interaction patterns between multiple genetic markers. Some existing bruteforce approaches explore the whole space of k-interactions in an exhaustive manner resulting in almost intractable execution times. Computational cost can be reduced drastically by restricting the search space with suitable preprocessing filters which prune unpromising candidates. Other approaches mitigate the execution time by employing massively parallel accelerators in order to benefit from the vast computational resources of these architectures. In this paper, we combine a novel preprocessing filter, namely SingleMI, with massively parallel computation on modern GPUs to further accelerate epistasis discovery. Our implementation improves both the runtime and accuracy when compared to a previous GPU counterpart that employs mutual information clustering for prefiltering. SingleMI is open source software and publicly available at: https://github.com/sleeepyjack/singlemi/. 相似文献

13.

Kudi: A free open-source python library for the analysis of properties along reaction paths

Stefan Vogt-Geisse 《Journal of molecular modeling》2016,22(5):110

With increasing computational capabilities, an ever growing amount of data is generated in computational chemistry that contains a vast amount of chemically relevant information. It is therefore imperative to create new computational tools in order to process and extract this data in a sensible way. Kudi is an open source library that aids in the extraction of chemical properties from reaction paths. The straightforward structure of Kudi makes it easy to use for users and allows for effortless implementation of new capabilities, and extension to any quantum chemistry package. A use case for Kudi is shown for the tautomerization reaction of formic acid. Kudi is available free of charge at www.github.com/stvogt/kudi 相似文献

14.

Metastasis as supra-cellular selection? A reply to Lean and Plutynski

Pierre-Luc Germain Lucie Laplane 《Biology & philosophy》2017,32(2):281-287

In response to Germain (Biol Philos 27:785–810, 2012. doi: 10.1007/s10539-012-9334-2) argument that evolution by natural selection has a limited explanatory power in cancer, Lean and Plutynski (Biol Philos 31:39–57, 2016. doi: 10.1007/s10539-015-9511-1) have recently argued that many adaptations in cancer only make sense at the tumor level, and that cancer progression mirrors the major evolutionary transitions. While we agree that selection could potentially act at various levels of organization in cancers, we argue that tumor-level selection (MLS2) is unlikely to actually play a relevant role in our understanding of the somatic evolution of human cancers. 相似文献

15.

<Emphasis Type="Italic">NIPTeR</Emphasis>: an R package for fast and accurate trisomy prediction in non-invasive prenatal testing

Lennart F. Johansson Hendrik A. de Weerd Eddy N. de Boer Freerk van Dijk Gerard J. te Meerman Rolf H. Sijmons Birgit Sikkema-Raddatz Morris A. Swertz 《BMC bioinformatics》2018,19(1):531

Background

Various algorithms have been developed to predict fetal trisomies using cell-free DNA in non-invasive prenatal testing (NIPT). As basis for prediction, a control group of non-trisomy samples is needed. Prediction accuracy is dependent on the characteristics of this group and can be improved by reducing variability between samples and by ensuring the control group is representative for the sample analyzed.

Results

NIPTeR is an open-source R Package that enables fast NIPT analysis and simple but flexible workflow creation, including variation reduction, trisomy prediction algorithms and quality control. This broad range of functions allows users to account for variability in NIPT data, calculate control group statistics and predict the presence of trisomies.

Conclusion

NIPTeR supports laboratories processing next-generation sequencing data for NIPT in assessing data quality and determining whether a fetal trisomy is present. NIPTeR is available under the GNU LGPL v3 license and can be freely downloaded from https://github.com/molgenis/NIPTeR or CRAN.

相似文献

16.

Reliability of signal transmission in stochastic nerve axon equations

Martin Sauer Wilhelm Stannat 《Journal of computational neuroscience》2016,40(1):103-111

We introduce a method for computing probabilities for spontaneous activity and propagation failure of the action potential in spatially extended, conductance-based neuronal models subject to noise, based on statistical properties of the membrane potential. We compare different estimators with respect to the quality of detection, computational costs and robustness and propose the integral of the membrane potential along the axon as an appropriate estimator to detect both spontaneous activity and propagation failure. Performing a model reduction we achieve a simplified analytical expression based on the linearization at the resting potential (resp. the traveling action potential). This allows to approximate the probabilities for spontaneous activity and propagation failure in terms of (classical) hitting probabilities of one-dimensional linear stochastic differential equations. The quality of the approximation with respect to the noise amplitude is discussed and illustrated with numerical results for the spatially extended Hodgkin-Huxley equations. Python simulation code is supplied on GitHub under the link https://github.com/deristnochda/Hodgkin-Huxley-SPDE. 相似文献

17.

ProSave: an application for restoring quantitative data to manipulated subsets of protein lists

Daniel A. Machlab Gabriel Velez Alexander G. Bassuk Vinit B. Mahajan 《Source code for biology and medicine》2018,13(1):3

相似文献

18.

A Cladist is a systematist who seeks a natural classification: some comments on Quinn (2017)

David M. Williams Malte C. Ebach 《Biology & philosophy》2018,33(1-2):10

In response to Quinn (Biol Philos, 2017. https://doi.org/10.1007/s10539-017-9577-z) we identify cladistics to be about natural classifications and their discovery and thereby propose to add an eighth cladistic definition to Quinn’s list, namely the systematist who seeks to discover natural classifications, regardless of their affiliation, theoretical or methodological justifications. 相似文献

19.

Locus-aware decomposition of gene trees with respect to polytomous species trees

Michał Aleksander Ciach Anna Muszewska Paweł Górecki 《Algorithms for molecular biology : AMB》2018,13(1):11

Background

Horizontal gene transfer (HGT), a process of acquisition and fixation of foreign genetic material, is an important biological phenomenon. Several approaches to HGT inference have been proposed. However, most of them either rely on approximate, non-phylogenetic methods or on the tree reconciliation, which is computationally intensive and sensitive to parameter values.

Results

We investigate the locus tree inference problem as a possible alternative that combines the advantages of both approaches. We present several algorithms to solve the problem in the parsimony framework. We introduce a novel tree mapping, which allows us to obtain a heuristic solution to the problems of locus tree inference and duplication classification.

Conclusions

Our approach allows for faster comparisons of gene and species trees and improves known algorithms for duplication inference in the presence of polytomies in the species trees. We have implemented our algorithms in a software tool available at https://github.com/mciach/LocusTreeInference.

相似文献

20.

Escher-FBA: a web application for interactive flux balance analysis

Elliot Rowe Bernhard O. Palsson 《BMC systems biology》2018,12(1):84

Background

Flux balance analysis (FBA) is a widely-used method for analyzing metabolic networks. However, most existing tools that implement FBA require downloading software and writing code. Furthermore, FBA generates predictions for metabolic networks with thousands of components, so meaningful changes in FBA solutions can be difficult to identify. These challenges make it difficult for beginners to learn how FBA works.

Results

To meet this need, we present Escher-FBA, a web application for interactive FBA simulations within a pathway visualization. Escher-FBA allows users to set flux bounds, knock out reactions, change objective functions, upload metabolic models, and generate high-quality figures without downloading software or writing code. We provide detailed instructions on how to use Escher-FBA to replicate several FBA simulations that generate real scientific hypotheses.

Conclusions

We designed Escher-FBA to be as intuitive as possible so that users can quickly and easily understand the core concepts of FBA. The web application can be accessed at https://sbrg.github.io/escher-fba.

相似文献