期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

<Emphasis Type="Italic">ScreenMill</Emphasis>: A freely available software suite for growth measurement,analysis and visualization of high-throughput screen data

John C Dittmar Robert JD Reid Rodney Rothstein 《BMC bioinformatics》2010,11(1):353

Background

Many high-throughput genomic experiments, such as Synthetic Genetic Array and yeast two-hybrid, use colony growth on solid media as a screen metric. These experiments routinely generate over 100,000 data points, making data analysis a time consuming and painstaking process. Here we describe ScreenMill, a new software suite that automates image analysis and simplifies data review and analysis for high-throughput biological experiments. 相似文献

2.

An eScience-Bayes strategy for analyzing omics data

Martin Eklund Ola Spjuth Jarl ES Wikberg 《BMC bioinformatics》2010,11(1):282

Background

The omics fields promise to revolutionize our understanding of biology and biomedicine. However, their potential is compromised by the challenge to analyze the huge datasets produced. Analysis of omics data is plagued by the curse of dimensionality, resulting in imprecise estimates of model parameters and performance. Moreover, the integration of omics data with other data sources is difficult to shoehorn into classical statistical models. This has resulted in ad hoc approaches to address specific problems. 相似文献

3.

BisoGenet: a new tool for gene network building,visualization and analysis 总被引：1，自引：0，他引：1

Alexander Martin Maria E Ochagavia Laya C Rabasa Jamilet Miranda Jorge Fernandez-de-Cossio Ricardo Bringas 《BMC bioinformatics》2010,11(1):91

Background

The increasing availability and diversity of omics data in the post-genomic era offers new perspectives in most areas of biomedical research. Graph-based biological networks models capture the topology of the functional relationships between molecular entities such as gene, protein and small compounds and provide a suitable framework for integrating and analyzing omics-data. The development of software tools capable of integrating data from different sources and to provide flexible methods to reconstruct, represent and analyze topological networks is an active field of research in bioinformatics. 相似文献

4.

High-throughput sequence alignment using Graphics Processing Units 总被引：1，自引：0，他引：1

Michael C Schatz Cole Trapnell Arthur L Delcher Amitabh Varshney 《BMC bioinformatics》2007,8(1):474

Background

The recent availability of new, less expensive high-throughput DNA sequencing technologies has yielded a dramatic increase in the volume of sequence data that must be analyzed. These data are being generated for several purposes, including genotyping, genome resequencing, metagenomics, and de novo genome assembly projects. Sequence alignment programs such as MUMmer have proven essential for analysis of these data, but researchers will need ever faster, high-throughput alignment tools running on inexpensive hardware to keep up with new sequence technologies. 相似文献

5.

mSpecs: a software tool for the administration and editing of mass spectral libraries in the field of metabolomics

Bernhard Thielen Stephanie Heinen Dietmar Schomburg 《BMC bioinformatics》2009,10(1):229

Background

Metabolome analysis with GC/MS has meanwhile been established as one of the "omics" techniques. Compound identification is done by comparison of the MS data with compound libraries. Mass spectral libraries in the field of metabolomics ought to connect the relevant mass traces of the metabolites to other relevant data, e.g. formulas, chemical structures, identification numbers to other databases etc. Since existing solutions are either commercial and therefore only available for certain instruments or not capable of storing such information, there is need to provide a software tool for the management of such data. 相似文献

6.

TableButler – a Windows based tool for processing large data tables generated with high-throughput methods

Christian Schwager Ute Wirkner Amir Abdollahi Peter E Huber 《BMC bioinformatics》2009,10(1):235-9

Background

High-throughput "omics" based data analysis play emerging roles in life sciences and molecular diagnostics. This emphasizes the urgent need for user-friendly windows-based software interfaces that could process the diversity of large tab-delimited raw data files generated by these methods. Depending on the study, dozens to hundreds of these data tables are generated. Before the actual statistical or cluster analysis, these data tables have to be combined and merged to expression matrices (e.g., in case of gene expression analysis). Gene annotations as well as information concerning the samples analyzed may be appended, renewed or extended. Often additional data values shall be computed or certain features must be filtered out. 相似文献

7.

An ontology for <Emphasis Type="Italic">Xenopus</Emphasis> anatomy and development

Erik Segerdell Jeff B Bowes Nicolas Pollet Peter D Vize 《BMC developmental biology》2008,8(1):92

Background

The frogs Xenopus laevis and Xenopus (Silurana) tropicalis are model systems that have produced a wealth of genetic, genomic, and developmental information. Xenbase is a model organism database that provides centralized access to this information, including gene function data from high-throughput screens and the scientific literature. A controlled, structured vocabulary for Xenopus anatomy and development is essential for organizing these data. 相似文献

8.

integRATE: a desirability-based data integration framework for the prioritization of candidate genes across heterogeneous omics and its application to preterm birth

Haley R. Eidem Jacob L. Steenwyk Jennifer H. Wisecaver John A. Capra Patrick Abbot Antonis Rokas 《BMC medical genomics》2018,11(1):107

Background

The integration of high-quality, genome-wide analyses offers a robust approach to elucidating genetic factors involved in complex human diseases. Even though several methods exist to integrate heterogeneous omics data, most biologists still manually select candidate genes by examining the intersection of lists of candidates stemming from analyses of different types of omics data that have been generated by imposing hard (strict) thresholds on quantitative variables, such as P-values and fold changes, increasing the chance of missing potentially important candidates.

Methods

To better facilitate the unbiased integration of heterogeneous omics data collected from diverse platforms and samples, we propose a desirability function framework for identifying candidate genes with strong evidence across data types as targets for follow-up functional analysis. Our approach is targeted towards disease systems with sparse, heterogeneous omics data, so we tested it on one such pathology: spontaneous preterm birth (sPTB).

Results

We developed the software integRATE, which uses desirability functions to rank genes both within and across studies, identifying well-supported candidate genes according to the cumulative weight of biological evidence rather than based on imposition of hard thresholds of key variables. Integrating 10 sPTB omics studies identified both genes in pathways previously suspected to be involved in sPTB as well as novel genes never before linked to this syndrome. integRATE is available as an R package on GitHub (https://github.com/haleyeidem/integRATE).

Conclusions

Desirability-based data integration is a solution most applicable in biological research areas where omics data is especially heterogeneous and sparse, allowing for the prioritization of candidate genes that can be used to inform more targeted downstream functional analyses.

相似文献

9.

An effective approach for identification of <Emphasis Type="Italic">in vivo</Emphasis> protein-DNA binding sites from paired-end ChIP-Seq data

Congmao Wang Jie Xu Dasheng Zhang Zoe A Wilson Dabing Zhang 《BMC bioinformatics》2010,11(1):81

Background

ChIP-Seq, which combines chromatin immunoprecipitation (ChIP) with high-throughput massively parallel sequencing, is increasingly being used for identification of protein-DNA interactions in vivo in the genome. However, to maximize the effectiveness of data analysis of such sequences requires the development of new algorithms that are able to accurately predict DNA-protein binding sites. 相似文献

10.

Predicting clinical outcome of neuroblastoma patients using an integrative network-based approach

Léon-Charles Tranchevent Petr V. Nazarov Tony Kaoma Georges P. Schmartz Arnaud Muller Sang-Yoon Kim Jagath C. Rajapakse Francisco Azuaje 《Biology direct》2018,13(1):12

相似文献

11.

DecGPU: distributed error correction on massively parallel graphics processing units using CUDA and MPI

Yongchao Liu Bertil Schmidt Douglas L Maskell 《BMC bioinformatics》2011,12(1):85

Background

Next-generation sequencing technologies have led to the high-throughput production of sequence data (reads) at low cost. However, these reads are significantly shorter and more error-prone than conventional Sanger shotgun reads. This poses a challenge for the de novo assembly in terms of assembly quality and scalability for large-scale short read datasets. 相似文献

12.

Proteomic and network analysis characterize stage-specific metabolism in Trypanosoma cruzi

Seth B Roberts Jennifer L Robichaux Arvind K Chavali Patricio A Manque Vladimir Lee Ana M Lara Jason A Papin Gregory A Buck 《BMC systems biology》2009,3(1):52

Background

Trypanosoma cruzi is a Kinetoplastid parasite of humans and is the cause of Chagas disease, a potentially lethal condition affecting the cardiovascular, gastrointestinal, and nervous systems of the human host. Constraint-based modeling has emerged in the last decade as a useful approach to integrating genomic and other high-throughput data sets with more traditional, experimental data acquired through decades of research and published in the literature. 相似文献

13.

flowClust: a Bioconductor package for automated gating of flow cytometry data

Kenneth Lo Florian Hahne Ryan R Brinkman Raphael Gottardo 《BMC bioinformatics》2009,10(1):145-8

Background

As a high-throughput technology that offers rapid quantification of multidimensional characteristics for millions of cells, flow cytometry (FCM) is widely used in health research, medical diagnosis and treatment, and vaccine development. Nevertheless, there is an increasing concern about the lack of appropriate software tools to provide an automated analysis platform to parallelize the high-throughput data-generation platform. Currently, to a large extent, FCM data analysis relies on the manual selection of sequential regions in 2-D graphical projections to extract the cell populations of interest. This is a time-consuming task that ignores the high-dimensionality of FCM data. 相似文献

14.

AutoSOME: a clustering method for identifying gene expression modules without prior knowledge of cluster number

Aaron M Newman James B Cooper 《BMC bioinformatics》2010,11(1):117

Background

Clustering the information content of large high-dimensional gene expression datasets has widespread application in "omics" biology. Unfortunately, the underlying structure of these natural datasets is often fuzzy, and the computational identification of data clusters generally requires knowledge about cluster number and geometry. 相似文献

15.

POINeT: protein interactome with sub-network analysis and hub prioritization

Sheng-An Lee Chen-Hsiung Chan Tzu-Chi Chen Chia-Ying Yang Kuo-Chuan Huang Chi-Hung Tsai Jin-Mei Lai Feng-Sheng Wang Cheng-Yan Kao Chi-Ying F Huang 《BMC bioinformatics》2009,10(1):114-11

Background

Protein-protein interactions (PPIs) are critical to every aspect of biological processes. Expansion of all PPIs from a set of given queries often results in a complex PPI network lacking spatiotemporal consideration. Moreover, the reliability of available PPI resources, which consist of low- and high-throughput data, for network construction remains a significant challenge. Even though a number of software tools are available to facilitate PPI network analysis, an integrated tool is crucial to alleviate the burden on querying across multiple web servers and software tools. 相似文献

16.

Integrated cellular network of transcription regulations and protein-protein interactions

Yu-Chao Wang Bor-Sen Chen 《BMC systems biology》2010,4(1):20

Background

With the accumulation of increasing omics data, a key goal of systems biology is to construct networks at different cellular levels to investigate cellular machinery of the cell. However, there is currently no satisfactory method to construct an integrated cellular network that combines the gene regulatory network and the signaling regulatory pathway. 相似文献

17.

GMFilter and SXTestPlate: software tools for improving the SNPlex? genotyping system

Markus Teuber Michael H Wenz Stefan Schreiber Andre Franke 《BMC bioinformatics》2009,10(1):81

Background

Genotyping of single-nucleotide polymorphisms (SNPs) is a fundamental technology in modern genetics. The SNPlex™ mid-throughput genotyping system (Applied Biosystems, Foster City, CA, USA) enables the multiplexed genotyping of up to 48 SNPs simultaneously in a single DNA sample. The high level of automation and the large amount of data produced in a high-throughput laboratory require advanced software tools for quality control and workflow management. 相似文献

18.

QSRA – a quality-value guided de novo short read assembler

Douglas W Bryant Weng-Keen Wong Todd C Mockler 《BMC bioinformatics》2009,10(1):69

Background

New rapid high-throughput sequencing technologies have sparked the creation of a new class of assembler. Since all high-throughput sequencing platforms incorporate errors in their output, short-read assemblers must be designed to account for this error while utilizing all available data. 相似文献

19.

In silico discovery of human natural antisense transcripts

Yuan-Yuan Li Lei Qin Zong-Ming Guo Lei Liu Hao Xu Pei Hao Jiong Su Yixiang Shi Wei-Zhong He Yi-Xue Li 《BMC bioinformatics》2006,7(1):18-8

相似文献

20.

Identifying and quantifying metabolites by scoring peaks of GC-MS data

Raphael BM Aggio Arno Mayor Sophie Reade Chris SJ Probert Katya Ruggiero 《BMC bioinformatics》2014,15(1)

Background

Metabolomics is one of most recent omics technologies. It has been applied on fields such as food science, nutrition, drug discovery and systems biology. For this, gas chromatography-mass spectrometry (GC-MS) has been largely applied and many computational tools have been developed to support the analysis of metabolomics data. Among them, AMDIS is perhaps the most used tool for identifying and quantifying metabolites. However, AMDIS generates a high number of false-positives and does not have an interface amenable for high-throughput data analysis. Although additional computational tools have been developed for processing AMDIS results and to perform normalisations and statistical analysis of metabolomics data, there is not yet a single free software or package able to reliably identify and quantify metabolites analysed by GC-MS.

Results

Here we introduce a new algorithm, PScore, able to score peaks according to their likelihood of representing metabolites defined in a mass spectral library. We implemented PScore in a R package called MetaBox and evaluated the applicability and potential of MetaBox by comparing its performance against AMDIS results when analysing volatile organic compounds (VOC) from standard mixtures of metabolites and from female and male mice faecal samples. MetaBox reported lower percentages of false positives and false negatives, and was able to report a higher number of potential biomarkers associated to the metabolism of female and male mice.

Conclusions

Identification and quantification of metabolites is among the most critical and time-consuming steps in GC-MS metabolome analysis. Here we present an algorithm implemented in a R package, which allows users to construct flexible pipelines and analyse metabolomics data in a high-throughput manner.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-014-0374-2) contains supplementary material, which is available to authorized users. 相似文献