期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Fusarium head blight (FHB) and Septoria tritici blotch (STB) severely impair wheat production. With the aim to further elucidate the genetic architecture underlying FHB and STB resistance, we phenotyped 1604 European wheat hybrids and their 135 parental lines for FHB and STB disease severities and determined genotypes at 17,372 single-nucleotide polymorphic loci.

Results

Cross-validated association mapping revealed the absence of large effect QTL for both traits. Genomic selection showed a three times higher prediction accuracy for FHB than STB disease severity for test sets largely unrelated to the training sets.

Conclusions

Our findings suggest that the genetic architecture is less complex and, hence, can be more properly tackled to perform accurate prediction for FHB than STB disease severity. Consequently, FHB disease severity is an interesting model trait to fine-tune genomic selection models exploiting beyond relatedness also knowledge of the genetic architecture.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1628-8) contains supplementary material, which is available to authorized users. 相似文献

6.

XSAnno: a framework for building ortholog models in cross-species transcriptome comparisons

Ying Zhu Mingfeng Li André MM Sousa Nenad ?estan 《BMC genomics》2014,15(1)

相似文献

7.

CAP-miRSeq: a comprehensive analysis pipeline for microRNA sequencing data

Zhifu Sun Jared Evans Aditya Bhagwate Sumit Middha Matthew Bockol Huihuang Yan Jean-Pierre Kocher 《BMC genomics》2014,15(1)

Background

miRNAs play a key role in normal physiology and various diseases. miRNA profiling through next generation sequencing (miRNA-seq) has become the main platform for biological research and biomarker discovery. However, analyzing miRNA sequencing data is challenging as it needs significant amount of computational resources and bioinformatics expertise. Several web based analytical tools have been developed but they are limited to processing one or a pair of samples at time and are not suitable for a large scale study. Lack of flexibility and reliability of these web applications are also common issues.

Results

We developed a Comprehensive Analysis Pipeline for microRNA Sequencing data (CAP-miRSeq) that integrates read pre-processing, alignment, mature/precursor/novel miRNA detection and quantification, data visualization, variant detection in miRNA coding region, and more flexible differential expression analysis between experimental conditions. According to computational infrastructure, users can install the package locally or deploy it in Amazon Cloud to run samples sequentially or in parallel for a large number of samples for speedy analyses. In either case, summary and expression reports for all samples are generated for easier quality assessment and downstream analyses. Using well characterized data, we demonstrated the pipeline’s superior performances, flexibility, and practical use in research and biomarker discovery.

Conclusions

CAP-miRSeq is a powerful and flexible tool for users to process and analyze miRNA-seq data scalable from a few to hundreds of samples. The results are presented in the convenient way for investigators or analysts to conduct further investigation and discovery.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-423) contains supplementary material, which is available to authorized users. 相似文献

8.

GRACOMICS: software for graphical comparison of multiple results with omics data

Minseok Seo Joon Yoon Taesung Park 《BMC genomics》2015,16(1)

相似文献

9.

Distance-based assessment of the localization of functional annotations in 3D genome reconstructions

Daniel Capurso Mark R Segal 《BMC genomics》2014,15(1)

Background

Recent studies used the contact data or three-dimensional (3D) genome reconstructions from Hi-C (chromosome conformation capture with next-generation sequencing) to assess the co-localization of functional genomic annotations in the nucleus. These analyses dichotomized data point pairs belonging to a functional annotation as “close” or “far” based on some threshold and then tested for enrichment of “close” pairs. We propose an alternative approach that avoids dichotomization of the data and instead directly estimates the significance of distances within the 3D reconstruction.

Results

We applied this approach to 3D genome reconstructions for Plasmodium falciparum, the causative agent of malaria, and Saccharomyces cerevisiae and compared the results to previous approaches. We found significant 3D co-localization of centromeres, telomeres, virulence genes, and several sets of genes with developmentally regulated expression in P. falciparum; and significant 3D co-localization of centromeres and long terminal repeats in S. cerevisiae. Additionally, we tested the experimental observation that telomeres form three to seven clusters in P. falciparum and S. cerevisiae. Applying affinity propagation clustering to telomere coordinates in the 3D reconstructions yielded six telomere clusters for both organisms.

Conclusions

Distance-based assessment replicated key findings, while avoiding dichotomization of the data (which previously yielded threshold-sensitive results).

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-992) contains supplementary material, which is available to authorized users. 相似文献

10.

Global copy number analyses by next generation sequencing provide insight into pig genome variation

Jicai Jiang Jiying Wang Haifei Wang Yan Zhang Huimin Kang Xiaotian Feng Jiafu Wang Zongjun Yin Wenbin Bao Qin Zhang Jian-Feng Liu 《BMC genomics》2014,15(1)

Background

Copy number variations (CNVs) confer significant effects on genetic innovation and phenotypic variation. Previous CNV studies in swine seldom focused on in-depth characterization of global CNVs.

Results

Using whole-genome assembly comparison (WGAC) and whole-genome shotgun sequence detection (WSSD) approaches by next generation sequencing (NGS), we probed formation signatures of both segmental duplications (SDs) and individualized CNVs in an integrated fashion, building the finest resolution CNV and SD maps of pigs so far. We obtained copy number estimates of all protein-coding genes with copy number variation carried by individuals, and further confirmed two genes with high copy numbers in Meishan pigs through an enlarged population. We determined genome-wide CNV hotspots, which were significantly enriched in SD regions, suggesting evolution of CNV hotspots may be affected by ancestral SDs. Through systematically enrichment analyses based on simulations and bioinformatics analyses, we revealed CNV-related genes undergo a different selective constraint from those CNV-unrelated regions, and CNVs may be associated with or affect pig health and production performance under recent selection.

Conclusions

Our studies lay out one way for characterization of CNVs in the pig genome, provide insight into the pig genome variation and prompt CNV mechanisms studies when using pigs as biomedical models for human diseases.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-593) contains supplementary material, which is available to authorized users. 相似文献

11.

Pathomx: an interactive workflow-based tool for the analysis of metabolomic data

Martin A Fitzpatrick Catherine M McGrath Stephen P Young 《BMC bioinformatics》2014,15(1)

相似文献

12.

SFMetrics: an analysis tool for scanning force microscopy images of biomolecules

Humberto Sánchez Claire Wyman 《BMC bioinformatics》2015,16(1)

Background

Scanning force microscopy (SFM) allows direct, rapid and high-resolution visualization of single molecular complexes; irregular shapes and differences in sizes are immediately revealed by the scanning tip in three-dimensional images. However, high-throughput analysis of SFM data is limited by the lack of versatile software tools accessible to SFM users. Most existing SFM software tools are aimed at broad general use: from material-surface analysis to visualization of biomolecules.

Results

We present SFMetrics as a metrology toolbox for SFM, specifically aimed at biomolecules like DNA and proteins, which features (a) semi-automatic high-throughput analysis of individual molecules; (b) ease of use working within MATLAB environment or as a stand-alone application; (c) compatibility with MultiMode (Bruker), NanoWizard (JPK instruments), Asylum (Asylum research), ASCII, and TIFF files, that can be adjusted with minor modifications to other formats.

Conclusion

Assembled in a single user interface, SFMetrics serves as a semi-automatic analysis tool capable of measuring several geometrical properties (length, volume and angles) from DNA and protein complexes, but is also applicable to other samples with irregular shapes.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0457-8) contains supplementary material, which is available to authorized users. 相似文献

13.

APP: an Automated Proteomics Pipeline for the analysis of mass spectrometry data based on multiple open access tools

Erik K Malm Vaibhav Srivastava Gustav Sundqvist Vincent Bulone 《BMC bioinformatics》2014,15(1)

Background

Mass spectrometry analyses of complex protein samples yield large amounts of data and specific expertise is needed for data analysis, in addition to a dedicated computer infrastructure. Furthermore, the identification of proteins and their specific properties require the use of multiple independent bioinformatics tools and several database search algorithms to process the same datasets. In order to facilitate and increase the speed of data analysis, there is a need for an integrated platform that would allow a comprehensive profiling of thousands of peptides and proteins in a single process through the simultaneous exploitation of multiple complementary algorithms.

Results

We have established a new proteomics pipeline designated as APP that fulfills these objectives using a complete series of tools freely available from open sources. APP automates the processing of proteomics tasks such as peptide identification, validation and quantitation from LC-MS/MS data and allows easy integration of many separate proteomics tools. Distributed processing is at the core of APP, allowing the processing of very large datasets using any combination of Windows/Linux physical or virtual computing resources.

Conclusions

APP provides distributed computing nodes that are simple to set up, greatly relieving the need for separate IT competence when handling large datasets. The modular nature of APP allows complex workflows to be managed and distributed, speeding up throughput and setup. Additionally, APP logs execution information on all executed tasks and generated results, simplifying information management and validation.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-014-0441-8) contains supplementary material, which is available to authorized users. 相似文献

14.

A MATLAB tool for pathway enrichment using a topology-based pathway regulation score

Maysson Ibrahim Sabah Jassim Michael Anthony Cawthorne Kenneth Langlands 《BMC bioinformatics》2014,15(1)

相似文献

15.

RIEMS: a software pipeline for sensitive and comprehensive taxonomic classification of reads from metagenomics datasets

Matthias Scheuch Dirk H?per Martin Beer 《BMC bioinformatics》2015,16(1)

Background

Fuelled by the advent and subsequent development of next generation sequencing technologies, metagenomics became a powerful tool for the analysis of microbial communities both scientifically and diagnostically. The biggest challenge is the extraction of relevant information from the huge sequence datasets generated for metagenomics studies. Although a plethora of tools are available, data analysis is still a bottleneck.

Results

To overcome the bottleneck of data analysis, we developed an automated computational workflow called RIEMS – Reliable Information Extraction from Metagenomic Sequence datasets. RIEMS assigns every individual read sequence within a dataset taxonomically by cascading different sequence analyses with decreasing stringency of the assignments using various software applications. After completion of the analyses, the results are summarised in a clearly structured result protocol organised taxonomically. The high accuracy and performance of RIEMS analyses were proven in comparison with other tools for metagenomics data analysis using simulated sequencing read datasets.

Conclusions

RIEMS has the potential to fill the gap that still exists with regard to data analysis for metagenomics studies. The usefulness and power of RIEMS for the analysis of genuine sequencing datasets was demonstrated with an early version of RIEMS in 2011 when it was used to detect the orthobunyavirus sequences leading to the discovery of Schmallenberg virus.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0503-6) contains supplementary material, which is available to authorized users. 相似文献

16.

SeAMotE: a method for high-throughput motif discovery in nucleic acid sequences

Federico Agostini Davide Cirillo Riccardo Delli Ponti Gian Gaetano Tartaglia 《BMC genomics》2014,15(1)

相似文献

17.

SummonChimera infers integrated viral genomes with nucleotide precision from NGS data

Joshua P Katz James M Pipas 《BMC bioinformatics》2014,15(1)

相似文献

18.

A new scoring function for top-down spectral deconvolution

Qiang Kou Si Wu Xiaowen Liu 《BMC genomics》2014,15(1)

Background

Top-down mass spectrometry plays an important role in intact protein identification and characterization. Top-down mass spectra are more complex than bottom-up mass spectra because they often contain many isotopomer envelopes from highly charged ions, which may overlap with one another. As a result, spectral deconvolution, which converts a complex top-down mass spectrum into a monoisotopic mass list, is a key step in top-down spectral interpretation.

Results

In this paper, we propose a new scoring function, L-score, for evaluating isotopomer envelopes. By combining L-score with MS-Deconv, a new software tool, MS-Deconv+, was developed for top-down spectral deconvolution. Experimental results showed that MS-Deconv+ outperformed existing software tools in top-down spectral deconvolution.

Conclusions

L-score shows high discriminative ability in identification of isotopomer envelopes. Using L-score, MS-Deconv+ reports many correct monoisotopic masses missed by other software tools, which are valuable for proteoform identification and characterization.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-1140) contains supplementary material, which is available to authorized users. 相似文献

19.

QuickNGS elevates Next-Generation Sequencing data analysis to a new level of automation

Prerana Wagle Milo? Nikoli? Peter Frommolt 《BMC genomics》2015,16(1)

Background

Next-Generation Sequencing (NGS) has emerged as a widely used tool in molecular biology. While time and cost for the sequencing itself are decreasing, the analysis of the massive amounts of data remains challenging. Since multiple algorithmic approaches for the basic data analysis have been developed, there is now an increasing need to efficiently use these tools to obtain results in reasonable time.

Results

We have developed QuickNGS, a new workflow system for laboratories with the need to analyze data from multiple NGS projects at a time. QuickNGS takes advantage of parallel computing resources, a comprehensive back-end database, and a careful selection of previously published algorithmic approaches to build fully automated data analysis workflows. We demonstrate the efficiency of our new software by a comprehensive analysis of 10 RNA-Seq samples which we can finish in only a few minutes of hands-on time. The approach we have taken is suitable to process even much larger numbers of samples and multiple projects at a time.

Conclusion

Our approach considerably reduces the barriers that still limit the usability of the powerful NGS technology and finally decreases the time to be spent before proceeding to further downstream analysis and interpretation of the data.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1695-x) contains supplementary material, which is available to authorized users. 相似文献

20.

AllelicImbalance: an R/ bioconductor package for detecting,managing, and visualizing allele expression imbalance data from RNA sequencing

Jesper R. G?din Ferdinand M. van’t Hooft Per Eriksson Lasse Folkersen 《BMC bioinformatics》2015,16(1)

Background

One aspect in which RNA sequencing is more valuable than microarray-based methods is the ability to examine the allelic imbalance of the expression of a gene. This process is often a complex task that entails quality control, alignment, and the counting of reads over heterozygous single-nucleotide polymorphisms. Allelic imbalance analysis is subject to technical biases, due to differences in the sequences of the measured alleles. Flexible bioinformatics tools are needed to ease the workflow while retaining as much RNA sequencing information as possible throughout the analysis to detect and address the possible biases.

Results

We present AllelicImblance, a software program that is designed to detect, manage, and visualize allelic imbalances comprehensively. The purpose of this software is to allow users to pose genetic questions in any RNA sequencing experiment quickly, enhancing the general utility of RNA sequencing. The visualization features can reveal notable, non-trivial allelic imbalance behavior over specific regions, such as exons.

Conclusions

The software provides a complete framework to perform allelic imbalance analyses of aligned RNA sequencing data, from detection to visualization, within the robust and versatile management class, ASEset.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0620-2) contains supplementary material, which is available to authorized users. 相似文献