期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

XSAnno: a framework for building ortholog models in cross-species transcriptome comparisons

Ying Zhu Mingfeng Li André MM Sousa Nenad ?estan 《BMC genomics》2014,15(1)

相似文献

2.

An empirical strategy to detect bacterial transcript structure from directional RNA-seq transcriptome data

Yejun Wang Keith D MacKenzie Aaron P White 《BMC genomics》2015,16(1)

相似文献

3.

Comparison of stranded and non-stranded RNA-seq transcriptome profiling and investigation of gene overlap

Shanrong Zhao Ying Zhang William Gordon Jie Quan Hualin Xi Sarah Du David von Schack Baohong Zhang 《BMC genomics》2015,16(1)

相似文献

4.

Experimental validation of methods for differential gene expression analysis and sample pooling in RNA-seq

Anto P. Rajkumar Per Qvist Ross Lazarus Francesco Lescai Jia Ju Mette Nyegaard Ole Mors Anders D. B?rglum Qibin Li Jane H. Christensen 《BMC genomics》2015,16(1)

Background

Massively parallel cDNA sequencing (RNA-seq) experiments are gradually superseding microarrays in quantitative gene expression profiling. However, many biologists are uncertain about the choice of differentially expressed gene (DEG) analysis methods and the validity of cost-saving sample pooling strategies for their RNA-seq experiments. Hence, we performed experimental validation of DEGs identified by Cuffdiff2, edgeR, DESeq2 and Two-stage Poisson Model (TSPM) in a RNA-seq experiment involving mice amygdalae micro-punches, using high-throughput qPCR on independent biological replicate samples. Moreover, we sequenced RNA-pools and compared their results with sequencing corresponding individual RNA samples.

Results

False-positivity rate of Cuffdiff2 and false-negativity rates of DESeq2 and TSPM were high. Among the four investigated DEG analysis methods, sensitivity and specificity of edgeR was relatively high. We documented the pooling bias and that the DEGs identified in pooled samples suffered low positive predictive values.

Conclusions

Our results highlighted the need for combined use of more sensitive DEG analysis methods and high-throughput validation of identified DEGs in future RNA-seq experiments. They indicated limited utility of sample pooling strategies for RNA-seq in similar setups and supported increasing the number of biological replicate samples.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1767-y) contains supplementary material, which is available to authorized users. 相似文献

5.

Skeletal muscle alterations and exercise performance decrease in erythropoietin-deficient mice: a comparative study

Laurence Mille-Hamard Veronique L Billat Elodie Henry Blandine Bonnamy Florence Joly Philippe Benech Eric Barrey 《BMC medical genomics》2012,5(1):1-20

相似文献

6.

Discerning novel splice junctions derived from RNA-seq alignment: a deep learning approach

Yi Zhang Xinan Liu James MacLeod Jinze Liu 《BMC genomics》2018,19(1):971

相似文献

7.

T-REx: Transcriptome analysis webserver for RNA-seq Expression data

Anne de Jong Sjoerd van der Meulen Oscar P. Kuipers Jan Kok 《BMC genomics》2015,16(1)

相似文献

8.

An investigation of biomarkers derived from legacy microarray data for their utility in the RNA-seq era

Zhenqiang Su Hong Fang Huixiao Hong Leming Shi Wenqian Zhang Wenwei Zhang Yanyan Zhang Zirui Dong Lee J Lancashire Marina Bessarabova Xi Yang Baitang Ning Binsheng Gong Joe Meehan Joshua Xu Weigong Ge Roger Perkins Matthias Fischer Weida Tong 《Genome biology》2014,15(12)

Background

Gene expression microarray has been the primary biomarker platform ubiquitously applied in biomedical research, resulting in enormous data, predictive models, and biomarkers accrued. Recently, RNA-seq has looked likely to replace microarrays, but there will be a period where both technologies co-exist. This raises two important questions: Can microarray-based models and biomarkers be directly applied to RNA-seq data? Can future RNA-seq-based predictive models and biomarkers be applied to microarray data to leverage past investment?

Results

We systematically evaluated the transferability of predictive models and signature genes between microarray and RNA-seq using two large clinical data sets. The complexity of cross-platform sequence correspondence was considered in the analysis and examined using three human and two rat data sets, and three levels of mapping complexity were revealed. Three algorithms representing different modeling complexity were applied to the three levels of mappings for each of the eight binary endpoints and Cox regression was used to model survival times with expression data. In total, 240,096 predictive models were examined.

Conclusions

Signature genes of predictive models are reciprocally transferable between microarray and RNA-seq data for model development, and microarray-based models can accurately predict RNA-seq-profiled samples; while RNA-seq-based models are less accurate in predicting microarray-profiled samples and are affected both by the choice of modeling algorithm and the gene mapping complexity. The results suggest continued usefulness of legacy microarray data and established microarray biomarkers and predictive models in the forthcoming RNA-seq era.

Electronic supplementary material

The online version of this article (doi:10.1186/s13059-014-0523-y) contains supplementary material, which is available to authorized users. 相似文献

9.

Allelic mapping bias in RNA-sequencing is not a major confounder in eQTL studies

Nikolaos I Panousis Maria Gutierrez-Arcelus Emmanouil T Dermitzakis Tuuli Lappalainen 《Genome biology》2014,15(9)

Background

RNA sequencing (RNA-seq) is the current gold-standard method to quantify gene expression for expression quantitative trait locus (eQTL) studies. However, a potential caveat in these studies is that RNA-seq reads carrying the non-reference allele of variant loci can have lower probability to map correctly to the reference genome, which could bias gene quantifications and cause false positive eQTL associations. In this study, we analyze the effect of this allelic mapping bias in eQTL discovery.

Results

We simulate RNA-seq read mapping over 9.5 M common SNPs and indels, with 15.6% of variants showing biased mapping rate for reference versus non-reference reads. However, removing potentially biased RNA-seq reads from an eQTL dataset of 185 individuals has a very small effect on gene and exon quantifications and eQTL discovery. We detect only a handful of likely false positive eQTLs, and overall eQTL SNPs show no significant enrichment for high mapping bias.

Conclusion

Our results suggest that RNA-seq quantifications are generally robust against allelic mapping bias, and that this does not have a severe effect on eQTL discovery. Nevertheless, we provide our catalog of putatively biased loci to allow better controlling for mapping bias to obtain more accurate results in future RNA-seq studies.

Electronic supplementary material

The online version of this article (doi:10.1186/s13059-014-0467-2) contains supplementary material, which is available to authorized users. 相似文献

10.

A normalization strategy for comparing tag count data

Kadota K Nishiyama T Shimizu K 《Algorithms for molecular biology : AMB》2012,7(1):5-13

Background

High-throughput sequencing, such as ribonucleic acid sequencing (RNA-seq) and chromatin immunoprecipitation sequencing (ChIP-seq) analyses, enables various features of organisms to be compared through tag counts. Recent studies have demonstrated that the normalization step for RNA-seq data is critical for a more accurate subsequent analysis of differential gene expression. Development of a more robust normalization method is desirable for identifying the true difference in tag count data.

Results

We describe a strategy for normalizing tag count data, focusing on RNA-seq. The key concept is to remove data assigned as potential differentially expressed genes (DEGs) before calculating the normalization factor. Several R packages for identifying DEGs are currently available, and each package uses its own normalization method and gene ranking algorithm. We compared a total of eight package combinations: four R packages (edgeR, DESeq, baySeq, and NBPSeq) with their default normalization settings and with our normalization strategy. Many synthetic datasets under various scenarios were evaluated on the basis of the area under the curve (AUC) as a measure for both sensitivity and specificity. We found that packages using our strategy in the data normalization step overall performed well. This result was also observed for a real experimental dataset.

Conclusion

Our results showed that the elimination of potential DEGs is essential for more accurate normalization of RNA-seq data. The concept of this normalization strategy can widely be applied to other types of tag count data and to microarray data. 相似文献

11.

Indel sensitive and comprehensive variant/mutation detection from RNA sequencing data for precision medicine

Naresh?Prodduturi Aditya?Bhagwate Jean-Pierre?A.?Kocher Zhifu?Sun Email author 《BMC medical genomics》2018,11(3):67

相似文献

12.

A skellam model to identify differential patterns of gene expression induced by environmental signals

Libo Jiang Ke Mao Rongling Wu 《BMC genomics》2014,15(1)

相似文献

13.

CodingQuarry: highly accurate hidden Markov model gene prediction in fungal genomes using RNA-seq transcripts

Alison C Testa James K Hane Simon R Ellwood Richard P Oliver 《BMC genomics》2015,16(1)

相似文献

14.

Whole-Transcriptome profiling of formalin-fixed,paraffin-embedded renal cell carcinoma by RNA-seq

Ping Li Andrew Conley Hao Zhang Hyung L Kim 《BMC genomics》2014,15(1)

相似文献

15.

A machine learning approach for the identification of key markers involved in brain development from single-cell transcriptomic data

Hu Yongli Hase Takeshi Li Hui Peng Prabhakar Shyam Kitano Hiroaki Ng See Kiong Ghosh Samik Wee Lawrence Jin Kiat 《BMC genomics》2016,17(13):1025-29

相似文献

16.

Transcriptome profiling of fruit development and maturation in Chinese white pear (Pyrus bretschneideri Rehd)

Min Xie Ying Huang Yanping Zhang Xin Wang Hua Yang Oliver Yu Wenhao Dai Congbing Fang 《BMC genomics》2013,14(1)

相似文献

17.

Inferring the kinetics of stochastic gene expression from single-cell RNA-sequencing data

Jong Kyoung Kim John C Marioni 《Genome biology》2013,14(1):R7

相似文献

18.

2DDB – a bioinformatics solution for analysis of quantitative proteomics data

Lars Malmström György Marko-Varga Gunilla Westergren-Thorsson Thomas Laurell Johan Malmström 《BMC bioinformatics》2006,7(1):158-9

Background

We present 2DDB, a bioinformatics solution for storage, integration and analysis of quantitative proteomics data. As the data complexity and the rate with which it is produced increases in the proteomics field, the need for flexible analysis software increases. 相似文献

19.

Identification of gene signatures from RNA-seq data using Pareto-optimal cluster algorithm

Saurav Mallik Zhongming Zhao 《BMC systems biology》2018,12(8):126

Background

Gene signatures are important to represent the molecular changes in the disease genomes or the cells in specific conditions, and have been often used to separate samples into different groups for better research or clinical treatment. While many methods and applications have been available in literature, there still lack powerful ones that can take account of the complex data and detect the most informative signatures.

Methods

In this article, we present a new framework for identifying gene signatures using Pareto-optimal cluster size identification for RNA-seq data. We first performed pre-filtering steps and normalization, then utilized the empirical Bayes test in Limma package to identify the differentially expressed genes (DEGs). Next, we used a multi-objective optimization technique, “Multi-objective optimization for collecting cluster alternatives” (MOCCA in R package) on these DEGs to find Pareto-optimal cluster size, and then applied k-means clustering to the RNA-seq data based on the optimal cluster size. The best cluster was obtained through computing the average Spearman’s Correlation Score among all the genes in pair-wise manner belonging to the module. The best cluster is treated as the signature for the respective disease or cellular condition.

Results

We applied our framework to a cervical cancer RNA-seq dataset, which included 253 squamous cell carcinoma (SCC) samples and 22 adenocarcinoma (ADENO) samples. We identified a total of 582 DEGs by Limma analysis of SCC versus ADENO samples. Among them, 260 are up-regulated genes and 322 are down-regulated genes. Using MOCCA, we obtained seven Pareto-optimal clusters. The best cluster has a total of 35 DEGs consisting of all-upregulated genes. For validation, we ran PAMR (prediction analysis for microarrays) classifier on the selected best cluster, and assessed the classification performance. Our evaluation, measured by sensitivity, specificity, precision, and accuracy, showed high confidence.

Conclusions

Our framework identified a multi-objective based cluster that is treated as a signature that can classify the disease and control group of samples with higher classification performance (accuracy 0.935) for the corresponding disease. Our method is useful to find signature for any RNA-seq or microarray data.

相似文献

20.

De novo assembly and transcriptome analysis of Atlantic salmon macrophage/dendritic-like TO cells following type I IFN treatment and Salmonid alphavirus subtype-3 infection

Cheng Xu ?ystein Evensen Hetron Mweemba Munang’andu 《BMC genomics》2015,16(1)

相似文献