期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

A large number of computational methods have been developed for analyzing differential gene expression in RNA-seq data. We describe a comprehensive evaluation of common methods using the SEQC benchmark dataset and ENCODE data. We consider a number of key features, including normalization, accuracy of differential expression detection and differential expression analysis when one condition has no detectable expression. We find significant differences among the methods, but note that array-based methods adapted to RNA-seq data perform comparably to methods designed for RNA-seq. Our results demonstrate that increasing the number of replicate samples significantly improves detection power over increased sequencing depth. 相似文献

7.

Probe Region Expression Estimation for RNA-Seq Data for Improved Microarray Comparability

Karolis Uziela Antti Honkela 《PloS one》2015,10(5)

Rapidly growing public gene expression databases contain a wealth of data for building an unprecedentedly detailed picture of human biology and disease. This data comes from many diverse measurement platforms that make integrating it all difficult. Although RNA-sequencing (RNA-seq) is attracting the most attention, at present, the rate of new microarray studies submitted to public databases far exceeds the rate of new RNA-seq studies. There is clearly a need for methods that make it easier to combine data from different technologies. In this paper, we propose a new method for processing RNA-seq data that yields gene expression estimates that are much more similar to corresponding estimates from microarray data, hence greatly improving cross-platform comparability. The method we call PREBS is based on estimating the expression from RNA-seq reads overlapping the microarray probe regions, and processing these estimates with standard microarray summarisation algorithms. Using paired microarray and RNA-seq samples from TCGA LAML data set we show that PREBS expression estimates derived from RNA-seq are more similar to microarray-based expression estimates than those from other RNA-seq processing methods. In an experiment to retrieve paired microarray samples from a database using an RNA-seq query sample, gene signatures defined based on PREBS expression estimates were found to be much more accurate than those from other methods. PREBS also allows new ways of using RNA-seq data, such as expression estimation for microarray probe sets. An implementation of the proposed method is available in the Bioconductor package “prebs.” 相似文献

8.

RNA-Seq vs Dual- and Single-Channel Microarray Data: Sensitivity Analysis for Differential Expression and Clustering

Alina S?rbu Gráinne Kerr Martin Crane Heather J. Ruskin 《PloS one》2012,7(12)

With the fast development of high-throughput sequencing technologies, a new generation of genome-wide gene expression measurements is under way. This is based on mRNA sequencing (RNA-seq), which complements the already mature technology of microarrays, and is expected to overcome some of the latter’s disadvantages. These RNA-seq data pose new challenges, however, as strengths and weaknesses have yet to be fully identified. Ideally, Next (or Second) Generation Sequencing measures can be integrated for more comprehensive gene expression investigation to facilitate analysis of whole regulatory networks. At present, however, the nature of these data is not very well understood. In this paper we study three alternative gene expression time series datasets for the Drosophila melanogaster embryo development, in order to compare three measurement techniques: RNA-seq, single-channel and dual-channel microarrays. The aim is to study the state of the art for the three technologies, with a view of assessing overlapping features, data compatibility and integration potential, in the context of time series measurements. This involves using established tools for each of the three different technologies, and technical and biological replicates (for RNA-seq and microarrays, respectively), due to the limited availability of biological RNA-seq replicates for time series data. The approach consists of a sensitivity analysis for differential expression and clustering. In general, the RNA-seq dataset displayed highest sensitivity to differential expression. The single-channel data performed similarly for the differentially expressed genes common to gene sets considered. Cluster analysis was used to identify different features of the gene space for the three datasets, with higher similarities found for the RNA-seq and single-channel microarray dataset. 相似文献

9.

Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks 总被引：25，自引：0，他引：25

Trapnell C Roberts A Goff L Pertea G Kim D Kelley DR Pimentel H Salzberg SL Rinn JL Pachter L 《Nature protocols》2012,7(3):562-578

相似文献

10.

RNASEQR--a streamlined and accurate RNA-seq sequence analysis program

Chen LY Wei KC Huang AC Wang K Huang CY Yi D Tang CY Galas DJ Hood LE 《Nucleic acids research》2012,40(6):e42

相似文献

11.

A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae

Intawat Nookaew Marta Papini Natapol Pornputtapong Gionata Scalcinati Linn Fagerberg Matthias Uhl��n Jens Nielsen 《Nucleic acids research》2012,40(20):10084-10097

相似文献

12.

Designing a transcriptome next-generation sequencing project for a nonmodel plant species1

Strickler SR Bombarely A Mueller LA 《American journal of botany》2012,99(2):257-266

相似文献

13.

乏情和发情初产母猪下丘脑–垂体–卵巢轴中lincRNAs表达谱比较分析

任巧玲张家庆陆东锋王璟陈俊峰马强白献晓郭红霞高彬文邢宝松《遗传》2020,(4):388-402

初产母猪断奶后能否正常发情对养猪生产影响重大,也是初产母猪被淘汰的主要原因。本研究以乏情和发情初产母猪为研究对象,首次利用RNA-seq技术对其下丘脑-垂体-卵巢轴中的基因间长链非编码RNAs(long intergenic noncoding RNAs,lincRNAs)进行筛选比较,得到lincRNAs的表达图谱,并对其特征和功能进行了初步分析。结果显示,在乏情和发情初产母猪下丘脑–垂体–卵巢轴中鉴定得到3519个lincRNAs,以发情组为对照共有17个lincRNAs存在差异表达,其中12个表达上调,5个表达下调(FC≥2,P<0.05)。选择4个差异表达的lincRNAs经qRT-PCR验证,其表达水平与测序结果基本一致。对这17个差异表达的lincRNAs进行GO分析、KEGG通路分析及lincRNA-mRNA共表达网络分析,发现这些lincRNAs主要与猪卵母细胞减数分裂成熟、卵巢细胞分化及颗粒细胞凋亡等生殖活动相关。本研究结果丰富了猪lincRNAs数据资源,为进一步深入研究初产母猪的生殖机能提供了理论依据。相似文献

14.

LincRNA AC027700.1在小鼠蜕膜化中的表达研究

谭丽萍高茹菲尹鑫陈雪梅李方方袁柳何俊琳《遗传》2022,(2):168-177

长链非编码RNA(long non-coding RNA,lncRNA)是一类长度大于200 nt、不具有蛋白编码潜能的RNA分子.在细胞生长发育、物质代谢以及疾病等的发生发展过程中起关键调控作用,但在蜕膜化相关领域研究报道较少.为了探究lincRNA AC027700.1在早孕小鼠子宫内膜中的表达规律,初步探讨AC0... 相似文献

15.

Comparative Transcriptome Profiling of the Early Response to Magnaporthe oryzae in Durable Resistant vs Susceptible Rice (Oryza sativa L.) Genotypes

Paolo Bagnaresi Chiara Biselli Luigi Orrù Simona Urso Laura Crispino Pamela Abbruscato Pietro Piffanelli Elisabetta Lupotto Luigi Cattivelli Giampiero Valè 《PloS one》2012,7(12)

相似文献

16.

MarVis: a tool for clustering and visualization of metabolic biomarkers

Alexander Kaever Thomas Lingner Kirstin Feussner Cornelia Göbel Ivo Feussner Peter Meinicke 《BMC bioinformatics》2009,10(1):1-8

Background

Gene set analysis based on Gene Ontology (GO) can be a promising method for the analysis of differential expression patterns. However, current studies that focus on individual GO terms have limited analytical power, because the complex structure of GO introduces strong dependencies among the terms, and some genes that are annotated to a GO term cannot be found by statistically significant enrichment.

Results

We proposed a method for enriching clustered GO terms based on semantic similarity, namely cluster enrichment analysis based on GO (CeaGO), to extend the individual term analysis method. Using an Affymetrix HGU95aV2 chip dataset with simulated gene sets, we illustrated that CeaGO was sensitive enough to detect moderate expression changes. When compared to parent-based individual term analysis methods, the results showed that CeaGO may provide more accurate differentiation of gene expression results. When used with two acute leukemia (ALL and ALL/AML) microarray expression datasets, CeaGO correctly identified specifically enriched GO groups that were overlooked by other individual test methods.

Conclusion

By applying CeaGO to both simulated and real microarray data, we showed that this approach could enhance the interpretation of microarray experiments. CeaGO is currently available at http://chgc.sh.cn/en/software/CeaGO/. 相似文献

17.

Bias and Correction in RNA-seq Data for Marine Species

Kai Song Li Li Guofan Zhang 《Marine biotechnology (New York, N.Y.)》2017,19(5):541-550

相似文献

18.

A low-cost library construction protocol and data analysis pipeline for Illumina-based strand-specific multiplex RNA-seq 总被引：1，自引：0，他引：1

Wang L Si Y Dedow LK Shao Y Liu P Brutnell TP 《PloS one》2011,6(10):e26426

相似文献

19.

PrimerSeq:Design and Visualization of RT-PCR Primers for Alternative Splicing Using RNA-seq Data

Collin Tokheim Juw Won Park Yi Xing 《基因组蛋白质组与生物信息学报(英文版)》2014,(2):105-109

相似文献

20.

Missing value imputation for microRNA expression data by using a GO-based similarity measure

Yang Yang Xu Zhuangdi Song Dandan 《BMC bioinformatics》2016,17(1):109-116

Missing values are commonly present in microarray data profiles. Instead of discarding genes or samples with incomplete expression level, missing values need to be properly imputed for accurate data analysis. The imputation methods can be roughly categorized as expression level-based and domain knowledge-based. The first type of methods only rely on expression data without the help of external data sources, while the second type incorporates available domain knowledge into expression data to improve imputation accuracy. In recent years, microRNA (miRNA) microarray has been largely developed and used for identifying miRNA biomarkers in complex human disease studies. Similar to mRNA profiles, miRNA expression profiles with missing values can be treated with the existing imputation methods. However, the domain knowledge-based methods are hard to be applied due to the lack of direct functional annotation for miRNAs. With the rapid accumulation of miRNA microarray data, it is increasingly needed to develop domain knowledge-based imputation algorithms specific to miRNA expression profiles to improve the quality of miRNA data analysis. We connect miRNAs with domain knowledge of Gene Ontology (GO) via their target genes, and define miRNA functional similarity based on the semantic similarity of GO terms in GO graphs. A new measure combining miRNA functional similarity and expression similarity is used in the imputation of missing values. The new measure is tested on two miRNA microarray datasets from breast cancer research and achieves improved performance compared with the expression-based method on both datasets. The experimental results demonstrate that the biological domain knowledge can benefit the estimation of missing values in miRNA profiles as well as mRNA profiles. Especially, functional similarity defined by GO terms annotated for the target genes of miRNAs can be useful complementary information for the expression-based method to improve the imputation accuracy of miRNA array data. Our method and data are available to the public upon request. 相似文献