共查询到20条相似文献,搜索用时 15 毫秒
1.
Nicholas J. Schurch Pietá Schofield Marek Gierliński Christian Cole Alexander Sherstnev Vijender Singh Nicola Wrobel Karim Gharbi Gordon G. Simpson Tom Owen-Hughes Mark Blaxter Geoffrey J. Barton 《RNA (New York, N.Y.)》2016,22(6):839-851
RNA-seq is now the technology of choice for genome-wide differential gene expression experiments, but it is not clear how many biological replicates are needed to ensure valid biological interpretation of the results or which statistical tools are best for analyzing the data. An RNA-seq experiment with 48 biological replicates in each of two conditions was performed to answer these questions and provide guidelines for experimental design. With three biological replicates, nine of the 11 tools evaluated found only 20%–40% of the significantly differentially expressed (SDE) genes identified with the full set of 42 clean replicates. This rises to >85% for the subset of SDE genes changing in expression by more than fourfold. To achieve >85% for all SDE genes regardless of fold change requires more than 20 biological replicates. The same nine tools successfully control their false discovery rate at ≲5% for all numbers of replicates, while the remaining two tools fail to control their FDR adequately, particularly for low numbers of replicates. For future RNA-seq experiments, these results suggest that at least six biological replicates should be used, rising to at least 12 when it is important to identify SDE genes for all fold changes. If fewer than 12 replicates are used, a superior combination of true positive and false positive performances makes edgeR and DESeq2 the leading tools. For higher replicate numbers, minimizing false positives is more important and DESeq marginally outperforms the other tools. 相似文献
2.
Microarray technology is a powerful tool for animal functional genomics studies, with applications spanning from gene identification and mapping, to function and control of gene expression. Microarray assays, however, are complex and costly, and hence generally performed with relatively small number of animals. Nevertheless, they generate data sets of unprecedented complexity and dimensionality. Therefore, such trials require careful planning and experimental design, in addition to tailored statistical and computational tools for their appropriate data mining. In this review, we discuss experimental design and data analysis strategies, which incorporate prior genomic and biological knowledge, such as genotypes and gene function and pathway membership. We focus the discussion on the design of genetical genomics studies, and on significance testing for detection of differential expression. It is shown that the use of prior biological information can improve the efficiency of microarray experiments. 相似文献
3.
Emel Rothzerg Jiake Xu David Wood Sulev Kks 《Experimental biology and medicine (Maywood, N.J.)》2021,246(19):2072
The Therapeutically Applicable Research to Generate Effective Treatments (TARGET) project aims to determine molecular changes that drive childhood cancers, including osteosarcoma. The main purpose of the program is to use the open-source database to develop novel, effective, and less toxic therapies. We downloaded TARGET-OS RNA-Sequencing data through R studio and merged the mRNA expression of genes with clinical information (vital status, survival time and gender). Further, we analyzed differential gene expressions between dead and alive patients based on TARGET-OS project. By this study, we found 5758 differentially expressed genes between deceased and alive patients with a false discovery rate below 0.05; 4469 genes were upregulated in deceased patients compared to alive, whereas 1289 genes were downregulated. The survival-related genes were obtained using Kaplan–Meier survival analysis and Cox univariate regression (KM < 0.05 and Cox P-value < 0.05). Out of 5758 differentially expressed genes, only 217 have been associated with overall survival. Eight survival-related downregulated genes (ERCC4, CLUAP1, CTNNBIP1, GCA, RAB40C, SIRPA, USP11, and TCN2) and four survival-related upregulated genes (MUC1, COL13A1, JAG2 and KAZALD1) were selected for further analysis as potential independent prognostic candidate genes. This study may help to discover novel prognostic markers and potential therapeutic targets for osteosarcoma. 相似文献
4.
5.
Gene amplification and expression by RNA viruses and potential for further application to plant gene transfer 总被引:1,自引:0,他引:1
The great majority of plant viruses encapsidate messenger-sense ssRNA and have no natural DNA phase in their life cycle. Despite their RNA nature, essentially any desired change can be introduced into such genomes by using recombinant DNA techniques with suitably constructed, expressible viral cDNA clones. For some viruses such as brome mosaic virus, these methods have been used to define the sequences controlling RNA-directed genomic RNA replication and the expression of internal genes via subgenomic mRNAs. The results suggest a surprising degree of genetic flexibility, which appears to be reflected in the varied gene complements and genetic organizations of presumably related plant and animal RNA viruses sharing conserved replication genes. Foreign genes inserted in such RNA virus genomes can be amplified and expressed to a high level in transfected plant cells. In addition to the potential use of such viruses as episomal expression vectors, it should be possible to couple the viral pathways of RNA-dependent RNA synthesis to amplify and to further regulate the expression of genes transformed into plant chromosomes. 相似文献
6.
7.
8.
This article presents the logical reasoning underlying the optimal design of an experiment. We used Free‐Air Carbon dioxide Enrichment (FACE) experiments to illustrate this trade‐off as such experiments are particularly costly. On a theoretical basis, two‐way nested designs and split‐plot designs have similar power in testing carbon dioxide (CO2) main effects. If researchers have the choice of adding two replicate rings or two control plots to their experiment, our results show that both options provide a substantial gain in statistical power, with a slightly greater gain in the former case and at reduced financial cost in the latter. The former option, however, provides an insurance against possible ring failure. On an empirical basis, we analysed a preliminary FACE photosynthesis dataset collected at Duke University. The experiment was designed as a split‐plot design to test the effects of growth environment (GROWTH) and measurement CO2 concentration (MEAS) on photosynthetic rates of loblolly pine. Although a significant effect of MEAS was observed, we failed to detect a significant main effect of GROWTH. Power analysis was used to understand why the GROWTH main effect was not significant. The minimum detectable difference between treatment means that we calculated for GROWTH in this experiment was 4.04 μmol CO2 m?2 s?1 for a statistical power of 0.90, whereas the observed difference was 0.16 μmol CO2 m?2 s?1. Our recommendations for the design of FACE experiments are: (i) consider a second treatment factor with many levels within each ring in order to obtain a split‐plot design that provides a powerful test of interaction between treatment factors; (ii) add control plots, unless financial constrictions disallow for necessary personnel; (3) pool the data of FACE experiments conducted in comparable ecosystems (e.g. forests or grasslands), with two rings per treatment level at each site. 相似文献
9.
10.
11.
Advances in sequencing technology have led to a sharp decrease in the cost of ''data generation''. But is this sufficient to ensure cost-effective and efficient ''knowledge generation''? 相似文献
12.
Single-cell sequencing has emerged as a revolutionary method that reveals biological processes with unprecedented resolution and scale, and has already greatly impacted biology and medicine. To investigate processes such as alternative splicing, novel exon detection and allele-specific expression (ASE), full-length based single-cell RNA-seq methods are required for broad sequence coverage and single nucleotide polymorphism (SNP) identification. In this review, we revisit recent achievements from studies that used single-cell RNA-seq to advance our understanding of ASE in the context of both autosomal and X-chromosome genes. We also recapitulate useful bioinformatic tools developed to identify haplotype phase. 相似文献
13.
Quantifying variation in gene expression 总被引:2,自引:1,他引:2
14.
为鉴定鸡下丘脑发育相关特异性表达miRNA,基于固始鸡1日龄和36周龄下丘脑小RNA的Solexa测序数据,共鉴定到266种2个发育阶段共表达的miRNA,其中157种miRNA的表达水平被显著下调,22种被显著上调.聚类分析显示,鸡下丘脑高丰度差异性miRNA主要集中于let-7、mir-181、mir-30、mir-99、mir-1和mir-17等基因家族.另外,预测了10种高丰度差异性miRNA的靶基因,并进行了相应的GO分析和KEGG通路分析.结果显示,预测靶基因在发育过程、代谢过程、细胞过程和生物学过程调节等4个生物学过程以及细胞周期、粘着斑、TGF-beta信号通路和MAPK信号通路等通路中显著富集.研究结果为进一步揭示miRNA调控鸡下丘脑发育的分子机制提供了有益线索. 相似文献
15.
随着新一代高通量DNA测序技术的快速发展,RNA测序(RNA-seq)已成为基因表达和转录组分析新的重要手段.RNA-seq技术产生的海量数据为生物信息学带来了新的机遇和挑战.有效地对测序数据进行针对性的生物信息学处理和分析,成为RNA-seq技术能否在科学探索中发挥重大作用的关键.以新一代Illumina/Solexa测序平台所产生的数据为例,在扼要介绍高通量RNA-seq测序流程的基础上,对RNA-seq数据处理和分析的方法和现有软件做一个较为全面的综述,并对其中有待进一步研究的问题进行展望. 相似文献
16.
Jesper R. G?din Ferdinand M. van’t Hooft Per Eriksson Lasse Folkersen 《BMC bioinformatics》2015,16(1)
Background
One aspect in which RNA sequencing is more valuable than microarray-based methods is the ability to examine the allelic imbalance of the expression of a gene. This process is often a complex task that entails quality control, alignment, and the counting of reads over heterozygous single-nucleotide polymorphisms. Allelic imbalance analysis is subject to technical biases, due to differences in the sequences of the measured alleles. Flexible bioinformatics tools are needed to ease the workflow while retaining as much RNA sequencing information as possible throughout the analysis to detect and address the possible biases.Results
We present AllelicImblance, a software program that is designed to detect, manage, and visualize allelic imbalances comprehensively. The purpose of this software is to allow users to pose genetic questions in any RNA sequencing experiment quickly, enhancing the general utility of RNA sequencing. The visualization features can reveal notable, non-trivial allelic imbalance behavior over specific regions, such as exons.Conclusions
The software provides a complete framework to perform allelic imbalance analyses of aligned RNA sequencing data, from detection to visualization, within the robust and versatile management class, ASEset.Electronic supplementary material
The online version of this article (doi:10.1186/s12859-015-0620-2) contains supplementary material, which is available to authorized users. 相似文献17.
18.
19.
20.