Computational and analytical framework for small RNA profiling by high-throughput sequencing |
| |
Authors: | Noah Fahlgren Christopher M. Sullivan Kristin D. Kasschau Elisabeth J. Chapman Jason S. Cumbie Taiowa A. Montgomery Sunny D. Gilbert Mark Dasenko Tyler W.H. Backman Scott A. Givan James C. Carrington |
| |
Affiliation: | 1.Center for Genome Research and Biocomputing, Oregon State University, Corvallis, Oregon 97331, USA;2.Department of Botany and Plant Pathology, Oregon State University, Corvallis, Oregon 97331, USA |
| |
Abstract: | The advent of high-throughput sequencing (HTS) methods has enabled direct approaches to quantitatively profile small RNA populations. However, these methods have been limited by several factors, including representational artifacts and lack of established statistical methods of analysis. Furthermore, massive HTS data sets present new problems related to data processing and mapping to a reference genome. Here, we show that cluster-based sequencing-by-synthesis technology is highly reproducible as a quantitative profiling tool for several classes of small RNA from Arabidopsis thaliana. We introduce the use of synthetic RNA oligoribonucleotide standards to facilitate objective normalization between HTS data sets, and adapt microarray-type methods for statistical analysis of multiple samples. These methods were tested successfully using mutants with small RNA biogenesis (miRNA-defective dcl1 mutant and siRNA-defective dcl2 dcl3 dcl4 triple mutant) or effector protein (ago1 mutant) deficiencies. Computational methods were also developed to rapidly and accurately parse, quantify, and map small RNA data. |
| |
Keywords: | small RNA sequencing-by-synthesis CASHX SAM-seq oligoribonucleotide standards statistical methods |
|
|