首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Dynamic linear model for the identification of miRNAs in next-generation sequencing data
Authors:Evan Johnson W  Welker Noah C  Bass Brenda L
Institution:Brigham Young University, Provo, Utah 84602, USA. evan@stat.byu.edu
Abstract:Summary Next‐generation sequencing technologies are poised to revolutionize the field of biomedical research. The increased resolution of these data promise to provide a greater understanding of the molecular processes that control the morphology and behavior of a cell. However, the increased amounts of data require innovative statistical procedures that are powerful while still being computationally feasible. In this article, we present a method for identifying small RNA molecules, called miRNAs, which regulate genes by targeting their mRNAs for degradation or translational repression. In the first step of our modeling procedure, we apply an innovative dynamic linear model that identifies candidate miRNA genes in high‐throughput sequencing data. The model is flexible and can accurately identify interesting biological features while accounting for both the read count, read spacing, and sequencing depth. Additionally, miRNA candidates are also processed using a modified Smith–Waterman sequence alignment that scores the regions for potential RNA hairpins, one of the defining features of miRNAs. We illustrate our method on simulated datasets as well as on a small RNA Caenorhabditis elegans dataset from the Illumina sequencing platform. These examples show that our method is highly sensitive for identifying known and novel miRNA genes.
Keywords:Bayesian methods  Dynamic linear model  Markov chain Monte Carlo  miRNA prediction  Smith–Waterman algorithm  Solexa/Illumina sequencing
本文献已被 PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号