首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Interpretation, Stratification and Evidence for Sequence Variants Affecting mRNA Splicing in Complete Human Genome Sequences
Authors:Ben CShirley  Eliseos JMucaki  Tyson Whitehead  Paul ICostea  Pelin Akan  Peter KRogan
Institution:1. Department of Computer Science, Middlesex College, The University of Western Ontario, London, ON N6A 5B7, Canada
2. Department of Biochemistry, Schulich School of Medicine and Dentistry, The University of Western Ontario, London,ON N6A 5C1, Canada
3. SHARCNET, London, ON N6A 5B7, Canada
4. Royal Institute of Technology, Science for Life Laboratory, Solna 17165, Sweden
5. Department of Computer Science, Middlesex College, The University of Western Ontario, London, ON N6A 5B7, Canada;Department of Biochemistry, Schulich School of Medicine and Dentistry, The University of Western Ontario, London,ON N6A 5C1, Canada;Cytognomix Inc., London, ON N6G 4X8, Canada
Abstract:Information theory-based methods have been shown to be sensitive and specific for predicting and quantifying the effects of non-coding mutations in Mendelian diseases. We present the Shannon pipeline software for genome-scale mutation analysis and provide evidence that the software predicts variants affecting mRNA splicing. Individual information contents (in bits) of reference and variant splice sites are compared and significant differences are annotated and prioritized. The software has been implemented for CLC-Bio Genomics platform. Annotation indicates the context of novel mutations as well as common and rare SNPs with splicing effects. Potential natural and cryptic mRNA splicing variants are identified, and null mutations are distinguished from leaky mutations. Mutations and rare SNPs were predicted in genomes of three cancer cell lines (U2OS, U251 and A431), which were supported by expression analyses. After filtering, tractable numbers of potentially deleterious variants are predicted by the software, suitable for further laboratory investigation. In these cell lines, novel functional variants comprised 6–17 inactivating mutations, 1–5 leaky mutations and 6–13 cryptic splicing mutations. Predicted effects were validated by RNA-seq analysis of the three aforementioned cancer cell lines, and expression microarray analysis of SNPs in HapMap cell lines.
Keywords:Mutation  mRNA splicing  Information theory  Next-generation sequencing  Genome interpretation
本文献已被 CNKI 万方数据 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号