首页 | 本学科首页   官方微博 | 高级检索  
   检索      

基于混合泊松分布的新生突变识别算法
引用本文:高迎心,温佳威,徐尔,艾冬梅.基于混合泊松分布的新生突变识别算法[J].中国生物化学与分子生物学报,2017,33(11):1168-1174.
作者姓名:高迎心  温佳威  徐尔  艾冬梅
作者单位:北京科技大学 数理学院 信息与计算科学系,北京100083;2)河南偃师高级中学, 河南 洛阳471900
基金项目:国家自然科学基金(No.61370131)资助
摘    要:对个体而言,不经父母遗传而后天获得的突变称为新生突变,绝大多数癌症都起自新生突变。构建快速精确的变异识别算法将有助于对癌症的研究。然而,针对前期新生突变识别算法准确率不高,且耗时多等问题,本文引入了基于变异位点的先验概率分布模型,运用基于混合泊松分布的期望最大化(EM)算法对新生突变识别算法进行改进与优化,研究了有亲缘关系的新生突变的识别,并在识别精度与运算速度方面与已有算法进行对比。结果表明,基于混合泊松分布的 期望最大化算法在提高运算速度的同时降低了假阳性比率,具有良好的识别效果。

关 键 词:人类基因组    新生突变    混合泊松分布    遗传疾病  
收稿时间:2017-08-06

Recognition of de Novo Mutations Based on Hybrid Poisson Distribution
GAO Ying-Xin,WEN Jia-Wei,XU Er,AI Dong-Mei.Recognition of de Novo Mutations Based on Hybrid Poisson Distribution[J].Chinese Journal of Biochemistry and Molecular Biology,2017,33(11):1168-1174.
Authors:GAO Ying-Xin  WEN Jia-Wei  XU Er  AI Dong-Mei
Abstract:For the individual, gene mutations that are acquired without parental inheritance are the origins of vast majority of cancers. Application of fast and accurate recognition algorithms will be a great help to the study of cancer. Aiming at the problem of poor accuracy and time consumption, a prior probability model of mutation sites was introduced. To modify and optimize the recognition algorithm, the Expectation Maximum (EM) algorithm based on mixed Poisson distribution was used to identify the de novo mutation involving kinship data and compare with the existing algorithms in recognition accuracy and computing speed. The results show that the EM algorithm based on mixed Poisson distribution can improve the speed of operation and reduce the false positive ratio, which is of great significance for the recognition of cancer.
Keywords:human genome  de novo mutation     hybrid Poisson distribution  genetic disease  
本文献已被 CNKI 等数据库收录!
点击此处可从《中国生物化学与分子生物学报》浏览原始摘要信息
点击此处可从《中国生物化学与分子生物学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号