首页 | 本学科首页   官方微博 | 高级检索  
   检索      

基于CFS-mRMR特征筛选方法和Adaboost算法的胶质瘤 相关基因筛选及预测模型的建立
引用本文:邱 纯,马巧蓉,赵曼曼,苏 强,钟美佐.基于CFS-mRMR特征筛选方法和Adaboost算法的胶质瘤 相关基因筛选及预测模型的建立[J].现代生物医学进展,2019,19(1):26-30.
作者姓名:邱 纯  马巧蓉  赵曼曼  苏 强  钟美佐
作者单位:中南大学湘雅医院肿瘤科;海南省人民医院肿瘤科;广西壮族自治区民族医院检验科;上海大学生命科学学院
摘    要:目的:找出胶质瘤病变发生机制相关的基因群,并在此基础上建立预测胶质瘤病变发生的预测模型。方法:收集GEO中胶质瘤芯片数据,使用关联特征选择(Correlation-based Feature Subset, CFS)和最小冗余最大相关性(Minimum Redundancy MaximumRelevance, mRMR)特征选择方法筛选出差异基因,分析这些差异基因的功能,然后使用Adaboost算法建立胶质瘤的预测模型,并对模型的预测能力进行评估。结果:通过特征筛选,得到了19个和胶质瘤病变相关的的基因;以该19个基因建组成特征子集,结合AdaBoost算法建立了胶质瘤的预测模型,经验证,模型的预报准确率可以达到95.59%。通过对19个差异基因的GO和KEGG分析,发现这些基因和肿瘤的发生发展有一定作用。结论:CFS-mRMR特征筛选方法可以有效地发现与胶质瘤疾病有关的基因,所筛选的19个差异基因具有生物学意义,且以此构建的胶质瘤预测模型,可以有效地对预测胶质瘤的发生。

关 键 词:胶质瘤  特征筛选  差异基因  Adaboost
收稿时间:2018/5/14 0:00:00
修稿时间:2018/6/12 0:00:00

Study of Classification of Gliomas Prediction Based on Machine Learning Method
QIU Chun,MA Qiao-rong,ZHAO Man-man,SU Qiang and ZHONG Mei-zuo.Study of Classification of Gliomas Prediction Based on Machine Learning Method[J].Progress in Modern Biomedicine,2019,19(1):26-30.
Authors:QIU Chun  MA Qiao-rong  ZHAO Man-man  SU Qiang and ZHONG Mei-zuo
Institution:1 XiangYa Hospital of South University, Changsha, Hunan, 410008, China; 2 Hainan Provincial Peopl''s Hospital, Haikou, Hainan, 570311, China,Clinical Laboratory, Affiliated Minzu Hospital Of Guangxi Medical University, Nanning, Guangxi, 530001, China,Shanghai Key Laboratory of Bio-Energy Crops, College of Life Science, Shanghai University, Shanghai, 200444, China,Shanghai Key Laboratory of Bio-Energy Crops, College of Life Science, Shanghai University, Shanghai, 200445, China and XiangYa Hospital of South University, Changsha, Hunan, 410008, China
Abstract: ABSTRACT Objective: This study aims to search the genes related to the mechanisms of occurrences of glioma, and try to build the prediction model of glioma. Methods: In this article, the data were collected from GEO database, and the prediction model of gliomas was studied using the mRMR and correlation-based feature subset (CfsSubset)-Adaboost method. Results:After feature selection,19 genes related to the mechanisms of occurrences of glioma were obtained. Based on the 19 genes, a prediction model based on Adaboost were built, which could be applied to predict the occurrence of glioma. The prediction model yields an accuracy rate of 95.59% for the 10-folds cross validation test. T EGFR and MAD2L1 were found related to gliomas based on GO and KEGG analysis. Conclusion: CFS-mRMR is an efficient feature selection method on searching the key genes correlated to gliomas, which also could be employed to build prediction model.
Keywords:Gliomas  Feature selection  Adaboost  Differential express genes
本文献已被 CNKI 等数据库收录!
点击此处可从《现代生物医学进展》浏览原始摘要信息
点击此处可从《现代生物医学进展》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号