首页 | 本学科首页   官方微博 | 高级检索  
   检索      

两种过滤特征基因选择算法的有效性研究
引用本文:李丽,李霞,郭政,汪强虎,王海芸.两种过滤特征基因选择算法的有效性研究[J].生命科学研究,2003,7(4):369-373,376.
作者姓名:李丽  李霞  郭政  汪强虎  王海芸
作者单位:哈尔滨医科大学,生物医学工程学教研室与生物信息学研究室,中国黑龙江,哈尔滨,150086
基金项目:国家自然科学基金(39970397,30170515),国家863计划(2002AA222052),黑龙江科技攻关,黑龙江自然科学基金(F0177),211工程"十五"建设项目.
摘    要:对基因表达谱进行特征基因选择不仅能改善疾病分类方法的效能,而且为寻找与疾病相关的特征基因提供新的途径.通过比较用调整p值的t检验、非参数评分两种特征基因选择算法后和未进行选择时支持向量机(SVM)分类器的分类性能、支持向量(SV)的吻合度、错分样本ID的吻合度和对样本均匀翻倍后的稳定性.结果发现:特征选择后线性、核函数为二阶多项式和径向基的SVM分类性能明显提高;特征选择前后的SV及错分样本ID的吻合度均较高;SVM的稳定性较好.由此得出结论:这两种特征选择算法具有一定的有效性.

关 键 词:特征基因  选择  算法  有效性  支持向量机  DNA芯片技术
文章编号:1007-7847(2003)04-0369-05

Efficiency of Two Filter Feature Gene Selection Algorithms
LI Li,LI Xia,GUO Zheng,WANG Qiang-hu,WANG Hai-yun.Efficiency of Two Filter Feature Gene Selection Algorithms[J].Life Science Research,2003,7(4):369-373,376.
Authors:LI Li  LI Xia  GUO Zheng  WANG Qiang-hu  WANG Hai-yun
Abstract:Application of feature gene selection in gene expression profile not only improve the efficiency of disease pattern classification methods but also provide a new way to find the genes correlation with disease. The performance of Support Vector Machine classifiers, the consistency of support vectors, the consistency of error samples and equably fold train samples between after using adjust p value t-test, nonparametric scoring and using all genes were compared. The results showed that t-test, linear SVM, square polynomial SVM, Radial Basis Function SVM have excellent performance and the stability of SVM is better after feature selection. The consistency of support vectors and the consistency of error samples were high between pre-feature-selection and post-feature-selection. It could be concluded that the two feature gene selection algorithms have strong validity.
Keywords:feature gene selection  feature gene  support vector machine
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号