首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 203 毫秒
1.
改进的遗传算法(GA)自动优化支持向量机(SVM)参数,同步决策最优特征子集。新颖的分组多基因交叉技术保留了基因小组中的信息,而且允许后代继承更多的来自染色体的遗传信息。该算法促进可行解集中的高质量染色体信息交换,提高了解空间的搜索能力。实验结果说明:改进GA-SVM不仅可决策出与疾病相关的重要特征变量、优化SVM参数,而且可提升分类性能。与前馈BP神经网络及自适应模糊推理系统两种学习算法的比较表明,改进GA-SVM具有更好地表现。  相似文献   

2.
将粒子群优化算法应用于序列联配,提出了一种改进的粒子群优化算法,该算法在粒子群的进化过程中根据粒子的适应值动态地调整粒子群的惯性权重与粒子群飞行速度范围,提高了算法的收敛速度和收敛精度;针对PSO算法可能出现的早熟现象,引入重新初始化机制,增强了算法的搜索能力,实验表明该算法是有效的。  相似文献   

3.
基于粒子群优化算法的脑磁图源定位   总被引:1,自引:0,他引:1  
脑磁图作为一种新型的脑探测技术,具有较高定位精度和毫秒级时间分辨率的特点。快速准确地利用脑磁图技术对三维空间中的脑神经活动源进行定位,对于脑功能研究和医学临床应用都具有重要的应用价值。可是,目前的脑磁图源定位广泛采用了多信号分类方法,它要求对三维大脑空间进行全局扫描,需要大量的计算,存在速度慢的缺点。针对这一问题,提出了一种基于粒子群优化算法的脑磁图源定位方法。先利用粒子群优化算法全局搜索能力强的特点寻找出目标函数的全局最优值,进行初步的脑磁图源定位;然后,再在小范围内进行小网格的搜索,进一步实现精确的定位。实验结果表明,基于粒子群优化算法的脑磁图源定位能够很好地解决上述问题,具有计算速度快、定位精度高的特点。  相似文献   

4.
蛋白质亚细胞定位预测对蛋白质的功能、相互作用及调控机制的研究具有重要意义。本文基于物化性质和结构性质对氨基酸的约化,描述序列局部和全局信息的"组成"、"转换"和"分布"特征,并利用氨基酸亲疏水性的数值统计特征,提出了一种新的蛋白质特征表示方法(NSBH)。分别使用三种分类器KNN、SVM及BP神经网络进行蛋白质亚细胞定位预测,比较了几种方法和特征融合方法的预测结果,显示融合特征表示及结合SVM分类器时能够达到更好的预测准确率。同时,还详细讨论了不同参数对实验结果的影响,具体的实验及比较结果显示了该方法的有效性。  相似文献   

5.
为探讨人工神经网络(ANN)在昆虫分类上的可行性,本文提出利用主成分分析和数学建模等方法相结合改进ANN,并以鳞翅目夜蛾科6种蛾类昆虫为样本进行了验证.首先利用Bugshape1.0特征提取软件获取6种蛾180个右前翅样本的13项数学形态特征数据,再运用主成分分析对蛾翅数学形态特征变量重新组合生成新的综合变量,然后结合主成分分析建立BP神经网络分类器.主成分分析结果表明,前5个主成分的累积贡献率为85.52%,已基本包含了全部特征变量具有的信息.在主成分分析的基础上,建立具有5个输入层节点,12个隐含层节点和1个输出层节点的三层BP神经网络分类器.每种蛾20个样本共120组特征数据对分类器进行训练和仿真,其余60组特征数据对分类器进行验证,仿真输出值与目标值的相关系数R=0.997,分类正确率达到了93.33%.较之未经过主成分分析而单独使用BP神经网络建立的分类器,基于主成分分析的BP神经网络分类器具有更优的性能和更准确的分类能力.研究结果表明本文提出的方法具有很好的分类和鉴别作用,为蛾种类的鉴别提供了一种可行的方法.  相似文献   

6.
基于SVM和平均影响值的人肿瘤信息基因提取   总被引:1,自引:0,他引:1       下载免费PDF全文
基于基因表达谱的肿瘤分类信息基因选取是发现肿瘤特异表达基因、探索肿瘤基因表达模式的重要手段。借助由基因表达谱获得的分类信息进行肿瘤诊断是当今生物信息学领域中的一个重要研究方向,有望成为临床医学上一种快速而有效的肿瘤分子诊断方法。鉴于肿瘤基因表达谱样本数据维数高、样本量小以及噪音大等特点,提出一种结合支持向量机应用平均影响值来寻找肿瘤信息基因的算法,其优点是能够搜索到基因数量尽可能少而分类能力尽可能强的多个信息基因子集。采用二分类肿瘤数据集验证算法的可行性和有效性,对于结肠癌样本集,只需3个基因就能获得100%的留一法交叉验证识别准确率。为避免样本集的不同划分对分类性能的影响,进一步采用全折交叉验证方法来评估各信息基因子集的分类性能,优选出更可靠的信息基因子集。与基它肿瘤分类方法相比,实验结果在信息基因数量以及分类性能方面具有明显的优势。  相似文献   

7.
建立了基于小波降噪和支持向量机的结肠癌基因表达数据肿瘤识别模型.对试验数据进行小波分解,并利用交叉验证的方法计算试验样本的平均分类准确率,确定小波函数与小波分解层数;引入能量阈值方法对小波分解系数进行阈值处理,达到降噪的目的;提出了基因分类贡献率与主成分分析结合的方法,提取结肠癌样本数据特征;利用支持向量机强大的非线性映射能力,实现对结肠癌样本数据的非线性分类.为了减弱样本集的划分对分类准确率的影响,本文采取Jackknife检验方法对支持向量分类器的分类器检验,其分类准确率为96.77%.试验结果证明了该方法的有效性,该方法对结肠癌的识别具有一定的参考价值.  相似文献   

8.
针对目前多分类运动想象脑电识别存在特征提取单一、分类准确率低等问题,提出一种多特征融合的四分类运动想象脑电识别方法来提高识别率。对预处理后的脑电信号分别使用希尔伯特-黄变换、一对多共空间模式、近似熵、模糊熵、样本熵提取结合时频—空域—非线性动力学的初始特征向量,用主成分分析降维,最后使用粒子群优化支持向量机分类。该算法通过对国际标准数据集BCI2005 Data set IIIa中的k3b受试者数据经MATLAB仿真处理后获得93.30%的识别率,均高于单一特征和其它组合特征下的识别率。分别对四名实验者实验采集运动想象脑电数据,使用本研究提出的方法处理获得了72.96%的平均识别率。结果表明多特征融合的特征提取方法能更好的表征运动想象脑电信号,使用粒子群支持向量机可取得较高的识别准确率,为人脑的认知活动提供了一种新的识别方法。  相似文献   

9.
采用Boosting机制的决策树集成分类器对嗜热和常温蛋白进行模式识别。通过自一致性检验、交叉验证和独立样本测试三种方法检测,其中作为Boosting算法中新的Logitboost算法表现更好,其识别的精度分别为100%、88.4%和89.5%,优于神经网络的识别效果。同时探讨了蛋白质分子大小对识别效果的影响。结果表明,将Boosting算法与其它单一分类器有效结合,有望提高研究者对生物分子相关特性的识别能力。  相似文献   

10.
鸟鸣识别是生态监测的重要手段,为进一步提升鸟鸣识别的准确性和鲁棒性,本文提出了1种新的基于深度特征融合的鸟鸣识别方法。该方法首先利用深度特征提取网络对鸟鸣的对数梅尔谱图和补充特征集的深度特征进行提取,再将两种深度特征进行融合,最后使用轻量级梯度提升机(light gradient boosting machine,light GBM)分类器进行分类。本文充分利用深度神经网络的特征提取能力以及light GBM的分类性能,将特征提取和特征分类过程进行分离,从而实现了高准确率的鸟鸣识别。实验结果显示,本文提出的方法在北京百鸟数据集中取得了目前已知的最佳结果,模型的平均准确率达到了98.70%,平均F1分数达到了98.84%。相比传统方法,深度融合特征在鸟鸣识别任务上准确率提升了5.62%以上。同时,引入的light GBM分类器使分类准确率提升了3.02%。此外,在CLO-43SD和Bird CLEF2022比赛的数据集中,本文方法也展现出卓越的性能,分别取得了98.32%和91.12%的平均准确率。本文还引入了类激活图对不同类型鸟鸣的识别结果进行可解释性分析,揭示了...  相似文献   

11.
目的:比较反向传播算法(BP)神经网络和径向基函数(RBF)神经网络预测老年痴呆症疾病进展的效果。方法:以老年痴呆症随访数据为研究对象,以性别、年龄、受教育程度、有无高血压、有无高胆固醇、有无心脏病、有无中风史、有无家族史8个指标作为输入变量,以五年随访的MMSE差值为输出变量,构建基于BP神经网络和RBF神经网络的老年痴呆症疾病进展预测模型。结果:与BP神经网络模型相比,RBF神经网络预测的结果更好,能够有效地预测老年痴呆症疾病进展。结论:神经网络模型将老年痴呆症疾病进展预测问题转化为随访数据中相关测量指标与MMSE差值的非线性问题,为复杂的老年痴呆症疾病进展预测提供了新思路。  相似文献   

12.
ObjectiveStudying the diagnostic value of CT imaging in non-small cell lung cancer (NSCLC), and establishing a prognosis model combined with clinical characteristics is the objective, so as to provide a reference for the survival prediction of NSCLC patients.MethodCT scan data of NSCLC 200 patients were taken as the research object. Through image segmentation, the radiology features of CT images were extracted. The reliability and performance of the prognosis model based on the optimal feature number of specific algorithm and the prognosis model based on the global optimal feature number were compared.Results30-RELF-NB (30 optimal features, RELF feature selection algorithm and NB classifier) has the highest accuracy and AUC (area under the subject characteristic curve) in the prognosis model based on the optimal features of specific algorithm. Among the prognosis models based on global optimal features, 25-NB (25 global optimal features, naive Bayes classification algorithm classifier) has the highest accuracy and AUC. Compared with the prediction model based on feature training of specific feature selection algorithm, the overall performance and stability of the prediction model based on global optimal feature are higher.ConclusionThe prognosis model based on the global optimal feature established in this paper has good reliability and performance, and can be applied to the CT radiology of NSCLC.  相似文献   

13.
BP人工神经网络模拟杨树林冠蒸腾   总被引:4,自引:0,他引:4  
利用2008和2010年的气温、饱和差、总辐射和叶面积指数作为模型输入,液流法观测的蒸腾速率作为模型输出,建立了用于杨树林冠蒸腾模拟的BP人工神经网络模型,利用2009年的观测数据对模型的模拟能力进行了检验,并应用连接权值计算得到的输入变量对输出变量的相对贡献进行了敏感性分析。结果表明:建立的BP人工神经网络蒸腾模型可以很好的模拟林冠蒸腾大小和季节变化,模拟的绝对误差和绝对相对误差的平均值分别为0.11 mm/d和9.5%,纳什效率系数为0.83;输入变量对蒸腾的相对贡献以及蒸腾与输入变量之间的相关性大小顺序相同,均为总辐射叶面积指数饱和差气温。  相似文献   

14.
Feature selection from DNA microarray data is a major challenge due to high dimensionality in expression data. The number of samples in the microarray data set is much smaller compared to the number of genes. Hence the data is improper to be used as the training set of a classifier. Therefore it is important to select features prior to training the classifier. It should be noted that only a small subset of genes from the data set exhibits a strong correlation with the class. This is because finding the relevant genes from the data set is often non-trivial. Thus there is a need to develop robust yet reliable methods for gene finding in expression data. We describe the use of several hybrid feature selection approaches for gene finding in expression data. These approaches include filtering (filter out the best genes from the data set) and wrapper (best subset of genes from the data set) phases. The methods use information gain (IG) and Pearson Product Moment Correlation (PPMC) as the filtering parameters and biogeography based optimization (BBO) as the wrapper approach. K nearest neighbour algorithm (KNN) and back propagation neural network are used for evaluating the fitness of gene subsets during feature selection. Our analysis shows that an impressive performance is provided by the IG-BBO-KNN combination in different data sets with high accuracy (>90%) and low error rate.  相似文献   

15.
《IRBM》2023,44(3):100748
ObjectivesEsophageal cancer is a high occult malignant tumor. Even with good diagnosis and treatment, the 5-year survival rate of esophageal cancer patients is still less than 30%. Considering the influence of clinical characteristics on postoperative esophageal cancer patients, the construction of a neural network model will help improve the poor prognosis of patients in the five years.Material and methodsIn this study, genetic algorithm optimized deep neural network is exploited to the clinical dataset of esophageal cancer. The independent prognostic factors are screened by Relief algorithm and Cox proportional risk regression. FTD prognostic staging system is established to assess the risk level of esophageal cancer patients.ResultsFTD staging system and independent prognostic factors are integrated into the genetic algorithm optimized deep neural network. The Area Under Curve (AUC) of FTD staging system is 0.802. FTD staging system is verified by the Kaplan-Meier survival curve, and the median survival time is divided for different risk grades. The FTD staging system is superior to the TNM stages in the prognosis effect. The AUC of deep neural network optimized by genetic algorithm is 0.91.ConclusionThe deep neural network optimized by genetic algorithm has good performance in predicting the 5-year survival status of esophageal cancer patients. The FTD staging system has a significant prognostic effect. The FTD staging system and genetic algorithm optimized deep neural network can be successfully availed in clinical diagnosis and treatment.  相似文献   

16.

Background  

Supervised classification is fundamental in bioinformatics. Machine learning models, such as neural networks, have been applied to discover genes and expression patterns. This process is achieved by implementing training and test phases. In the training phase, a set of cases and their respective labels are used to build a classifier. During testing, the classifier is used to predict new cases. One approach to assessing its predictive quality is to estimate its accuracy during the test phase. Key limitations appear when dealing with small-data samples. This paper investigates the effect of data sampling techniques on the assessment of neural network classifiers.  相似文献   

17.
AimThe aim was to design and develop a decision support system with a graphical user interface for the prediction of the case of peripheral nerve disorder and to build a classifier using artificial neural networks that can distinguish between carpal tunnel syndrome, neuropathy and normal peripheral nerve conduction.Materials and methodsThe data used were the Nerve Conduction Study data obtained from Kannur Medical College, India. A recurrent neural network and a two-layer feed forward network trained with scaled conjugate gradient back-propagation algorithm were implemented and results were compared.ResultsBoth the networks provided fast convergence and good performance, accuracy being 98.6% and 97.4% for the recurrent neural network and the feed forward networks respectively, the confusion matrix in each case indicated only a few misclassifications. The developed decision support system also gave accurate results in agreement with the specialist's diagnosis and was also useful in storing and viewing the results.DiscussionsIn the field of medicine, programs are being developed that aids in diagnostic decision making by emulating human intelligence such as logical thinking, decision making, learning, etc. The system developed proves useful in combination with other systems in providing diagnostic and predictive medical opinions. It was not meant to replace the specialist, yet it can be used to assist a general practitioner or specialist in diagnosing and predicting patient's condition.ConclusionsThe study proves that artificial neural networks are indeed of value in combination with other systems in providing diagnostic and predictive medical opinions. But the major drawback of these studies, which makes use of the nerve conduction study data are the inherent shortcomings of the interpretation of the results, which include lack of standardization and absence of population-based reference intervals.  相似文献   

18.
We investigate the general problem of signal classification and, in particular, that of assigning stimulus labels to neural spike trains recorded from single cortical neurons. Finding efficient ways of classifying neural responses is especially important in experiments involving rapid presentation of stimuli. We introduce a fast, exact alternative to Bayesian classification. Instead of estimating the class-conditional densities p(x|y) (where x is a scalar function of the feature[s], y the class label) and converting them to P(y|x) via Bayes’ theorem, this probability is evaluated directly and without the need for approximations. This is achieved by integrating over all possible binnings of x with an upper limit on the number of bins. Computational time is quadratic in both the number of observed data points and the number of bins. The algorithm also allows for the computation of feedback signals, which can be used as input to subsequent stages of inference, e.g. neural network training. Responses of single neurons from high-level visual cortex (area STSa) to rapid sequences of complex visual stimuli are analysed. Information latency and response duration increase nonlinearly with presentation duration, suggesting that neural processing speeds adapt to presentation speeds. Action Editor: Alexander Borst  相似文献   

19.
目的:探究将统计学习方法应用于心理测验所得的大量数据进行学习分析的可行性,并基于探究结果对飞行职业的人格特征进行进一步探索,为飞行人员的选拔及评估提供新的思路。方法:从某航空公司随机抽取1020名男性被试,其中飞行人员510名,非飞行人员510名,采用卡特尔16项人格测试对其进行测验,施测后对得到的16项因子分采用支持向量机就随机划分的训练组和测试组进行学习,分析学习结果。结果:挑选出4项因子作为分类的特征因子,基于线性支持向量机构建的分类器在交叉验证下的平均正确率为64%。结论:采用SVM构建的分类器具有一定的可靠性和有效性。  相似文献   

20.
《IRBM》2022,43(6):678-686
ObjectivesFeature selection in data sets is an important task allowing to alleviate various machine learning and data mining issues. The main objectives of a feature selection method consist on building simpler and more understandable classifier models in order to improve the data mining and processing performances. Therefore, a comparative evaluation of the Chi-square method, recursive feature elimination method, and tree-based method (using Random Forest) used on the three common machine learning methods (K-Nearest Neighbor, naïve Bayesian classifier and decision tree classifier) are performed to select the most relevant primitives from a large set of attributes. Furthermore, determining the most suitable couple (i.e., feature selection method-machine learning method) that provides the best performance is performed.Materials and methodsIn this paper, an overview of the most common feature selection techniques is first provided: the Chi-Square method, the Recursive Feature Elimination method (RFE) and the tree-based method (using Random Forest). A comparative evaluation of the improvement (brought by such feature selection methods) to the three common machine learning methods (K- Nearest Neighbor, naïve Bayesian classifier and decision tree classifier) are performed. For evaluation purposes, the following measures: micro-F1, accuracy and root mean square error are used on the stroke disease data set.ResultsThe obtained results show that the proposed approach (i.e., Tree Based Method using Random Forest, TBM-RF, decision tree classifier, DTC) provides accuracy higher than 85%, F1-score higher than 88%, thus, better than the KNN and NB using the Chi-Square, RFE and TBM-RF methods.ConclusionThis study shows that the couple - Tree Based Method using Random Forest (TBM-RF) decision tree classifier successfully and efficiently contributes to find the most relevant features and to predict and classify patient suffering of stroke disease.”  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号