首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到19条相似文献,搜索用时 125 毫秒
1.
结合小波分析理论与支持向量机理论,构造分类器模型,将前列腺癌基因芯片数据分成癌症和正常两种。本文着重研究小波高频系数基因芯片数据的特征提取,并通过实验对比小波高频系数和低频系数特征提取对分类器性能的影响。其中haar小波3层分解提取高频系数,送入分类器分类后,得到的正确分类率为93.31%。db1小波4层分解提取低频系数,送入分类器分类后,得到的正确分类率为93.53%。小波低频系数特征提取分类效果总体上好于高频系数,分类器性能稳定。  相似文献   

2.
建立了基于小波降噪和支持向量机的结肠癌基因表达数据肿瘤识别模型.对试验数据进行小波分解,并利用交叉验证的方法计算试验样本的平均分类准确率,确定小波函数与小波分解层数;引入能量阈值方法对小波分解系数进行阈值处理,达到降噪的目的;提出了基因分类贡献率与主成分分析结合的方法,提取结肠癌样本数据特征;利用支持向量机强大的非线性映射能力,实现对结肠癌样本数据的非线性分类.为了减弱样本集的划分对分类准确率的影响,本文采取Jackknife检验方法对支持向量分类器的分类器检验,其分类准确率为96.77%.试验结果证明了该方法的有效性,该方法对结肠癌的识别具有一定的参考价值.  相似文献   

3.
高维蛋白质波谱癌症数据分析,一直面临着高维数据的困扰。针对高维蛋白质波谱癌症数据在降维过程中的问题,提出基于小波分析技术和主成分分析技术的高维蛋白质波谱癌症数据特征提取的方法,并在特征提取之后,使用支持向量机进行分类。对8-7-02数据集进行2层小波分解时,分别使用db1、db3、db4、db6、db8、db10、haar小波基,并使用支持向量机进行分类,正确率分别达到98.18%、98.35%、98.04%、98.36%、97.89%、97.96%、98.20%。在进一步提高分类识别正确率的同时,提高了时间率。  相似文献   

4.
目的:本文利用表面肌电(sEMG)信号来研究多种手指组合动作的识别问题。方法:在对采集的四个通道sEMG信号进行降噪预处理的基础上,采用移动加窗处理方法来提取关于手指运动状态的信号活动段,再分析各个信号活动段的小波系数统计特征,进而利用多类支持向量机(SVM)分类算法来实现手指组合动作的识别。结果:动作识别率最高达到100%。结论:所采用方法能够有效地识别多种手势动作,并为后续基于肌电信号的实时人机接口系统的研究奠定了理论基础。  相似文献   

5.
本文首次以平均ERP和单次ERP小波变换系数相关性为基础,设计了小波时频滤波器,可以将单次事件关联电位的P3波从眼动、自发脑电等干扰中提取出来  相似文献   

6.
高维蛋白质波谱数据分析过程中,对于数据的特征提取一直是许多学者专注解决的问题。本文提出了一种基于高频系数的小波分析和主成份分析技术(Principal component analysis,PCA)的特征提取方法,首先采用小波分析技术对数据进行降噪,提取高频系数作为特征,之后用主成份分析技术进行降维。实验显示:本论文中提出的方法在8-7-02、4/3/02数据集上的实验识别率分别可以达到100%和99.45%,可以有效提高分类识别率。  相似文献   

7.
李博  李强 《生物磁学》2011,(20):3942-3945
目的:本文利用表面肌电(sEMG)信号来研究多种手指组合动作的识别问题。方法:在对采集的四个通道sEMG信号进行降噪预处理的基础上,采用移动加窗处理方法来提取关于手指运动状态的信号活动段,再分析各个信号活动段的小波系数统计特征,进而利用多类支持向量机(SVM分类算法来实现手指组合动作的识别。结果:动作识别率最高达到100%。结论:所采用方法能够有效地识别多种手势动作,并为后续基于肌电信号的实时人机接口系统的研究奠定了理论基础。  相似文献   

8.
目的:针对老人易跌倒和跌倒过后可能产生严重后果这一现实问题,通过将表面肌电信号和加速度融合,进一步优化采用支持向量机分类器下的包含跌倒在内的几种不同动作的分类效果。方法:提出基于表面肌电和加速度信号融合的跌倒识别算法,首先采集股直肌,股内侧肌,胫骨前肌和腓肠肌的表面肌电信号以及位于腰部的三轴加速度信号作为实验数据,然后利用滑动窗口法提取表面肌电和加速度信号的均方根值,最后针对人体日常活动和跌倒的运动特征,构建了支持向量机的分类器。结果:实验数据共计320组数据,包括3种日常活动和向前跌倒,其中160组数据作为训练集,另外160组数据作为测试集。对4种动作进行识别实验,算法的准确度为93.23%、灵敏度为92.4%、特异度为100%,达到了良好的分类效果。结论:基于支持向量机的表面肌电信号和加速度融合的跌倒识别算法分类效果良好,对于老人跌倒防护具有现实意义。  相似文献   

9.
基于支持向量机的~(31)P磁共振波谱肝细胞癌诊断   总被引:1,自引:1,他引:0  
支持向量机是在统计学习理论基础上发展起来的一种新的机器学习方法,在模式识别领域有着广泛的应用。利用基于支持向量机模型的31P磁共振波谱数据对肝脏进行分类,区别肝细胞癌,肝硬化和正常的肝组织。通过对基于多项式核函数和径向基核函数的支持向量机分类器进行比较,并且得到三种肝脏分类的识别率。实验表明基于31P磁共振波谱数据的支持向量机分类模型能够对活体肝脏进行诊断性的预测。  相似文献   

10.
一种滤除医学影像噪声的混合滤波算法   总被引:1,自引:0,他引:1       下载免费PDF全文
目的:医学影像在获取、存储、传输过程中会不同程度地受到噪声污染,这极大影像了其在临床诊疗中的应用。为了有效地滤除医学影像噪声,提出了一种混合滤波算法。方法:该算法首先将含有高斯和椒盐噪声的图像进行形态学开运算,然后对开运算后的图像进行二维小波分解,得到高频和低频小波分解系数。保留低频系数不变,将高频系数经过维纳滤波器进行滤波,最后进行小波系数重构。结果:采用该混合滤波算法、小波阈值去噪、中值滤波、维纳滤波分别对含有混合噪声的医学影像分别进行滤除噪声处理,该滤波算法去噪后影像的PSNR值明显高于其他三种方法。结论:该混合滤波算法是一种较为有效的医学影像噪声滤除方法。  相似文献   

11.
We present a method of data reduction using a wavelet transform in discriminant analysis when the number of variables is much greater than the number of observations. The method is illustrated with a prostate cancer study, where the sample size is 248, and the number of variables is 48,538 (generated using the ProteinChip technology). Using a discrete wavelet transform, the 48,538 data points are represented by 1271 wavelet coefficients. Information criteria identified 11 of the 1271 wavelet coefficients with the highest discriminatory power. The linear classifier with the 11 wavelet coefficients detected prostate cancer in a separate test set with a sensitivity of 97% and specificity of 100%.  相似文献   

12.
MOTIVATION: DNA microarray data analysis has been used previously to identify marker genes which discriminate cancer from normal samples. However, due to the limited sample size of each study, there are few common markers among different studies of the same cancer. With the rapid accumulation of microarray data, it is of great interest to integrate inter-study microarray data to increase sample size, which could lead to the discovery of more reliable markers. RESULTS: We present a novel, simple method of integrating different microarray datasets to identify marker genes and apply the method to prostate cancer datasets. In this study, by applying a new statistical method, referred to as the top-scoring pair (TSP) classifier, we have identified a pair of robust marker genes (HPN and STAT6) by integrating microarray datasets from three different prostate cancer studies. Cross-platform validation shows that the TSP classifier built from the marker gene pair, which simply compares relative expression values, achieves high accuracy, sensitivity and specificity on independent datasets generated using various array platforms. Our findings suggest a new model for the discovery of marker genes from accumulated microarray data and demonstrate how the great wealth of microarray data can be exploited to increase the power of statistical analysis. CONTACT: leixu@jhu.edu.  相似文献   

13.
In the past, microarray studies have been criticized due to noise and the limited overlap between gene signatures. Prior biological knowledge should therefore be incorporated as side information in models based on gene expression data to improve the accuracy of diagnosis and prognosis in cancer. As prior knowledge, we investigated interaction and pathway information from the human interactome on different aspects of biological systems. By exploiting the properties of kernel methods, relations between genes with similar functions but active in alternative pathways could be incorporated in a support vector machine classifier based on spectral graph theory. Using 10 microarray data sets, we first reduced the number of data sources relevant for multiple cancer types and outcomes. Three sources on metabolic pathway information (KEGG), protein-protein interactions (OPHID) and miRNA-gene targeting (microRNA.org) outperformed the other sources with regard to the considered class of models. Both fixed and adaptive approaches were subsequently considered to combine the three corresponding classifiers. Averaging the predictions of these classifiers performed best and was significantly better than the model based on microarray data only. These results were confirmed on 6 validation microarray sets, with a significantly improved performance in 4 of them. Integrating interactome data thus improves classification of cancer outcome for the investigated microarray technologies and cancer types. Moreover, this strategy can be incorporated in any kernel method or non-linear version of a non-kernel method.  相似文献   

14.
Biomarkers are needed to address overtreatment that occurs for the majority of prostate cancer patients that would not die of the disease but receive radical treatment. A possible barrier to biomarker discovery may be the polyclonal/multifocal nature of prostate tumors as well as cell-type heterogeneity between patient samples. Tumor-adjacent stroma (tumor microenvironment) is less affected by genetic alteration and might therefore yield more consistent biomarkers in response to tumor aggressiveness. To this end we compared Affymetrix gene expression profiles in stroma near tumor and identified a set of 115 probe sets for which the expression levels were significantly correlated with time-to-relapse. We also compared patients that chemically relapsed shortly after prostatectomy (<1 year), and patients that did not relapse in the first four years after prostatectomy. We identified 131 differentially expressed microarray probe sets between these two categories. 19 probe sets (15 genes overlapped between the two gene lists with p<0.0001). We developed a PAM-based classifier by training on samples containing stroma near tumor: 9 rapid relapse patient samples and 9 indolent patient samples. We then tested the classifier on 47 different samples, containing 90% or more stroma. The classifier predicted the risk status of patients with an average accuracy of 87%. This is the first general tumor microenvironment-based prognostic classifier. These results indicate that the prostate cancer microenvironment exhibits reproducible changes useful for predicting outcomes for patients.  相似文献   

15.
MOTIVATION: Various studies have shown that cancer tissue samples can be successfully detected and classified by their gene expression patterns using machine learning approaches. One of the challenges in applying these techniques for classifying gene expression data is to extract accurate, readily interpretable rules providing biological insight as to how classification is performed. Current methods generate classifiers that are accurate but difficult to interpret. This is the trade-off between credibility and comprehensibility of the classifiers. Here, we introduce a new classifier in order to address these problems. It is referred to as k-TSP (k-Top Scoring Pairs) and is based on the concept of 'relative expression reversals'. This method generates simple and accurate decision rules that only involve a small number of gene-to-gene expression comparisons, thereby facilitating follow-up studies. RESULTS: In this study, we have compared our approach to other machine learning techniques for class prediction in 19 binary and multi-class gene expression datasets involving human cancers. The k-TSP classifier performs as efficiently as Prediction Analysis of Microarray and support vector machine, and outperforms other learning methods (decision trees, k-nearest neighbour and na?ve Bayes). Our approach is easy to interpret as the classifier involves only a small number of informative genes. For these reasons, we consider the k-TSP method to be a useful tool for cancer classification from microarray gene expression data. AVAILABILITY: The software and datasets are available at http://www.ccbm.jhu.edu CONTACT: actan@jhu.edu.  相似文献   

16.
17.
An optimal and individualized treatment protocol based on accurate diagnosis is urgently required for the adequate treatment of patients. For this purpose, it is important to develop a sophisticated algorithm that can manage large amount of data, such as gene expression data from DNA microarray, for optimal and individualized diagnosis. Especially, marker gene selection is essential in the analysis of gene expression data.In the present study, we developed the combination method of projective adaptive resonance theory and boosted fuzzy classifier with SWEEP operator method for model construction and marker selection. And we applied this method to microarray data of acute leukemia and brain tumor. The method enabled the selection of 14 important genes related to the prognosis of the tumor. In addition, we proposed improved reliability index for cancer diagnostic prediction of blinded subjects. Based on the index, the discriminated group with over 90% prediction accuracy was separated from the others.PART-BFCS with improved RIBFCS method does not only show high performance, but also has the feature of reliable prediction further. This result suggests that PART-BFCS with improved RIBFCS method has the potential to function as a new method of class prediction for diagnosis of patients.  相似文献   

18.
Advances in DNA microarray technologies have made gene expression profiles a significant candidate in identifying different types of cancers. Traditional learning-based cancer identification methods utilize labeled samples to train a classifier, but they are inconvenient for practical application because labels are quite expensive in the clinical cancer research community. This paper proposes a semi-supervised projective non-negative matrix factorization method (Semi-PNMF) to learn an effective classifier from both labeled and unlabeled samples, thus boosting subsequent cancer classification performance. In particular, Semi-PNMF jointly learns a non-negative subspace from concatenated labeled and unlabeled samples and indicates classes by the positions of the maximum entries of their coefficients. Because Semi-PNMF incorporates statistical information from the large volume of unlabeled samples in the learned subspace, it can learn more representative subspaces and boost classification performance. We developed a multiplicative update rule (MUR) to optimize Semi-PNMF and proved its convergence. The experimental results of cancer classification for two multiclass cancer gene expression profile datasets show that Semi-PNMF outperforms the representative methods.  相似文献   

19.
Summary .  Time course microarray data consist of mRNA expression from a common set of genes collected at different time points. Such data are thought to reflect underlying biological processes developing over time. In this article, we propose a model that allows us to examine differential expression and gene network relationships using time course microarray data. We model each gene-expression profile as a random functional transformation of the scale, amplitude, and phase of a common curve. Inferences about the gene-specific amplitude parameters allow us to examine differential gene expression. Inferences about measures of functional similarity based on estimated time-transformation functions allow us to examine gene networks while accounting for features of the gene-expression profiles. We discuss applications to simulated data as well as to microarray data on prostate cancer progression.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号