首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
结合小波分析理论与支持向量机理论,构造分类器模型,将前列腺癌基因芯片数据分成癌症和正常两种。本文着重研究小波高频系数基因芯片数据的特征提取,并通过实验对比小波高频系数和低频系数特征提取对分类器性能的影响。其中haar小波3层分解提取高频系数,送入分类器分类后,得到的正确分类率为93.31%。db1小波4层分解提取低频系数,送入分类器分类后,得到的正确分类率为93.53%。小波低频系数特征提取分类效果总体上好于高频系数,分类器性能稳定。  相似文献   

2.
刘玉杰  刘毅慧 《生物信息学》2011,9(3):255-258,262
特征提取和分类是模式识别中的关键问题。结合小波分析理论和支持向量机理论,构造分类器模型,将前列腺癌基因芯片数据分成癌症和正常两种。提取小波低频系数表征原始数据并送入支持向量机分类器分类,实验证明:提取db1小波4层分解下的低频系数,送入分类器分类后正确分类率达到93.53%。Haar小波的正确率是92.94%。可见提取不同小波低频系数,得到的分类效果相差不大。  相似文献   

3.
建立了基于小波降噪和支持向量机的结肠癌基因表达数据肿瘤识别模型.对试验数据进行小波分解,并利用交叉验证的方法计算试验样本的平均分类准确率,确定小波函数与小波分解层数;引入能量阈值方法对小波分解系数进行阈值处理,达到降噪的目的;提出了基因分类贡献率与主成分分析结合的方法,提取结肠癌样本数据特征;利用支持向量机强大的非线性映射能力,实现对结肠癌样本数据的非线性分类.为了减弱样本集的划分对分类准确率的影响,本文采取Jackknife检验方法对支持向量分类器的分类器检验,其分类准确率为96.77%.试验结果证明了该方法的有效性,该方法对结肠癌的识别具有一定的参考价值.  相似文献   

4.
《IRBM》2022,43(5):325-332
ObjectiveIn cardiac patient-care, compression of long-term ECG data is essential to minimize the data storage requirement and transmission cost. Hence, this paper presents a novel electrocardiogram data compression technique which utilizes modified run-length encoding of wavelet coefficients.MethodFirst, wavelet transform is applied to the ECG data which decomposes it and packs maximum energy to less number of transform coefficients. The wavelet transform coefficients are quantized using dead-zone quantization. It discards small valued coefficients lying in the dead-zone interval while other coefficients are kept at the formulated quantized output interval. Among all the quantized coefficients, an average value is assigned to those coefficients for which energy packing efficiency is less than 99.99%. The obtained coefficients are encoded using modified run-length coding. It offers higher compression ratio than conventional run-length coding without any loss of information.ResultsCompression performance of the proposed technique is evaluated using different ECG records taken from the MIT-BIH arrhythmia database. The average compression performance in terms of compression ratio, percent root mean square difference, normalized percent mean square difference, and signal to noise ratio are 17.18, 3.92, 6.36, and 28.27 dB respectively for 48 ECG records.ConclusionThe compression results obtained by the proposed technique is better than techniques recently introduced by others. The proposed technique can be utilized for compression of ECG records of Holter monitoring.  相似文献   

5.
基于DWT-GA-PLS的土壤碱解氮含量高光谱估测方法   总被引:1,自引:0,他引:1  
以山东齐河县为研究区,实地采集土壤样本,在土样高光谱测试并进行一阶导数变换的基础上,先运用离散小波变换(DWT)对土壤光谱去噪降维,然后采用遗传算法(GA)筛选土壤碱解氮定量估测模型的参与变量,最后应用偏最小二乘(PLS)回归构建土壤碱解氮含量的估测模型.结果表明: 离散小波变换结合遗传算法和偏最小二乘法(DWT-GA-PLS)用于土壤碱解氮含量定量估测,不仅可压缩光谱变量、减少模型参与变量,而且可改善模型估测准确度;较之于采用土壤全谱,小波离散分解1~2层低频系数构建的模型在参与变量大幅减少的情况下,取得更准确或与之相当的预测结果,其中,基于第2层小波低频系数采用GA筛选变量构建的PLS模型的预测效果表现最好,预测R2达到0.85,RMSE为8.11 mg·kg-1,RPD为2.53.说明DWT-GA-PLS用于土壤碱解氮含量高光谱定量估测的有效性.  相似文献   

6.

Background

Array-based comparative genomic hybridization (array CGH) is a highly efficient technique, allowing the simultaneous measurement of genomic DNA copy number at hundreds or thousands of loci and the reliable detection of local one-copy-level variations. Characterization of these DNA copy number changes is important for both the basic understanding of cancer and its diagnosis. In order to develop effective methods to identify aberration regions from array CGH data, many recent research work focus on both smoothing-based and segmentation-based data processing. In this paper, we propose stationary packet wavelet transform based approach to smooth array CGH data. Our purpose is to remove CGH noise in whole frequency while keeping true signal by using bivariate model.

Results

In both synthetic and real CGH data, Stationary Wavelet Packet Transform (SWPT) is the best wavelet transform to analyze CGH signal in whole frequency. We also introduce a new bivariate shrinkage model which shows the relationship of CGH noisy coefficients of two scales in SWPT. Before smoothing, the symmetric extension is considered as a preprocessing step to save information at the border.

Conclusion

We have designed the SWTP and the SWPT-Bi which are using the stationary wavelet packet transform with the hard thresholding and the new bivariate shrinkage estimator respectively to smooth the array CGH data. We demonstrate the effectiveness of our approach through theoretical and experimental exploration of a set of array CGH data, including both synthetic data and real data. The comparison results show that our method outperforms the previous approaches.
  相似文献   

7.
高维蛋白质波谱数据分析过程中,对于数据的特征提取一直是许多学者专注解决的问题。本文提出了一种基于高频系数的小波分析和主成份分析技术(Principal component analysis,PCA)的特征提取方法,首先采用小波分析技术对数据进行降噪,提取高频系数作为特征,之后用主成份分析技术进行降维。实验显示:本论文中提出的方法在8-7-02、4/3/02数据集上的实验识别率分别可以达到100%和99.45%,可以有效提高分类识别率。  相似文献   

8.
BACKGROUND: The purpose of this study was to prove the feasibility of a longmer oligonucleotide microarray platform to profile gene copy number alterations in prostate cancer cell lines and to quickly indicate novel candidate genes, which may play a role in carcinogenesis. METHODS/RESULTS AND FINDINGS: Genome-wide screening for regions of genetic gains and losses on nine prostate cancer cell lines (PC3, DU145, LNCaP, CWR22, and derived sublines) was carried out using comparative genomic hybridization on a 35,000 feature oligonucleotide microarray (arrayCGH). Compared to conventional chromosomal CGH, more deletions and small regions of gains, particularly in pericentromeric regions and regions next to the telomeres, were detected. As validation of the high-resolution of arrayCGH we further analyzed a small amplicon of 1.7 MB at 9p13.3, which was found in CWR22 and CWR22-Rv1. Increased copy number was confirmed by fluorescence in situ hybridization using the BAC clone RP11-165H19 from the amplified region comprising the two genes interleukin 11 receptor alpha (IL11-RA) and dynactin 3 (DCTN3). Using quantitative real time PCR (qPCR) we could demonstrate that IL11-RA is the gene with the highest copy number gain in the cell lines compared to DCTN3 suggesting IL11-RA to be the amplification target. Screening of 20 primary prostate carcinomas by qPCR revealed an IL11-RA copy number gain in 75% of the tumors analyzed. Gain of DCTN3 was only found in two cases together with a gain of IL11-RA. CONCLUSIONS/SIGNIFICANCE: ArrayCGH using longmer oligonucleotide microarrays is feasible for high-resolution analysis of chomosomal imbalances. Characterization of a small gained region at 9p13.3 in prostate cancer cell lines and primary prostate cancer samples by fluorescence in situ hybridization and quantitative PCR has revealed interleukin 11 receptor alpha gene as a candidate target of amplification with an amplification frequency of 75% in prostate carcinomas. Frequent amplification of IL11-RA in prostate cancer is a potential mechanism of IL11-RA overexpression in this tumor type.  相似文献   

9.
基于小波变换的NDVI与地形因子多尺度空间相关分析   总被引:13,自引:0,他引:13  
李双成  高伟明  周巧富  刘逢媛 《生态学报》2006,26(12):4198-4203
以西藏高原生态系统的NDVI(Normalized Difference Vegetation Index)及其地形影响因子为分析对象,使用小波变换揭示了其多尺度空间格局。通过小波方差尺度图可以辨识出,在研究区域内NDVI及其地形因子存在着4、12km和25km等多尺度变异格局。小波多尺度相关分析是对普通相关的一种拓展,使用小波系数分尺度计算了NDVI及其影响因素的相关系数,并与普通相关进行了比较。结果表明,4种地形因子(海拔高度、坡度、坡向和CTI)不论是正相关还是负相关,在较小尺度上与NDVI的相关系数都比较小,一般情况是尺度增大,相关性增大。这反映了地形因子作为大的宏观制约因素对NDVI起作用。实践证明,小波分析对于揭示自然要素的多尺度空间结构和各向异性是一种强有力的工具。  相似文献   

10.
C.K. Jha  M.H. Kolekar 《IRBM》2021,42(1):65-72
ObjectiveIn health-care systems, compression is an essential tool to solve the storage and transmission problems. In this regard, this paper reports a new electrocardiogram (ECG) data compression scheme which employs sifting function based empirical mode decomposition (EMD) and discrete wavelet transform.MethodEMD based on sifting function is utilized to get the first intrinsic mode function (IMF). After EMD, the first IMF and four significant sifting functions are combined together. This combination is free from many irrelevant components of the signal. Discrete wavelet transform (DWT) with mother wavelet ‘bior4.4’ is applied to this combination. The transform coefficients obtained after DWT are passed through dead-zone quantization. It discards small transform coefficients lying around zero. Further, integer conversion of coefficients and run-length encoding are utilized to achieve a compressed form of ECG data.ResultsCompression performance of the proposed scheme is evaluated using 48 ECG records of the MIT-BIH arrhythmia database. In the comparison of compression results, it is observed that the proposed method exhibits better performance than many recent ECG compressors. A mean opinion score test is also conducted to evaluate the true quality of the reconstructed ECG signals.ConclusionThe proposed scheme offers better compression performance with preserving the key features of the signal very well.  相似文献   

11.
文章提出了一种用小波变换来检测生物荧光图像中囊泡的方法。作者用à trous小波对图像进行小波变换,然后求出每层系数的中值绝对偏差σ,并用t=kσ/0.67作为阈值对每层系数进行门限滤波,然后通过提取小波变换系数来重构图像。通过设计实验与常用的“rolling ball”算法对比,发现小波变换算法在低信噪比的情况下,具有更好的灵敏度;对于形状大小不同的信号,具有更好的稳定性;而且对于信号的细节信息具有更好的保真性。  相似文献   

12.
高维蛋白质波谱癌症数据分析,一直面临着高维数据的困扰。针对高维蛋白质波谱癌症数据在降维过程中的问题,提出基于小波分析技术和主成分分析技术的高维蛋白质波谱癌症数据特征提取的方法,并在特征提取之后,使用支持向量机进行分类。对8-7-02数据集进行2层小波分解时,分别使用db1、db3、db4、db6、db8、db10、haar小波基,并使用支持向量机进行分类,正确率分别达到98.18%、98.35%、98.04%、98.36%、97.89%、97.96%、98.20%。在进一步提高分类识别正确率的同时,提高了时间率。  相似文献   

13.
基于小波变换的混合二维ECG数据压缩方法   总被引:5,自引:0,他引:5  
提出了一种新的基于小波变换的混合二维心电(electrocardiogram,ECG)数据压缩方法。基于ECG数据的两种相关性,该方法首先将一维ECG信号转化为二维信号序列。然后对二维序列进行了小波变换,并利用改进的编码方法对变换后的系数进行了压缩编码:即先根据不同系数子带的各自特点和系数子带之间的相似性,改进了等级树集合分裂(setpartitioninghierarchicaltrees,SPIHT)算法和矢量量化(vectorquantization,VQ)算法;再利用改进后的SPIHT与VQ相混合的算法对小波变换后的系数进行了编码。利用所提算法与已有具有代表性的基于小波变换的压缩算法和其他二维ECG信号的压缩算法,对MIT/BIH数据库中的心律不齐数据进行了对比压缩实验。结果表明:所提算法适用于各种波形特征的ECG信号,并且在保证压缩质量的前提下,可以获得较大的压缩比。  相似文献   

14.
Wavelet transform has been widely applied in extracting characteristic information in spike sorting. As the wavelet coefficients used to distinguish various spike shapes are often disorganized, they still lack in effective unsupervised methods still lacks to select the most discriminative features. In this paper, we propose an unsupervised feature selection method, employing kernel density estimation to select those wavelet coefficients with bimodal or multimodal distributions. This method is tested on a simulated spike data set, and the average misclassification rate after fuzzy C-means clustering has been greatly reduced, which proves this kernel density estimation-based feature selection approach is effective.  相似文献   

15.
Summary This article introduces new methods for performing classification of complex, high‐dimensional functional data using the functional mixed model (FMM) framework. The FMM relates a functional response to a set of predictors through functional fixed and random effects, which allows it to account for various factors and between‐function correlations. The methods include training and prediction steps. In the training steps we train the FMM model by treating class designation as one of the fixed effects, and in the prediction steps we classify the new objects using posterior predictive probabilities of class. Through a Bayesian scheme, we are able to adjust for factors affecting both the functions and the class designations. While the methods can be used in any FMM framework, we provide details for two specific Bayesian approaches: the Gaussian, wavelet‐based FMM (G‐WFMM) and the robust, wavelet‐based FMM (R‐WFMM). Both methods perform modeling in the wavelet space, which yields parsimonious representations for the functions, and can naturally adapt to local features and complex nonstationarities in the functions. The R‐WFMM allows potentially heavier tails for features of the functions indexed by particular wavelet coefficients, leading to a down‐weighting of outliers that makes the method robust to outlying functions or regions of functions. The models are applied to a pancreatic cancer mass spectroscopy data set and compared with other recently developed functional classification methods.  相似文献   

16.
The continuous wavelet transform was applied to the human EEG signals recorded in different states of brain activity. The dynamics of local maxima chains in the matrices of the continuous wavelet transform coefficients was studied. The typologization method was developed for local maxima chains to separate by their drift in the frequency space as well as by dynamics of their signal “energy.” The method proved to be highly informative. It was shown that it was highly sensitive to a selection of one of two responses to the test question. It is determined that local maxima chains in most cases are gradually increasing and decreasing in the frequency space and by changes in the values of their continuous wavelet transform coefficients. The functional asymmetry in local maxima chains types’ distribution is determined. The results obtained allow us to consider the types of the local maxima chains dynamics as a new phenomenon of EEG activity.  相似文献   

17.
The study proposes a method for supervised classification of multi-channel surface electromyographic signals with the aim of controlling myoelectric prostheses. The representation space is based on the discrete wavelet transform (DWT) of each recorded EMG signal using unconstrained parameterization of the mother wavelet. The classification is performed with a support vector machine (SVM) approach in a multi-channel representation space. The mother wavelet is optimized with the criterion of minimum classification error, as estimated from the learning signal set. The method was applied to the classification of six hand movements with recording of the surface EMG from eight locations over the forearm. Misclassification rate in six subjects using the eight channels was (mean ± S.D.) 4.7 ± 3.7% with the proposed approach while it was 11.1 ± 10.0% without wavelet optimization (Daubechies wavelet). The DWT and SVM can be implemented with fast algorithms, thus, the method is suitable for real-time implementation.  相似文献   

18.
BackgroundThe purpose of this study is to estimate and project the number of years of potential life lost (YPLL) among males who die of prostate cancer in the United States from 2004 through 2050 and compare the projections by race/ethnicity and age, accounting for demographic changes and population growth.MethodsWe applied the life expectancy method to estimate YPLL caused by deaths of prostate cancer and all cancers in men by using 1999–2004 national mortality data, 2008 census population demographic projections, and 2004 U.S. life tables. We performed sensitivity analyses by varying death rate and population projections, and examined increase in YPLL from population growth, changes in demographics, and death rates.ResultsThe number of YPLL caused by prostate cancer deaths was projected to increase by 226.1%, from 291,853 in 2004 to 951,753 in 2050. Hispanics were projected to have the fastest growth in YPLL (977.1% from 2004 to 2050) caused by prostate cancer, followed by non-Hispanic blacks (543.1%), and non-Hispanic others (269.7%). People aged 75 or older was projected to account for 62.0% of YPLL from prostate cancer in 2050 compared with 50.8% in 2004. Of the projected increase in YPLL caused by prostate cancer deaths by 2050, 9.8% were due to changes in demographic composition, 26.8% because of mortality change, and 63.4% because of population growth.ConclusionsYPLL due to prostate cancer deaths are projected to increase dramatically, and become a greater burden in the future. The projections highlight the importance of comprehensive cancer control and research on cancers including prostate cancer and racial/ethnic-specific estimates.  相似文献   

19.
Prostate cancer is the most common non-skin cancer and the second leading cause of cancer related mortality for men in the United States. There is strong empirical and epidemiological evidence supporting a stronger role of genetics in early-onset prostate cancer. We performed a genome-wide association scan for early-onset prostate cancer. Novel aspects of this study include the focus on early-onset disease (defined as men with prostate cancer diagnosed before age 56 years) and use of publically available control genotype data from previous genome-wide association studies. We found genome-wide significant (p<5×10−8) evidence for variants at 8q24 and 11p15 and strong supportive evidence for a number of previously reported loci. We found little evidence for individual or systematic inflated association findings resulting from using public controls, demonstrating the utility of using public control data in large-scale genetic association studies of common variants. Taken together, these results demonstrate the importance of established common genetic variants for early-onset prostate cancer and the power of including early-onset prostate cancer cases in genetic association studies.  相似文献   

20.
The prostatic membrane antigen (PSMA) is a protein that is expressed in the prostatic epithelium. We studied the expression of PSMA in a series of 55 patients with different stages of prostate cancer and we compared the PSMA staining in prostate cancer cells, in high-grade prostatic intraepithelial neoplasia (PIN) and in histologically benign prostatic epithelium for the same specimen. For this purpose archival paraffin-embedded specimens were studied by immunohistochemistry with a monoclonal antibody 7E11-C5.3 against PSMA using the streptavidin-biotin method. The mean percentage of PSMA immunoreactivity was 56.67% in prostate cancer (CaP) cells, and 48.6% in PIN cells, which was significantly higher than benign-appearing prostatic epithelium (5.72%) (for each pair, p<0.001). PSMA expression was greater in CaP with a higher Gleason score (p=0.01), but no relationship was found with serum PSA value. We conclude that PSMA overexpression is detected in high-grade PIN and is associated with a higher Gleason score of prostate cancer. It is a potential marker for studying carcinogenesis and progression of prostate cancer.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号