首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 663 毫秒
1.
In this paper, we present a novel cascaded classification framework for automatic detection of individual and clusters of microcalcifications (μC). Our framework comprises three classification stages: i) a random forest (RF) classifier for simple features capturing the second order local structure of individual μCs, where non-μC pixels in the target mammogram are efficiently eliminated; ii) a more complex discriminative restricted Boltzmann machine (DRBM) classifier for μC candidates determined in the RF stage, which automatically learns the detailed morphology of μC appearances for improved discriminative power; and iii) a detector to detect clusters of μCs from the individual μC detection results, using two different criteria. From the two-stage RF-DRBM classifier, we are able to distinguish μCs using explicitly computed features, as well as learn implicit features that are able to further discriminate between confusing cases. Experimental evaluation is conducted on the original Mammographic Image Analysis Society (MIAS) and mini-MIAS databases, as well as our own Seoul National University Bundang Hospital digital mammographic database. It is shown that the proposed method outperforms comparable methods in terms of receiver operating characteristic (ROC) and precision-recall curves for detection of individual μCs and free-response receiver operating characteristic (FROC) curve for detection of clustered μCs.  相似文献   

2.
Heart rate variability (HRV) analysis has quantified the functioning of the autonomic regulation of the heart and heart''s ability to respond. However, majority of studies on HRV report several differences between patients with congestive heart failure (CHF) and healthy subjects, such as time-domain, frequency domain and nonlinear HRV measures. In the paper, we mainly presented a new approach to detect congestive heart failure (CHF) based on combination support vector machine (SVM) and three nonstandard heart rate variability (HRV) measures (e.g. SUM_TD, SUM_FD and SUM_IE). The CHF classification model was presented by using SVM classifier with the combination SUM_TD and SUM_FD. In the analysis performed, we found that the CHF classification algorithm could obtain the best performance with the CHF classification accuracy, sensitivity and specificity of 100%, 100%, 100%, respectively.  相似文献   

3.

Background

Nowadays, sleep quality is one of the most important measures of healthy life, especially considering the huge number of sleep-related disorders. Identifying sleep stages using polysomnographic (PSG) signals is the traditional way of assessing sleep quality. However, the manual process of sleep stage classification is time-consuming, subjective and costly. Therefore, in order to improve the accuracy and efficiency of the sleep stage classification, researchers have been trying to develop automatic classification algorithms. Automatic sleep stage classification mainly consists of three steps: pre-processing, feature extraction and classification. Since classification accuracy is deeply affected by the extracted features, a poor feature vector will adversely affect the classifier and eventually lead to low classification accuracy. Therefore, special attention should be given to the feature extraction and selection process.

Methods

In this paper the performance of seven feature selection methods, as well as two feature rank aggregation methods, were compared. Pz-Oz EEG, horizontal EOG and submental chin EMG recordings of 22 healthy males and females were used. A comprehensive feature set including 49 features was extracted from these recordings. The extracted features are among the most common and effective features used in sleep stage classification from temporal, spectral, entropy-based and nonlinear categories. The feature selection methods were evaluated and compared using three criteria: classification accuracy, stability, and similarity.

Results

Simulation results show that MRMR-MID achieves the highest classification performance while Fisher method provides the most stable ranking. In our simulations, the performance of the aggregation methods was in the average level, although they are known to generate more stable results and better accuracy.

Conclusions

The Borda and RRA rank aggregation methods could not outperform significantly the conventional feature ranking methods. Among conventional methods, some of them slightly performed better than others, although the choice of a suitable technique is dependent on the computational complexity and accuracy requirements of the user.
  相似文献   

4.
《Genomics》2020,112(5):3089-3096
Automatic classification of glaucoma from fundus images is a vital diagnostic tool for Computer-Aided Diagnosis System (CAD). In this work, a novel fused feature extraction technique and ensemble classifier fusion is proposed for diagnosis of glaucoma. The proposed method comprises of three stages. Initially, the fundus images are subjected to preprocessing followed by feature extraction and feature fusion by Intra-Class and Extra-Class Discriminative Correlation Analysis (IEDCA). The feature fusion approach eliminates between-class correlation while retaining sufficient Feature Dimension (FD) for Correlation Analysis (CA). The fused features are then fed to the classifiers namely Support Vector Machine (SVM), Random Forest (RF) and K-Nearest Neighbor (KNN) for classification individually. Finally, Classifier fusion is also designed which combines the decision of the ensemble of classifiers based on Consensus-based Combining Method (CCM). CCM based Classifier fusion adjusts the weights iteratively after comparing the outputs of all the classifiers. The proposed fusion classifier provides a better improvement in accuracy and convergence when compared to the individual algorithms. A classification accuracy of 99.2% is accomplished by the two-level hybrid fusion approach. The method is evaluated on the public datasets High Resolution Fundus (HRF) and DRIVE datasets with cross dataset validation.  相似文献   

5.
目的:研究互信息和自相关函数在睡眠各阶段心率变异性(heart rate variability,HRV)分析中的应用。方法:采用网络公开数据库SleepHeart Rateand Stroke VolumeDataBarhk,将RR序列分为30S一段,并以每30S数据为中心截取5minRR序列作为待分析对象。提取5minRR序列的互信息特征BDM、PDM和自相关函数特征BDC、PDC,然后用统计方法分析各特征在觉醒瓜EM/浅睡/深睡四种睡眠状态下的差异。结果:浅睡和深睡期间,BDM、PDM显著高于觉醒和REM睡眠(P〈0.001)。而BDC显著低于觉醒和REM睡眠(P〈0.001),PDC无显著差异(P〉0.05)。结论:①BDM、PDM和BDC从不同角度反映了不同睡眠阶段下RR序列的特征,它们或许与HRV不同的调节机制有关。@BDM、PDM和BDC可作为辅助HRV睡眠分期的新指标。  相似文献   

6.
Pattern recognition and classification are two of the key topics in computer science. In this paper a novel method for the task of pattern classification is presented. The proposed method combines a hybrid associative classifier (Clasificador Híbrido Asociativo con Traslación, CHAT, in Spanish), a coding technique for output patterns called one-hot vector and majority voting during the classification step. The method is termed as CHAT One-Hot Majority (CHAT-OHM). The performance of the method is validated by comparing the accuracy of CHAT-OHM with other well-known classification algorithms. During the experimental phase, the classifier was applied to four datasets related to the medical field. The results also show that the proposed method outperforms the original CHAT classification accuracy.  相似文献   

7.
PurposeTo address high false-positive results of FFDM issue, we make the first effort to develop a computer-aided diagnosis (CAD) scheme to analyze and distinguish breast lesions.MethodThe breast lesion regions were first segmented and depicted on FFDM images from 106 patients. In this work, 11 gray-level gap-length matrix texture features and 12 shape features were extracted form craniocaudal view and mediolateral oblique view, and then Student’s t-test, Fisher-score and Relief-F were introduced to select features. We also investigated the effect of three factors, i.e., discretisation, selection methods and classifier methods, of the classification performance via analysis of variance. Finally, a classification model was constructed. Spearman’s correlation coefficient analysis was conducted to assess the internal relevance of features.ResultsThe proposed scheme using Student’s t-test achieved an area under the receiver operating characteristic curve (AUC) value of 0.923 at 512 bins. The AUC values are 0.884, 0.867, 0.874 and 0.901 for the low gray-level gaps emphasis (LGGE), solidity, extent, and the combined set, respectively. Solidity and extent depicts the correlation coefficient of 0.86 (P < 0.05).ConclusionsWe present a new CAD scheme based on the contribution of the significant factors. The experimental results demonstrate that the presented scheme can be used to successfully distinguish breast carcinoma lesions and benign fibroadenoma lesions in our FFDM dataset and the MIAS dataset, which may provide a CAD method to assist radiologists in diagnosing and interpreting screening mammograms. Moreover, we found that LGGE, solidity and extent features show great potential for breast lesion classification.  相似文献   

8.
Electroencephalogram (EEG) signals are widely used to study the activity of the brain, such as to determine sleep stages. These EEG signals are nonlinear and non-stationary in nature. It is difficult to perform sleep staging by visual interpretation and linear techniques. Thus, we use a nonlinear technique, higher order spectra (HOS), to extract hidden information in the sleep EEG signal. In this study, unique bispectrum and bicoherence plots for various sleep stages were proposed. These can be used as visual aid for various diagnostics application. A number of HOS based features were extracted from these plots during the various sleep stages (Wakefulness, Rapid Eye Movement (REM), Stage 1-4 Non-REM) and they were found to be statistically significant with p-value lower than 0.001 using ANOVA test. These features were fed to a Gaussian mixture model (GMM) classifier for automatic identification. Our results indicate that the proposed system is able to identify sleep stages with an accuracy of 88.7%.  相似文献   

9.
Atrial fibrillation (AF) and atrial flutter (AFL) are the two common atrial arrhythmia encountered in the clinical practice. In order to diagnose these abnormalities the electrocardiogram (ECG) is widely used. The conventional linear time and frequency domain methods cannot decipher the hidden complexity present in these signals. The ECG is inherently a non-linear, non-stationary and non-Gaussian signal. The non-linear models can provide improved results and capture minute variations present in the time series. Higher order spectra (HOS) is a non-linear dynamical method which is highly rugged to noise. In the present study, the performances of two methods are compared: (i) 3rd order HOS cumulants and (ii) HOS bispectrum. The 3rd order cumulant and bispectrum coefficients are subjected to dimensionality reduction using independent component analysis (ICA) and classified using classification and regression tree (CART), random forest (RF), artificial neural network (ANN) and k-nearest neighbor (KNN) classifiers to select the best classifier. The ICA components of cumulant coefficients have provided the average accuracy, sensitivity, specificity and positive predictive value of 99.50%, 100%, 99.22% and 99.72% respectively using KNN classifier. Similarly, the ICA components of HOS bispectrum coefficients have yielded the average accuracy, sensitivity, specificity and PPV of 97.65%, 98.16%, 98.75% and 99.53% respectively using KNN. So, the ICA performed on the 3rd order HOS cumulants coupled with KNN classifier performed better than the HOS bispectrum method. The proposed methodology is robust and can be used in mass screening of cardiac patients.  相似文献   

10.
To identify non-coding RNA (ncRNA) signals within genomic regions, a classification tool was developed based on a hybrid random forest (RF) with a logistic regression model to efficiently discriminate short ncRNA sequences as well as long complex ncRNA sequences. This RF-based classifier was trained on a well-balanced dataset with a discriminative set of features and achieved an accuracy, sensitivity and specificity of 92.11%, 90.7% and 93.5%, respectively. The selected feature set includes a new proposed feature, SCORE. This feature is generated based on a logistic regression function that combines five significant features—structure, sequence, modularity, structural robustness and coding potential—to enable improved characterization of long ncRNA (lncRNA) elements. The use of SCORE improved the performance of the RF-based classifier in the identification of Rfam lncRNA families. A genome-wide ncRNA classification framework was applied to a wide variety of organisms, with an emphasis on those of economic, social, public health, environmental and agricultural significance, such as various bacteria genomes, the Arthrospira (Spirulina) genome, and rice and human genomic regions. Our framework was able to identify known ncRNAs with sensitivities of greater than 90% and 77.7% for prokaryotic and eukaryotic sequences, respectively. Our classifier is available at http://ncrna-pred.com/HLRF.htm.  相似文献   

11.
Sleep apnoea is a very common sleep disorder which is able to cause symptoms such as daytime sleepiness, irritability and poor concentration. This paper presents a combinational feature extraction approach based on some nonlinear features extracted from Electro Cardio Graph (ECG) Reconstructed Phase Space (RPS) and usually used frequency domain features for detection of sleep apnoea. Here 6 nonlinear features extracted from ECG RPS are combined with 3 frequency based features to reconstruct final feature set. The nonlinear features consist of Detrended Fluctuation Analysis (DFA), Correlation Dimensions (CD), 3 Large Lyapunov Exponents (LLEs) and Spectral Entropy (SE). The final proposed feature set show about 94.8% accuracy over the Physionet sleep apnoea dataset using a kernel based SVM classifier. This research also proves that using non-linear analysis to detect sleep apnoea can potentially improve the classification accuracy of apnoea detection system.  相似文献   

12.

Background

The goal of this work is to develop a non-invasive method in order to help detecting Alzheimer's disease in its early stages, by implementing voice analysis techniques based on machine learning algorithms.

Methods

We extract temporal and acoustical voice features (e.g. Jitter and Harmonics-to-Noise Ratio) from read speech of patients in Early Stage of Alzheimer's Disease (ES-AD), with Mild Cognitive Impairment (MCI), and from a Healthy Control (HC) group. Three classification methods are used to evaluate the efficiency of these features, namely kNN, SVM and decision Tree. To assess the effectiveness of this set of features, we compare them with two sets of feature parameters that are widely used in speech and speaker recognition applications. A two-stage feature selection process is conducted to optimize classification performance. For these experiments, the data samples of HC, ES-AD and MCI groups were collected at AP-HP Broca Hospital, in Paris.

Results

First, a wrapper feature selection method for each feature set is evaluated and the relevant features for each classifier are selected. By combining, for each classifier, the features selected from each initial set, we improve the classification accuracy by a relative gain of more than 30% for all classifiers. Then the same feature selection procedure is performed anew on the combination of selected feature sets, resulting in an additional significant improvement of classification accuracy.

Conclusion

The proposed method improved the classification accuracy for ES-AD, MCI and HC groups and promises the effectiveness of speech analysis and machine learning techniques to help detect pathological diseases.  相似文献   

13.
Oligomers of length k, or k-mers, are convenient and widely used features for modeling the properties and functions of DNA and protein sequences. However, k-mers suffer from the inherent limitation that if the parameter k is increased to resolve longer features, the probability of observing any specific k-mer becomes very small, and k-mer counts approach a binary variable, with most k-mers absent and a few present once. Thus, any statistical learning approach using k-mers as features becomes susceptible to noisy training set k-mer frequencies once k becomes large. To address this problem, we introduce alternative feature sets using gapped k-mers, a new classifier, gkm-SVM, and a general method for robust estimation of k-mer frequencies. To make the method applicable to large-scale genome wide applications, we develop an efficient tree data structure for computing the kernel matrix. We show that compared to our original kmer-SVM and alternative approaches, our gkm-SVM predicts functional genomic regulatory elements and tissue specific enhancers with significantly improved accuracy, increasing the precision by up to a factor of two. We then show that gkm-SVM consistently outperforms kmer-SVM on human ENCODE ChIP-seq datasets, and further demonstrate the general utility of our method using a Naïve-Bayes classifier. Although developed for regulatory sequence analysis, these methods can be applied to any sequence classification problem.  相似文献   

14.

Background  

Polysomnography (PSG) is used to define physiological sleep and different physiological sleep stages, to assess sleep quality and diagnose many types of sleep disorders such as obstructive sleep apnea. However, PSG requires not only the connection of various sensors and electrodes to the subject but also spending the night in a bed that is different from the subject's own bed. This study is designed to investigate the feasibility of automatic classification of sleep stages and obstructive apneaic epochs using only the features derived from a single-lead electrocardiography (ECG) signal.  相似文献   

15.
Automatic classification of cardiac arrhythmias using heart rate variability (HRV) analysis has been an important research topic in recent years. Explorations reveal that various HRV feature combinations can provide highly accurate models for some rhythm disorders. However, the proposed feature combinations lack a direct and carefully designed comparison. The goal of this work is to assess the various HRV feature combinations in classification of cardiac arrhythmias. In this setting, a total of 56 known HRV features are grouped in eight feature combinations. We evaluate and compare the combinations on a difficult problem of automatic classification between nine types of cardiac rhythms using three classification algorithms: support vector machines, AdaBoosted C4.5, and random forest. The effect of analyzed segment length on classification accuracy is also examined. The results demonstrate that there are three combinations that stand out the most, with total classification accuracy of roughly 85% on time segments of 20 s duration. A simple combination of time domain features is shown to be comparable to the more informed combinations, with only 1–4% worse results on average than the three best ones. Random forest and AdaBoosted C4.5 are shown to be comparably accurate, while support vector machines was less accurate (4–5%) on this problem. We conclude that the nonlinear features exhibit only a minor influence on the overall accuracy in discerning different arrhythmias. The analysis also shows that reasonably accurate arrhythmia classification lies in the range of 10–40 s, with a peak at 20 s, and a significant drop after 40 s.  相似文献   

16.

Background

This study proposed an effective method based on the wavelet multi-scale α-entropy features of heart rate variability (HRV) for the recognition of paroxysmal atrial fibrillation (PAF). This new algorithm combines wavelet decomposition and non-linear analysis methods. The PAF signal, the signal distant from PAF, and the normal sinus signals can be identified and distinguished by extracting the characteristic parameters from HRV signals and analyzing their quantification indexes. The original ECG signals for QRS detection and HRV signal extraction are first processed. The features from the HRV signals are extracted as feature vectors using the wavelet multi-scale entropy. A support vector machine-based classifier is used for PAF prediction.

Results

The performance of the proposed method in predicting PAF episodes is evaluated with 100 signals from the MIT-BIT PAF prediction database. With regard to the dynamics and uncertainty of PAF signals, our proposed method obtains the values of 92.18, 94.88, and 89.48% for the evaluation criteria of correct rate, sensitivity, and specificity, respectively.

Conclusions

Our proposed method presents better results than the existing studies based on time domain, frequency domain, and non-linear methods. Thus, our method shows considerable potential for clinical monitoring and treatment.
  相似文献   

17.
This paper proposes a new power spectral-based hybrid genetic algorithm-support vector machines (SVMGA) technique to classify five types of electrocardiogram (ECG) beats, namely normal beats and four manifestations of heart arrhythmia. This method employs three modules: a feature extraction module, a classification module and an optimization module. Feature extraction module extracts electrocardiogram's spectral and three timing interval features. Non-parametric power spectral density (PSD) estimation methods are used to extract spectral features. Support vector machine (SVM) is employed as a classifier to recognize the ECG beats. We investigate and compare two such classification approaches. First they are specified experimentally by the trial and error method. In the second technique the approach optimizes the relevant parameters through an intelligent algorithm. These parameters are: Gaussian radial basis function (GRBF) kernel parameter σ and C penalty parameter of SVM classifier. Then their performances in classification of ECG signals are evaluated for eight files obtained from the MIT–BIH arrhythmia database. Classification accuracy of the SVMGA approach proves superior to that of the SVM which has constant and manually extracted parameter.  相似文献   

18.
In this study, a novel spatial filter design method is introduced. Spatial filtering is an important processing step for feature extraction in motor imagery-based brain-computer interfaces. This paper introduces a new motor imagery signal classification method combined with spatial filter optimization. We simultaneously train the spatial filter and the classifier using a neural network approach. The proposed spatial filter network (SFN) is composed of two layers: a spatial filtering layer and a classifier layer. These two layers are linked to each other with non-linear mapping functions. The proposed method addresses two shortcomings of the common spatial patterns (CSP) algorithm. First, CSP aims to maximize the between-classes variance while ignoring the minimization of within-classes variances. Consequently, the features obtained using the CSP method may have large within-classes variances. Second, the maximizing optimization function of CSP increases the classification accuracy indirectly because an independent classifier is used after the CSP method. With SFN, we aimed to maximize the between-classes variance while minimizing within-classes variances and simultaneously optimizing the spatial filter and the classifier. To classify motor imagery EEG signals, we modified the well-known feed-forward structure and derived forward and backward equations that correspond to the proposed structure. We tested our algorithm on simple toy data. Then, we compared the SFN with conventional CSP and its multi-class version, called one-versus-rest CSP, on two data sets from BCI competition III. The evaluation results demonstrate that SFN is a good alternative for classifying motor imagery EEG signals with increased classification accuracy.  相似文献   

19.
Background Lysine succinylation is one of the reversible protein post-translational modifications (PTMs), which regulate the structure and function of proteins. It plays a significant role in various cellular physiologies including some diseases of human as well as many other organisms. The accurate identification of succinylation site is essential to understand the various biological functions and drug development.Methods In this study, we developed an improved method to predict lysine succinylation sites mapping on Homo sapiens by the fusion of three encoding schemes such as binary, the composition of k-spaced amino acid pairs (CKSAAP) and amino acid composition (AAC) with the random forest (RF) classifier. The prediction performance of the proposed random forest (RF) based on the fusion model in a comparison of other candidates was investigated by using 20-fold cross-validation (CV) and two independent test datasets were collected from two different sources.Results The CV results showed that the proposed predictor achieves the highest scores of sensitivity (SN) as 0.800, specificity (SP) as 0.902, accuracy (ACC) as 0.919, Mathew correlation coefficient (MCC) as 0.766 and partial AUC (pAUC) as 0.163 at a false-positive rate (FPR) = 0.10 and area under the ROC curve (AUC) as 0.958. It achieved the highest performance scores of SN as 0.811, SP as 0.902, ACC as 0.891, MCC as 0.629 and pAUC as 0.139 and AUC as 0.921 for the independent test protein set-1 and SN as 0.772, SP as 0.901, ACC as 0.836, MCC as 0.677 and pAUC as 0.141 at FPR = 0.10 and AUC as 0.923 for the independent test protein set-2. It also outperformed all the other existing prediction models.Conclusion The prediction performances as discussed in this article recommend that the proposed method might be a useful and encouraging computational resource for lysine succinylation site prediction in the case of human population.  相似文献   

20.

Background

The purpose of this study is to explore the potential of phase contrast imaging to detect fibrotic progress in its early stage; to investigate the feasibility of texture features for quantified diagnosis of liver fibrosis; and to evaluate the performance of back propagation (BP) neural net classifier for characterization and classification of liver fibrosis.

Methods

Fibrous mouse liver samples were imaged by X-ray phase contrast imaging, nine texture measures based on gray-level co-occurrence matrix were calculated and the feasibility of texture features in the characterization and discrimination of liver fibrosis at early stages was investigated. Furthermore, 36 or 18 features were applied to the input of BP classifier; the classification performance was evaluated using receiver operating characteristic curve.

Results

The phase contrast images displayed a vary degree of texture pattern from normal to severe fibrosis stages. The BP classifier could distinguish liver fibrosis among normal, mild, moderate and severe stages; the average accuracy was 95.1% for 36 features, and 91.1% for 18 features.

Conclusion

The study shows that early stages of liver fibrosis can be discriminated by the morphological features on the phase contrast images. BP network model based on combination of texture features is demonstrated effective for staging liver fibrosis.
  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号