首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Seabird plays an important role in the marine ecosystem and is an indispensable part of the food chain. However, the seabird population has been experiencing a rapid decline due to various factors including climate change, fisheries, and invasive non-native species. To better protect seabirds, the first step is to accurately monitor them. Automatic classification of seabirds would significantly speed up the monitoring process. In this paper, we propose a dual transfer learning framework for improved seabird image classification based on spatial pyramid pooling. Specifically, a dual transfer learning framework is used to capture various patterns to improve the discriminability of the proposed model. Both InceptionV3 and DenseNet201 are used as the backbones, whose outputs are concatenated using a spatial pyramid pooling (SPP) layer. Here, SPP is used to address images of different sizes. Next, two types of pooling, global average-pooling (GAP) and global max-pooling (GMP) are applied to the output of the SPP layer, where the results of GAP and GMP are linearly added up. Our method takes both InceptionV3 and DenseNet201 as feature extractors and is trained offline in an end-to-end style. The proposed dual transfer learning framework-based seabird image classification method reached the accuracy, precision, recall, F1-score of 95.11%, 95.33%, 95.11%, 95.13% on the 10 classes seabird image dataset.  相似文献   

2.
目的 单细胞生长检测可以更加科学地揭示微生物代谢变化的规律,为后期微生物工程应用提供指导。针对微生物生长应用于食品安全期和最佳食用期的精准检测问题,本文提出一种基于拉曼技术的单细胞生长检测方法。方法 首先,通过同步培养实验采集了枯草芽孢杆菌两个批次共900个单细胞拉曼光谱(SCRS)数据,其中600个用于训练和测试,另一批次300个用于模型验证。其次,基于主成分分析的特征关系矩阵,提出CP-SP特征评估方法以筛选SCRS特征用于模型检测。再基于XGBoost构建检测模型,并应用网格搜索和交叉验证对检测模型进行调优。最后,应用混淆矩阵、ROC曲线评估模型对细胞滞后期、对数期和稳定期的检测准确率、敏感性和特异性。结果 选用CP-SP筛选的第一、第二和第四主成分较特征贡献率前3个主成分的分类性能提高了3.1%,调优后的细胞生长检测模型测试准确率为96.0%,验证准确率为92.3%。结论 基于拉曼技术的单细胞生长检测方法能准确识别单细胞生长状态且具有较高的泛化能力,可为食品安全和保鲜制定精准调控机制提供科学指导。  相似文献   

3.
《IRBM》2022,43(5):434-446
ObjectiveThe initial principal task of a Brain-Computer Interfacing (BCI) research is to extract the best feature set from a raw EEG (Electroencephalogram) signal so that it can be used for the classification of two or multiple different events. The main goal of the paper is to develop a comparative analysis among different feature extraction techniques and classification algorithms.Materials and methodsIn this present investigation, four different methodologies have been adopted to classify the recorded MI (motor imagery) EEG signal, and their comparative study has been reported. Haar Wavelet Energy (HWE), Band Power, Cross-correlation, and Spectral Entropy (SE) based Cross-correlation feature extraction techniques have been considered to obtain the necessary features set from the raw EEG signals. Four different machine learning algorithms, viz. LDA (Linear Discriminant Analysis), QDA (Quadratic Discriminant Analysis), Naïve Bayes, and Decision Tree, have been used to classify the features.ResultsThe best average classification accuracies are 92.50%, 93.12%, 72.26%, and 98.71% using the four methods. Further, these results have been compared with some recent existing methods.ConclusionThe comparative results indicate a significant accuracy level performance improvement of the proposed methods with respect to the existing one. Hence, this presented work can guide to select the best feature extraction method and the classifier algorithm for MI-based EEG signals.  相似文献   

4.
ObjectiveStudying the diagnostic value of CT imaging in non-small cell lung cancer (NSCLC), and establishing a prognosis model combined with clinical characteristics is the objective, so as to provide a reference for the survival prediction of NSCLC patients.MethodCT scan data of NSCLC 200 patients were taken as the research object. Through image segmentation, the radiology features of CT images were extracted. The reliability and performance of the prognosis model based on the optimal feature number of specific algorithm and the prognosis model based on the global optimal feature number were compared.Results30-RELF-NB (30 optimal features, RELF feature selection algorithm and NB classifier) has the highest accuracy and AUC (area under the subject characteristic curve) in the prognosis model based on the optimal features of specific algorithm. Among the prognosis models based on global optimal features, 25-NB (25 global optimal features, naive Bayes classification algorithm classifier) has the highest accuracy and AUC. Compared with the prediction model based on feature training of specific feature selection algorithm, the overall performance and stability of the prediction model based on global optimal feature are higher.ConclusionThe prognosis model based on the global optimal feature established in this paper has good reliability and performance, and can be applied to the CT radiology of NSCLC.  相似文献   

5.
《IRBM》2022,43(4):251-258
ObjectivesEsophageal Cancer is the sixth most common cancer with a high fatality rate. Early prognosis of esophageal abnormalities can improve the survival rate of the patients. The sequence of the progress of the esophageal cancer is from esophagitis to non-dysplasia Barrett's esophagus to dysplasia Barrett's esophagus to esophageal adenocarcinoma (EAC). Many studies revealed a 5-fold increase in EAC patients diagnosed with esophagitis, and those diagnosed with Barrett's esophagus have a greater risk of EAC.Material and methodsConvolutional Neural Network (CNN) with efficient feature extractors enable better prognosis of the pre cancerous stage, Barrett's esophagus and esophagitis. The transfer learning techniques with CNN can extract more relevant features for the automated classification of Barrett's esophagus and esophagitis. This paper presents a study on the classification of the esophagitis and Barrett's esophagus (BE) using Deep Convolution Neural Networks (DCNN).ResultsIn the first experiment, the DCNN models perform as a feature extractor, and standard classifiers do the classification. The performance analysis shows that the CNN model ResNet50 with Support Vector Machine (SVM) has an accuracy of 93.5%, recall 93.5%, precision 93.4%, f score 93.5%, AUC 89.8%. In the second experiment, the DCNN classification models perform the classification with Transfer Learning and fine-tuning. The ResNet50 model has improved accuracy of 94.46%, precision 94.46%, f score 94.46%, AUC 96.20%.ConclusionThe ResNet50 model with transfer learning and fine-tuning gives a better performance than the ResNet50 model with SVM classifier. Our experiments show that the DCNN is effective for diagnosing EAC, both as feature extractors and classification models with transfer learning and fine-tuning.  相似文献   

6.
《IRBM》2022,43(4):272-278
PurposeVulnerable plaque of carotid atherosclerosis is prone to rupture, which can easily lead to acute cardiovascular and cerebrovascular accidents. Accurate identification of the vulnerable plaque is a challenging task, especially on limited datasets.MethodsThis paper proposes a multi-feature fusion method to identify high-risk plaque, in which three types of features are combined, i.e. global features of carotid ultrasound images, echo features of regions of interests (ROI) and expert knowledge from ultrasound reports. Due to the fusion of three types of features, more critical features for identifying high-risk plaque are included in the feature set. Therefore, better performance can be achieved even on limited datasets.ResultsFrom testing all combinations of three types of features, the results showed that the accuracy of using all three types of features is the highest. The experiments also showed that the performance of the proposed method is better than other plaque classification methods and classical Convolutional Neural Networks (CNNs) on the Plaque dataset.ConclusionThe proposed method helped to build a more complete feature set so that the machine learning models could identify vulnerable plaque more accurately even on datasets with poor quality and small scale.  相似文献   

7.

Background

Imbalanced data classification is an inevitable problem in medical intelligent diagnosis. Most of real-world biomedical datasets are usually along with limited samples and high-dimensional feature. This seriously affects the classification performance of the model and causes erroneous guidance for the diagnosis of diseases. Exploring an effective classification method for imbalanced and limited biomedical dataset is a challenging task.

Methods

In this paper, we propose a novel multilayer extreme learning machine (ELM) classification model combined with dynamic generative adversarial net (GAN) to tackle limited and imbalanced biomedical data. Firstly, principal component analysis is utilized to remove irrelevant and redundant features. Meanwhile, more meaningful pathological features are extracted. After that, dynamic GAN is designed to generate the realistic-looking minority class samples, thereby balancing the class distribution and avoiding overfitting effectively. Finally, a self-adaptive multilayer ELM is proposed to classify the balanced dataset. The analytic expression for the numbers of hidden layer and node is determined by quantitatively establishing the relationship between the change of imbalance ratio and the hyper-parameters of the model. Reducing interactive parameters adjustment makes the classification model more robust.

Results

To evaluate the classification performance of the proposed method, numerical experiments are conducted on four real-world biomedical datasets. The proposed method can generate authentic minority class samples and self-adaptively select the optimal parameters of learning model. By comparing with W-ELM, SMOTE-ELM, and H-ELM methods, the quantitative experimental results demonstrate that our method can achieve better classification performance and higher computational efficiency in terms of ROC, AUC, G-mean, and F-measure metrics.

Conclusions

Our study provides an effective solution for imbalanced biomedical data classification under the condition of limited samples and high-dimensional feature. The proposed method could offer a theoretical basis for computer-aided diagnosis. It has the potential to be applied in biomedical clinical practice.
  相似文献   

8.
R.R. Janghel  Y.K. Rathore 《IRBM》2021,42(4):258-267
ObjectivesAlzheimer's Disease (AD) is the most general type of dementia. In all leading countries, it is one of the primary reasons of death in senior citizens. Currently, it is diagnosed by calculating the MSME score and by the manual study of MRI Scan. Also, different machine learning methods are utilized for automatic diagnosis but existing has some limitations in terms of accuracy. So, main objective of this paper to include a preprocessing method before CNN model to increase the accuracy of classification.Materials and methodIn this paper, we present a deep learning-based approach for detection of Alzheimer's Disease from ADNI database of Alzheimer's disease patients, the dataset contains fMRI and PET images of Alzheimer's patients along with normal person's image. We have applied 3D to 2D conversion and resizing of images before applying VGG-16 architecture of Convolution neural network for feature extraction. Finally, for classification SVM, Linear Discriminate, K means clustering, and Decision tree classifiers are used.ResultsThe experimental result shows that the average accuracy of 99.95% is achieved for the classification of the fMRI dataset, while the average accuracy of 73.46% is achieved with the PET dataset. On comparing results on the basis of accuracy, specificity, sensitivity and on some other parameters we found that these results are better than existing methods.Conclusionsthis paper, suggested a unique way to increase the performance of CNN models by applying some preprocessing on image dataset before sending to CNN architecture for feature extraction. We applied this method on ADNI database and on comparing the accuracies with other similar approaches it shows better results.  相似文献   

9.
《IRBM》2019,40(5):297-305
BackgroundBrain Computer Interface (BCI) systems have been widely used to develop sustainable assistive technology for people suffering from neurological impairments. A major limitation of current BCI systems is that they are based on Subject-dependent (SD) concept. The SD based BCI system is time consuming and inconvenient for physical or mental disables people and also not suitable for limited computer resources. In order to overcome these problems, recently subject-independent (SI) based BCI concept has been introduced to identify mental states of motor disabled people but the expected outcome of the SI based BCI has not been achieved yet. Hence this paper intends to present an efficient scheme for SI based BCI system. The goal of this research is to develop a method for classifying mental states which can be used by any user. For attaining this target, this study employs a supervised spatial filtering method with four types of feature extraction methods including Katz Fractal Dimension, Sub band Energy, Log Variance and Root Mean Square (RMS) and finally the obtained features are used as input to Linear Discriminant Analysis (LDA) classification model for identifying mental states for SI BCI system.ResultsThe performance of the proposed design is evaluated in several ways such as considering different time window length; different frequency bands; different number of channels. The mean classification accuracy using Katz feature is 84.35% which is the maximum output compare to other features that outperforms the existing methods.ConclusionsOur proposed design will help to make a new technology for development of real-time SI based BCI systems that can be more supportive for the motor disabled patients.  相似文献   

10.
《IRBM》2022,43(2):107-113
Background and objectiveAn important task of the brain-computer interface (BCI) of motor imagery is to extract effective time-domain features, frequency-domain features or time-frequency domain features from the raw electroencephalogram (EEG) signals for classification of motor imagery. However, choosing an appropriate method to combine time domain and frequency domain features to improve the performance of motor imagery recognition is still a research hotspot.MethodsIn order to fully extract and utilize the time-domain and frequency-domain features of EEG in classification tasks, this paper proposed a novel dual-stream convolutional neural network (DCNN), which can use time domain signal and frequency domain signal as the inputs, and the extracted time-domain features and frequency-domain features are fused by linear weighting for classification training. Furthermore, the weight can be learned by the DCNN automatically.ResultsThe experiments based on BCI competition II dataset III and BCI competition IV dataset 2a showed that the model proposed by this study has better performance than other conventional methods. The model used time-frequency signal as the inputs had better performance than the model only used time-domain signals or frequency-domain signals. The accuracy of classification was improved for each subject compared with the models only used one signals as the inputs.ConclusionsFurther analysis shown that the fusion weight of different subject is specifically, adjusting the weight coefficient automatically is helpful to improve the classification accuracy.  相似文献   

11.
12.
BackgroundT-cell epitopes play the important role in T-cell immune response, and they are critical components in the epitope-based vaccine design. Immunogenicity is the ability to trigger an immune response. The accurate prediction of immunogenic T-cell epitopes is significant for designing useful vaccines and understanding the immune system.MethodsIn this paper, we attempt to differentiate immunogenic epitopes from non-immunogenic epitopes based on their primary structures. First of all, we explore a variety of sequence-derived features, and analyze their relationship with epitope immunogenicity. To effectively utilize various features, a genetic algorithm (GA)-based ensemble method is proposed to determine the optimal feature subset and develop the high-accuracy ensemble model. In the GA optimization, a chromosome is to represent a feature subset in the search space. For each feature subset, the selected features are utilized to construct the base predictors, and an ensemble model is developed by taking the average of outputs from base predictors. The objective of GA is to search for the optimal feature subset, which leads to the ensemble model with the best cross validation AUC (area under ROC curve) on the training set.ResultsTwo datasets named ‘IMMA2’ and ‘PAAQD’ are adopted as the benchmark datasets. Compared with the state-of-the-art methods POPI, POPISK, PAAQD and our previous method, the GA-based ensemble method produces much better performances, achieving the AUC score of 0.846 on IMMA2 dataset and the AUC score of 0.829 on PAAQD dataset. The statistical analysis demonstrates the performance improvements of GA-based ensemble method are statistically significant.ConclusionsThe proposed method is a promising tool for predicting the immunogenic epitopes. The source codes and datasets are available in S1 File.  相似文献   

13.
《IRBM》2020,41(4):212-222
Pneumonia is one of the diseases that people may encounter in any period of their lives. Approximately 18% of infectious diseases are caused by pneumonia. This disease may result in death in the following stages. In order to diagnose pneumonia as a medical condition, lung X-ray images are routinely examined by the field experts in the clinical practice. In this study, lung X-ray images that are available for the diagnosis of pneumonia were used. The convolutional neural network was employed as feature extractor, and some of existing convolutional neural network models that are AlexNet, VGG-16 and VGG-19 were utilized so as to realize this specific task. Then, the number of deep features was reduced from 1000 to 100 by using the minimum redundancy maximum relevance algorithm for each deep model. Accordingly, we achieved 100 deep features from each deep model, and we combined these features so as to provide an efficient feature set consisting of totally 300 deep features. In this step of the experiment, this feature set was given as an input to the decision tree, k-nearest neighbors, linear discriminant analysis, linear regression, and support vector machine learning models. Finally, all models ensured promising results, especially linear discriminant analysis yielded the most efficient results with an accuracy of 99.41%. Consequently, the results point out that the deep features provided robust and consistent features for pneumonia detection, and minimum redundancy maximum relevance method was found a beneficial tool to reduce the dimension of the feature set.  相似文献   

14.
BackgroundThe prognosis of chemotherapy is important in clinical decision-making for non-small cell lung cancer (NSCLC) patients.ObjectivesTo develop a model for predicting treatment response to chemotherapy in NSCLC patients from pre-chemotherapy CT images.Materials and MethodsThis retrospective multicenter study enrolled 485 patients with NSCLC who received chemotherapy alone as a first-line treatment. Two integrated models were developed using radiomic and deep-learning-based features. First, we partitioned pre-chemotherapy CT images into spheres and shells with different radii around the tumor (0–3, 3–6, 6–9, 9–12, 12–15 mm) containing intratumoral and peritumoral regions. Second, we extracted radiomic and deep-learning-based features from each partition. Third, using radiomic features, five sphere–shell models, one feature fusion model, and one image fusion model were developed. Finally, the model with the best performance was validated in two cohorts.ResultsAmong the five partitions, the model of 9–12 mm achieved the highest area under the curve (AUC) of 0.87 (95% confidence interval: 0.77–0.94). The AUC was 0.94 (0.85–0.98) for the feature fusion model and 0.91 (0.82–0.97) for the image fusion model. For the model integrating radiomic and deep-learning-based features, the AUC was 0.96 (0.88–0.99) for the feature fusion method and 0.94 (0.85–0.98) for the image fusion method. The best-performing model had an AUC of 0.91 (0.81–0.97) and 0.89 (0.79–0.93) in two validation sets, respectively.ConclusionsThis integrated model can predict the response to chemotherapy in NSCLC patients and assist physicians in clinical decision-making.  相似文献   

15.
目的 长链非编码RNA在遗传、代谢和基因表达调控等方面发挥着重要作用。然而,传统的实验方法解析RNA的三级结构耗时长、费用高且操作要求高。此外,通过计算方法来预测RNA的三级结构在近十年来无突破性进展。因此,需要提出新的预测算法来准确的预测RNA的三级结构。所以,本文发展可以用于提高RNA三级结构预测准确性的碱基关联图预测方法。方法 为了利用RNA理化特征信息,本文应用多层全卷积神经网络和循环神经网络的深度学习算法来预测RNA碱基间的接触概率,并通过注意力机制处理RNA序列中碱基间相互依赖的特征。结果 通过多层神经网络与注意力机制结合,本文方法能够有效得到RNA特征值中局部和全局的信息,提高了模型的鲁棒性和泛化能力。检验计算表明,所提出模型对序列长度L的4种标准(L/10、L/5、L/2、L)碱基关联图的预测准确率分别达到0.84、0.82、0.82和0.75。结论 基于注意力机制的深度学习预测算法能够提高RNA碱基关联图预测的准确率,从而帮助RNA三级结构的预测。  相似文献   

16.
Using surface electromyography (sEMG) signal for efficient recognition of hand gestures has attracted increasing attention during the last decade, with most previous work being focused on recognition of upper arm and gross hand movements and some work on the classification of individual finger movements such as finger typing tasks. However, relatively few investigations can be found in the literature for automatic classification of multiple finger movements such as finger number gestures. This paper focuses on the recognition of number gestures based on a 4-channel wireless sEMG system. We investigate the effects of three popular feature types (i.e. Hudgins’ time–domain features (TD), autocorrelation and cross-correlation coefficients (ACCC) and spectral power magnitudes (SPM)) and four popular classification algorithms (i.e. k-nearest neighbor (k-NN), linear discriminant analysis (LDA), quadratic discriminant analysis (QDA) and support vector machine (SVM)) in offline recognition. Motivated by the good performance of SVM, we further propose combining the three features and employing a new classification method, multiple kernel learning SVM (MKL-SVM). Real sEMG results from six subjects show that all combinations, except k-NN or LDA using ACCC features, can achieve above 91% average recognition accuracy, and the highest accuracy is 97.93% achieved by the proposed MKL-SVM method using the three feature combination (3F). Referring to the offline recognition results, we also implement a real-time recognition system. Our results show that all six subjects can achieve a real-time recognition accuracy higher than 90%. The number gestures are therefore promising for practical applications such as human–computer interaction (HCI).  相似文献   

17.
《IRBM》2022,43(1):62-74
BackgroundThe prediction of breast cancer subtypes plays a key role in the diagnosis and prognosis of breast cancer. In recent years, deep learning (DL) has shown good performance in the intelligent prediction of breast cancer subtypes. However, most of the traditional DL models use single modality data, which can just extract a few features, so it cannot establish a stable relationship between patient characteristics and breast cancer subtypes.DatasetWe used the TCGA-BRCA dataset as a sample set for molecular subtype prediction of breast cancer. It is a public dataset that can be obtained through the following link: https://portal.gdc.cancer.gov/projects/TCGA-BRCAMethodsIn this paper, a Hybrid DL model based on the multimodal data is proposed. We combine the patient's gene modality data with image modality data to construct a multimodal fusion framework. According to the different forms and states, we set up feature extraction networks respectively, and then we fuse the output of the two feature networks based on the idea of weighted linear aggregation. Finally, the fused features are used to predict breast cancer subtypes. In particular, we use the principal component analysis to reduce the dimensionality of high-dimensional data of gene modality and filter the data of image modality. Besides, we also improve the traditional feature extraction network to make it show better performance.ResultsThe results show that compared with the traditional DL model, the Hybrid DL model proposed in this paper is more accurate and efficient in predicting breast cancer subtypes. Our model achieved a prediction accuracy of 88.07% in 10 times of 10-fold cross-validation. We did a separate AUC test for each subtype, and the average AUC value obtained was 0.9427. In terms of subtype prediction accuracy, our model is about 7.45% higher than the previous average.  相似文献   

18.
《IRBM》2022,43(3):198-209
BackgroundFrequency band optimization improves the performance of common spatial pattern (CSP) in motor imagery (MI) tasks classification because MI-related electroencephalograms (EEGs) are highly frequency specific. Many variants of CSP algorithm divided the EEG into various sub bands and then applied CSP. However, the feature dimension of MI-EEG data increases with addition of frequency sub bands and requires efficient feature selection algorithms. The performance of CSP also depends on filtering techniques.MethodIn this study, we designed a dual tree complex wavelet transform based filter bank to filter the EEG into sub bands, instead of traditional filtering methods, which improved the spatial feature extraction efficiency. Further, after filtering EEG into different sub bands, we extracted spatial features from each sub band using CSP and optimized them by a proposed supervised learning framework based on neighbourhood component analysis (NCA). Subsequently, a support vector machine (SVM) is trained to perform classification.ResultsAn experimental study, conducted on two datasets (BCI Competition IV (Dataset 2b), and BCI competition III (Dataset IIIa)), validated the MI classification effectiveness of the proposed method in comparison with standard algorithms such as CSP, Filter bank CSP (CSP), and Discriminative FBCSP (DFBCSP). The average classification accuracy obtained by the proposed method for BCI Competition IV (Dataset 2b), and BCI Competition III (Dataset IIIa) are 84.02 ± 12.2 and 89.1 ± 7.50, respectively and found significant than that achieved by standard methods.ConclusionAchieved superior results suggest that the proposed algorithm can improve the performance of MI-based Brain-computer interface devices.  相似文献   

19.
Diptera insects have the characteristics of spreading diseases and destroying forests. There are similarities among different species, which makes it difficult to identify a Diptera insect. Most traditional convolutional neural networks have large parameters and high recognition latency. Therefore, they are not suitable for deploying models on embedded devices for classification and recognition. This paper proposes an improved neural architecture based on differentiable search method. First, we designed a network search cell by adding the feature output of the previous layer to each search cell. Second, we added the attention module to the search space to expand the searchable range. At the same time, we used methods such as model quantization and limiting the ReLU function to the ReLU6 function to reduce computer resource consumption. Finally, the network model was transplanted to the NVIDIA Jetson Xavier NX embedded development platform to verify the network performance so that the neural architecture search could be organically combined with the embedded development platform. The experimental results show that the designed neural architecture achieves 98.9% accuracy on the Diptera insect dataset with a latency of 8.4 ms. It has important practical significance for the recognition of Diptera insects in embedded devices.  相似文献   

20.
Moon  Myungjin  Nakai  Kenta 《BMC genomics》2016,17(13):65-74
Background

Lately, biomarker discovery has become one of the most significant research issues in the biomedical field. Owing to the presence of high-throughput technologies, genomic data, such as microarray data and RNA-seq, have become widely available. Many kinds of feature selection techniques have been applied to retrieve significant biomarkers from these kinds of data. However, they tend to be noisy with high-dimensional features and consist of a small number of samples; thus, conventional feature selection approaches might be problematic in terms of reproducibility.

Results

In this article, we propose a stable feature selection method for high-dimensional datasets. We apply an ensemble L 1 -norm support vector machine to efficiently reduce irrelevant features, considering the stability of features. We define the stability score for each feature by aggregating the ensemble results, and utilize backward feature elimination on a purified feature set based on this score; therefore, it is possible to acquire an optimal set of features for performance without the need to set a specific threshold. The proposed methodology is evaluated by classifying the binary stage of renal clear cell carcinoma with RNA-seq data.

Conclusion

A comparison with established algorithms, i.e., a fast correlation-based filter, random forest, and an ensemble version of an L 2 -norm support vector machine-based recursive feature elimination, enabled us to prove the superior performance of our method in terms of classification as well as stability in general. It is also shown that the proposed approach performs moderately on high-dimensional datasets consisting of a very large number of features and a smaller number of samples. The proposed approach is expected to be applicable to many other researches aimed at biomarker discovery.

  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号