首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
物种分类与识别是生物多样性监测的基础, 明确物种的类别及其分布是解决几乎所有生态学问题的前提。为深入了解基于多源遥感数据的植物物种分类与识别相关研究的发展现状和存在的问题, 本文对2000年以来该领域的研究进行了总结分析, 发现: 当前大多数研究集中在欧洲和北美地区的温带或北方森林以及南非的热带稀树草原; 使用最多的遥感数据是机载高光谱数据, 而激光雷达作为补充数据, 通过单木分割及提供单木的三维垂直结构信息, 显著提高了分类精度; 支持向量机和随机森林作为应用最广的非参数分类算法, 平均分类精度达80%; 随着计算机技术及机器学习领域的不断成熟, 人工神经网络在物种识别领域得以迅速发展。基于此, 本文对目前基于遥感数据的植物物种分类与识别中在分类对象复杂性、多源遥感数据整合、植物物候与纹理特征整合和分类算法技术等方面面临的挑战进行了总结, 并建议通过整合多时相监测数据、高光谱和激光雷达数据、短波红外等特定波谱信息、采用深度学习等方法来提高分类精度。  相似文献   

2.
树种多样性是生态学研究的重要内容,树木的种类和空间分布信息可有效服务于可持续森林管理。但在复杂林分条件下,获取高精度分类结果的难度大。而无人机遥感可获取局域超精细数据,为树种分类精度的提高提供了可能。基于可见光、高光谱、激光雷达等多源无人机遥感数据,探究其在亚热带林分条件下的树种分类潜力。研究发现:(1)随机森林分类器总体精度和各树种的F1分数最高,适合亚热带多树种的分类制图,其区分13种类别(8乔木,4草本)的总体精度为95.63%,Kappa系数为0.948;(2)多源数据的使用可以显著提高分类精度,全特征模型精度最高,且高光谱和激光雷达数据显著影响全特征模型分类精度,可见光纹理数据作用较小;(3)分类特征重要性从大到小排序为结构信息,植被指数,纹理信息,最小噪声变换分量。  相似文献   

3.
A new machine learning method referred to as F-score_ELM was proposed to classify the lying and truth-telling using the electroencephalogram (EEG) signals from 28 guilty and innocent subjects. Thirty-one features were extracted from the probe responses from these subjects. Then, a recently-developed classifier called extreme learning machine (ELM) was combined with F-score, a simple but effective feature selection method, to jointly optimize the number of the hidden nodes of ELM and the feature subset by a grid-searching training procedure. The method was compared to two classification models combining principal component analysis with back-propagation network and support vector machine classifiers. We thoroughly assessed the performance of these classification models including the training and testing time, sensitivity and specificity from the training and testing sets, as well as network size. The experimental results showed that the number of the hidden nodes can be effectively optimized by the proposed method. Also, F-score_ELM obtained the best classification accuracy and required the shortest training and testing time.  相似文献   

4.
Tandem mass spectrometry (MS/MS) combined with protein database searching has been widely used in protein identification. A validation procedure is generally required to reduce the number of false positives. Advanced tools using statistical and machine learning approaches may provide faster and more accurate validation than manual inspection and empirical filtering criteria. In this study, we use two feature selection algorithms based on random forest and support vector machine to identify peptide properties that can be used to improve validation models. We demonstrate that an improved model based on an optimized set of features reduces the number of false positives by 58% relative to the model which used only search engine scores, at the same sensitivity score of 0.8. In addition, we develop classification models based on the physicochemical properties and protein sequence environment of these peptides without using search engine scores. The performance of the best model based on the support vector machine algorithm is at 0.8 AUC, 0.78 accuracy, and 0.7 specificity, suggesting a reasonably accurate classification. The identified properties important to fragmentation and ionization can be either used in independent validation tools or incorporated into peptide sequencing and database search algorithms to improve existing software programs.  相似文献   

5.
《植物生态学报》2015,39(12):1125
Aims Using remote sensing data for tree species classification plays a key role in forestry resource monitoring, sustainable forest management and biodiversity research.Methods This study used integrated sensor LiCHy (LiDAR, CCD and Hyperspectral) to obtain both the high resolution imagery and the hyperspectral data at the same time for the natural secondary forest in south Jiangsu hilly region. The data were used to identify the crown and to classify tree species at multiple levels. Firstly, tree crowns were selected by segmenting high-resolution imagery at multiple scales based on edge detection; secondly, characteristic variables of hyperspectral images were extracted, then optimization variables were selected based on the theory of information entropy. Tree species and forest types were classified using either all characteristic variables or optimization variables only. Finally, tree species and forest types were reclassified along with the tree crowns information, and the accuracy of classification was discussed. Important findings Based on all available characteristic variables, the overall accuracy for four typical tree species classification was 64.6%, and the Kappa coefficient was 0.493. The overall accuracy for forest types classification was 81.1%, and the Kappa coefficient was 0.584. Based on optimization variables only, the overall accuracy for four typical tree species classification dropped to 62.9%, and the Kappa coefficient was 0.459. The overall accuracy for forest types classification was 77.7%, and the Kappa coefficient was 0.525. Obtaining both high resolution image and hyperspectral data at the same time by integrated sensor can increase overall accuracy in classifying forest types and tree species in northern subtropical forest.  相似文献   

6.
申鑫  曹林  徐婷  佘光辉 《植物生态学报》2015,39(12):1125-1135
利用遥感数据开展森林资源树种的分类对森林资源的监测、森林可持续经营及生物多样性研究都有重要意义。该文以江苏南部丘陵地区的北亚热带天然次生林为研究对象, 利用LiCHy (LiDAR、CCD、Hyperspectral)集成传感器同期获取的高分辨率和高光谱数据, 进行冠幅识别和多个层次的树种分类: 首先, 对高分辨率影像进行基于边缘检测的多尺度分割, 提取出单木冠幅; 其次, 对高光谱影像进行特征变量提取, 并对提取出的特征变量利用信息熵原理选取优化特征变量; 然后, 分别利用全部特征变量和经优化的重要特征变量对森林树种及森林类型进行预分类; 最后, 在预分类结果中加入单木冠幅信息对森林树种及森林类型进行重分类, 并分析分类结果的精度。研究表明: 1)利用全部特征变量进行4个典型树种分类时, 总体精度为64.6%, Kappa系数为0.493; 而针对森林类型的分类精度为81.1%, Kappa系数为0.584。2)利用选取的优化特征变量分类精度略低于利用全部特征变量的分类精度, 其中对4个典型树种分类时, 总体精度为62.9%, Kappa系数为0.459; 而针对森林类型的分类精度为77.7%, Kappa系数为0.525。通过集成传感器同期获取的高分辨率和高光谱数据可以有效地进行北亚热带森林的树种分类及森林类型的划分。  相似文献   

7.
冠层树种多样性是自然森林生态系统功能和服务的重要基础。及时掌握冠层多样性的现状及变化趋势, 是探讨诸多重要生态学问题的前提, 更是制定合理生物多样性保护策略的基础。但受制于传统的多样性信息采集方法, 区域尺度的高精度冠层多样性监测发展较为缓慢; 许多在气候变化和人类干扰下的生物多样性分布信息得不到及时更新。近年来基于无人机的冠层高光谱影像收集与分析技术的发展, 使得冠层多样性监测迎来了新的发展契机。本文从森林冠层高光谱影像出发, 介绍了与多样性监测相关的无人机航拍和基于深度学习的图像处理技术, 并结合已有文献, 探讨了无人机高光谱应用于森林冠层树种多样性监测的研究现状、可行性、优势及缺陷等。我们认为冠层高光谱影像为多样性监测提供了不可或缺且丰富的原始信息; 而无人机与高光谱相机的结合, 使得区域化高频率(如每周)、高精度(如分米乃至厘米级)的冠层多样性信息自动化收集成为可能。然而高光谱影像数据量大、数据维度高与数据结构非线性的特点为影像处理带来了挑战, 而深度学习技术的飞跃, 使得从冠层高光谱影像中提取个体及物种信息达到了极高精度。恰当地使用这些技术将大大提升冠层树种多样性的自动化监测水平, 由此也将帮助我们在当前剧变环境下及时掌握森林冠层多样性的现状与变化, 为生物多样性研究与保护提供可靠的数据支撑。  相似文献   

8.
掌握森林内树木种类及其分布情况对研究森林生态系统具有重要意义.为推广国产高分数据在森林树种分类方面的应用,同时探究不同时相、分类特征及分类器的组合对树种分类结果的影响,本研究利用3景高分二号影像构建了3种单时相和4种多时相,通过多尺度分割、C5.0特征优选及支持向量机(SVM)和随机森林(RF)两种分类器分别实现了不同时相及特征维度下面向对象的8个树种的分类,最终取得了总体精度在63.5~83.5%、Kappa系数在0.57~0.81的良好结果.结果表明: 时相的选择会对分类结果产生较大的影响,其中,基于多时相的结果往往优于单时相,多时相下不同影像组合间以及单时相间亦存在明显的精度差异;特征优选会对分类精度的提升起到积极作用,应予以足够重视;SVM在不同时相及特征维度下的表现均较为稳定,在单时相及分类特征难以直接区分树种的情况下应优先使用SVM,但使用SVM时应注意其易发生过拟合;RF不易发生明显的过拟合,但其对分类特征的质量依赖较大,并倾向于在良好的影像组合下取得较为优异的结果.  相似文献   

9.
The globe's population is increasing day by day, which causes the severe problem of organic food for everyone. Farmers are becoming progressively conscious of the need to control numerous essential factors such as crop health, water or fertilizer use, and harmful diseases in the field. However, it is challenging to monitor agricultural activities. Therefore, precision agriculture is an important decision support system for food production and decision-making. Several methods and approaches have been used to support precision agricultural practices. The present study performs a systematic literature review on hyperspectral imaging technology and the most advanced deep learning and machine learning algorithm used in agriculture applications to extract and synthesize the significant datasets and algorithms. We reviewed legal studies carefully, highlighted hyperspectral datasets, focused on the most methods used for hyperspectral applications in agricultural sectors, and gained insight into the critical problems and challenges in the hyperspectral data processing. According to our study, it has been found that the Hyperion hyperspectral, Landsat-8, and Sentinel 2 multispectral datasets were mainly used for agricultural applications. The most applied machine learning method was support vector machine and random forest. In addition, the deep learning-based Convolutional Neural Networks (CNN) model is mainly used for crop classification due to its high performance with hyperspectral datasets. The present review will be helpful to the new researchers working in the field of hyperspectral remote sensing for agricultural applications with a machine and deep learning methods.  相似文献   

10.
探讨原发性肝癌患者精确放疗后乙型肝炎病毒(hepatitis b virus,HBV)再激活的危险特征和分类预测模型。提出基于遗传算法的特征选择方法,从原发性肝癌数据的初始特征集中选择HBV再激活的最优特征子集。建立贝叶斯和支持向量机的HBV再激活分类预测模型,并预测最优特征子集和初始特征集的分类性能。实验结果表明,基于遗传算法的特征选择提高了HBV再激活分类性能,最优特征子集的分类性能明显优于初始特征子集的分类性能。影响HBV再激活的最优特征子集包括:HBV DNA水平,肿瘤分期TNM,Child-Pugh,外放边界和全肝最大剂量。贝叶斯的分类准确性最高可达82.89%,支持向量机的分类准确性最高可达83.34%。  相似文献   

11.
12.
Microarray data analysis has been shown to provide an effective tool for studying cancer and genetic diseases. Although classical machine learning techniques have successfully been applied to find informative genes and to predict class labels for new samples, common restrictions of microarray analysis such as small sample sizes, a large attribute space and high noise levels still limit its scientific and clinical applications. Increasing the interpretability of prediction models while retaining a high accuracy would help to exploit the information content in microarray data more effectively. For this purpose, we evaluate our rule-based evolutionary machine learning systems, BioHEL and GAssist, on three public microarray cancer datasets, obtaining simple rule-based models for sample classification. A comparison with other benchmark microarray sample classifiers based on three diverse feature selection algorithms suggests that these evolutionary learning techniques can compete with state-of-the-art methods like support vector machines. The obtained models reach accuracies above 90% in two-level external cross-validation, with the added value of facilitating interpretation by using only combinations of simple if-then-else rules. As a further benefit, a literature mining analysis reveals that prioritizations of informative genes extracted from BioHEL's classification rule sets can outperform gene rankings obtained from a conventional ensemble feature selection in terms of the pointwise mutual information between relevant disease terms and the standardized names of top-ranked genes.  相似文献   

13.
Bayesian networks are knowledge representation tools that model the (in)dependency relationships among variables for probabilistic reasoning. Classification with Bayesian networks aims to compute the class with the highest probability given a case. This special kind is referred to as Bayesian network classifiers. Since learning the Bayesian network structure from a dataset can be viewed as an optimization problem, heuristic search algorithms may be applied to build high-quality networks in medium- or large-scale problems, as exhaustive search is often feasible only for small problems. In this paper, we present our new algorithm, ABC-Miner, and propose several extensions to it. ABC-Miner uses ant colony optimization for learning the structure of Bayesian network classifiers. We report extended computational results comparing the performance of our algorithm with eight other classification algorithms, namely six variations of well-known Bayesian network classifiers, cAnt-Miner for discovering classification rules and a support vector machine algorithm.  相似文献   

14.
As important members of the ecosystem, birds are good monitors of the ecological environment. Bird recognition, especially birdsong recognition, has attracted more and more attention in the field of artificial intelligence. At present, traditional machine learning and deep learning are widely used in birdsong recognition. Deep learning can not only classify and recognize the spectrums of birdsong, but also be used as a feature extractor. Machine learning is often used to classify and recognize the extracted birdsong handcrafted feature parameters. As the data samples of the classifier, the feature of birdsong directly determines the performance of the classifier. Multi-view features from different methods of feature extraction can obtain more perfect information of birdsong. Therefore, aiming at enriching the representational capacity of single feature and getting a better way to combine features, this paper proposes a birdsong classification model based multi-view features, which combines the deep features extracted by convolutional neural network (CNN) and handcrafted features. Firstly, four kinds of handcrafted features are extracted. Those are wavelet transform (WT) spectrum, Hilbert-Huang transform (HHT) spectrum, short-time Fourier transform (STFT) spectrum and Mel-frequency cepstral coefficients (MFCC). Then CNN is used to extract the deep features from WT, HHT and STFT spectrum, and the minimal-redundancy-maximal-relevance (mRMR) to select optimal features. Finally, three classification models (random forest, support vector machine and multi-layer perceptron) are built with the deep features and handcrafted features, and the probability of classification results of the two types of features are fused as the new features to recognize birdsong. Taking sixteen species of birds as research objects, the experimental results show that the three classifiers obtain the accuracy of 95.49%, 96.25% and 96.16% respectively for the features of the proposed method, which are better than the seven single features and three fused features involved in the experiment. This proposed method effectively combines the deep features and handcrafted features from the perspectives of signal. The fused features can more comprehensively express the information of the bird audio itself, and have higher classification accuracy and lower dimension, which can effectively improve the performance of bird audio classification.  相似文献   

15.
Introduction: The coronavirus disease 2019 (COVID-19) has become a significant public health problem worldwide. In this context, CT-scan automatic analysis has emerged as a COVID-19 complementary diagnosis tool allowing for radiological finding characterization, patient categorization, and disease follow-up. However, this analysis depends on the radiologist’s expertise, which may result in subjective evaluations.Objective: To explore deep learning representations, trained from thoracic CT-slices, to automatically distinguish COVID-19 disease from control samples.Materials and methods: Two datasets were used: SARS-CoV-2 CT Scan (Set-1) and FOSCAL clinic’s dataset (Set-2). The deep representations took advantage of supervised learning models previously trained on the natural image domain, which were adjusted following a transfer learning scheme. The deep classification was carried out: (a) via an end-to-end deep learning approach and (b) via random forest and support vector machine classifiers by feeding the deep representation embedding vectors into these classifiers.Results: The end-to-end classification achieved an average accuracy of 92.33% (89.70% precision) for Set-1 and 96.99% (96.62% precision) for Set-2. The deep feature embedding with a support vector machine achieved an average accuracy of 91.40% (95.77% precision) and 96.00% (94.74% precision) for Set-1 and Set-2, respectively.Conclusion: Deep representations have achieved outstanding performance in the identification of COVID-19 cases on CT scans demonstrating good characterization of the COVID-19 radiological patterns. These representations could potentially support the COVID-19 diagnosis in clinical settings.  相似文献   

16.
探究全球生态系统动力学调查(GEDI)多波束激光雷达数据估测区域森林郁闭度(FCC)的潜力,对于评估森林生态系统状态和林分环境具有重要作用。选取滇西北典型生态脆弱区香格里拉为研究区,以GEDI波形数据为信息源,提取46245个有林地光斑参数,使用经验贝叶斯克里金法(EBK)获取光斑参数在研究区未知空间的连续分布,结合54块实测样地数据,采用支持向量机的递归特征消除法(SVM-RFE)、随机森林(RF)和Pearson分析分别优选特征变量,基于贝叶斯优化(BO)随机森林回归模型(BO-RFR)、贝叶斯优化梯度回归模型(BO-GBRT)和偏最小二乘法(PLSR)研建森林郁闭度最佳估测模型。结果表明:(1)EBK法预测精度高,估测结果可靠,R2:0.20-0.92,RMSE:0.004-2812.912,MAE:0.003-1996.258,MRE:0.007-4.423;(2)基于不同特征优选方法筛选的特征变量和数量略有差异,SVM-RFE 法优选出6个参数(cover、pai、sensitivity、rv_a1、rv_a4、rg_a4)的平均交叉验证精度达0.84,RF法以贡献度5%为阈值筛选出5个参数(cover、pai、pgap_theta_error、modis_treecover、modis_nonvegetated),Pearson法以相关性大于0.3且在0.01水平显著优选出5个参数(cover、pai、rv_a5、rg_a5、pgap_theta_error);(3)不同特征变量优选方法筛选的建模参数研建估测模型精度差异性较大,以SVM-RFE和RF方法优选参数构建估测模型的精度更佳,SVM-RFE方法优选参数研建估测模型精度变化相对稳定,以 RF方法中的BO-GBRT模型为最佳FCC估测模型(R2=0.85、RMSE=0.069,P=86.5%);(4)采用BO-GBRT模型估测研究区森林郁闭度和空间制图,与GEDI pai参数预测的FCC具有较高空间相关性达0.53,FCC均值分别为0.58、0.61,主要分布在0.4-0.7,分别占比65.45%、51.79%。研究区森林郁闭度主要处于中度郁闭,北部区域主要为高度郁闭区,与研究区植被覆盖度的空间分布具有一致性,说明使用GEDI数据估测森林郁闭度的方法具有可行性、结果具有可靠性。研究为使用GEDI数据高效、及时、低成本估测大空间尺度的森林水平结构参数的相关研究奠定了基础。  相似文献   

17.
染色体易位重组位点的识别对很多染色体遗传性疾病的诊断有着重要的意义.本文基于实际诊断中采集到的24类染色体数据和9号正常与异常染色体数据,构建了一套自动识别染色体易位重组位点的模型和方法.首先,对染色体图像进行预处理,得到了方向梯度直方图特征(HOG)和局部二值模式特征(LBP),构建了基于纹理特征的染色体24分类多通...  相似文献   

18.
19.
原发性肝癌(PLC)患者在精确放疗后乙型肝炎病毒(HBV)再激活是一种常见并发症,及时的预测防护能降低发病率、死亡率。研究表明:多余的特征变量会影响HBV再激活的预测精度。通过提出基于近邻成分分析(NCA)的特征选择方法找出HBV再激活的危险因素及特征组合。之后分别建立经Bayes优化前后的支持向量机模型(SVM)对这些关键特征子集及初始特征集进行分类预测。实验结果表:明HBV DNA水平、KPS评分、分割方式、外放边界、V25、肿瘤分期TNM、ChildPugh等都是影响HBV再激活的危险因素。其中经NCA特征选择之后发现的V25是在乙型肝炎病毒再激活研究中首次提出的危险因素。10折交叉验证下特征组合HBV DNA水平、外放边界、V25的预测精度高达86.11%。支持向量机分类器可以很好的应用于乙型肝炎病毒再激活的研究,特征选择后的关键特征组合具有更优越的分类性能。  相似文献   

20.
Yao Y  Zhang T  Xiong Y  Li L  Huo J  Wei DQ 《Biotechnology journal》2011,6(11):1367-1376
The support vector machine (SVM), an effective statistical learning method, has been widely used in mutation prediction. Two factors, i.e., feature selection and parameter setting, have shown great influence on the efficiency and accuracy of SVM classification. In this study, according to the principles of a genetic algorithm (GA) and SVM, we developed a GA-SVM program and applied it to human cytochrome P450s (CYP450s), which are important monooxygenases in phase I drug metabolism. The program optimizes features and parameters simultaneously, and hence fewer features are used and the overall prediction accuracy is improved. We focus on the mutation of non-synonymous single nucleotide polymorphisms (nsSNPs) in protein sequences that appear to exhibit significant influences on drug metabolism. The final predictive model has a quite satisfactory performance, with the prediction accuracy of 61% and cross-validation accuracy of 73%. The results indicate that the GA-SVM program is a powerful tool in optimizing mutation predictive models of nsSNPs of human CYP450s.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号