首页 | 本学科首页   官方微博 | 高级检索  
   检索      


CNN and transformer framework for insect pest classification
Institution:1. Lushan Botanical Garden, Chinese Academy of Sciences, Jiangxi Province 332900, PR China;2. College of Forestry, Nanjing Forestry University, Nanjing 210037, PR China;3. Jiangsu Wiscom Technology Co. Ltd, Nanjing 211100, PR China;1. Department of Geographical Sciences, School of Geography and Information Engineering, China University of Geosciences, Wuhan 430074, China;2. Hubei Key Laboratory of Critical Zone Evolution, China University of Geosciences, Wuhan 430074, China;3. Guizhou Electric Power Design Research Institute, Power Construction Corporation of China, Guiyang 550002, Guizhou, China;1. Shaanxi Meteorological Service Center of Agricultural Remote Sensing and Economic Crops, Xi''an 710014, China;2. National Tibetan Plateau Data Center, State Key Laboratory of Tibetan Plateau Earth System Science, Institute of Tibetan Plateau Research, Chinese Academy of Sciences, Beijing 100101, China;3. Faculty of Geographical Science, Beijing Normal University, Beijing 100875, China;1. National Engineering Research Center for Agro-Ecological Big Data Analysis & Application, Anhui University, Hefei 230601, China;2. School of Information Science and Technology, Fudan University, Shanghai 200433, China;1. College of Mechanical and Electrical Engineering, Qingdao Agricultural University, Qingdao 266109, China;2. Department of Science and Technology, Qingdao Agricultural University, Qingdao 266109, China;3. College of Plant Health and Medicine, Qingdao Agricultural University, Qingdao 266109, China
Abstract:Insect pests pose a significant and increasing threat to agricultural production worldwide. However, most existing recognition methods are built upon well-known convolutional neural networks, which limits the possibility of improving pest recognition accuracies. This research attempts to overcome this challenge from a novel perspective, constructing a simplified but very useful network for effective insect pest recognition by combining transformer architecture and convolution blocks. First, the representative features are extracted from the input image using a backbone convolutional neural network. Second, a new transformer attention-based classification head is proposed to sufficiently utilize spatial data from the features. With that, we explore different combinations for each module in our model and abstract our model into a simple and scalable architecture; we introduce more effective training strategies, pretrained models and data augmentation methods. Our models performance was evaluated on the IP102 benchmark dataset and achieved classification accuracies of 74.897% and 75.583% with minimal implementation costs at image resolutions of 224 × 224 pixels and 480 × 480 pixels, respectively. Our model also attains accuracies of 99.472% and 97.935% on the D0 dataset and Li's dataset, respectively, with an image resolution of 224 × 224 pixels. The experimental results demonstrate that our method is superior to the state-of-the-art methods on these datasets. Accordingly, the proposed model can be deployed in practice and provides additional insights into the related research.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号