Affiliation: | 1. The College of Information Engineering, Shanghai Maritime University, Shanghai, China;2. The College of Information Engineering, Shanghai Maritime University, Shanghai, China Telecom SudParis, IMT, Institut Polytechnique de Paris, Paris, France;3. Shanghai Ship and Shipping Research Institute Co., Ltd., Shanghai, China COSCO Shipping Technology Co., Ltd., Shanghai, China;4. Shanghai Ship and Shipping Research Institute Co., Ltd., Shanghai, China;5. COSCO Shipping Technology Co., Ltd., Shanghai, China |
Abstract: | Pests are the main threats to crop growth, and the precision classification of pests is conducive to formulating effective prevention and governance strategies. In response to the problems of low efficiency and inadaptability to the large-scale environment of existing pest classification methods, this paper proposes a new pest classification method based on a convolutional neural network (CNN) and an improved Vision Transformer model. First, the MMAlNet is designed to extract the characteristics of the identification object from different scales and finer granularity. Then, a classification model called DenseNet Vision Transformer (DNVT) combining a CNN and an improved vision transformer model is proposed. The proposed DNVT captures both long distance dependencies and local characteristic modelling capabilities, which can effectively improve pest classification accuracy. Finally, the ensemble learning algorithm is used to learn MMAlNet and DNVT classification forecasts for soft voting, further enhancing the classification accuracy of pests. The simulation experiment results on the D0 and IP102 datasets show that the proposed method attained a maximum classification of 99.89 and 74.20%, respectively, which is better than other state-of-the-art methods and has a high practical application value. |