高级搜索

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于动态聚焦与语义提示的细粒度害虫分类网络

柳长源 赵海健 吴海滨

柳长源, 赵海健, 吴海滨. 基于动态聚焦与语义提示的细粒度害虫分类网络[J]. 电子与信息学报. doi: 10.11999/JEIT260044
引用本文: 柳长源, 赵海健, 吴海滨. 基于动态聚焦与语义提示的细粒度害虫分类网络[J]. 电子与信息学报. doi: 10.11999/JEIT260044
LIU Changyuan, ZHAO Haijian, WU Haibin. Dynamic Focus and Semantic Prompt Network for Fine-Grained Pest Classification[J]. Journal of Electronics & Information Technology. doi: 10.11999/JEIT260044
Citation: LIU Changyuan, ZHAO Haijian, WU Haibin. Dynamic Focus and Semantic Prompt Network for Fine-Grained Pest Classification[J]. Journal of Electronics & Information Technology. doi: 10.11999/JEIT260044

基于动态聚焦与语义提示的细粒度害虫分类网络

doi: 10.11999/JEIT260044 cstr: 32379.14.JEIT260044
基金项目: 黑龙江省交通运输厅科技项目(HJK2024B002)
详细信息
    作者简介:

    柳长源:男,副教授,研究方向为图像处理与模式识别

    赵海健:男,硕士生,研究方向为图像分类与图像识别

    吴海滨:男,教授,研究方向为机器视觉与图像分类

    通讯作者:

    柳长源 liuchangyuan@hrbust.edu.cn

  • 中图分类号: TP391.41

Dynamic Focus and Semantic Prompt Network for Fine-Grained Pest Classification

Funds: Scientific and Technological Project of the Department of Transportation of Heilongjiang Province (HJK2024B002)
  • 摘要: 农业害虫图像普遍存在复杂背景干扰、不同虫态时期外观差异显著、拍摄角度多样和尺度变化大等问题,导致现有细粒度图像分类模型在特征提取和虫态变化适应性方面仍存在不足。针对上述问题,本文构建了一个涵盖多虫态时期、多角度和多尺度的农业害虫多维数据集(APMD),并提出一种基于动态聚焦与语义提示的细粒度害虫分类网络(DFS-PestNet)。本文构建主特征流与提示增强流相结合的解耦并行架构,通过空间形变建模模块(SDP)动态聚焦害虫斑点、翅脉等关键判别区域,以增强复杂背景下的局部细微特征提取能力;引入提示引导机制模块(AHVP),在浅中层特征中融合类别语义与空间位置信息,提升对不同虫态形态变化的适应性;同时采用显著性双分支采样(DSS),通过可学习原型部件和双分支显著性融合自适应聚合害虫关键部位特征,增强对微小害虫和早期幼虫等小目标的识别能力。实验结果表明,在IP102和APMD两个数据集上本文模型均取得了优于基线模型和现有主流方法的分类性能,验证了其在复杂场景下细粒度害虫分类任务中的有效性与应用潜力,为智慧农业中的虫害监测与精准防治提供技术参考。
  • 图  1  APMD数据集图像

    图  2  DFS-PestNet模型结构

    图  3  SDP模块结构

    图  4  AHVP模块结构

    图  5  DSS模块结构

    图  6  Grad-CAM可视化对比

    图  7  t-SNE可视化APMD数据集特征分布图

    图  8  模型超参数变化下的准确率大小

    表  1  APMD数据集层级化分类体系

    种类数虫态数图像数量
    训练集验证集测试集
    半翅目233(卵/幼/成虫)43471242621
    鳞翅目164(卵/幼/蛹/成虫)3024864432
    鞘翅目104(卵/幼/蛹/成虫)1890540270
    双翅目34(卵/幼/蛹/成虫)56716281
    膜翅目24(卵/幼/蛹/成虫)37810854
    蜱螨类24(卵/幼/蛹/成虫)37810854
    直翅目13(卵/幼/成虫)1895427
    等翅目13(卵/幼/成虫)1895427
    合计581096231321566
    下载: 导出CSV

    表  2  消融实验结果

    实验
    编号
    SDP
    模块
    AHVP
    模块
    DSS
    模块
    A/% P/% R/% F1-score/%
    1 75.03 67.85 64.60 66.18
    2 75.73 69.10 65.90 67.46
    3 75.67 68.95 65.85 67.36
    4 75.38 68.40 65.30 66.81
    5 76.52 70.40 67.45 68.89
    6 76.49 70.35 67.30 68.79
    7 76.50 70.38 67.40 68.86
    8 77.24 71.64 68.78 70.18
    下载: 导出CSV

    表  3  IP102数据集上对比试验结果

    网络模型BackboneA/%P/%R/%F1-score/%FPS
    VRFNet[18]EfficientNet68.3468.3768.3368.34-
    EfficientNet B7[19]EfficientNet70.01----
    ViT[20]ViT-B/1673.4068.1766.8966.5270
    IELT[21]ViT-B/1675.4069.5067.8268.6562
    FFEL-Net[22]ViT-B/1676.2168.4466.8667.64-
    FRCF[23]ViT-B/1674.69---91
    Gate-ViT[24]ViT-B/1676.1070.1169.1469.62109
    EST[25]Swin-B71.8465.8563.2264.0684
    本文模型Swin-B77.2471.6468.7870.18158
    下载: 导出CSV

    表  4  APMD数据集上对比试验结果

    网络模型BackboneA/%P/%R/%F1-score/%FPS
    ViT[20]ViT-B/1688.9289.8087.4588.61125
    IELT[21]ViT-B/1690.2490.9089.2190.05117
    Gate-ViT[24]ViT-B/1691.6092.1090.8791.48112
    CLCA[26]ViT-B/1693.5294.0092.7993.39110
    GLSim[27]ViT-B/1695.7896.1095.3295.71105
    EST[25]Swin-B93.4593.8293.1093.46138
    本文模型Swin-B98.0198.2098.0097.90164
    下载: 导出CSV
  • [1] 陆宴辉, 刘杨, 杨现明, 等. 中国农业害虫综合防治研究进展: 2018年-2022年[J]. 植物保护, 2023, 49(5): 145–166. doi: 10.16688/j.zwbh.2023207.

    LU Yanhui, LIU Yang, YANG Xianming, et al. Advances in integrated management of agricultural insect pests in China: 2018-2022[J]. Plant Protection, 2023, 49(5): 145–166. doi: 10.16688/j.zwbh.2023207.
    [2] 赵雪如, 李晖, 胡欣仪, 等. 基于深度学习的田间害虫自动识别技术综述[J]. 图像与信号处理, 2023, 12(2): 77–88. doi: 10.12677/JISP.2023.122008.

    ZHAO Xueru, LI Hui, HU Xinyi, et al. Survey of automatic identification of field pests based on deep learning[J]. Journal of Image and Signal Processing, 2023, 12(2): 77–88. doi: 10.12677/JISP.2023.122008.
    [3] WU Xiaoping, ZHAN Chi, LAI Yukun, et al. IP102: A large-scale benchmark dataset for insect pest recognition[C]. The 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, 2019: 8779–8788. doi: 10.1109/CVPR.2019.00899.
    [4] 陈磊, 刘立波, 王晓丽. 2020年宁夏枸杞虫害图文跨模态检索数据集[J]. 中国科学数据, 2022, 7(3): 1–8. doi: 10.11922/11-6035.nasdc.2021.0058.zh.

    CHEN Lei, LIU Libo, and WANG Xiaoli. A dataset of image-text cross-modal retrieval of Lycium barbarum pests in Ningxia in 2020[J]. China Scientific Data, 2022, 7(3): 1–8. doi: 10.11922/11-6035.nasdc.2021.0058.zh.
    [5] LI Yanfen, WANG Hanxiang, DANG L M, et al. Crop pest recognition in natural scenes using convolutional neural networks[J]. Computers and Electronics in Agriculture, 2020, 169: 105174. doi: 10.1016/j.compag.2019.105174.
    [6] BOLLIS E, PEDRINI H, and AVILA S. Weakly supervised learning guided by activation mapping applied to a novel citrus pest benchmark[C]. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, Seattle, USA, 2020: 310–319. doi: 10.1109/CVPRW50498.2020.00043.
    [7] FANG Mingwei, TAN Zhiping, TANG Yu, et al. Pest-ConFormer: A hybrid CNN-Transformer architecture for large-scale multi-class crop pest recognition[J]. Expert Systems with Applications, 2024, 255: 124833. doi: 10.1016/j.eswa.2024.124833.
    [8] CHENG Zekai and XIA Wan. Fine-grained image classification on agricultural pest larvae[J]. IOP Conference Series: Earth and Environmental Science, 2021, 792: 012037. doi: 10.1088/1755-1315/792/1/012037.
    [9] AMARATHUNGA D C, RATNAYAKE M N, GRUNDY J, et al. Fine-grained image classification of microscopic insect pest species: Western Flower thrips and Plague thrips[J]. Computers and Electronics in Agriculture, 2022, 203: 107462. doi: 10.1016/j.compag.2022.107462.
    [10] WANG Linfeng, LIU Yong, LI Jiayao, et al. Based on the multi-scale information sharing network of fine-grained attention for agricultural pest detection[J]. PLoS One, 2023, 18(10): e0286732. doi: 10.1371/journal.pone.0286732.
    [11] 赵凤, 耿苗苗, 刘汉强, 等. 卷积神经网络与视觉Transformer联合驱动的跨层多尺度融合网络高光谱图像分类方法[J]. 电子与信息学报, 2024, 46(5): 2237–2248. doi: 10.11999/JEIT231209.

    ZHAO Feng, GENG Miaomiao, LIU Hanqiang, et al. Convolutional neural network and vision Transformer-driven cross-layer multi-scale fusion network for hyperspectral image classification[J]. Journal of Electronics & Information Technology, 2024, 46(5): 2237–2248. doi: 10.11999/JEIT231209.
    [12] 文泓力, 胡庆浩, 黄立威, 等. 基于参数高效ViT与多模态导引的遥感图像小样本分类方法[J]. 电子与信息学报, 2025, 47(12): 4689–4703. doi: 10.11999/JEIT250996.

    WEN Hongli, HU Qinghao, HUANG Liwei, et al. Few-shot remote sensing image classification based on parameter-efficient vision transformer and multimodal guidance[J]. Journal of Electronics & Information Technology, 2025, 47(12): 4689–4703. doi: 10.11999/JEIT250996.
    [13] 宋婉莹, 刘毓琛, 王杰, 等. 面向高分辨遥感图像的熵驱动自适应融合网络构建与场景分类研究[J/OL]. 电子与信息学报, https://link.cnki.net/urlid/11.4494.TN.20260405.2112.010, 2025.

    SONG Wanying, LIU Yuchen, WANG Jie, et al. Entropy-driven adaptive fusion network for scene classification of high-resolution remote sensing images[J]. Journal of Electronics & Information Technology, https://link.cnki.net/urlid/11.4494.TN.20260405.2112.010, 2025.
    [14] HAN Yuantao, ZHANG Cong, ZHAN Xiaoyun, et al. Crossing multiple life stages: Fine-grained classification of agricultural pests[J]. Plant Methods, 2024, 20(1): 191. doi: 10.1186/s13007-024-01317-w.
    [15] WANG Jiahui, XU Qin, JIANG Bo, et al. Multi-granularity part sampling attention for fine-grained visual classification[J]. IEEE Transactions on Image Processing, 2024, 33: 4529–4542. doi: 10.1109/TIP.2024.3441813.
    [16] DAI Jifeng, QI Haozhi, XIONG Yuwen, et al. Deformable convolutional networks[C]. The 2017 IEEE International Conference on Computer Vision, Venice, Italy, 2017: 764–773. doi: 10.1109/ICCV.2017.89.
    [17] KIRKLAND E J. Advanced Computing in Electron Microscopy[M]. 2nd ed. New York: Springer, 2010: 261–263. doi: 10.1007/978-1-4419-6533-2.
    [18] NANDHINI C and BRINDHA M. Visual regenerative fusion network for pest recognition[J]. Neural Computing and Applications, 2024, 36(6): 2867–2882. doi: 10.1007/s00521-023-09173-w.
    [19] MALIK P and PARIDA M K. Classification of insect pest using transfer learning mechanism[C]. The 8th International Conference on Computer Vision and Image Processing, Jammu, India, 2023: 78–89. doi: 10.1007/978-3-031-58535-7_7.
    [20] DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An image is worth 16x16 words: Transformers for image recognition at scale[C]. The 9th International Conference on Learning Representations, 2021.
    [21] XU Qin, WANG Jiahui, JIANG Bo, et al. Fine-grained visual classification via internal ensemble learning transformer[J]. IEEE Transactions on Multimedia, 2023, 25: 9015–9028. doi: 10.1109/TMM.2023.3244340.
    [22] 张文丽, 宋威. 基于特征融合与集成学习的细粒度图像分类[J]. 激光与光电子学进展, 2024, 61(22): 2237010. doi: 10.3788/LOP240759.

    ZHANG Wenli and SONG Wei. Fine-grained image classification based on feature fusion and ensemble learning[J]. Laser & Optoelectronics Progress, 2024, 61(22): 2237010. doi: 10.3788/LOP240759.
    [23] LIU Honglin, ZHAN Yongzhao, XIA Huifen, et al. Self-supervised transformer-based pre-training method using latent semantic masking auto-encoder for pest and disease classification[J]. Computers and Electronics in Agriculture, 2022, 203: 107448. doi: 10.1016/j.compag.2022.107448.
    [24] LU Xiaowei, WANG Kanqi, WANG Peiyu, et al. Gate-ViT: Gated vision transformer for fine-grained visual classification[C]. The 29th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Sydney, Australia, 2025: 468–479. doi: 10.1007/978-981-96-8180-8_37.
    [25] LIU Wei and ZHANG Ao. Plant disease detection algorithm based on efficient Swin transformer[J]. Computers, Materials & Continua, 2025, 82(2): 3045–3068. doi: 10.32604/cmc.2024.058640.
    [26] RIOS E A, YUANDA J C, GHANZ V L, et al. Cross-layer cache aggregation for token reduction in ultra-fine-grained image recognition[C]. 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Hyderabad, India, 2025: 1–5. doi: 10.1109/icassp49660.2025.10890489.
    [27] RIOS E A, HU Minchun, and LAI Bocheng. Global-local similarity for efficient fine-grained image recognition with vision transformers[C]. 2025 IEEE International Symposium on Circuits and Systems (ISCAS), London, UK, 2025: 1–5. doi: 10.1109/ISCAS56072.2025.11043866.
  • 加载中
图(8) / 表(4)
计量
  • 文章访问数:  6
  • HTML全文浏览量:  3
  • PDF下载量:  0
  • 被引次数: 0
出版历程
  • 收稿日期:  2026-01-13
  • 修回日期:  2026-04-13
  • 录用日期:  2026-04-13
  • 网络出版日期:  2026-04-30

目录

    /

    返回文章
    返回