高级搜索

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于特征通道和空间联合注意机制的遮挡行人检测方法

陈勇 刘曦 刘焕淋

陈勇, 刘曦, 刘焕淋. 基于特征通道和空间联合注意机制的遮挡行人检测方法[J]. 电子与信息学报, 2020, 42(6): 1486-1493. doi: 10.11999/JEIT190606
引用本文: 陈勇, 刘曦, 刘焕淋. 基于特征通道和空间联合注意机制的遮挡行人检测方法[J]. 电子与信息学报, 2020, 42(6): 1486-1493. doi: 10.11999/JEIT190606
Yong CHEN, Xi LIU, Huanlin LIU. Occluded Pedestrian Detection Based on Joint Attention Mechanism of Channel-wise and Spatial Information[J]. Journal of Electronics & Information Technology, 2020, 42(6): 1486-1493. doi: 10.11999/JEIT190606
Citation: Yong CHEN, Xi LIU, Huanlin LIU. Occluded Pedestrian Detection Based on Joint Attention Mechanism of Channel-wise and Spatial Information[J]. Journal of Electronics & Information Technology, 2020, 42(6): 1486-1493. doi: 10.11999/JEIT190606

基于特征通道和空间联合注意机制的遮挡行人检测方法

doi: 10.11999/JEIT190606
基金项目: 国家自然科学基金(51977021)
详细信息
    作者简介:

    陈勇:男,1963年生,博士,教授,研究方向为图像处理

    刘曦:男,1993年生,硕士生,研究方向为行人目标检测

    刘焕淋:女,1970年生,博士,教授,研究方向为信号处理等方面的研究

    通讯作者:

    陈勇 chenyong@cqupt.edu.cn

  • 中图分类号: TN911.73; TP391.41

Occluded Pedestrian Detection Based on Joint Attention Mechanism of Channel-wise and Spatial Information

Funds: The National Natural Science Foundation of China (51977021)
  • 摘要:

    遮挡是行人检测任务中导致漏检发生的主要原因之一,对检测器性能造成了不利影响。为了增强检测器对于遮挡行人目标的检测能力,该文提出一种基于特征引导注意机制的单级行人检测方法。首先,设计一种特征引导注意模块,在保持特征通道间的关联性的同时保留了特征图的空间信息,引导模型关注遮挡目标可视区域;然后,通过注意模块融合浅层和深层特征,从而提取到行人的高层语义特征;最后,将行人检测作为一种高层语义特征检测问题,通过激活图的形式预测得到行人位置和尺度,并生成最终的预测边界框,避免了基于先验框的预测方式所带来的额外参数设置。所提方法在CityPersons数据集上进行了测试,并在Caltech数据集上进行了跨数据集实验。结果表明该方法对于遮挡目标检测准确度优于其他对比算法。同时该方法实现了较快的检测速度,取得了检测准确度和速度的平衡。

  • 图  1  模型总体结构

    图  2  注意模块总体结构

    图  3  特征通道注意模块结构

    图  4  空间关注模块

    图  5  行人解析网络

    图  6  可视化位置预测热图

    图  7  Caltech跨数据库实验

    表  1  验证实验条件设置

    R (Reasonable)HO (Heavy Occlusion)R+HO (Reasonable+Heavy Occlusion)
    $v \in [0.65,\infty )$$v \in [0.20,0.65]$$v \in [0.20,\infty )$
    下载: 导出CSV

    表  2  注意网络验证结果MR–2(%)

    方法RHOR+HO
    文献[16]16.056.738.2
    Baseline12.141.138.1
    Baseline+CA11.839.237.8
    Baseline+CA+SA11.638.537.3
    下载: 导出CSV

    表  3  CityPersons数据集测试结果MR-2(%)

    方法主干网络ReasonableHeavyPartialBare测试时间(s)
    OR-CNN[11]VGG-1612.855.715.36.7
    FasterRCNN[21]VGG-1615.4
    ALFNet[8]ResNet-5012.051.911.48.40.27
    CSP[9]ResNet-5011.049.310.47.30.33
    CAFL[13]ResNet-5011.450.412.17.6
    PedJointNet[14]ResNet-5013.552.1
    TLL[20]ResNet-5015.553.617.210.0
    RepLoss[23]ResNet-5013.256.916.87.6
    本文方法ResNet-5011.647.69.87.50.22
    下载: 导出CSV
  • 张功国, 吴建, 易亿, 等. 基于集成卷积神经网络的交通标志识别[J]. 重庆邮电大学学报: 自然科学版, 2019, 31(4): 571–577. doi: 10.3979/j.issn.1673-825X.2019.04.019

    ZHANG Gongguo, WU Jian, YI Yi, et al. Traffic sign recognition based on ensemble convolutional neural network[J]. Journal of Chongqing University of Posts and Telecommunications:Natural Science Edition, 2019, 31(4): 571–577. doi: 10.3979/j.issn.1673-825X.2019.04.019
    种衍文, 匡湖林, 李清泉. 一种基于多特征和机器学习的分级行人检测方法[J]. 自动化学报, 2012, 38(3): 375–381. doi: 10.3724/SP.J.1004.2012.00375

    CHONG Yanwen, KUANG Hulin, and LI Qingquan. Two-stage pedestrian detection based on multiple features and machine learning[J]. Acta Automatica Sinica, 2012, 38(3): 375–381. doi: 10.3724/SP.J.1004.2012.00375
    刘威, 段成伟, 遇冰, 等. 基于后验HOG特征的多姿态行人检测[J]. 电子学报, 2015, 43(2): 217–224. doi: 10.3969/j.issn.0372-2112.2015.02.002

    LIU Wei, DUAN Chengwei, YU Bing, et al. Multi-pose pedestrian detection based on posterior HOG feature[J]. Acta Electronica Sinica, 2015, 43(2): 217–224. doi: 10.3969/j.issn.0372-2112.2015.02.002
    REN Shaoqing, HE Kaiming, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[C]. 2015 Advances in Neural Information Processing Systems, Montreal, Canada, 2015: 91–99.
    LI Jianan, LIANG Xiaodan, SHEN Shengmei, et al. Scale-aware fast R-CNN for pedestrian detection[J]. IEEE Transactions on Multimedia, 2018, 20(4): 985–996. doi: 10.1109/TMM.2017.2759508
    王进, 陈知良, 李航, 等. 一种基于增量式超网络的多标签分类方法[J]. 重庆邮电大学学报: 自然科学版, 2019, 31(4): 538–549. doi: 10.3979/j.issn.1673-825X.2019.04.015

    WANG Jin, CHEN Zhiliang, LI Hang, et al. Hierarchical multi-label classification using incremental hypernetwork[J]. Journal of Chongqing University of Posts and Telecommunications:Natural Science Edition, 2019, 31(4): 538–549. doi: 10.3979/j.issn.1673-825X.2019.04.015
    GIRSHICK R. Fast R-CNN[C]. 2015 IEEE International Conference on Computer Vision, Santiago, Chile, 2015: 1440–1448.
    LIU Wei, LIAO Shengcai, HU Weidong, et al. Learning efficient single-stage pedestrian detectors by asymptotic localization fitting[C]. The 15th European Conference on Computer Vision, Munich, Germany, 2018: 618–634.
    LIU Wei, LIAO Shengcai, REN Weiqiang, et al. High-level semantic feature detection: A new perspective for pedestrian detection[C]. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, 2019: 5182–5191.
    ZHANG Shanshan, BENENSON R, OMRAN M, et al. How far are we from solving pedestrian detection?[C]. 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, 2016: 1259–1267.
    ZHANG Shifeng, WEN Longyin, BIAN Xiao, et al. Occlusion-aware R-CNN: detecting pedestrians in a crowd[C]. The 15th European Conference on Computer Vision, Munich, Germany, 2018: 637–653.
    OUYANG Wanli, ZHOU Hui, LI Hongsheng, et al. Jointly learning deep features, deformable parts, occlusion and classification for pedestrian detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(8): 1874–1887. doi: 10.1109/TPAMI.2017.2738645
    FEI Chi, LIU Bin, CHEN Zhu, et al. Learning pixel-level and instance-level context-aware features for pedestrian detection in crowds[J]. IEEE Access, 2019, 7: 94944–94953. doi: 10.1109/ACCESS.2019.2928879
    LIN C Y, XIE Hongxia, and ZHENG Hua. PedJointNet: Joint head-shoulder and full body deep network for pedestrian detection[J]. IEEE Access, 2019, 7: 47687–47697. doi: 10.1109/ACCESS.2019.2910201
    ZHANG Shanshan, YANG Jian, and SCHIELE B. Occluded pedestrian detection through guided attention in CNNs[C]. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA, 2018: 6995–7003.
    ZHU Chenchen, HE Yihui, and SAVVIDES M. Feature selective anchor-free module for single-shot object detection[C]. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, 2019: 840–849.
    LIN T Y, DOLLÁR P, GIRSHICK R, et al. Feature pyramid networks for object detection[C]. 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, 2017: 936–944.
    CHEN Long, ZHANG Hanwang, XIAO Jun, et al. SCA-CNN: Spatial and channel-wise attention in convolutional networks for image captioning[C]. 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, 2017: 6298–6306.
    WOO S, PARK J, LEE J Y, et al. Cbam: Convolutional block attention module[C]. The 15th European Conference on Computer Vision, Munich, Germany, 2018: 3–19.
    SONG Tao, SUN Leiyu, XIE Di, et al. Small-scale pedestrian detection based on topological line localization and temporal feature aggregation[C]. The 15th European Conference on Computer Vision, Munich, Germany, 2018: 536–551.
    ZHANG Shanshan, BENENSON R, and SCHIELE B. Citypersons: A diverse dataset for pedestrian detection[C]. 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, USA, 2017: 4457–4465.
    DOLLAR P, WOJEK C, SCHIELE B, et al. Pedestrian detection: An evaluation of the state of the art[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(4): 743–761. doi: 10.1109/TPAMI.2011.155
    WANG Xinlong, XIAO Tete, JIANG Yuning, et al. Repulsion loss: Detecting pedestrians in a crowd[C]. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA, 2018: 7774–7783.
  • 加载中
图(7) / 表(3)
计量
  • 文章访问数:  3523
  • HTML全文浏览量:  1397
  • PDF下载量:  165
  • 被引次数: 0
出版历程
  • 收稿日期:  2019-08-09
  • 修回日期:  2020-02-18
  • 网络出版日期:  2020-03-13
  • 刊出日期:  2020-06-22

目录

    /

    返回文章
    返回