高级搜索

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

面向空地行人重识别的柯尔莫哥洛夫-阿诺德非线性增强方法

陈逸钧 曾宪贤 刘舜 王磊军

陈逸钧, 曾宪贤, 刘舜, 王磊军. 面向空地行人重识别的柯尔莫哥洛夫-阿诺德非线性增强方法[J]. 电子与信息学报. doi: 10.11999/JEIT260430
引用本文: 陈逸钧, 曾宪贤, 刘舜, 王磊军. 面向空地行人重识别的柯尔莫哥洛夫-阿诺德非线性增强方法[J]. 电子与信息学报. doi: 10.11999/JEIT260430
CHEN Yijun, ZENG Xianxian, LIU Shun, WANG Leijun. Kolmogorov-Arnold Nonlinear Enhancement Method for Aerial-Ground Person Re-Identification[J]. Journal of Electronics & Information Technology. doi: 10.11999/JEIT260430
Citation: CHEN Yijun, ZENG Xianxian, LIU Shun, WANG Leijun. Kolmogorov-Arnold Nonlinear Enhancement Method for Aerial-Ground Person Re-Identification[J]. Journal of Electronics & Information Technology. doi: 10.11999/JEIT260430

面向空地行人重识别的柯尔莫哥洛夫-阿诺德非线性增强方法

doi: 10.11999/JEIT260430 cstr: 32379.14.JEIT260430
基金项目: 国家自然科学基金(62401164),广东省基础与应用基础研究基金自然科学基金面上项目(2024A1515010219, 2026A1515011911),广东省教育厅重点领域项目(2025ZDZX3008),广东省高校人文社科重点实验室项目(2025WZJD005),广东省教育厅科研能力提升项目(2025ZDJS023)
详细信息
    作者简介:

    陈逸钧:男,硕士研究生,研究方向为行人重识别

    曾宪贤:男,副教授,研究方向为人工智能

    刘舜:男,讲师,研究方向为统计学习

    王磊军:男,讲师,研究方向为物联网

    通讯作者:

    曾宪贤 zengxianxian@gpnu.edu.cn

  • 中图分类号: TN911.73; TP391.41

Kolmogorov-Arnold Nonlinear Enhancement Method for Aerial-Ground Person Re-Identification

Funds: the National Natural Science Foundation of China (Grant No.62401164), the Guangdong Basic and Applied Basic Research Foundation (No. 2024A1515010219, 2026A1515011911), the Guangdong Provincial Department of Education Key Field Project on Artificial Intelligence (No. 2025ZDZX3008), the Guangdong Key Research Institute of Humanities and Social Sciences at Universities (No. 2025WZJD005), the Key Discipline Improvement Project of Guangdong Province under Grant (No. 2025ZDJS023)
  • 摘要: 空地行人重识别旨在实现无人机视角与地面摄像视角下同一身份行人的跨平台匹配。由于空地视角差异显著且跨域分布偏移严重,现有方法中依赖线性特征变换的分类监督分支难以充分建模复杂非线性判别关系。针对上述问题,该文提出一种柯尔莫哥洛夫-阿诺德非线性增强模块,置于主干网络输出特征与线性分类层之间,用于替代传统的线性特征变换过程。该模块借鉴柯尔莫哥洛夫-阿诺德表示思想,通过可学习的样条函数对特征进行自适应非线性重构与增强,从而强化监督映射过程,促进更具判别性的空地跨视角表征学习。在CARGO和AG-ReID数据集上的实验结果表明,所提方法优于现有同类前沿方法。尤其在最具挑战性的空地跨视角检索协议下,两个数据集上的Rank-1分别达到58.75%和84.41%,较基线方法分别提升10.63%和1.99%,表明该方法具有较强的跨视角检索能力。
  • 图  1  基于VDT骨干网络的KANEM空地行人重识别框架。

    图  2  在CARGO数据集的总体协议下对参数$ \lambda $,GP的分析结果,其中(b)-(d)为参数GP的网格搜索结果

    图  3  t-SNE可视化结果

    图  4  在CARGO数据集的四种检索协议下的检索可视化结果

    表  1  在CARGO数据集下与其他方法的对比实验结果(%)

    方法来源协议1:(ALL)协议2:(G$ \leftrightarrow $G)协议3:(A$ \leftrightarrow $A)协议4:(A$ \leftrightarrow $G)
    Rank-1mAPmINPRank-1mAPmINPRank-1mAPmINPRank-1mAPmINP
    PCB[11]TPAMI-2144.2338.1526.1472.3261.9245.7257.5042.3422.5021.2521.0214.22
    SBS[12]ACM MM-2350.3243.0929.7673.2162.9948.2467.5049.7329.3231.2529.0018.71
    BoT[13]CVPR-1954.8146.4932.4077.6866.4751.3465.0049.7929.8236.2532.5621.46
    MGN[14]ACMM-1854.4946.5833.5582.1469.3153.6065.0048.8627.4232.5030.4421.53
    APNet[15]TIP-2158.9750.2435.7677.6866.8351.8567.5054.5737.3544.3739.3526.76
    VV[16]IJCNN-1945.8338.8439.5772.3162.9948.2467.5049.7329.3231.2529.0018.71
    AGW[10]TPAMI-2160.2653.4440.2281.2571.6658.0967.5056.4840.4043.5740.9029.39
    ViT[17]ICLR-2161.5453.5439.6282.1471.3457.5580.0064.4747.9743.1340.1128.20
    IDA[18]自动化学报-2564.4258.1746.1783.0477.0467.5082.5069.6554.5848.7545.1333.92
    DTST[19]ICME-2564.4255.7341.9278.5772.4062.1080.0063.3144.6750.6343.3929.46
    VIF[20]ICCV-2565.7157.4644.1283.9374.1962.3082.5066.9851.4451.2544.5531.20
    VDT[6]CVPR-2464.1055.2041.1382.1471.5958.3982.5066.8350.2248.1242.7629.95
    Ours−−70.1963.1651.3483.9376.1766.0780.0072.0959.8658.7553.2741.11
    下载: 导出CSV

    表  2  在AG-ReID数据集下与其他方法的对比实验结果(%)

    方法来源协议1:(A→G)协议2:(G→A)
    Rank-1mAPmINPRank-1mAPmINP
    SBS[12]ACM MM-2373.5459.7773.7062.37
    BoT[13]CVPR-1970.0155.4771.2058.83
    VV[16]IJCNN-1977.2267.2341.4379.7369.8342.37
    ViT[17]ICLR-2181.2872.3882.6473.35
    Explain[4]ICME-2381.4772.6181.8573.35
    DTST[19]ICME-2583.4874.5149.8684.7276.0550.04
    SeCap[7]CVPR-2583.9175.1450.3185.7876.9650.52
    VDT[6]CVPR-2482.4274.2349.2884.2476.4849.50
    Ours−−84.4176.2153.0586.6977.9952.28
    下载: 导出CSV

    表  3  KANEM的不同的层数与维度在CARGO数据集下的实验结果(%)

    维度与层数配置协议1:(ALL)协议2:(G$ \leftrightarrow $G)协议3:(A$ \leftrightarrow $A)协议4:(A$ \leftrightarrow $G)
    Rank-1mAPmINPRank-1mAPmINPRank-1mAPmINPRank-1mAPmINP
    $ 768\rightarrow 768 $68.4562.5351.7084.8277.0867.0280.0071.0259.2657.3850.6738.72
    $ 768\rightarrow {N}_{\text{id}} $67.9562.5351.2683.0477.3167.0177.5071.1259.5756.2551.9939.72
    $ 768\rightarrow 768\rightarrow 768 $69.2363.0151.5982.1475.9867.6782.5071.2459.1657.6751.3340.24
    $ 768\rightarrow 1536\rightarrow 768 $68.9160.9348.2682.1476.0767.1282.5072.9359.7956.6849.8536.57
    $ 768\rightarrow 768\rightarrow {N}_{\text{id}} $67.9561.8049.8881.2575.4365.9677.5070.5459.0656.8852.6340.55
    $ 768\rightarrow 1536\rightarrow {N}_{\text{id}} $70.1963.1651.3483.9376.1766.0780.0072.0959.8658.7553.2741.11
    $ 768\rightarrow 1536\rightarrow 1536\rightarrow {N}_{\text{id}} $66.9961.5149.7981.2575.6465.8580.0071.3659.4655.0950.6337.68
    下载: 导出CSV

    表  4  KANEM的有效性分析实验结果(%)

    方法协议1:(ALL)协议2:(G$ \leftrightarrow $G)协议3:(A$ \leftrightarrow $A)协议4:(A$ \leftrightarrow $G)
    Rank-1mAPmINPRank-1mAPmINPRank-1mAPmINPRank-1mAPmINP
    Baseline64.1055.2041.1382.1471.5958.3982.5066.8350.2248.1242.7629.95
    Baseline+MLP(ReLU)68.5662.3450.3581.2574.3264.3780.0072.1259.6756.8852.4040.44
    Baseline+MLP(GELU)68.5961.0149.7683.0475.8165.0577.5070.9459.4456.2551.6838.42
    Baseline+KANEM70.1963.1651.3483.9376.1766.0780.0072.0959.8658.7553.2741.11
    下载: 导出CSV

    表  5  KANEM在不同框架和不同类型行人重识别任务上的通用性分析(%)

    任务类型模型名称数据集评估设置Baseline+KANEM变化量
    Rank-1mAPRank-1mAPRank-1mAP
    遮挡行人重识别PVPM[21]Occluded-REID普通66.859.567.960.2+1.1+0.7
    空地行人重识别SeCap[7]CARGO总体68.5960.1974.3568.24+5.76+8.05
    空中到地面69.4358.9475.6170.11+6.18+11.17
    换衣行人重识别CAL[22]CCVID不换衣82.681.387.284.3+4.6+3.0
    换衣81.779.686.183.0+4.4+3.45
    可见光-红外行人重识别DEEN[23]LLCM红外到可见光54.962.956.463.4+1.5+0.5
    可见光到红外62.565.868.365.2+5.8-0.6
    下载: 导出CSV
  • [1] 张红颖, 樊世钰, 罗谦, 等. 结合视觉文本匹配和图嵌入的可见光-红外行人重识别[J]. 电子与信息学报, 2024, 46(9): 3662–3671. doi: 10.11999/JEIT240318.

    ZHANG Hongying, FAN Shiyu, LUO Qian, et al. Visible-infrared person re-identification combining visual-textual matching and graph embedding[J]. Journal of Electronics & Information Technology, 2024, 46(9): 3662–3671. doi: 10.11999/JEIT240318.
    [2] 庄建军, 王楠. T3FRNet: 一种融合三重感知细粒度重构的换衣行人重识别方法[J]. 电子与信息学报, 2026, 48(1): 370–381. doi: 10.11999/JEIT250476.

    ZHUANG Jianjun and WANG Nan. T3FRNet: A cloth-changing person re-identification via texture-aware transformer tuning fine-grained reconstruction method[J]. Journal of Electronics & Information Technology, 2026, 48(1): 370–381. doi: 10.11999/JEIT250476.
    [3] 周玉, 赵小锋, 汪一, 等. 关键细粒度信息指导的多尺度遮挡行人重识别[J]. 电子与信息学报, 2024, 46(6): 2578–2586. doi: 10.11999/JEIT230686.

    ZHOU Yu, ZHAO Xiaofeng, WANG Yi, et al. Multi-scale occluded person re-identification guided by key fine-grained information[J]. Journal of Electronics & Information Technology, 2024, 46(6): 2578–2586. doi: 10.11999/JEIT230686.
    [4] NGUYEN K, FOOKES C, SRIDHARAN S, et al. AG-ReID 2023: Aerial-ground person re-identification challenge results[C]. 2023 IEEE International Joint Conference on Biometrics (IJCB), Ljubljana, Slovenia, 2023: 1–10. doi: 10.1109/IJCB57857.2023.10448780.
    [5] NGUYEN H, NGUYEN K, SRIDHARAN S, et al. AG-ReID. v2: Bridging aerial and ground views for person re-identification[J]. IEEE Transactions on Information Forensics and Security, 2024, 19: 2896–2908. doi: 10.1109/TIFS.2024.3353078.
    [6] ZHANG Quan, WANG Lei, PATEL V M, et al. View-decoupled transformer for person re-identification under aerial-ground camera network[C]. Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, USA, 2024: 22000–22009. doi: 10.1109/CVPR52733.2024.02077.
    [7] WANG Shining, WANG Yunlong, WU Ruiqi, et al. SeCap: Self-calibrating and adaptive prompts for cross-view person re-identification in aerial-ground networks[C]. Proceedings of the 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, USA, 2025: 22119–22128. doi: 10.1109/CVPR52734.2025.02060.
    [8] KOLMOGOROV A N. On the representation of continuous functions of several variables by superpositions of continuous functions of fewer variables[J]. American Mathematical Society Translations, 1961, 17(2): 369–373.
    [9] LIU Ziming, WANG Yixuan, VAIDYA S, et al. KAN: Kolmogorov-Arnold networks[C]. The Thirteenth International Conference on Learning Representations, Singapore, Singapore, 2025.
    [10] YE Mang, SHEN Jianbing, LIN Gaojie, et al. Deep learning for person re-identification: A survey and outlook[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(6): 2872–2893. doi: 10.1109/TPAMI.2021.3054775.
    [11] SUN Yifan, ZHENG Liang, LI Yali, et al. Learning part-based convolutional features for person re-identification[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43(3): 902–917. doi: 10.1109/TPAMI.2019.2938523.
    [12] HE Lingxiao, LIAO Xingyu, LIU Wu, et al. FastReID: A Pytorch toolbox for general instance re-identification[C]. Proceedings of the 31st ACM International Conference on Multimedia, Ottawa, Canada, 2023: 9664–9667. doi: 10.1145/3581783.3613460.
    [13] LUO Hao, GU Youzhi, LIAO Xingyu, et al. Bag of tricks and a strong baseline for deep person re-identification[C]. Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, Long Beach, USA, 2019: 1487–1495. doi: 10.1109/CVPRW.2019.00190.
    [14] WANG Guanshou, YUAN Yufeng, CHEN Xiong, et al. Learning discriminative features with multiple granularities for person re-identification[C]. Proceedings of the 26th ACM International Conference on Multimedia, Seoul, Republic of Korea, 2018: 274–282. doi: 10.1145/3240508.3240552.
    [15] CHEN Guangyi, GU Tianpei, LU Jiwen, et al. Person re-identification via attention pyramid[J]. IEEE Transactions on Image Processing, 2021, 30: 7663–7676. doi: 10.1109/TIP.2021.3107211.
    [16] KUMA R, WEILL E, AGHDASI F, et al. Vehicle re-identification: An efficient baseline using triplet embedding[C]. 2019 International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 2019: 1–9. doi: 10.1109/IJCNN.2019.8852059.
    [17] DOSOVITSKIY A, BEYER L, KOLESNIKOV A, et al. An image is worth 16x16 words: Transformers for image recognition at scale[C]. 9th International Conference on Learning Representations, Vienna, Austria, 2021: 611–631. (查阅网上资料, 未找到本条文献出版城市、页码信息, 请确认).
    [18] 贝俊仁, 张权, 赖剑煌. 基于隐式解码对齐的空地行人重识别方法[J]. 自动化学报, 2025, 51(9): 1988–2000. doi: 10.16383/j.aas.c240705.

    BEI Junren, ZHANG Quan, and LAI Jianhuang. Implicit decoder alignment for aerial-ground person re-identification[J]. Acta Automatica Sinica, 2025, 51(9): 1988–2000. doi: 10.16383/j.aas.c240705.
    [19] WANG Yuhai and PISHGAR M. Dynamic token selective transformer for aerial-ground person re-identification[C]. 2025 IEEE International Conference on Multimedia and Expo, Nantes, France, 2025: 1–6. doi: 10.1109/ICME59968.2025.11210054.
    [20] KHALID W, LIU Bin, LI Xulin, et al. Bridging the sky and ground: Towards view-invariant feature learning for aerial-ground person re-identification[C]. Proceedings of the 2025 IEEE/CVF International Conference on Computer Vision, Honolulu, USA, 2025: 9749–9758. doi: 10.1109/ICCV51701.2025.00909.
    [21] GAO Shang, WANG Jingya, LU Huchuan, et al. Pose-guided visible part matching for occluded person ReID[C]. Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, USA, 2020: 11741–11749. doi: 10.1109/CVPR42600.2020.01176.
    [22] GU Xinqian, CHANG Hong, MA Bingpeng, et al. Clothes-changing person re-identification with RGB modality only[C]. Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, USA, 2022: 1050–1059. doi: 10.1109/CVPR52688.2022.00113.
    [23] ZHANG Yukang and WANG Hanzi. Diverse embedding expansion network and low-light cross-modality benchmark for visible-infrared person re-identification[C]. Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, Canada, 2023: 2153–2162. doi: 10.1109/CVPR52729.2023.00214.
  • 加载中
图(4) / 表(5)
计量
  • 文章访问数:  35
  • HTML全文浏览量:  6
  • PDF下载量:  2
  • 被引次数: 0
出版历程
  • 修回日期:  2026-06-18
  • 录用日期:  2026-06-18
  • 网络出版日期:  2026-06-29

目录

    /

    返回文章
    返回