高级搜索

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

面向多源遥感数据分类的尺度自适应融合网络

刘晓敏 余梦君 乔振壮 王浩宇 邢长达

刘晓敏, 余梦君, 乔振壮, 王浩宇, 邢长达. 面向多源遥感数据分类的尺度自适应融合网络[J]. 电子与信息学报. doi: 10.11999/JEIT240178
引用本文: 刘晓敏, 余梦君, 乔振壮, 王浩宇, 邢长达. 面向多源遥感数据分类的尺度自适应融合网络[J]. 电子与信息学报. doi: 10.11999/JEIT240178
LIU Xiaomin, YU Mengjun, QIAO Zhenzhuang, WANG Haoyu, XING Changda. Scale Adaptive Fusion Network for Multimodal Remote Sensing Data Classification[J]. Journal of Electronics & Information Technology. doi: 10.11999/JEIT240178
Citation: LIU Xiaomin, YU Mengjun, QIAO Zhenzhuang, WANG Haoyu, XING Changda. Scale Adaptive Fusion Network for Multimodal Remote Sensing Data Classification[J]. Journal of Electronics & Information Technology. doi: 10.11999/JEIT240178

面向多源遥感数据分类的尺度自适应融合网络

doi: 10.11999/JEIT240178
基金项目: 国家自然科学基金(62303468, 62303469),江苏省自然科学基金(BK20221116, BK20221112),中国博士后科学基金(2023M733757),江苏省卓越博士后计划(2022ZB530)
详细信息
    作者简介:

    刘晓敏:女,硕士生导师,研究方向为强化学习、多模态融合、图像处理

    余梦君:男,硕士生,研究方向为深度强化学习、模式识别

    乔振壮:男,硕士生,研究方向为高光谱图像分类、深度学习

    王浩宇:男,助理研究员,研究方向为高光谱图像分类、多模态融合、机器学习

    邢长达:男,副教授,研究方向是高光谱图像智能分析

    通讯作者:

    王浩宇 wanghaoyucumt@163.com

  • 中图分类号: TN911.73; TP18

Scale Adaptive Fusion Network for Multimodal Remote Sensing Data Classification

Funds: The National Natural Science Foundation of China (62303468, 62303469), The Natural Science Foundation of Jiangsu Province (BK20221116, BK20221112), China Postdoctoral Science Foundation (2023M733757), The Excellent Post Doctorate Program of Jiangsu Province (2022ZB530)
  • 摘要: 多模态融合方法能够利用不同模态的互补特性有效提升地物分类的准确性,近年来成为各领域的研究热点。现有多模态融合方法被成功应用于面向高光谱图像(HSI)和激光雷达(LiDAR)的联合分类任务。然而,现有的研究仍面临许多挑战,包括地物间空间依赖关系难捕获,多模态数据中判别性信息难获取等。为应对上述挑战,该文将多模态、多尺度、多视角特征融合整合到一个统一的框架中,提出一种尺度自适应融合网络(SAFN)。首先,提出动态多尺度图模块以捕获地物复杂的空间依赖关系,提升模型对不规则地物以及尺度迥异地物的适应能力。其次,基于激光雷达和高光谱图像的互补特性,约束同一空间近邻区域内的地物具有相近的特征表示,获取判别性遥感特征。然后,提出多模态空-谱融合模块,建立多模态、多尺度、多视角特征间的信息交互,捕获各特征间可共享的类辨识信息,为地物分类任务提供具有判别性的融合特征。最后,将融合特征输入分类器中得到类别概率得分,对地物类别进行预测。为验证方法的有效性,该文在3个数据集(Houston, Trento, MUUFL)上进行了实验。实验结果表明,与现有主流算法相比较,SAFN在多源遥感数据分类任务中取得了最佳的视觉效果和最高精度。
  • 图  1  SAFN流程图

    图  2  使用不同算法获得的Trento数据的t-SNE图

    图  3  各消融模型的t-SNE图

    表  1  Houston数据集分类精度(%)

    类别CNNHRWNEndNetCCR-NetFGCNCNN-DF-SMEDFNSAFN
    Healthy grass98.6289.6099.3597.4895.4292.2094.9681.48
    Stressed grass94.4197.0899.5186.1492.1199.3583.4797.25
    Synthetic grass94.3999.8599.8597.4998.9298.2397.4999.71
    Trees98.2092.0890.2899.1096.3998.2096.4997.30
    Soil99.0299.5197.7199.6799.7499.7698.5397.71
    Water82.6290.4995.7496.7294.1893.4498.6985.57
    Residential69.3979.4183.8989.5878.9077.8986.2294.15
    Commercial67.4079.4951.6369.8580.7475.4983.9989.30
    Road74.7653.9073.5496.4076.3764.6976.2286.04
    Highway79.7986.0087.5788.5787.9490.8994.1287.99
    Railway75.8964.2884.7780.8283.3877.0489.7185.93
    Parking lot 162.4177.4173.5481.6278.8785.2494.2395.30
    Parking lot 283.3085.5285.7586.8683.0593.3295.1099.56
    Tennis court98.7899.0210099.2798.4199.7699.5199.76
    Running track97.5099.0699.3896.4198.0310098.2895.00
    OA(%)83.7584.2286.3087.7988.3287.9891.1192.17
    AA(%)85.0986.1888.1789.2789.5089.7092.4792.80
    Kappa(%)82.4582.9386.2186.7187.3687.0090.3991.54
    下载: 导出CSV

    表  2  Trento数据集分类精度(%)

    类别CNNHRWNEndNetCCR-NetFGCNCNN-DF-SMEDFNSAFN
    Apple trees95.0496.0195.3796.1197.0998.9697.7899.53
    Buildings77.7781.9394.0396.5392.0292.0897.8297.85
    Ground97.8210099.7896.9597.8299.4799.1396.30
    Woods99.7899.5099.5399.9699.9899.5799.9799.90
    Vineyard98.5798.5199.0499.7499.9598.5799.9699.46
    Roads79.0486.2181.2081.2687.8390.0594.0797.78
    OA(%)94.4195.6296.3597.0397.5197.4498.8499.22
    AA(%)91.3493.6994.8259.0995.7896.4598.1298.47
    Kappa(%)92.5694.1695.1596.0496.6896.5798.4598.96
    下载: 导出CSV

    表  3  MUUFL数据集分类精度(%)

    类别CNNHRWNEndNetCCR-NetFGCNCNN-DF-SMEDFNSAFN
    Trees81.8281.8580.0183.1682.3983.6380.9482.63
    Mostly grass65.4761.9480.9071.5276.7176.1976.3384.99
    Mixed ground surface55.9659.4057.5158.1867.2050.2571.3170.72
    Dirt and sand78.7879.9079.6286.5487.2272.8190.7786.93
    Road75.7671.2179.4972.7976.9681.8785.6386.85
    Water98.3298.0892.5598.5697.8496.8691.8398.21
    Building shadow82.5087.3684.5278.2477.7479.4982.6892.23
    Building72.7177.1671.6580.2476.7084.5783.9387.25
    Sidewalk45.6263.5261.6567.3464.4959.4177.9073.33
    Yellow curb57.1466.9277.4477.4481.2064.4288.7296.93
    Cloth panels98.1796.3595.4399.0999.5490.7699.5493.57
    OA(%)74.5275.3676.2577.0778.3577.5780.7782.88
    AA(%)73.8176.7078.4379.3780.7376.3984.5186.70
    Kappa(%)67.6368.6369.9070.8772.3971.4975.5778.19
    下载: 导出CSV

    表  4  不同组件对总体精度的影响(%)

    数据集SAFN-ASAFN-BSAFN-CSAFN
    Houston 201387.8989.2390.3592.17
    Trento96.7397.5498.4899.22
    MUUFL78.1380.1581.7682.88
    下载: 导出CSV
  • [1] WANG Leiquan, ZHU Tongchuan, KUMAR N, et al. Attentive-adaptive network for hyperspectral images classification with noisy labels[J]. IEEE Transactions on Geoscience and Remote Sensing, 2023, 61: 5505514. doi: 10.1109/TGRS.2023.3254159.
    [2] HANG Renlong, LI Zhu, GHAMISI P, et al. Classification of hyperspectral and LiDAR data using coupled CNNs[J]. IEEE Transactions on Geoscience and Remote Sensing, 2020, 58(7): 4939–4950. doi: 10.1109/TGRS.2020.2969024.
    [3] 王成龙, 赵倩, 赵琰, 等. 基于深度可分离卷积的实时遥感目标检测算法[J]. 电光与控制, 2022, 29(8): 45–49. doi: 10.3969/j.issn.1671-637X.2022.08.009.

    WANG Chenglong, ZHAO Qian, ZHAO Yan, et al. A real-time remote sensing target detection algorithm based on depth separable convolution[J]. Electronics Optics & Control, 2022, 29(8): 45–49. doi: 10.3969/j.issn.1671-637X.2022.08.009.
    [4] AHMAD M, KHAN A M, MAZZARA M, et al. Multi-layer extreme learning machine-based autoencoder for hyperspectral image classification[C]. The 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, Prague, Czech, 2019: 75–82. doi: 10.5220/0007258000750082.
    [5] CUI Ying, SHAO Chao, LUO Li, et al. Center weighted convolution and GraphSAGE cooperative network for hyperspectral image classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2023, 61: 5508216. doi: 10.1109/TGRS.2023.3264653.
    [6] LI Mingsong, LI Wei, LIU Yikun, et al. Adaptive mask sampling and manifold to Euclidean subspace learning with distance covariance representation for hyperspectral image classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2023, 61: 5508518. doi: 10.1109/TGRS.2023.3265388.
    [7] OU Xianfeng, WU Meng, TU Bing, et al. Multi-objective unsupervised band selection method for hyperspectral images classification[J]. IEEE Transactions on Image Processing, 2023, 32: 1952–1965. doi: 10.1109/TIP.2023.3258739.
    [8] XUE Zhixiang, TAN Xiong, YU Xuchu, et al. Deep hierarchical vision transformer for hyperspectral and LiDAR data classification[J]. IEEE Transactions on Image Processing, 2022, 31: 3095–3110. doi: 10.1109/TIP.2022.3162964.
    [9] 赵伍迪, 李山山, 李安, 等. 结合深度学习的高光谱与多源遥感数据融合分类[J]. 遥感学报, 2021, 25(7): 1489–1502. doi: 10.11834/jrs.20219117.

    ZHAO Wudi, LI Shanshan, LI An, et al. Deep fusion of hyperspectral images and multi-source remote sensing data for classification with convolutional neural network[J]. National Remote Sensing Bulletin, 2021, 25(7): 1489–1502. doi: 10.11834/jrs.20219117.
    [10] ZHAO Xudong, ZHANG Mengmeng, TAO Ran, et al. Fractional Fourier image transformer for multimodal remote sensing data classification[J]. IEEE Transactions on Neural Networks and Learning Systems, 2024, 35(2): 2314–2326. doi: 10.1109/TNNLS.2022.3189994.
    [11] ROY S K, DERIA A, HONG Danfeng, et al. Hyperspectral and LiDAR data classification using joint CNNs and morphological feature learning[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 5530416. doi: 10.1109/TGRS.2022.3177633.
    [12] 孙强, 陈远. 多层次时空特征自适应集成与特有-共享特征融合的双模态情感识别[J]. 电子与信息学报, 2024, 46(2): 574–587. doi: 10.11999/JEIT231110.

    SUN Qiang and CHEN Yuan. Bimodal emotion recognition with adaptive integration of multi-level spatial-temporal features and specific-shared feature fusion[J]. Journal of Electronics & Information Technology. 2024, 46(2): 574–587. doi: 10.11999/JEIT231110.
    [13] 雷大江, 杜加浩, 张莉萍, 等. 联合多流融合和多尺度学习的卷积神经网络遥感图像融合方法[J]. 电子与信息学报, 2022, 44(1): 237–244. doi: 10.11999/JEIT200792.

    LEI Dajiang, DU Jiahao, ZHANG Liping, et al. Multi-stream architecture and multi-scale convolutional neural network for remote sensing image fusion[J]. Journal of Electronics & Information Technology, 2022, 44(1): 237–244. doi: 10.11999/JEIT200792.
    [14] JIA Sen, ZHAN Zhangwei, ZHANG Meng, et al. Multiple feature-based superpixel-level decision fusion for hyperspectral and LiDAR data classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021, 59(2): 1437–1452. doi: 10.1109/TGRS.2020.2996599.
    [15] ZHAO Guangrui, YE Qiaolin, SUN Le, et al. Joint classification of hyperspectral and LiDAR data using a hierarchical CNN and transformer[J]. IEEE Transactions on Geoscience and Remote Sensing, 2023, 61: 5500716. doi: 10.1109/TGRS.2022.3232498.
    [16] LI Hengchao, HU Wenshuai, LI Wei, et al. A3 CLNN: Spatial, spectral and multiscale attention ConvLSTM neural network for multisource remote sensing data classification[J]. IEEE Transactions on Neural Networks and Learning Systems, 2022, 33(2): 747–761. doi: 10.1109/TNNLS.2020.3028945.
    [17] ZHANG Mengmeng, LI Wei, ZHANG Yuxiang, et al. Hyperspectral and LiDAR data classification based on structural optimization transmission[J]. IEEE Transactions on Cybernetics, 2023, 53(5): 3153–3164. doi: 10.1109/TCYB.2022.3169773.
    [18] LI Jiaojiao, MA Yinle, SONG Rui, et al. A triplet semisupervised deep network for fusion classification of hyperspectral and LiDAR data[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 5540513. doi: 10.1109/TGRS.2022.3213513.
    [19] DONG Wenqian, ZHANG Tian, QU Jiahui, et al. Multibranch feature fusion network with self- and cross-guided attention for hyperspectral and LiDAR classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 5530612. doi: 10.1109/TGRS.2022.3179737.
    [20] 马梁, 苟于涛, 雷涛, 等. 基于多尺度特征融合的遥感图像小目标检测[J]. 光电工程, 2022, 49(4): 210363. doi: 10.12086/oee.2022.210363.

    MA Liang, GOU Yutao, LEI Tao, et al. Small object detection based on multi-scale feature fusion using remote sensing images[J]. Opto-Electronic Engineering, 2022, 49(4): 210363. doi: 10.12086/oee.2022.210363.
    [21] ZHANG Zhongqiang, LIU Danhua, GAO Dahua, et al. A novel spectral-spatial multi-scale network for hyperspectral image classification with the Res2Net block[J]. International Journal of Remote Sensing, 2022, 43(3): 751–777. doi: 10.1080/01431161.2021.2005840.
    [22] XU Kejie, ZHAO Yue, ZHANG Lingming, et al. Spectral–spatial residual graph attention network for hyperspectral image classification[J]. IEEE Geoscience and Remote Sensing Letters, 2022, 19: 5509305. doi: 10.1109/LGRS.2021.3111985.
    [23] TAN Xiong and XUE Zhixiang. Spectral-spatial multi-layer perceptron network for hyperspectral image land cover classification[J]. European Journal of Remote Sensing, 2022, 55(1): 409–419. doi: 10.1080/22797254.2022.2087540.
    [24] HONG Danfeng, HAN Zhu, YAO Jing, et al. SpectralFormer: Rethinking hyperspectral Image classification with transformers[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 5518615. doi: 10.1109/TGRS.2021.3130716.
    [25] HAMILTON W L, YING R, and LESKOVEC J. Inductive representation learning on large graphs[C]. The 31st International Conference on Neural Information Processing Systems, Long Beach, USA, 2017: 1025–1035.
    [26] YU Haoyang, ZHANG Hao, LIU Yao, et al. Dual-channel convolution network with image-based global learning framework for hyperspectral image classification[J]. IEEE Geoscience and Remote Sensing Letters, 2022, 19: 6005705. doi: 10.1109/LGRS.2021.3139358.
    [27] ZHAO Xudong, TAO Ran, LI Wei, et al. Joint classification of hyperspectral and LiDAR data using hierarchical random walk and deep CNN architecture[J]. IEEE Transactions on Geoscience and Remote Sensing, 2020, 58(10): 7355–7370. doi: 10.1109/TGRS.2020.2982064.
    [28] HONG Danfeng, GAO Lianru, HANG Renlong, et al. Deep encoder–decoder networks for classification of hyperspectral and LiDAR data[J]. IEEE Geoscience and Remote Sensing Letters, 2022, 19: 5500205. doi: 10.1109/LGRS.2020.3017414.
    [29] WU Xin, HONG Danfeng, and CHANUSSOT J. Convolutional neural networks for multimodal remote sensing data classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 5517010. doi: 10.1109/TGRS.2021.3124913.
    [30] ZHAO Xudong, TAO Ran, LI Wei, et al. Fractional Gabor convolutional network for multisource remote sensing data classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 5503818. doi: 10.1109/TGRS.2021.3065507.
    [31] WANG Haoyu, CHENG Yuhu, LIU Xiaomin, et al. Reinforcement learning based Markov edge decoupled fusion network for fusion classification of hyperspectral and LiDAR[J]. IEEE Transactions on Multimedia, 2024, 26: 7174–7187. doi: 10.1109/TMM.2024.3360717.
  • 加载中
图(3) / 表(4)
计量
  • 文章访问数:  32
  • HTML全文浏览量:  23
  • PDF下载量:  9
  • 被引次数: 0
出版历程
  • 收稿日期:  2024-03-15
  • 修回日期:  2024-07-01
  • 网络出版日期:  2024-07-06

目录

    /

    返回文章
    返回