高级搜索

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

面向人脸伪造防御的属性感知对抗样本生成方法

高帆 严伟丹 邵文泽 张登银

高帆, 严伟丹, 邵文泽, 张登银. 面向人脸伪造防御的属性感知对抗样本生成方法[J]. 电子与信息学报. doi: 10.11999/JEIT260043
引用本文: 高帆, 严伟丹, 邵文泽, 张登银. 面向人脸伪造防御的属性感知对抗样本生成方法[J]. 电子与信息学报. doi: 10.11999/JEIT260043
GAO Fan, YAN Weidan, SHAO Wenze, ZHANG Dengyin. Defending Deepfakes by Attribute-Aware Attack[J]. Journal of Electronics & Information Technology. doi: 10.11999/JEIT260043
Citation: GAO Fan, YAN Weidan, SHAO Wenze, ZHANG Dengyin. Defending Deepfakes by Attribute-Aware Attack[J]. Journal of Electronics & Information Technology. doi: 10.11999/JEIT260043

面向人脸伪造防御的属性感知对抗样本生成方法

doi: 10.11999/JEIT260043 cstr: 32379.14.JEIT260043
基金项目: 国家自然科学基金(62471241, 92470126)
详细信息
    作者简介:

    高帆:男,硕士生,研究方向为深度伪造防御

    严伟丹:男,博士生,研究方向为图像处理、深度伪造防御等

    邵文泽:男,博士,教授,研究方向为变分方法、计算统计、表示学习及其成像与视觉应用

    张登银:男,博士,研究员,研究方向为现代通信网络、信号与信息处理等

    通讯作者:

    张登银 zhangdy@njupt.edu.cn

  • 中图分类号: TN911.73; TP391

Defending Deepfakes by Attribute-Aware Attack

Funds: The National Natural Science Foundation of China(62471241, 92470126)
  • 摘要: 人脸深度伪造的非法滥用,会导致严重的人身财产损害。基于梯度攻击的传统防御方法,虽然在伪造模型参数已知的白盒场景下,通过多轮迭代能取得一定的防御效果,但是面对实际的黑盒攻击时,往往逊色于目前主流的基于生成对抗网络(GAN)跨模型集成训练的方法。即使GAN能够进行快速推理,但其扰动生成缺乏感知约束,隐蔽性较差,难以满足实际需求。此外,日新月异的人脸伪造模型则给对抗扰动的跨模型防御的可迁移性提出了更高要求。为此,该文提出一种属性感知的对抗样本生成方法,旨在改善扰动隐蔽性的同时,达到兼顾跨模型防御性能的目的。一方面,该文在仅考虑人脸图像前景的基础上,使用属性显著掩码划分出的面部与发型区域,通过自适应扰动生成器生成特异性的对抗扰动,有效平衡了对抗样本的隐蔽性与攻击性。另一方面,该文从数据增强角度出发,通过融合参考人脸图像的频域相位信息,生成更具多样性的输入特征,防范扰动过拟合的同时提升可迁移性能。实验结果表明,所提方法在跨模型防御的定量和定性测试中均取得较好的防御性能。
  • 图  1  人脸先验知识可视化对比

    图  2  本文方法的总体架构

    图  3  (a) 自适应对抗扰动生成器网络结构 图3(b) 自适应空频注意力模块

    图  4  不同方法的主动防御结果可视化对比示例

    图  5  本文方法的跨模型防御结果可视化示例

    图  6  对抗样本生成方法在不同测试集上迁移攻击的防御结果可视化对比

    图  7  显著分割掩码和属性分割掩码引导的主动防御结果可视化对比

    图  8  本文方法的跨模型防御消融实验结果可视化

    表  1  在CelebA-HQ测试集上的主动防御性能的定量结果对比(粗体表示最优,下划线表示次优)

    对抗
    样本
    隐蔽性能主动防御性能推理
    时间
    /s
    攻击成功率(%)扰动输出
    PSNR↑LPIPS↓FGANAttGANHiSDStarGANAGANPSNR↓LPIPS↑



    LAE[13][14]17.100.255178.081.085.0100.091.016.720.27003.020
    D-D[7]35.400.052599.80.00.0100.098.224.840.21631.040
    D-D(SA)[7][18]37.860.018898.40.00.0100.097.030.740.20721.220
    A-F[8]40.860.0036100.00.00.060.0100.026.150.19099.130



    集成
    SA[18][14]32.110.081498.027.085.093.099.018.370.22725.470
    CMUA[9][14]32.450.167887.097.023.091.099.020.530.3306-
    FOUND[11][14]33.230.144173.091.044.0100.0100.017.380.3688-
    AdvGAN[10]30.350.069296.20.080.0100.098.017.110.30220.116
    PD-DWT[14]32.910.035395.074.073.0100.099.017.750.28710.124
    本文32.620.043597.483.491.4100.0100.016.790.28490.167
    下载: 导出CSV

    表  2  对抗样本生成方法在不同来源测试集上迁移攻击的防御性能定量结果对比

    数据集对抗样本隐蔽性能主动防御性能
    攻击成功率(%)扰动输出
    PSNR↑SSIM↑FGANAttGANHiSDStarGANAGANPSNR↓SSIM↓
    FFHQAdvGAN[10]30.3180.7835100.00.077.2100.0100.018.6760.6495
    PD-DWT[14]32.5700.863799.087.678.299.6100.017.1420.6529
    本文32.3520.8578100.084.892.4100.0100.016.5270.6241
    LFWAdvGAN[10]30.4050.769899.60.079.2100.0100.017.5360.6081
    PD-DWT[14]33.5910.866699.689.678.099.6100.018.5270.7079
    本文33.4410.8923100.084.087.699.2100.017.5180.6655
    FF++AdvGAN[10]30.3050.7457100.00.093.899.7100.016.4390.5998
    PD-DWT[14]32.7360.8663100.0100.080.9100.0100.017.7100.6831
    本文32.6840.8655100.087.885.3100.0100.016.6980.6521
    下载: 导出CSV

    表  3  本文方法的跨模型集成防御消融实验结果

    对抗
    样本
    隐蔽性能主动防御性能
    攻击成功率(%)扰动输出
    PSNR↑LPIPS↓StarGANFGANAttGANHiSDAGANPSNR↓LPIPS↑
    无①②③25.550.2299100.0100.0100.080.0100.016.490.4741
    无①②26.830.1303100.099.895.687.499.815.620.4316
    无①③26.080.1500100.099.4100.086.2100.014.950.4471
    无②③32.310.068999.0100.043.063.299.817.920.3299
    无①26.100.1592100.097.6100.089.4100.014.540.4805
    无②32.560.045999.696.270.095.4100.017.030.3204
    无③32.160.047199.297.686.284.6100.017.700.3102
    完整方法32.620.0435100.0097.483.491.4100.016.790.2849
    下载: 导出CSV
  • [1] PENG Chunlei, LUO Xiaoyi, LIU Decheng, et al. Semantic token transformer for face forgery detection[J]. IEEE Transactions on Information Forensics and Security, 2025, 20: 4904–4914. doi: 10.1109/TIFS.2025.3567110.
    [2] 刘鹏宇, 郑添阳, 董敏. 一种伪造注意图驱动的多任务深伪视频检测模型[J]. 电子与信息学报, 2026, 48(1): 346–358. doi: 10.11999/JEIT250926.

    LIU Pengyu, ZHENG Tianyang, and DONG Min. A fake attention map-driven multi-task deepfake video detection model[J]. Journal of Electronics & Information Technology, 2026, 48(1): 346–358. doi: 10.11999/JEIT250926.
    [3] GOODFELLOW I J, SHLENS J, and SZEGEDY C. Explaining and harnessing adversarial examples[C]. 3rd International Conference on Learning Representations, San Diego, USA, 2015. doi: 10.48550/arXiv.1412.6572.
    [4] LIU Decheng, SU Qixuan, PENG Chunlei, et al. Imperceptible face forgery attack via adversarial semantic mask[EB/OL]. https://doi.org/10.48550/arXiv.2406.10887, 2024.
    [5] DEB D, ZHANG Jianbang, and JAIN A K. AdvFaces: Adversarial face synthesis[C]. Proceedings of 2020 IEEE International Joint Conference on Biometrics (IJCB), Houston, USA, 2020: 1–10. doi: 10.1109/IJCB48548.2020.9304898.
    [6] 瞿左珉, 殷琪林, 盛紫琦, 等. 人脸深度伪造主动防御技术综述[J]. 中国图象图形学报, 2024, 29(2): 318–342. doi: 10.11834/jig.230128.

    QU Zuomin, YIN Qilin, SHENG Ziqi, et al. Overview of deepfake proactive defense techniques[J]. Journal of Image and Graphics, 2024, 29(2): 318–342. doi: 10.11834/jig.230128.
    [7] RUIZ N, BARGAL S A, and SCLAROFF S. Disrupting deepfakes: Adversarial attacks against conditional image translation networks and facial manipulation systems[C]. 16th European Conference on Computer Vision, Glasgow, UK, 2020: 236–251. doi: 10.1007/978-3-030-66823-5_14.
    [8] WANG Run, HUANG Ziheng, CHEN Zhikai, et al. Anti-forgery: Towards a stealthy and robust DeepFake disruption attack via adversarial perceptual-aware perturbations[C]. Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, Vienna, Austria, 2022: 761–767. doi: 10.24963/ijcai.2022/107.
    [9] HUANG Hao, WANG Yongtao, CHEN Zhaoyu, et al. CMUA-watermark: A cross-model universal adversarial watermark for combating deepfakes[C]. Proceedings of the 36th AAAI Conference on Artificial Intelligence, Palo Alto, USA, 2022: 989–997. doi: 10.1609/aaai.v36i1.19982. (查阅网上资料,未找到本条文献出版地信息,请确认).
    [10] XIAO Chaowei, LI Bo, ZHU Junyan, et al. Generating adversarial examples with adversarial networks[C]. Proceedings of the 27th International Joint Conference on Artificial Intelligence, Stockholm, Sweden, 2018: 3905–3911. doi: 10.24963/ijcai.2018/543.
    [11] TANG Long, YE Dengpan, LU Zhenhao, et al. Feature extraction matters more: An effective and efficient universal deepfake disruptor[J]. ACM Transactions on Multimedia Computing, Communications, and Applications, 2025, 21(2): 46. doi: 10.1145/3653457.
    [12] 王金伟, 曾可慧, 张家伟, 等. 基于空频联合卷积神经网络的GAN生成人脸检测[J]. 计算机科学, 2023, 50(6): 216–224. doi: 10.11896/jsjkx.220400268.

    WANG Jinwei, ZENG Kehui, ZHANG Jiawei, et al. GAN-generated face detection based on space-frequency convolutional neural network[J]. Computer Science, 2023, 50(6): 216–224. doi: 10.11896/jsjkx.220400268.
    [13] HE Ziwen, WANG Wei, GUAN Weinan, et al. Defeating deepfakes via adversarial visual reconstruction[C]. Proceedings of the 30th ACM International Conference on Multimedia, Lisboa, Portugal, 2022: 2464–2472. doi: 10.1145/3503161.3547923.
    [14] 洪钰婷, 陈北京. 抵抗第二次人脸属性编辑的不可感知主动防御算法[J]. 计算机辅助设计与图形学学报, 2025: 1–10. doi: 10.3724/SP.J.1089.2024-00316.

    HONG Yuting and CHEN Beijing. Imperceptible proactive defense against second facial attribute editing[J]. Journal of Computer-Aided Design & Computer Graphics, 2025: 1–10. doi: 10.3724/SP.J.1089.2024-00316.
    [15] FENG Yixiang and HUANG Fangjun. Compression-resistant adversarial perturbation for real-world proactive defense against deepfakes[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2026, 36(4): 5161–5172. doi: 10.1109/TCSVT.2025.3626505.
    [16] QU Zuomin, XI Zuping, LU Wei, et al. DF-RAP: A robust adversarial perturbation for defending against deepfakes in real-world social network scenarios[J]. IEEE Transactions on Information Forensics and Security, 2024, 19: 3943–3957. doi: 10.1109/TIFS.2024.3372803.
    [17] BYUN J, GO H, and KIM C. Geometrically adaptive dictionary attack on face recognition[C]. Proceedings of the 2022 IEEE/CVF Winter Conference on Applications of Computer Vision, Waikoloa, USA, 2022: 3809–3818. doi: 10.1109/WACV51458.2022.00386.
    [18] LI Qilei, GAO Mingliang, ZHANG Guisheng, et al. Defending deepfakes by saliency-aware attack[J]. IEEE Transactions on Computational Social Systems, 2024, 11(4): 5060–5067. doi: 10.1109/TCSS.2023.3271121.
    [19] YANG Yong, LI Changjiang, JIANG Yi, et al. Invisible-face: Rethinking facial attribute privacy in social media photo sharing[J]. IEEE Transactions on Information Forensics and Security, 2025, 20: 6101–6116. doi: 10.1109/TIFS.2025.3579592.
    [20] 吴涛, 纪琼辉, 先兴平, 等. 信息熵驱动的图神经网络黑盒迁移对抗攻击方法[J]. 电子与信息学报, 2025, 47(10): 3814–3825. doi: 10.11999/JEIT250303.

    WU Tao, JI Qionghui, XIAN Xingping, et al. Information entropy-driven black-box transferable adversarial attack method for graph neural networks[J]. Journal of Electronics & Information Technology, 2025, 47(10): 3814–3825. doi: 10.11999/JEIT250303.
    [21] TOKOZUME Y, USHIKU Y, and HARADA T. Between-class learning for image classification[C]. Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA, 2018: 5486–5494. doi: 10.1109/CVPR.2018.00575.
    [22] HENDRYCKS D, ZOU A, MAZEIKA M, et al. PixMix: Dreamlike pictures comprehensively improve safety measures[C]. Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, USA, 2022: 16783–16792. doi: 10.1109/CVPR52688.2022.01628.
    [23] 钱亚冠, 孔亚鑫, 陈科成, 等. 利用频谱衰减增强深度神经网络对抗迁移攻击[J]. 电子与信息学报, 2025, 47(10): 3847–3857. doi: 10.11999/JEIT250157.

    QIAN Yaguan, KONG Yaxin, CHEN Kecheng, et al. Adversarial transferability attack on deep neural networks through spectral coefficient decay[J]. Journal of Electronics & Information Technology, 2025, 47(10): 3847–3857. doi: 10.11999/JEIT250157.
    [24] 凌海, 凌捷. 基于特征融合的对抗样本定向目标攻击可迁移性增强[J]. 计算机工程, 2025, 51(11): 162–170. doi: 10.19678/j.issn.1000-3428.0069983.

    LING Hai and LING Jie. Transferability enhancement of adversarial sample directed targeted attack based on feature fusion[J]. Computer Engineering, 2025, 51(11): 162–170. doi: 10.19678/j.issn.1000–3428.0069983. doi: 10.19678/j.issn.1000-3428.0069983.
    [25] YU Hu, ZHENG Naishan, ZHOU Man, et al. Frequency and spatial dual guidance for image dehazing[C]. 17th European Conference on Computer Vision, Tel Aviv, Israel, 2022: 181–198. doi: 10.1007/978-3-031-19800-7_11.
    [26] 沈瑜, 白珊, 魏子易, 等. 基于跨模态感知和空频交叉的医学图像融合[J]. 中国激光, 2025, 52(9): 0907106. doi: 10.3788/CJL241333.

    SHEN Yu, BAI Shan, WEI Ziyi, et al. Medical image fusion network for cross-modality perception and spatial-frequency interaction[J]. Chinese Journal of Lasers, 2025, 52(9): 0907106. doi: 10.3788/CJL241333.
    [27] TORBUNOV D, HUANG Yi, YU Haiwan, et al. UVCGAN: UNet vision transformer cycle-consistent GAN for unpaired image-to-image translation[C]. Proceedings of the 2023 IEEE/CVF Winter Conference on Applications of Computer Vision, Waikoloa, USA, 2023: 702–712. doi: 10.1109/WACV56688.2023.00077.
    [28] XIONG Zihao, ZHOU Fei, WU Fengyi, et al. DRPCA-Net: Make robust PCA great again for infrared small target detection[J]. IEEE Transactions on Geoscience and Remote Sensing, 2025, 63: 5005516. doi: 10.1109/TGRS.2025.3588392.
    [29] ZHU Hao, WU W, ZHU Wentao, et al. CelebV-HQ: A large-scale video facial attributes dataset[C]. Proceedings of the 17th European Conference on Computer Vision, Tel Aviv, Israel, 2022: 650–667. doi: 10.1007/978-3-031-20071-7_38.
    [30] YU Changqian, GAO Changxin, WANG Jingbo, et al. BiSeNet V2: Bilateral network with guided aggregation for real-time semantic segmentation[J]. International Journal of Computer Vision, 2021, 129(11): 3051–3068. doi: 10.1007/s11263-021-01515-2.
    [31] SIDDIQUEE M M R, ZHOU Zongwei, TAJBAKHSH N, et al. Learning fixed points in generative adversarial networks: From image-to-image translation to disease detection and localization[C]. Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, Seoul, South Korea, 2019: 191–200. doi: 10.1109/ICCV.2019.00028.
    [32] HE Zhenliang, ZUO Wangmeng, KAN Meina, et al. AttGAN: Facial attribute editing by only changing what you want[J]. IEEE Transactions on Image Processing, 2019, 28(11): 5464–5478. doi: 10.1109/TIP.2019.2916751.
    [33] LI Xinyang, ZHANG Shengchuan, HU Jie, et al. Image-to-image translation via hierarchical style disentanglement[C]. Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, USA, 2021: 8635–8644. doi: 10.1109/CVPR46437.2021.00853.
    [34] CHOI Y, CHOI M, KIM M, et al. StarGAN: Unified generative adversarial networks for multi-domain image-to-image translation[C]. Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA, 2018: 8789–8797. doi: 10.1109/CVPR.2018.00916.
    [35] TANG Hao, LIU Hong, XU Dan, et al. AttentionGAN: Unpaired image-to-image translation using attention-guided generative adversarial networks[J]. IEEE Transactions on Neural Networks and Learning Systems, 2023, 34(4): 1972–1987. doi: 10.1109/TNNLS.2021.3105725.
    [36] KARRAS T, LAINE S, and AILA T. A style-based generator architecture for generative adversarial networks[C]. Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, 2019: 4396–4405. doi: 10.1109/CVPR.2019.00453.
    [37] RÖSSLER A, COZZOLINO D, VERDOLIVA L, et al. FaceForensics++: Learning to detect manipulated facial images[C]. Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, Seoul, Korea (South), 2019: 1–11. doi: 10.1109/ICCV.2019.00009.
  • 加载中
图(8) / 表(3)
计量
  • 文章访问数:  18
  • HTML全文浏览量:  9
  • PDF下载量:  3
  • 被引次数: 0
出版历程
  • 收稿日期:  2026-01-12
  • 修回日期:  2026-05-29
  • 录用日期:  2026-05-29
  • 网络出版日期:  2026-06-08

目录

    /

    返回文章
    返回