吉林大学学报(信息科学版) ›› 2022, Vol. 40 ›› Issue (5): 846-855.

• • 上一篇    下一篇

基于平行注意力机制的对抗样本防御方法

赵 杰, 郭 东   

  1. 吉林大学 计算机科学与技术学院, 长春 130012
  • 收稿日期:2021-12-22 出版日期:2022-10-10 发布日期:2022-10-10
  • 作者简介:赵杰(1997— ), 男, 山东日照人, 吉林大学硕士研究生, 主要从事机器学习安全防御研究, ( Tel) 86-15764328017 (E-mail)jiezhao19@ mails. jlu. edu. cn; 郭东(1975— ), 男, 吉林德惠人, 吉林大学副教授, 主要从事云计算与网络安全 研究, (Tel)86-13341581599(E-mail)guodong@ jlu. edu. cn。

Adversarial Examples Defense Method Base on Parallel Attention Mechanism

ZHAO Jie, GUO Dong   

  1. College of Computer Science and Technology, Jilin University, Changchun 130012, China
  • Received:2021-12-22 Online:2022-10-10 Published:2022-10-10

摘要: 为降低对抗样本的影响, 提高分类模型在遭受攻击威胁下的精度, 利用哺乳动物视觉系统工作原理, 结合注意力机制, 提出一种新型防御对抗样本模型 PSCAM-GAN(Parallel Spatial and Channel Attention Mechanism Adversarial Generative Network)。 该防御模型在通过编码器获得对抗样本的特征图后, 使用平行注意力机制提 取特征图中的个体和位置信息, 然后在保证这些特征不变的情况下, 重新调整特征图的权重, 通过解码器产生 净化结果。 该方法能在清除恶意扰动的同时保持净化结果与输入的一致性, 有效降低对抗样本对模型精度的 影响。 在 CIFAR-10 MNIST 数据集上, PSCAM-GAN 面对多种对抗样本攻击时的防御效果超越了其他基于预 处理的防御方法, 能有效提高模型的健壮性。

关键词: 深度学习, 对抗样本, 对抗生成网络, 图像分类

Abstract: We have the effect of adversarial examples is reduced and the accuracy of the classification model is improved under the threat. Inspired by the mammalian visual modality, we proposed a purification defense method using a novel parallel attention mechanism to mitigate the effect of adversarial examples, called PSCAM- GAN(Parallel Spatial and Channel Attention Mechanism Adversarial Generative Network). The defense model first generates the feature map through the encoder, the parallel attention module is used to extract the object and space information. Under the condition that these features remain unchanged, the weight of the feature map is readjusted generating purification results by decoder. This method can keep the consistency between the purification result and the input while removing malicious perturbation, and effectively reduce the influence of adversarial samples on the model accuracy. The robustness of the model is evaluated through various types of attacks on CIFAR-10 and MNIST dataset. The experiments show that PSCAM-GAN completely surpassed other pre-processing based defense methods. These mean the defense method can effectively improve the robustness of the original models.

Key words: deep learning, adversarial examples, generative adversarial networks, image classification

中图分类号: 

  • TP391