
摘要
对抗样本通常被视为对卷积神经网络(ConvNets)的一种威胁。然而,本文提出了一个相反的视角:若以恰当方式利用,对抗样本反而可以用于提升图像识别模型的性能。为此,我们提出了一种名为 AdvProp 的增强型对抗训练方法,将对抗样本视为额外的训练样本,以缓解过拟合问题。该方法的核心在于为对抗样本引入一个独立的辅助批量归一化(auxiliary batch normalization)层,因为对抗样本与正常样本具有不同的底层数据分布。实验表明,AdvProp 在多种图像识别任务中均显著提升了多个模型的性能,且在模型规模更大时效果更为突出。例如,在 ImageNet 数据集上对最新的 EfficientNet-B7 模型应用 AdvProp 后,我们取得了显著提升:ImageNet 准确率提高 0.7%,ImageNet-C 提升 6.5%,ImageNet-A 提升 7.0%,Stylized-ImageNet 提升 4.8%。进一步结合增强版 EfficientNet-B8 模型,我们的方法在不依赖额外数据的情况下,实现了 85.5% 的 ImageNet Top-1 准确率,达到当前最优水平。这一结果甚至超越了文献 [20] 中表现最佳的模型——后者使用了约 35 亿张 Instagram 图像(约为 ImageNet 数据量的 3000 倍)和约 9.4 倍的参数量进行训练。相关模型代码已开源,可访问 https://github.com/tensorflow/tpu/tree/master/models/official/efficientnet。
代码仓库
rwightman/pytorch-image-models
pytorch
GitHub 中提及
tensorflow/tpu
官方
tf
open-mmlab/mmclassification
pytorch
osmr/imgclsmob
mxnet
yaoshiang/MobileNetV2-CIFAR-Cleverhans
GitHub 中提及
基准测试
| 基准 | 方法 | 指标 |
|---|---|---|
| domain-generalization-on-vizwiz | EfficientNet-B4 (advprop+autoaug) | Accuracy - All Images: 48.1 Accuracy - Clean Images: 51.4 Accuracy - Corrupted Images: 42.5 |
| domain-generalization-on-vizwiz | EfficientNet-B5 (advprop+autoaug) | Accuracy - All Images: 49.1 Accuracy - Clean Images: 51.7 Accuracy - Corrupted Images: 44 |
| domain-generalization-on-vizwiz | EfficientNet-B6 (advprop+autoaug) | Accuracy - All Images: 49.6 Accuracy - Clean Images: 53.2 Accuracy - Corrupted Images: 44.7 |
| domain-generalization-on-vizwiz | EfficientNet-B1 (advprop+autoaug) | Accuracy - All Images: 42.4 Accuracy - Clean Images: 46.7 Accuracy - Corrupted Images: 36.2 |
| domain-generalization-on-vizwiz | EfficientNet-B8 (advprop+autoaug) | Accuracy - All Images: 50.5 Accuracy - Clean Images: 53.2 Accuracy - Corrupted Images: 45.8 |
| domain-generalization-on-vizwiz | EfficientNet-B2 (advprop+autoaug) | Accuracy - All Images: 44.3 Accuracy - Clean Images: 48 Accuracy - Corrupted Images: 38.2 |
| domain-generalization-on-vizwiz | EfficientNet-B7 (advprop+autoaug) | Accuracy - All Images: 49.7 Accuracy - Clean Images: 52 Accuracy - Corrupted Images: 45 |
| domain-generalization-on-vizwiz | EfficientNet-B0 (advprop+autoaug) | Accuracy - All Images: 40.5 Accuracy - Clean Images: 44.9 Accuracy - Corrupted Images: 34.2 |
| domain-generalization-on-vizwiz | EfficientNet-B3 (advprop+autoaug) | Accuracy - All Images: 45.5 Accuracy - Clean Images: 49.5 Accuracy - Corrupted Images: 39.8 |
| image-classification-on-imagenet | AdvProp (EfficientNet-B7) | Number of params: 66M Top 1 Accuracy: 85.2% |
| image-classification-on-imagenet | AdvProp (EfficientNet-B8) | Number of params: 88M Top 1 Accuracy: 85.5% |