3 个月前

大迁移(Big Transfer,BiT):通用视觉表征学习

大迁移(Big Transfer,BiT):通用视觉表征学习

摘要

预训练表征的迁移能够显著提升深度神经网络在视觉任务中的样本效率,并简化超参数调优过程。我们重新审视了在大规模监督数据集上进行预训练,随后在目标任务上微调模型的经典范式。通过扩大预训练规模,并提出一种简洁的训练方法,我们称之为大迁移(Big Transfer,简称BiT),在超过20个数据集上取得了优异的性能表现。BiT在极为广泛的数据规模范围内均表现出色——从每类仅1个样本到总计100万样本的场景均能有效工作。在ImageNet(ILSVRC-2012)数据集上,BiT达到87.5%的Top-1准确率;在CIFAR-10上达到99.4%;在包含19个任务的视觉任务适应基准(Visual Task Adaptation Benchmark, VTAB)上达到76.3%。在小样本场景下,BiT在每类仅10个样本的情况下,于ILSVRC-2012上仍取得76.8%的准确率,在CIFAR-10上达到97.0%。我们对影响迁移性能的关键组件进行了深入分析,揭示了其成功背后的机制。

基准测试

基准方法指标
fine-grained-image-classification-on-oxfordBiT-M (ResNet)
Accuracy: 99.30%
Top-1 Error Rate: 0.70
fine-grained-image-classification-on-oxfordBiT-L (ResNet)
Accuracy: 99.63%
Top-1 Error Rate: 0.37
fine-grained-image-classification-on-oxford-2BiT-L (ResNet)
Accuracy: 96.62
Top-1 Error Rate: 3.38%
fine-grained-image-classification-on-oxford-2BiT-M (ResNet)
Accuracy: 94.47
Top-1 Error Rate: 5.53%
image-classification-on-cifar-10BiT-L (ResNet)
Percentage correct: 99.37
image-classification-on-cifar-10BiT-M (ResNet)
Percentage correct: 98.91
image-classification-on-cifar-100BiT-M (ResNet)
Percentage correct: 92.17
image-classification-on-cifar-100BiT-L (ResNet)
Percentage correct: 93.51
image-classification-on-flowers-102BiT-L (ResNet)
Accuracy: 99.63
image-classification-on-flowers-102BiT-M (ResNet)
Accuracy: 99.30
image-classification-on-imagenetBiT-M (ResNet)
Number of params: 928M
Top 1 Accuracy: 85.39%
image-classification-on-imagenetBiT-L (ResNet)
Top 1 Accuracy: 87.54%
Top 5 Accuracy: 98.46
image-classification-on-imagenet-realBiT-L
Accuracy: 90.54%
Params: 928M
image-classification-on-imagenet-realBiT-M
Accuracy: 89.02%
image-classification-on-objectnetBiT-L (ResNet-152x4)
Top-1 Accuracy: 58.7
Top-5 Accuracy: 80
image-classification-on-objectnetBiT-M (ResNet-152x4)
Top-1 Accuracy: 47.0
Top-5 Accuracy: 69
image-classification-on-objectnetBiT-S (ResNet-152x4)
Top-1 Accuracy: 36.0
Top-5 Accuracy: 57
image-classification-on-objectnet-boundingBiT-S (ResNet)
Top 5 Accuracy: 64.4
image-classification-on-objectnet-boundingBiT-M (ResNet)
Top 5 Accuracy: 76.0
image-classification-on-objectnet-boundingBiT-L (ResNet)
Top 5 Accuracy: 85.1
image-classification-on-omnibenchmarkBiT-M
Average Top-1 Accuracy: 40.4
image-classification-on-vtab-1k-1BiT-S
Top-1 Accuracy: 66.9
image-classification-on-vtab-1k-1BiT-L
Top-1 Accuracy: 76.3
image-classification-on-vtab-1k-1BiT-L (50 hypers/task)
Top-1 Accuracy: 78.72
image-classification-on-vtab-1k-1BiT-M
Top-1 Accuracy: 70.6

用 AI 构建 AI

从想法到上线——通过免费 AI 协同编程、开箱即用的环境和市场最优价格的 GPU 加速您的 AI 开发

AI 协同编程
即用型 GPU
最优价格
立即开始

Hyper Newsletters

订阅我们的最新资讯
我们会在北京时间 每周一的上午九点 向您的邮箱投递本周内的最新更新
邮件发送服务由 MailChimp 提供