Command Palette
Search for a command to run...
Chen Liang-Chieh Papandreou George Schroff Florian Adam Hartwig

Abstract
In this work, we revisit atrous convolution, a powerful tool to explicitlyadjust filter's field-of-view as well as control the resolution of featureresponses computed by Deep Convolutional Neural Networks, in the application ofsemantic image segmentation. To handle the problem of segmenting objects atmultiple scales, we design modules which employ atrous convolution in cascadeor in parallel to capture multi-scale context by adopting multiple atrousrates. Furthermore, we propose to augment our previously proposed AtrousSpatial Pyramid Pooling module, which probes convolutional features at multiplescales, with image-level features encoding global context and further boostperformance. We also elaborate on implementation details and share ourexperience on training our system. The proposed `DeepLabv3' systemsignificantly improves over our previous DeepLab versions without DenseCRFpost-processing and attains comparable performance with other state-of-artmodels on the PASCAL VOC 2012 semantic image segmentation benchmark.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| dichotomous-image-segmentation-on-dis-te1 | DeeplabV3+ | E-measure: 0.772 HCE: 234 MAE: 0.102 S-Measure: 0.694 max F-Measure: 0.601 weighted F-measure: 0.506 |
| dichotomous-image-segmentation-on-dis-te2 | DeeplabV3+ | E-measure: 0.813 HCE: 516 MAE: 0.105 S-Measure: 0.729 max F-Measure: 0.681 weighted F-measure: 0.587 |
| dichotomous-image-segmentation-on-dis-te3 | DeeplabV3+ | E-measure: 0.833 HCE: 999 MAE: 0.102 S-Measure: 0.749 max F-Measure: 0.717 weighted F-measure: 0.623 |
| dichotomous-image-segmentation-on-dis-te4 | DeeplabV3+ | E-measure: 0.820 HCE: 3709 MAE: 0.111 S-Measure: 0.744 max F-Measure: 0.715 weighted F-measure: 0.621 |
| dichotomous-image-segmentation-on-dis-vd | DeeplabV3+ | E-measure: 0.796 HCE: 1520 MAE: 0.114 S-Measure: 0.716 max F-Measure: 0.660 weighted F-measure: 0.568 |
| semantic-segmentation-on-cityscapes | DeepLabv3 (ResNet-101, coarse) | Mean IoU (class): 81.3% |
| semantic-segmentation-on-cityscapes-val | DeepLabv3 (Dilated-ResNet-101) | mIoU: 78.5% |
| semantic-segmentation-on-pascal-voc-2012 | DeepLabv3-JFT | Mean IoU: 86.9% |
| semantic-segmentation-on-pascal-voc-2012-val | DeepLabv3-JFT | mIoU: 82.7% |
| semantic-segmentation-on-selma | DeepLabV3 | mIoU: 70.7 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.