Command Palette
Search for a command to run...
BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation
Yu Changqian ; Wang Jingbo ; Peng Chao ; Gao Changxin ; Yu Gang ; Sang Nong

Abstract
Semantic segmentation requires both rich spatial information and sizeablereceptive field. However, modern approaches usually compromise spatialresolution to achieve real-time inference speed, which leads to poorperformance. In this paper, we address this dilemma with a novel BilateralSegmentation Network (BiSeNet). We first design a Spatial Path with a smallstride to preserve the spatial information and generate high-resolutionfeatures. Meanwhile, a Context Path with a fast downsampling strategy isemployed to obtain sufficient receptive field. On top of the two paths, weintroduce a new Feature Fusion Module to combine features efficiently. Theproposed architecture makes a right balance between the speed and segmentationperformance on Cityscapes, CamVid, and COCO-Stuff datasets. Specifically, for a2048x1024 input, we achieve 68.4% Mean IOU on the Cityscapes test dataset withspeed of 105 FPS on one NVIDIA Titan XP card, which is significantly fasterthan the existing methods with comparable performance.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| dichotomous-image-segmentation-on-dis-te1 | BSV1 | E-measure: 0.741 HCE: 288 MAE: 0.108 S-Measure: 0.695 max F-Measure: 0.595 weighted F-measure: 0.474 |
| dichotomous-image-segmentation-on-dis-te2 | BSV1 | E-measure: 0.781 HCE: 621 MAE: 0.111 S-Measure: 0.740 max F-Measure: 0.680 weighted F-measure: 0.564 |
| dichotomous-image-segmentation-on-dis-te3 | BSV1 | E-measure: 0.801 HCE: 1146 MAE: 0.109 S-Measure: 0.757 max F-Measure: 0.710 weighted F-measure: 0.595 |
| dichotomous-image-segmentation-on-dis-te4 | BSV1 | E-measure: 0.788 HCE: 3999 MAE: 0.114 S-Measure: 0.755 max F-Measure: 0.710 weighted F-measure: 0.598 |
| dichotomous-image-segmentation-on-dis-vd | BSV1 | E-measure: 0.767 HCE: 1660 MAE: 0.116 S-Measure: 0.728 max F-Measure: 0.662 weighted F-measure: 0.548 |
| real-time-semantic-segmentation-on-camvid | BiSeNet | mIoU: 68.7% |
| real-time-semantic-segmentation-on-cityscapes | BiSeNet(ResNet-18) | Frame (fps): 65.5 Time (ms): 15.2 mIoU: 74.7% |
| real-time-semantic-segmentation-on-cityscapes | BiSeNet(Xception39) | Frame (fps): 105.8 Time (ms): 9.5 mIoU: 68.4% |
| real-time-semantic-segmentation-on-cityscapes | BiSeNet | Frame (fps): 65.5 mIoU: 74.7% |
| semantic-segmentation-on-bdd100k-val | BiSeNet-V1(ResNet-18) | mIoU: 53.8(45.1fps) |
| semantic-segmentation-on-camvid | BiSeNet | Mean IoU: 68.7% |
| semantic-segmentation-on-cityscapes | BiSeNet (ResNet-101) | Mean IoU (class): 78.9% |
| semantic-segmentation-on-skyscapes-dense-1 | BiSeNet (ResNet-50) | Mean IoU: 30.82 |
| semantic-segmentation-on-trans10k | BiSeNet | GFLOPs: 19.91 mIoU: 58.40% |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.