Command Palette
Search for a command to run...
Rudra P K Poudel; Stephan Liwicki; Roberto Cipolla

Abstract
The encoder-decoder framework is state-of-the-art for offline semantic image segmentation. Since the rise in autonomous systems, real-time computation is increasingly desirable. In this paper, we introduce fast segmentation convolutional neural network (Fast-SCNN), an above real-time semantic segmentation model on high resolution image data (1024x2048px) suited to efficient computation on embedded devices with low memory. Building on existing two-branch methods for fast segmentation, we introduce our `learning to downsample' module which computes low-level features for multiple resolution branches simultaneously. Our network combines spatial detail at high resolution with deep features extracted at lower resolution, yielding an accuracy of 68.0% mean intersection over union at 123.5 frames per second on Cityscapes. We also show that large scale pre-training is unnecessary. We thoroughly validate our metric in experiments with ImageNet pre-training and the coarse labeled data of Cityscapes. Finally, we show even faster computation with competitive results on subsampled inputs, without any network modifications.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| semantic-segmentation-on-cityscapes | Fast-SCNN | Mean IoU (class): 68% |
| semantic-segmentation-on-cityscapes-val | Fast-SCNN + Coarse + ImageNet | mIoU: 69.19 |
| semantic-segmentation-on-dada-seg | Fast-SCNN | mIoU: 26.32 |
| semantic-segmentation-on-densepass | Fast-SCNN | mIoU: 24.6% |
| semantic-segmentation-on-eventscape | Fast-SCNN | mIoU: 44.27 |
| semantic-segmentation-on-synpass | Fast-SCNN | mIoU: 21.30% |
| thermal-image-segmentation-on-pst900 | Fast-SCNN | mIoU: 47.2 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.