Command Palette
Search for a command to run...
Nirkin Yuval ; Wolf Lior ; Hassner Tal

Abstract
We present a novel, real-time, semantic segmentation network in which theencoder both encodes and generates the parameters (weights) of the decoder.Furthermore, to allow maximal adaptivity, the weights at each decoder blockvary spatially. For this purpose, we design a new type of hypernetwork,composed of a nested U-Net for drawing higher level context features, amulti-headed weight generating module which generates the weights of each blockin the decoder immediately before they are consumed, for efficient memoryutilization, and a primary network that is composed of novel dynamic patch-wiseconvolutions. Despite the usage of less-conventional blocks, our architectureobtains real-time performance. In terms of the runtime vs. accuracy trade-off,we surpass state of the art (SotA) results on popular semantic segmentationbenchmarks: PASCAL VOC 2012 (val. set) and real-time semantic segmentation onCityscapes, and CamVid. The code is available: https://nirkin.com/hyperseg.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| dichotomous-image-segmentation-on-dis-te1 | HySM | E-measure: 0.803 HCE: 205 MAE: 0.082 S-Measure: 0.761 max F-Measure: 0.695 weighted F-measure: 0.597 |
| dichotomous-image-segmentation-on-dis-te2 | HySM | E-measure: 0.832 HCE: 451 MAE: 0.085 S-Measure: 0.794 max F-Measure: 0.759 weighted F-measure: 0.667 |
| dichotomous-image-segmentation-on-dis-te3 | HySM | E-measure: 0.857 HCE: 887 MAE: 0.079 S-Measure: 0.811 max F-Measure: 0.792 weighted F-measure: 0.701 |
| dichotomous-image-segmentation-on-dis-te4 | HySM | E-measure: 0.842 HCE: 3331 MAE: 0.091 S-Measure: 0.802 max F-Measure: 0.782 weighted F-measure: 0.693 |
| dichotomous-image-segmentation-on-dis-vd | HySM | E-measure: 0.814 HCE: 1324 MAE: 0.096 S-Measure: 0.773 max F-Measure: 0.734 weighted F-measure: 0.640 |
| real-time-semantic-segmentation-on-camvid | HyperSeg-S | Frame (fps): 38.0 Time (ms): 26.3 mIoU: 78.4 |
| real-time-semantic-segmentation-on-camvid | HyperSeg-L | Frame (fps): 16.6 Time (ms): 60.2 mIoU: 79.1 |
| real-time-semantic-segmentation-on-cityscapes | HyperSeg-M | Frame (fps): 36.9 Time (ms): 27.1 mIoU: 75.8% |
| semantic-segmentation-on-pascal-voc-2012-val | HyperSeg-L | mIoU: 80.61% |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.