HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

PIDNet: A Real-time Semantic Segmentation Network Inspired by PID Controllers

Jiacong Xu Zixiang Xiong Shankar P. Bhattacharyya

PIDNet: A Real-time Semantic Segmentation Network Inspired by PID Controllers

Abstract

Two-branch network architecture has shown its efficiency and effectiveness in real-time semantic segmentation tasks. However, direct fusion of high-resolution details and low-frequency context has the drawback of detailed features being easily overwhelmed by surrounding contextual information. This overshoot phenomenon limits the improvement of the segmentation accuracy of existing two-branch models. In this paper, we make a connection between Convolutional Neural Networks (CNN) and Proportional-Integral-Derivative (PID) controllers and reveal that a two-branch network is equivalent to a Proportional-Integral (PI) controller, which inherently suffers from similar overshoot issues. To alleviate this problem, we propose a novel three-branch network architecture: PIDNet, which contains three branches to parse detailed, context and boundary information, respectively, and employs boundary attention to guide the fusion of detailed and context branches. Our family of PIDNets achieve the best trade-off between inference speed and accuracy and their accuracy surpasses all the existing models with similar inference speed on the Cityscapes and CamVid datasets. Specifically, PIDNet-S achieves 78.6% mIOU with inference speed of 93.2 FPS on Cityscapes and 80.1% mIOU with speed of 153.7 FPS on CamVid.

Code Repositories

hamidriasat/PIDNet
tf
Mentioned in GitHub
enot-autodl/lpcv-2023
pytorch
Mentioned in GitHub
XuJiacong/PIDNet
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
real-time-semantic-segmentation-on-camvidPIDNet-S (Cityscapes-Pretrained)
Frame (fps): 153.7(3090)
Time (ms): 6.5
mIoU: 80.1
real-time-semantic-segmentation-on-camvidPIDNet-M (Cityscapes-Pretrained)
Frame (fps): 85.6(3090)
Time (ms): 11.7
mIoU: 82.0
real-time-semantic-segmentation-on-cityscapesPIDNet-L
Frame (fps): 31.1(3090)
Time (ms): 32.2
mIoU: 80.6%
real-time-semantic-segmentation-on-cityscapesPIDNet-M
Frame (fps): 42.2(3090)
Time (ms): 23.7
mIoU: 79.8%
real-time-semantic-segmentation-on-cityscapesPIDNet-S
Frame (fps): 93.2(3090)
Time (ms): 10.7
mIoU: 78.6%
real-time-semantic-segmentation-on-cityscapes-1PIDNet-M
Frame (fps): 42.2(3090)
Time (ms): 23.7
mIoU: 79.9%
real-time-semantic-segmentation-on-cityscapes-1PIDNet-S
Frame (fps): 93.2(3090)
Time (ms): 10.7
mIoU: 78.8%
real-time-semantic-segmentation-on-cityscapes-1PIDNet-L
Frame (fps): 31.1(3090)
Time (ms): 32.2
mIoU: 80.9%
semantic-segmentation-on-camvidPIDNet-Wider
Mean IoU: 82.0%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
PIDNet: A Real-time Semantic Segmentation Network Inspired by PID Controllers | Papers | HyperAI