Command Palette
Search for a command to run...
Yingjie Zhai Deng-Ping Fan Jufeng Yang Ali Borji Ling Shao Junwei Han Liang Wang

Abstract
Multi-level feature fusion is a fundamental topic in computer vision. It has been exploited to detect, segment and classify objects at various scales. When multi-level features meet multi-modal cues, the optimal feature aggregation and multi-modal learning strategy become a hot potato. In this paper, we leverage the inherent multi-modal and multi-level nature of RGB-D salient object detection to devise a novel cascaded refinement network. In particular, first, we propose to regroup the multi-level features into teacher and student features using a bifurcated backbone strategy (BBS). Second, we introduce a depth-enhanced module (DEM) to excavate informative depth cues from the channel and spatial views. Then, RGB and depth modalities are fused in a complementary way. Our architecture, named Bifurcated Backbone Strategy Network (BBS-Net), is simple, efficient, and backbone-independent. Extensive experiments show that BBS-Net significantly outperforms eighteen SOTA models on eight challenging datasets under five evaluation measures, demonstrating the superiority of our approach ($\sim 4 \%$ improvement in S-measure $vs.$ the top-ranked model: DMRA-iccv2019). In addition, we provide a comprehensive analysis on the generalization ability of different RGB-D datasets and provide a powerful training set for future research.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| rgb-d-salient-object-detection-on-des | BBS-Net | Average MAE: 0.021 S-Measure: 93.3 max E-Measure: 96.6 max F-Measure: 92.7 |
| rgb-d-salient-object-detection-on-lfsd | BBS-Net | Average MAE: 0.072 S-Measure: 86.4 max E-Measure: 90.1 max F-Measure: 85.8 |
| rgb-d-salient-object-detection-on-nlpr | BBS-Net | Average MAE: 0.023 S-Measure: 93.0 max E-Measure: 96.1 max F-Measure: 91.8 |
| rgb-d-salient-object-detection-on-sip | BBS-Net | Average MAE: 0.055 S-Measure: 87.9 max E-Measure: 92.2 max F-Measure: 88.3 |
| rgb-d-salient-object-detection-on-ssd | BBS-Net | Average MAE: 0.044 S-Measure: 88.2 max E-Measure: 91.9 max F-Measure: 85.9 |
| rgb-d-salient-object-detection-on-stere | BBS-Net | Average MAE: 0.041 S-Measure: 90.8 max E-Measure: 94.2 max F-Measure: 90.3 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.