Command Palette
Search for a command to run...
Yu Qian ; Zhao Xiaoqi ; Pang Youwei ; Zhang Lihe ; Lu Huchuan

Abstract
Dichotomous Image Segmentation (DIS) has recently emerged towardshigh-precision object segmentation from high-resolution natural images. When designing an effective DIS model, the main challenge is how to balancethe semantic dispersion of high-resolution targets in the small receptive fieldand the loss of high-precision details in the large receptive field. Existingmethods rely on tedious multiple encoder-decoder streams and stages togradually complete the global localization and local refinement. Human visual system captures regions of interest by observing them frommultiple views. Inspired by it, we model DIS as a multi-view object perceptionproblem and provide a parsimonious multi-view aggregation network (MVANet),which unifies the feature fusion of the distant view and close-up view into asingle stream with one encoder-decoder structure. With the help of the proposedmulti-view complementary localization and refinement modules, our approachestablished long-range, profound visual interactions across multiple views,allowing the features of the detailed close-up view to focus on highly slenderstructures.Experiments on the popular DIS-5K dataset show that our MVANetsignificantly outperforms state-of-the-art methods in both accuracy and speed.The source code and datasets will be publicly available at\href{https://github.com/qianyu-dlut/MVANet}{MVANet}.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| dichotomous-image-segmentation-on-dis-te1 | MVANet | E-measure: 0.911 HCE: 104 MAE: 0.037 S-Measure: 0.879 max F-Measure: 0.873 weighted F-measure: 0.823 |
| dichotomous-image-segmentation-on-dis-te2 | MVANet | E-measure: 0.944 HCE: 251 MAE: 0.030 S-Measure: 0.915 max F-Measure: 0.916 weighted F-measure: 0.874 |
| dichotomous-image-segmentation-on-dis-te3 | MVANet | E-measure: 0.954 HCE: 525 MAE: 0.031 S-Measure: 0.920 max F-Measure: 0.929 weighted F-measure: 0.890 |
| dichotomous-image-segmentation-on-dis-te4 | MVANet | E-measure: 0.944 HCE: 2331 MAE: 0.041 S-Measure: 0.903 max F-Measure: 0.912 weighted F-measure: 0.857 |
| dichotomous-image-segmentation-on-dis-vd | MVANet | E-measure: 0.941 HCE: 893 MAE: 0.034 S-Measure: 0.905 max F-Measure: 0.904 weighted F-measure: 0.863 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.