HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Patch-Depth Fusion: Dichotomous Image Segmentation via Fine-Grained Patch Strategy and Depth Integrity-Prior

Liu Xianjie Fu Keren Zhao Qijun

Patch-Depth Fusion: Dichotomous Image Segmentation via Fine-Grained
  Patch Strategy and Depth Integrity-Prior

Abstract

Dichotomous Image Segmentation (DIS) is a high-precision object segmentationtask for high-resolution natural images. The current mainstream methods focuson the optimization of local details but overlook the fundamental challenge ofmodeling the integrity of objects. We have found that the depth integrity-priorimplicit in the the pseudo-depth maps generated by Depth Anything Model v2 andthe local detail features of image patches can jointly address the abovedilemmas. Based on the above findings, we have designed a novel Patch-DepthFusion Network (PDFNet) for high-precision dichotomous image segmentation. Thecore of PDFNet consists of three aspects. Firstly, the object perception isenhanced through multi-modal input fusion. By utilizing the patch fine-grainedstrategy, coupled with patch selection and enhancement, the sensitivity todetails is improved. Secondly, by leveraging the depth integrity-priordistributed in the depth maps, we propose an integrity-prior loss to enhancethe uniformity of the segmentation results in the depth maps. Finally, weutilize the features of the shared encoder and, through a simple depthrefinement decoder, improve the ability of the shared encoder to capture subtledepth-related information in the images. Experiments on the DIS-5K dataset showthat PDFNet significantly outperforms state-of-the-art non-diffusion methods.Due to the incorporation of the depth integrity-prior, PDFNet achieves or evensurpassing the performance of the latest diffusion-based methods while usingless than 11% of the parameters of diffusion-based methods. The source code athttps://github.com/Tennine2077/PDFNet

Code Repositories

tennine2077/pdfnet
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
dichotomous-image-segmentation-on-dis-te1PDFNet
E-measure: 0.927
MAE: 0.031
S-Measure: 0.899
max F-Measure: 0.890
weighted F-measure: 0.846
dichotomous-image-segmentation-on-dis-te2PDFNet
E-measure: 0.947
MAE: 0.028
S-Measure: 0.924
max F-Measure: 0.921
weighted F-measure: 0.885
dichotomous-image-segmentation-on-dis-te3PDFNet
E-measure: 0.957
MAE: 0.027
S-Measure: 0.928
max F-Measure: 0.936
weighted F-measure: 0.900
dichotomous-image-segmentation-on-dis-te4PDFNet
E-measure: 0.941
MAE: 0.037
S-Measure: 0.910
max F-Measure: 0.911
weighted F-measure: 0.867
dichotomous-image-segmentation-on-dis-vdPDFNet
E-measure: 0.944
MAE: 0.030
S-Measure: 0.916
max F-Measure: 0.913
weighted F-measure: 0.873
rgb-salient-object-detection-on-hrsodPDFNet (HRSOD,UHRSD)
MAE: 0.012
S-Measure: 0.963
max F-Measure: 0.965
rgb-salient-object-detection-on-uhrsdPDFNet (HRSOD, UHRSD)
MAE: 0.019
S-Measure: 0.953
max F-Measure: 0.963

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Patch-Depth Fusion: Dichotomous Image Segmentation via Fine-Grained Patch Strategy and Depth Integrity-Prior | Papers | HyperAI