HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

ZoomNeXt: A Unified Collaborative Pyramid Network for Camouflaged Object Detection

Pang Youwei ; Zhao Xiaoqi ; Xiang Tian-Zhu ; Zhang Lihe ; Lu Huchuan

ZoomNeXt: A Unified Collaborative Pyramid Network for Camouflaged Object
  Detection

Abstract

Recent camouflaged object detection (COD) attempts to segment objectsvisually blended into their surroundings, which is extremely complex anddifficult in real-world scenarios. Apart from the high intrinsic similaritybetween camouflaged objects and their background, objects are usually diversein scale, fuzzy in appearance, and even severely occluded. To this end, wepropose an effective unified collaborative pyramid network that mimics humanbehavior when observing vague images and videos, \ie zooming in and out.Specifically, our approach employs the zooming strategy to learn discriminativemixed-scale semantics by the multi-head scale integration and rich granularityperception units, which are designed to fully explore imperceptible cluesbetween candidate objects and background surroundings. The former's intrinsicmulti-head aggregation provides more diverse visual patterns. The latter'srouting mechanism can effectively propagate inter-frame differences inspatiotemporal scenarios and be adaptively deactivated and output all-zeroresults for static representations. They provide a solid foundation forrealizing a unified architecture for static and dynamic COD. Moreover,considering the uncertainty and ambiguity derived from indistinguishabletextures, we construct a simple yet effective regularization, uncertaintyawareness loss, to encourage predictions with higher confidence in candidateregions. Our highly task-friendly framework consistently outperforms existingstate-of-the-art methods in image and video COD benchmarks. Our code can befound at {https://github.com/lartpang/ZoomNeXt}.

Code Repositories

lartpang/zoomnext
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
camouflaged-object-segmentation-onZoomNeXt-PVTv2-B5
MAE: 0.020
S-measure: 0.757
mDice: 0.599
mIoU: 0.510
weighted F-measure: 0.593
camouflaged-object-segmentation-on-camoZoomNeXt-PVTv2-B5
MAE: 0.041
S-Measure: 0.889
Weighted F-Measure: 0.857
camouflaged-object-segmentation-on-camoZoomNeXt-PVTv2-B4
MAE: 0.04
S-Measure: 0.888
Weighted F-Measure: 0.859
camouflaged-object-segmentation-on-camoZoomNeXt-ResNet-50
MAE: 0.065
S-Measure: 0.833
Weighted F-Measure: 0.774
camouflaged-object-segmentation-on-chameleonZoomNeXt-PVTv2-B4
MAE: 0.016
S-measure: 0.925
weighted F-measure: 0.897
camouflaged-object-segmentation-on-chameleonZoomNeXt-ResNet-50
MAE: 0.021
S-measure: 0.908
weighted F-measure: 0.858
camouflaged-object-segmentation-on-chameleonZoomNeXt-PVTv2-B5
MAE: 0.018
S-measure: 0.924
weighted F-measure: 0.885
camouflaged-object-segmentation-on-codZoomNeXt-ResNet-50
MAE: 0.026
S-Measure: 0.861
Weighted F-Measure: 0.768
camouflaged-object-segmentation-on-codZoomNeXt-PVTv2-B5
MAE: 0.018
S-Measure: 0.898
Weighted F-Measure: 0.827
camouflaged-object-segmentation-on-codZoomNeXt-PVTv2-B4
MAE: 0.017
S-Measure: 0.898
Weighted F-Measure: 0.838
camouflaged-object-segmentation-on-moca-maskZoomNeXt-PVTv2-B5
MAE: 0.010
S-measure: 0.734
mDice: 0.497
mIoU: 0.422
weighted F-measure: 0.476
camouflaged-object-segmentation-on-nc4kZoomNeXt-PVTv2-B5
MAE: 0.028
S-measure: 0.903
weighted F-measure: 0.863
camouflaged-object-segmentation-on-nc4kZoomNeXt-ResNet-50
MAE: 0.037
S-measure: 0.874
weighted F-measure: 0.816
camouflaged-object-segmentation-on-nc4kZoomNeXt-PVTv2-B4
MAE: 0.028
S-measure: 0.900
weighted F-measure: 0.865

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
ZoomNeXt: A Unified Collaborative Pyramid Network for Camouflaged Object Detection | Papers | HyperAI