3 months ago

Masked Diffusion as Self-supervised Representation Learner

Zixuan Pan Jianxu Chen Yiyu Shi

Abstract

Denoising diffusion probabilistic models have recently demonstrated state-of-the-art generative performance and have been used as strong pixel-level representation learners. This paper decomposes the interrelation between the generative capability and representation learning ability inherent in diffusion models. We present the masked diffusion model (MDM), a scalable self-supervised representation learner for semantic segmentation, substituting the conventional additive Gaussian noise of traditional diffusion with a masking mechanism. Our proposed approach convincingly surpasses prior benchmarks, demonstrating remarkable advancements in both medical and natural image semantic segmentation tasks, particularly in few-shot scenarios.

Code Repositories

zx-pan/mdm

Official

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
medical-image-segmentation-on-glas	MDM	Dice: 91.95 F1: 91.95 IoU: 85.13
medical-image-segmentation-on-monuseg	MDM	F1: 81.01

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette