Open Vocabulary Semantic Segmentation On 5

评估指标

mIoU

评测结果

各个模型在此基准测试上的表现结果

Paper TitleRepository
SILC97.6SILC: Improving Vision Language Pretraining with Self-Distillation-
SCAN97.2Open-Vocabulary Segmentation with Semantic-Assisted Calibration
CAT-Seg97.0CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
MaskCLIP++96.8High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation
MAFT+96.5Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation
EBSeg-L96.4Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
FC-CLIP95.4Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
OVSeg Swin-B94.5Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP
MAFT-ViTL92.1Learning Mask-aware CLIP Representations for Zero-Shot Segmentation
HyperSeg92.1HyperSeg: Towards Universal Visual Segmentation with Large Language Model
MAFT-ViTL92.1--
POMP89.4Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition
TagAlign(trained with image-text pairs)87.9TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification
ODISE84.6Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models
TCL83.2Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs
LaVG82.5In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation
PACL72.3Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning
ZegFormer-Decoupling Zero-Shot Semantic Segmentation
ZSSeg-A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language Model
0 of 19 row(s) selected.
Open Vocabulary Semantic Segmentation On 5 | SOTA | HyperAI超神经