Referring Expression Segmentation On Refcocog

评估指标

Overall IoU

评测结果

各个模型在此基准测试上的表现结果

Paper TitleRepository
MLCD-Seg-7B79.9Multi-label Cluster Discrimination for Visual Representation Learning
HyperSeg79.4HyperSeg: Towards Universal Visual Segmentation with Large Language Model
UniLSeg-10079.27Universal Segmentation at Arbitrary Granularity with Language Instruction
UniLSeg-2078.41Universal Segmentation at Arbitrary Granularity with Language Instruction
EVF-SAM76.8EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
DETRIS74.6Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation
C3VG74.43Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints
GROUNDHOG74.1GROUNDHOG: Grounding Large Language Models to Holistic Segmentation-
GLEE-Pro72.9General Object Foundation Model for Images and Videos at Scale
SafaRi-B70.48SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation-
PolyFormer-L69.2PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
MaskRIS (Swin-B, combined DB)69.12MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation
PolyFormer-B67.76PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
MaskRIS (Swin-B)65.55MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation
MagNet65.36Mask Grounding for Referring Image Segmentation
X-Decoder (Davit-d5)64.6Generalized Decoding for Pixel, Image, and Language
VLT (Swin-B)63.49VLT: Vision-Language Transformer and Query Generation for Referring Segmentation
LAVT61.24LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
VLT (Darknet53)52.99Vision-Language Transformer and Query Generation for Referring Segmentation
SHNet49.90Comprehensive Multi-Modal Interactions for Referring Image Segmentation
0 of 21 row(s) selected.
Referring Expression Segmentation On Refcocog | SOTA | HyperAI超神经