| TBD | 86.2 | 86.8 | 87.5 | 72.2 | 69.4 | 66.6 | 82.3 | 80.0 | 77.6 | 50.1 | Tackling Background Distraction in Video Object Segmentation | |
| KMN | 88.1 | 87.6 | 87.1 | - | - | - | 77.8 | 76.0 | 74.2 | 8.33 | Kernelized Memory Network for Video Object Segmentation | |
| RMNet | 82.3 | 81.5 | 80.6 | - | - | - | 77.2 | 75.0 | 72.8 | 11.9 | Efficient Regional Memory Network for Video Object Segmentation | |
| TVOS | - | - | - | 67.4 | 63.1 | 58.8 | 74.7 | 72.3 | 69.9 | 37.0 | A Transductive Approach for Video Object Segmentation | |
| BMVOS | 81.4 | 82.2 | 82.9 | 64.7 | 62.7 | 60.7 | 74.7 | 72.7 | 70.7 | 45.9 | Pixel-Level Bijective Matching for Video Object Segmentation | |
| STM | 88.1 | 86.5 | 84.8 | - | - | - | 74.0 | 71.6 | 69.2 | 6.25 | Video Object Segmentation using Space-Time Memory Networks | |
| FEELVOS | 83.1 | 81.7 | 80.3 | 57.5 | 54.4 | 51.2 | 72.3 | 69.1 | 65.9 | 2.22 | FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation | |