Command Palette
Search for a command to run...
Cao Kaidi ; Brbic Maria ; Leskovec Jure

Abstract
A fundamental limitation of applying semi-supervised learning in real-worldsettings is the assumption that unlabeled test data contains only classespreviously encountered in the labeled training data. However, this assumptionrarely holds for data in-the-wild, where instances belonging to novel classesmay appear at testing time. Here, we introduce a novel open-worldsemi-supervised learning setting that formalizes the notion that novel classesmay appear in the unlabeled test data. In this novel setting, the goal is tosolve the class distribution mismatch between labeled and unlabeled data, whereat the test time every input instance either needs to be classified into one ofthe existing classes or a new unseen class needs to be initialized. To tacklethis challenging problem, we propose ORCA, an end-to-end deep learning approachthat introduces uncertainty adaptive margin mechanism to circumvent the biastowards seen classes caused by learning discriminative features for seenclasses faster than for the novel classes. In this way, ORCA reduces the gapbetween intra-class variance of seen with respect to novel classes. Experimentson image classification datasets and a single-cell annotation datasetdemonstrate that ORCA consistently outperforms alternative baselines, achieving25% improvement on seen and 96% improvement on novel classes of the ImageNetdataset.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| novel-object-detection-on-lvis-v1-0-val | ORCA Cao et al. (2022) | All mAP: 2.03 Known mAP: 20.57 Novel mAP: 0.49 |
| open-world-semi-supervised-learning-on-1 | ORCA (ResNet-50) | All accuracy (10% Labeled): 69.7 All accuracy (50% Labeled): 77.8 Novel accuracy (10% Labeled): 60.5 Novel accuracy (50% Labeled): 72.1 Seen accuracy (10% Labeled): 83.9 Seen accuracy (50% Labeled): 89.1 |
| open-world-semi-supervised-learning-on-cifar | ORCA (ResNet-18) | All accuracy (10% Labeled): 84.1 All accuracy (50% Labeled): 89.7 Novel accuracy (10% Labeled): 85.5 Novel accuracy (50% Labeled): 90.4 Seen accuracy (10% Labeled): 82.8 Seen accuracy (50% Labeled): 88.2 |
| open-world-semi-supervised-learning-on-cifar-1 | ORCA (ResNet-18) | All accuracy (10% Labeled): 38.6 All accuracy (50% Labeled): 48.1 Novel accuracy (10% Labeled): 31.8 Novel accuracy (50% Labeled): 43.0 Seen accuracy (10% Labeled): 52.5 Seen accuracy (50% Labeled): 66.9 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.