HyperAIHyperAI

Command Palette

Search for a command to run...

a month ago

Commonly Uncommon: Semantic Sparsity in Situation Recognition

Yatskar Mark Ordonez Vicente Zettlemoyer Luke Farhadi Ali

Commonly Uncommon: Semantic Sparsity in Situation Recognition

Abstract

Semantic sparsity is a common challenge in structured visual classificationproblems; when the output space is complex, the vast majority of the possiblepredictions are rarely, if ever, seen in the training set. This paper studiessemantic sparsity in situation recognition, the task of producing structuredsummaries of what is happening in images, including activities, objects and theroles objects play within the activity. For this problem, we find empiricallythat most object-role combinations are rare, and current state-of-the-artmodels significantly underperform in this sparse data regime. We avoid manysuch errors by (1) introducing a novel tensor composition function that learnsto share examples across role-noun combinations and (2) semantically augmentingour training data with automatically gathered examples of rarely observedoutputs using web data. When integrated within a complete CRF-based structuredprediction model, the tensor-based approach outperforms existing state of theart by a relative improvement of 2.11% and 4.40% on top-5 verb and noun-roleaccuracy, respectively. Adding 5 million images with our semantic augmentationtechniques gives further relative improvements of 6.23% and 9.57% on top-5 verband noun-role accuracy.

Code Repositories

my89/imSitu
pytorch
Mentioned in GitHub
thilinicooray/my_imsitu
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
grounded-situation-recognition-on-swigCRF + Aug
Top-1 Verb: 34.12
Top-1 Verb u0026 Value: 26.45
Top-5 Verbs: 62.59
Top-5 Verbs u0026 Value: 46.88
situation-recognition-on-imsituCRF + Aug
Top-1 Verb: 34.12
Top-1 Verb u0026 Value: 26.45
Top-5 Verbs: 62.59
Top-5 Verbs u0026 Value: 46.88

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Commonly Uncommon: Semantic Sparsity in Situation Recognition | Papers | HyperAI