HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Learning Spatio-Temporal Structure from RGB-D Videos for Human Activity Detection and Anticipation

{Hema S. Koppula Ashutosh Saxena}

Learning Spatio-Temporal Structure from RGB-D Videos for Human Activity Detection and Anticipation

Abstract

We consider the problem of detecting past activities as well as anticipating which activity will happen in the future and how. We start by modeling the rich spatio-temporal relations between human poses and objects (called affordances) using a conditional random field (CRF). However, because of the ambiguity in the temporal segmentation of the sub-activities that constitute an activity, in the past as well as in the future, multiple graph structures are possible. In this paper, we reason about these alternate possibilities by reasoning over multiple possible graph structures. We obtain them by approximating the graph with only additive features, which lends to efficient dynamic programming. Starting with this proposal graph structure, we then design moves to obtain several other likely graph structures. We then show that our approach improves the state-of-the-art significantly for detecting past activities as well as for anticipating future activities, on a dataset of 120 activity videos collected from four subjects.

Benchmarks

BenchmarkMethodologyMetrics
skeleton-based-action-recognition-on-cad-120All Features (w ground truth)
Accuracy: 89.3%
skeleton-based-action-recognition-on-cad-120Our DP seg. + moves + heuristic seg.
Accuracy: 70.3%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Learning Spatio-Temporal Structure from RGB-D Videos for Human Activity Detection and Anticipation | Papers | HyperAI