HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention

Furnari Antonino ; Farinella Giovanni Maria

What Would You Expect? Anticipating Egocentric Actions with
  Rolling-Unrolling LSTMs and Modality Attention

Abstract

Egocentric action anticipation consists in understanding which objects thecamera wearer will interact with in the near future and which actions they willperform. We tackle the problem proposing an architecture able to anticipateactions at multiple temporal scales using two LSTMs to 1) summarize the past,and 2) formulate predictions about the future. The input video is processedconsidering three complimentary modalities: appearance (RGB), motion (opticalflow) and objects (object-based features). Modality-specific predictions arefused using a novel Modality ATTention (MATT) mechanism which learns to weighmodalities in an adaptive fashion. Extensive evaluations on two large-scalebenchmark datasets show that our method outperforms prior art by up to +7% onthe challenging EPIC-Kitchens dataset including more than 2500 actions, andgeneralizes to EGTEA Gaze+. Our approach is also shown to generalize to thetasks of early action recognition and action recognition. Our method is rankedfirst in the public leaderboard of the EPIC-Kitchens egocentric actionanticipation challenge 2019. Please see our web pages for code and examples:http://iplab.dmi.unict.it/rulstm - https://github.com/fpv-iplab/rulstm.

Code Repositories

antoninofurnari/rulstm
pytorch
Mentioned in GitHub
fpv-iplab/rulstm
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
action-anticipation-on-epic-kitchens-55-1RULSTM [24, 23]
Top 1 Accuracy - Act.: 8.16
Top 1 Accuracy - Noun: 15.19
Top 1 Accuracy - Verb: 27.01
Top 5 Accuracy - Act.: 21.10
Top 5 Accuracy - Noun: 34.38
Top 5 Accuracy - Verb: 69.55
action-anticipation-on-epic-kitchens-55-seenRULSTM [24, 23]
Top 1 Accuracy - Act.: 14.39
Top 1 Accuracy - Noun: 22.78
Top 1 Accuracy - Verb: 33.04
Top 5 Accuracy - Act.: 33.73
Top 5 Accuracy - Noun: 50.95
Top 5 Accuracy - Verb: 79.55
egocentric-activity-recognition-on-epic-1RULSTM
Actions Top-1 (S1): 33.06
Actions Top-1 (S2): 19.49

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention | Papers | HyperAI