HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos

{Ehsan Elhamifar YuHan Shen}

Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos

Abstract

We address the problem of online action segmentation for egocentric procedural task videos. While previous studies have mostly focused on offline action segmentation where entire videos are available for both training and inference the transition to online action segmentation is crucial for practical applications like AR/VR task assistants. Notably applying an offline-trained model directly to online inference results in a significant performance drop due to the inconsistency between training and inference. We propose an online action segmentation framework by first modifying existing architectures to make them causal. Second we develop a novel action progress prediction module to dynamically estimate the progress of ongoing actions and using them to refine the predictions of causal action segmentation. Third we propose to learn task graphs from training videos and leverage them to obtain smooth and procedure-consistent segmentations. With the combination of progress and task graph with casual action segmentation our framework effectively addresses prediction uncertainty and oversegmentation in online action segmentation and achieves significant improvement on three egocentric datasets.

Benchmarks

BenchmarkMethodologyMetrics
action-segmentation-on-assembly101ProTAS(Offline)
Edit: 29.2
F1@10%: 28.7
F1@25%: 24.4
F1@50%: 17.5
MoF: 34.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos | Papers | HyperAI