HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Action Segmentation with Mixed Temporal Domain Adaptation

Min-Hung Chen Baopu Li Yingze Bao Ghassan AlRegib

Action Segmentation with Mixed Temporal Domain Adaptation

Abstract

The main progress for action segmentation comes from densely-annotated data for fully-supervised learning. Since manual annotation for frame-level actions is time-consuming and challenging, we propose to exploit auxiliary unlabeled videos, which are much easier to obtain, by shaping this problem as a domain adaptation (DA) problem. Although various DA techniques have been proposed in recent years, most of them have been developed only for the spatial direction. Therefore, we propose Mixed Temporal Domain Adaptation (MTDA) to jointly align frame- and video-level embedded feature spaces across domains, and further integrate with the domain attention mechanism to focus on aligning the frame-level features with higher domain discrepancy, leading to more effective domain adaptation. Finally, we evaluate our proposed methods on three challenging datasets (GTEA, 50Salads, and Breakfast), and validate that MTDA outperforms the current state-of-the-art methods on all three datasets by large margins (e.g. 6.4% gain on F1@50 and 6.8% gain on the edit score for GTEA).

Benchmarks

BenchmarkMethodologyMetrics
action-segmentation-on-50-salads-1DA
Acc: 83.2
Edit: 75.2
F1@10%: 82.0
F1@25%: 80.1
F1@50%: 72.5
action-segmentation-on-breakfast-1DA
Acc: 71.0
Average F1: 66.4
Edit: 73.6
F1@10%: 74.2
F1@25%: 68.6
F1@50%: 56.5
action-segmentation-on-gtea-1DA
Acc: 80.0
Edit: 85.8
F1@10%: 90.5
F1@25%: 88.4
F1@50%: 76.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Action Segmentation with Mixed Temporal Domain Adaptation | Papers | HyperAI