HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Boundary-Aware Cascade Networks for Temporal Action Segmentation

{Li-Min Wang Gangshan Wu Ziteng Gao Zhifeng Li Zhenzhi Wang}

Boundary-Aware Cascade Networks for Temporal Action Segmentation

Abstract

Identifying human action segments in an untrimmed video is still challenging due to boundary ambiguity and over-segmentation issues. To address these problems, we present a new boundary-aware cascade network by introducing two novel components. First, we devise a new cascading paradigm, called Stage Cascade, to enable our model to have adaptive receptive fields and more confident predictions for ambiguous frames. Second, we design a general and principled smoothing operation, termed as local barrier pooling, to aggregate local predictions by leveraging semantic boundary information. Moreover, these two components can be jointly fine-tuned in an end-to-end manner. We perform experiments on three challenging datasets: 50Salads, GTEA and Breakfast dataset, demonstrating that our framework significantly out-performs the current state-of-the-art methods. The code is available at https://github.com/MCG-NJU/BCN.

Benchmarks

BenchmarkMethodologyMetrics
action-segmentation-on-50-salads-1BCN
Acc: 84.4
Edit: 74.3
F1@10%: 82.3
F1@25%: 81.3
F1@50%: 74
action-segmentation-on-breakfast-1BCN
Acc: 70.4
Average F1: 63.1
Edit: 66.2
F1@10%: 68.7
F1@25%: 65.5
F1@50%: 55.0
action-segmentation-on-gtea-1BCN
Acc: 79.8
Edit: 84.4
F1@10%: 88.5
F1@25%: 87.1
F1@50%: 77.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Boundary-Aware Cascade Networks for Temporal Action Segmentation | Papers | HyperAI