Command Palette
Search for a command to run...
Boundary-Aware Cascade Networks for Temporal Action Segmentation
{Li-Min Wang Gangshan Wu Ziteng Gao Zhifeng Li Zhenzhi Wang}

Abstract
Identifying human action segments in an untrimmed video is still challenging due to boundary ambiguity and over-segmentation issues. To address these problems, we present a new boundary-aware cascade network by introducing two novel components. First, we devise a new cascading paradigm, called Stage Cascade, to enable our model to have adaptive receptive fields and more confident predictions for ambiguous frames. Second, we design a general and principled smoothing operation, termed as local barrier pooling, to aggregate local predictions by leveraging semantic boundary information. Moreover, these two components can be jointly fine-tuned in an end-to-end manner. We perform experiments on three challenging datasets: 50Salads, GTEA and Breakfast dataset, demonstrating that our framework significantly out-performs the current state-of-the-art methods. The code is available at https://github.com/MCG-NJU/BCN.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| action-segmentation-on-50-salads-1 | BCN | Acc: 84.4 Edit: 74.3 F1@10%: 82.3 F1@25%: 81.3 F1@50%: 74 |
| action-segmentation-on-breakfast-1 | BCN | Acc: 70.4 Average F1: 63.1 Edit: 66.2 F1@10%: 68.7 F1@25%: 65.5 F1@50%: 55.0 |
| action-segmentation-on-gtea-1 | BCN | Acc: 79.8 Edit: 84.4 F1@10%: 88.5 F1@25%: 87.1 F1@50%: 77.3 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.