HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning

Jia-Run Du Jia-Chang Feng Kun-Yu Lin Fa-Ting Hong Xiao-Ming Wu Zhongang Qi Ying Shan Wei-Shi Zheng

Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning

Abstract

Weakly Supervised Temporal Action Localization (WSTAL) aims to localize and classify action instances in long untrimmed videos with only video-level category labels. Due to the lack of snippet-level supervision for indicating action boundaries, previous methods typically assign pseudo labels for unlabeled snippets. However, since some action instances of different categories are visually similar, it is non-trivial to exactly label the (usually) one action category for a snippet, and incorrect pseudo labels would impair the localization performance. To address this problem, we propose a novel method from a category exclusion perspective, named Progressive Complementary Learning (ProCL), which gradually enhances the snippet-level supervision. Our method is inspired by the fact that video-level labels precisely indicate the categories that all snippets surely do not belong to, which is ignored by previous works. Accordingly, we first exclude these surely non-existent categories by a complementary learning loss. And then, we introduce the background-aware pseudo complementary labeling in order to exclude more categories for snippets of less ambiguity. Furthermore, for the remaining ambiguous snippets, we attempt to reduce the ambiguity by distinguishing foreground actions from the background. Extensive experimental results show that our method achieves new state-of-the-art performance on two popular benchmarks, namely THUMOS14 and ActivityNet1.3.

Code Repositories

Run542968/ProCL
pytorch
Mentioned in GitHub
fjchange/him-net
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
weakly-supervised-action-localization-on-1ProCL
mAP@0.5:0.95: 26.1
weakly-supervised-action-localization-on-4ProCL
avg-mAP (0.1-0.5): 58.2
mAP@0.5: 40.5
weakly-supervised-action-localization-on-8ProCL
avg-mAP (0.1:0.7): 47.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning | Papers | HyperAI