HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Weakly Supervised Temporal Sentence Grounding With Gaussian-Based Contrastive Proposal Learning

{Yang Liu Yuxin Peng Qingchao Chen Yanjie Huang Minghang Zheng}

Weakly Supervised Temporal Sentence Grounding With Gaussian-Based Contrastive Proposal Learning

Abstract

Temporal sentence grounding aims to detect the most salient moment corresponding to the natural language query from untrimmed videos. As labeling the temporal boundaries is labor-intensive and subjective, the weakly-supervised methods have recently received increasing attention. Most of the existing weakly-supervised methods generate the proposals by sliding windows, which are content-independent and of low quality. Moreover, they train their model to distinguish positive visual-language pairs from negative ones randomly collected from other videos, ignoring the highly confusing video segments within the same video. In this paper, we propose Contrastive Proposal Learning (CPL) to overcome the above limitations. Specifically, we use multiple learnable Gaussian functions to generate both positive and negative proposals within the same video that can characterize the multiple events in a long video. Then, we propose a controllable easy to hard negative proposal mining strategy to collect negative samples within the same video, which can ease the model optimization and enables CPL to distinguish highly confusing scenes. The experiments show that our method achieves state-of-the-art performance on Charades-STA and ActivityNet Captions datasets. The code and models are available at https://github.com/minghangz/cpl.

Benchmarks

BenchmarkMethodologyMetrics
temporal-sentence-grounding-on-charades-staCPL (Weak, MViT-K400-Pretrain-feature, evaluated by AdaFocus)
R1@0.5: 47.8
R1@0.7: 21.8
R5@0.5: 84.6
R5@0.7: 50.4
temporal-sentence-grounding-on-charades-staCPL (Weak, I3D-K400-Pretrain-feature, evaluated by AdaFocus)
R1@0.5: 39.6
R1@0.7: 18.6
R5@0.5: 81.4
R5@0.7: 49.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Weakly Supervised Temporal Sentence Grounding With Gaussian-Based Contrastive Proposal Learning | Papers | HyperAI