HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Counting Out Time: Class Agnostic Video Repetition Counting in the Wild

Debidatta Dwibedi; Yusuf Aytar; Jonathan Tompson; Pierre Sermanet; Andrew Zisserman

Counting Out Time: Class Agnostic Video Repetition Counting in the Wild

Abstract

We present an approach for estimating the period with which an action is repeated in a video. The crux of the approach lies in constraining the period prediction module to use temporal self-similarity as an intermediate representation bottleneck that allows generalization to unseen repetitions in videos in the wild. We train this model, called Repnet, with a synthetic dataset that is generated from a large unlabeled video collection by sampling short clips of varying lengths and repeating them with different periods and counts. This combination of synthetic data and a powerful yet constrained model, allows us to predict periods in a class-agnostic fashion. Our model substantially exceeds the state of the art performance on existing periodicity (PERTUBE) and repetition counting (QUVA) benchmarks. We also collect a new challenging dataset called Countix (~90 times larger than existing datasets) which captures the challenges of repetition counting in real-world videos. Project webpage: https://sites.google.com/view/repnet .

Code Repositories

confifu/RepNet-Pytorch
pytorch
Mentioned in GitHub
materight/RepNet-pytorch
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
repetitive-action-counting-on-countixRepNet
MAE: 0.3641
OBO: 0.3034
repetitive-action-counting-on-repcountRepNet
OBO: 0.013

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Counting Out Time: Class Agnostic Video Repetition Counting in the Wild | Papers | HyperAI