HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Temporal Convolutional Networks: A Unified Approach to Action Segmentation

Colin Lea; Rene Vidal; Austin Reiter; Gregory D. Hager

Temporal Convolutional Networks: A Unified Approach to Action Segmentation

Abstract

The dominant paradigm for video-based action segmentation is composed of two steps: first, for each frame, compute low-level features using Dense Trajectories or a Convolutional Neural Network that encode spatiotemporal information locally, and second, input these features into a classifier that captures high-level temporal relationships, such as a Recurrent Neural Network (RNN). While often effective, this decoupling requires specifying two separate models, each with their own complexities, and prevents capturing more nuanced long-range spatiotemporal relationships. We propose a unified approach, as demonstrated by our Temporal Convolutional Network (TCN), that hierarchically captures relationships at low-, intermediate-, and high-level time-scales. Our model achieves superior or competitive performance using video or sensor data on three public action segmentation datasets and can be trained in a fraction of the time it takes to train an RNN.

Code Repositories

Around-30/Kaggle
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
action-segmentation-on-jigsawsTCN
Accuracy: 81.4
Edit Distance: 83.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Temporal Convolutional Networks: A Unified Approach to Action Segmentation | Papers | HyperAI