HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

FAR: Fourier Aerial Video Recognition

Divya Kothandaraman Tianrui Guan Xijun Wang Sean Hu Ming Lin Dinesh Manocha

FAR: Fourier Aerial Video Recognition

Abstract

We present an algorithm, Fourier Activity Recognition (FAR), for UAV video activity recognition. Our formulation uses a novel Fourier object disentanglement method to innately separate out the human agent (which is typically small) from the background. Our disentanglement technique operates in the frequency domain to characterize the extent of temporal change of spatial pixels, and exploits convolution-multiplication properties of Fourier transform to map this representation to the corresponding object-background entangled features obtained from the network. To encapsulate contextual information and long-range space-time dependencies, we present a novel Fourier Attention algorithm, which emulates the benefits of self-attention by modeling the weighted outer product in the frequency domain. Our Fourier attention formulation uses much fewer computations than self-attention. We have evaluated our approach on multiple UAV datasets including UAV Human RGB, UAV Human Night, Drone Action, and NEC Drone. We demonstrate a relative improvement of 8.02% - 38.69% in top-1 accuracy and up to 3 times faster over prior works.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
action-recognition-on-drone-actionFAR
Top 1 Accuracy: 92.7
action-recognition-on-nec-droneFAR
Top 1 Accuracy: 71.46
action-recognition-on-uav-humanFAR
Top 1 Accuracy: 39.1
action-recognition-on-uav-human-1FAR
Top 1 Accuracy: 38.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
FAR: Fourier Aerial Video Recognition | Papers | HyperAI