Command Palette
Search for a command to run...
Hasan Mahmudul Choi Jonghyun Neumann Jan Roy-Chowdhury Amit K. Davis Larry S.

Abstract
Perceiving meaningful activities in a long video sequence is a challengingproblem due to ambiguous definition of 'meaningfulness' as well as clutters inthe scene. We approach this problem by learning a generative model for regularmotion patterns, termed as regularity, using multiple sources with very limitedsupervision. Specifically, we propose two methods that are built upon theautoencoders for their ability to work with little to no supervision. We firstleverage the conventional handcrafted spatio-temporal local features and learna fully connected autoencoder on them. Second, we build a fully convolutionalfeed-forward autoencoder to learn both the local features and the classifiersas an end-to-end learning framework. Our model can capture the regularitiesfrom multiple datasets. We evaluate our methods in both qualitative andquantitative ways - showing the learned regularity of videos in various aspectsand demonstrating competitive performance on anomaly detection datasets as anapplication.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| abnormal-event-detection-in-video-on-ubi | Hasan et al. | AUC: 0.528 Decidability: 0.194 EER: 0.466 |
| semi-supervised-anomaly-detection-on-ubi | Hasan et al. | AUC: 0.528 Decidability: 0.194 EER: 0.466 |
| video-anomaly-detection-on-hr-avenue | Conv-AE | AUC: 84.8 |
| video-anomaly-detection-on-hr-shanghaitech | Conv-AE | AUC: 69.8 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.