Command Palette
Search for a command to run...
Dingfeng Shi Yujie Zhong Qiong Cao Lin Ma Jia Li Dacheng Tao

Abstract
In this paper, we present a one-stage framework TriDet for temporal action detection. Existing methods often suffer from imprecise boundary predictions due to the ambiguous action boundaries in videos. To alleviate this problem, we propose a novel Trident-head to model the action boundary via an estimated relative probability distribution around the boundary. In the feature pyramid of TriDet, we propose an efficient Scalable-Granularity Perception (SGP) layer to mitigate the rank loss problem of self-attention that takes place in the video features and aggregate information across different temporal granularities. Benefiting from the Trident-head and the SGP-based feature pyramid, TriDet achieves state-of-the-art performance on three challenging benchmarks: THUMOS14, HACS and EPIC-KITCHEN 100, with lower computational costs, compared to previous methods. For example, TriDet hits an average mAP of $69.3\%$ on THUMOS14, outperforming the previous best by $2.5\%$, but with only $74.6\%$ of its latency. The code is released to https://github.com/sssste/TriDet.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| temporal-action-localization-on-activitynet | TriDet (TSP features) | mAP: 36.8 mAP IOU@0.5: 54.7 mAP IOU@0.75: 38.0 mAP IOU@0.95: 8.4 |
| temporal-action-localization-on-epic-kitchens | TriDet (verb) | Avg mAP (0.1-0.5): 25.4 mAP IOU@0.1: 28.6 mAP IOU@0.2: 27.4 mAP IOU@0.3: 26.1 mAP IOU@0.4: 24.2 mAP IOU@0.5: 20.8 |
| temporal-action-localization-on-hacs | TriDet (SlowFast) | Average-mAP: 38.6 mAP@0.5: 56.7 mAP@0.75: 39.3 mAP@0.95: 11.7 |
| temporal-action-localization-on-hacs | TriDet (I3D RGB) | Average-mAP: 36.8 mAP@0.5: 54.5 mAP@0.75: 36.8 mAP@0.95: 11.5 |
| temporal-action-localization-on-thumos14 | TriDet (I3D features) | Avg mAP (0.3:0.7): 69.3 mAP IOU@0.3: 83.6 mAP IOU@0.4: 80.1 mAP IOU@0.5: 72.9 mAP IOU@0.6: 62.4 mAP IOU@0.7: 47.4 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.