Temporal Action Localization On Thumos14

评估指标

mAP IOU@0.1
mAP IOU@0.2
mAP IOU@0.3
mAP IOU@0.4
mAP IOU@0.5

评测结果

各个模型在此基准测试上的表现结果

Paper TitleRepository
AdaTAD (VideoMAEv2-giant)--89.786.780.9End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
RDFA-S6 (InternVideo2-6B)--88.784.678.2Enhancing Temporal Action Localization: Advanced S6 Modeling with Recurrent Mechanism
ActionMamba(InternVideo2-6B)--86.8983.0976.90Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
TriDet (VideoMAE v2-g feature)--84.880.073.3Temporal Action Localization with Enhanced Instant Discriminability
ActionFormer (VideoMAE V2-g features)--84.079.673.0VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
TriDet (I3D features)--83.680.172.9TriDet: Temporal Action Detection with Relative Boundary Modeling
TemporalMaxer (I3D features)--82.878.971.8TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization
ASL(I3D features)--83.179.071.7Action Sensitivity Learning for Temporal Action Localization-
ActionFormer (I3D features)--82.177.871.0ActionFormer: Localizing Moments of Actions with Transformers
DualDETR (I3D features)--82.978.070.4Dual DETRs for Multi-Label Temporal Action Detection-
BasicTAD (160,6,192,R50-SlowOnly)--75.570.863.5BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection
TadML(two-stream)--73.2969.7362.53TadML: A fast temporal action detection with Mechanics-MLP
TadTR--74.869.160.1End-to-end Temporal Action Detection with Transformer
BasicTAD (112,3,96,R50-SlowOnly)--68.465.058.6BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection
ReAct (TSN features)--69.265.057.1ReAct: Temporal Action Detection with Relational Queries
AVFusion--70.164.957.1Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action Localization
TAGS (I3D)--68.663.857.0Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning
MUSES--68.964.056.9Multi-shot Temporal Event Localization: a Benchmark
TadML(rgb-only)--68.7864.6656.61TadML: A fast temporal action detection with Mechanics-MLP
E2E-TAD (SlowFast R50+TadTR)--69.464.356.0An Empirical Study of End-to-End Temporal Action Detection
0 of 42 row(s) selected.
Temporal Action Localization On Thumos14 | SOTA | HyperAI超神经