Audio Classification On Fsd50K
评估指标
mAP
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| ONE-PEACE | 69.7 | ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities | |
| MN | 65.6 | Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio Models | |
| PaSST-S | 65.55 | Efficient Training of Audio Transformers with Patchout | |
| DyMN-L | 65.5 | Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio Models | |
| PaSST-N-S | 64.2 | Efficient Training of Audio Transformers with Patchout | |
| PSLA | 56.71 | PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation | |
| Temporal Knowledge Distillation for On-device Audio Classification | 54.8 | Temporal Knowledge Distillation for On-device Audio Classification | - | 
| Large 6-Layer Transformer with Pooling | 53.7 | Audio Transformers | - | 
| LHGNN | - | LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging | - | 
0 of 9 row(s) selected.