| Local Prior Matching (Large Model) | 20.84 | Semi-Supervised Speech Recognition via Local Prior Matching | |
| Local Prior Matching (Large Model, ConvLM LM) | 15.28 | Semi-Supervised Speech Recognition via Local Prior Matching | |
| TDNN + pNorm + speed up/down speech | 12.5 | - | - |
| Convolutional Speech Recognition | 10.47 | Fully Convolutional Speech Recognition | - |
| Jasper DR 10x5 (+ Time/Freq Masks) | 7.84 | Jasper: An End-to-End Convolutional Neural Acoustic Model | |
| Multi-Stream Self-Attention With Dilated 1D Convolutions | 5.80 | State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D Convolutions | |