| TextFuseNet (ResNeXt-101) | 92.23 | - | 93.96 | 90.56 | TextFuseNet: Scene Text Detection with Richer Fused Features | - |
| CharNet H-88 (multi-scale) | 91.55 | - | 92.65 | 90.47 | Convolutional Character Networks | |
| CharNet H-88 (single-scale) | 90.97 | - | 89.99 | 91.98 | Convolutional Character Networks | |
| CharNet H-50 (multi-scale) | 90.16 | - | 90.9 | 89.44 | Convolutional Character Networks | |
| CharNet H-57 (multi-scale) | 90.06 | - | 91.43 | 88.74 | Convolutional Character Networks | |
| CharNet H-50 (single-scale) | 89.7 | - | 91.15 | 88.3 | Convolutional Character Networks | |
| CharNet H-57 (single-scale) | 89.66 | - | 88.88 | 90.45 | Convolutional Character Networks | |
| PMTD | 89.33 | - | 91.3 | 87.43 | Pyramid Mask Text Detector | |