| scene-text-recognition-on-coco-text | CLIP4STR-B | |
| scene-text-recognition-on-coco-text | CLIP4STR-L | |
| scene-text-recognition-on-cute80 | CLIP4STR-L | |
| scene-text-recognition-on-cute80 | CLIP4STR-L (DataComp-1B) | |
| scene-text-recognition-on-cute80 | CLIP4STR-B | |
| scene-text-recognition-on-host | CLIP4STR-B | |
| scene-text-recognition-on-host | CLIP4STR-L | |
| scene-text-recognition-on-ic19-art | CLIP4STR-L | |
| scene-text-recognition-on-ic19-art | CLIP4STR-L (DataComp-1B) | |
| scene-text-recognition-on-ic19-art | CLIP4STR-B | |
| scene-text-recognition-on-icdar2013 | CLIP4STR-L | |
| scene-text-recognition-on-icdar2013 | CLIP4STR-B | |
| scene-text-recognition-on-icdar2013 | CLIP4STR-L (DataComp-1B) | |
| scene-text-recognition-on-icdar2015 | CLIP4STR-L (DataComp-1B) | |
| scene-text-recognition-on-icdar2015 | CLIP4STR-L | |
| scene-text-recognition-on-icdar2015 | CLIP4STR-B | |
| scene-text-recognition-on-iiit5k | CLIP4STR-B (DataComp-1B) | |
| scene-text-recognition-on-iiit5k | CLIP4STR-L | |
| scene-text-recognition-on-iiit5k | CLIP4STR-B | |
| scene-text-recognition-on-iiit5k | CLIP4STR-L (DataComp-1B) | |
| scene-text-recognition-on-svt | CLIP4STR-L | |
| scene-text-recognition-on-svt | CLIP4STR-B | |
| scene-text-recognition-on-svt | CLIP4STR-H (DFN-5B) | |
| scene-text-recognition-on-svt | CLIP4STR-L (DataComp-1B) | |
| scene-text-recognition-on-svtp | CLIP4STR-L | |
| scene-text-recognition-on-svtp | CLIP4STR-B | |
| scene-text-recognition-on-svtp | CLIP4STR-L (DataComp-1B) | |
| scene-text-recognition-on-uber-text | CLIP4STR-L (DataComp-1B) | |
| scene-text-recognition-on-uber-text | CLIP4STR-B | |
| scene-text-recognition-on-wost | CLIP4STR-L | |
| scene-text-recognition-on-wost | CLIP4STR-H (DFN-5B) | |
| scene-text-recognition-on-wost | CLIP4STR-B | |
| scene-text-recognition-on-wost | CLIP4STR-L (DataComp-1B) | |