Command Palette
Search for a command to run...
Language Modelling On Openwebtext
Metrics
eval_loss
eval_perplexity
parameters
Results
Performance results of various models on this benchmark
| Paper Title | Repository | ||||
|---|---|---|---|---|---|
| GPT2-GELU | 2.95 | 19.24 | 124M | Polynomial, trigonometric, and tropical activations | |
| GPT2-Fourier | 2.93 | 18.72 | 124M | Polynomial, trigonometric, and tropical activations | |
| GPT2-Tropical | 2.92 | 18.64 | 124M | Polynomial, trigonometric, and tropical activations | |
| GPT2-Hermite | 2.91 | 18.39 | 124M | Polynomial, trigonometric, and tropical activations |
0 of 4 row(s) selected.