| bias-detection-on-stereoset-1 | OPT 175B | ICAT Score: 60 LMS: 74.8 SS: 59.9 |
| bias-detection-on-stereoset-1 | GAL 120B | ICAT Score: 65.6 LMS: 75 SS: 56.2 |
| bias-detection-on-stereoset-1 | GPT-3 (text-davinci-002) | ICAT Score: 60.8 LMS: 77.6 SS: 60.8 |
| common-sense-reasoning-on-arc-challenge | BLOOM (few-shot, k=5) | |
| common-sense-reasoning-on-arc-challenge | GAL 120B (zero-shot) | |
| common-sense-reasoning-on-arc-challenge | OPT (few-shot, k=5) | |
| common-sense-reasoning-on-arc-challenge | GPT-3 (zero-shot) | |
| common-sense-reasoning-on-arc-easy | GAL 120B (0-shot) | |
| common-sense-reasoning-on-arc-easy | BLOOM (5-shot) | |
| common-sense-reasoning-on-arc-easy | GPT-3 (zero-shot) | |
| common-sense-reasoning-on-arc-easy | OPT (5-shot) | |
| math-word-problem-solving-on-math | GAL 120B <work> | Accuracy: 16.6 Parameters (Billions): 120 |
| math-word-problem-solving-on-math | GAL 120B (5-shot) mCoT | Accuracy: 20.4 Parameters (Billions): 120 |
| math-word-problem-solving-on-math | Minerva 540B (5-shot) mCoT | Accuracy: 33.6 Parameters (Billions): 540 |
| math-word-problem-solving-on-math | GAL 30B <work> | Accuracy: 11.4 Parameters (Billions): 30 |
| math-word-problem-solving-on-math | PaLM 540B (5-shot) mCoT | Accuracy: 8.8 Parameters (Billions): 540 |
| math-word-problem-solving-on-math | GPT-3 175B (8-shot) | Accuracy: 5.2 Parameters (Billions): 175 |
| math-word-problem-solving-on-math | GAL 30B (5-shot) mCoT | Accuracy: 12.7 Parameters (Billions): 30 |
| mathematical-reasoning-on-mmlu-mathematics | GAL 120B <work> | |
| molecular-property-prediction-on-bace-1 | GAL 1.3B | |
| molecular-property-prediction-on-bace-1 | GAL 30B | |
| molecular-property-prediction-on-bace-1 | GAL 125M | |
| molecular-property-prediction-on-bace-1 | GAL 120B | |
| molecular-property-prediction-on-bace-1 | GAL 6.7B | |
| molecular-property-prediction-on-bbbp-1 | GAL 6.7B | |
| molecular-property-prediction-on-bbbp-1 | GAL 125M | |
| molecular-property-prediction-on-bbbp-1 | GAL 120B | |
| molecular-property-prediction-on-bbbp-1 | Uni-Mol | |
| molecular-property-prediction-on-bbbp-1 | GAL 30B | |
| molecular-property-prediction-on-bbbp-1 | GAL 1.3B | |
| molecular-property-prediction-on-clintox-1 | GAL 1.3B | Molecules (M): 2 ROC-AUC: 58.9 |
| molecular-property-prediction-on-clintox-1 | GAL 125M | Molecules (M): 2 ROC-AUC: 51.8 |
| molecular-property-prediction-on-clintox-1 | GAL 120B | Molecules (M): 2 ROC-AUC: 82.6 |
| molecular-property-prediction-on-clintox-1 | GAL 6.7B | Molecules (M): 2 ROC-AUC: 78.4 |
| molecular-property-prediction-on-clintox-1 | GAL 30B | Molecules (M): 2 ROC-AUC: 82.2 |
| molecular-property-prediction-on-hiv-dataset | GAL 30B | |
| molecular-property-prediction-on-hiv-dataset | GAL 1.3B | |
| molecular-property-prediction-on-hiv-dataset | GAL 125M | |
| molecular-property-prediction-on-hiv-dataset | GAL 6.7B | |
| molecular-property-prediction-on-hiv-dataset | Uni-Mol | |
| molecular-property-prediction-on-hiv-dataset | GAL 120B | |
| molecular-property-prediction-on-moleculenet | GAL 30B | |
| molecular-property-prediction-on-moleculenet | GAL 125M | |
| molecular-property-prediction-on-moleculenet | GAL 1.3B | |
| molecular-property-prediction-on-moleculenet | GAL 6.7B | |
| molecular-property-prediction-on-moleculenet | Uni-Mol | |
| molecular-property-prediction-on-sider-1 | GAL 125M | |
| molecular-property-prediction-on-sider-1 | GAL 1.3B | |
| molecular-property-prediction-on-sider-1 | GAL 6.7B | |
| molecular-property-prediction-on-sider-1 | GAL 120B | |
| molecular-property-prediction-on-sider-1 | GAL 30B | |
| molecular-property-prediction-on-tox21-1 | GAL 125M | |
| molecular-property-prediction-on-tox21-1 | GAL 120B | |
| molecular-property-prediction-on-tox21-1 | Uni-Mol | |
| molecular-property-prediction-on-tox21-1 | GAL 6.7B | |
| molecular-property-prediction-on-tox21-1 | GAL 30B | |
| molecular-property-prediction-on-tox21-1 | GAL 1.3B | |
| multi-task-language-understanding-on-mmlu | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-10 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-10 | Gopher (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-10 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-10 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-10 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-11 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-11 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-11 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-11 | Gopher (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-11 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-12 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-12 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-12 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-12 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-12 | Gopher (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-13 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-13 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-13 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-13 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-13 | Gopher (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-14 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-14 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-14 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-14 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-15 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-15 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-15 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-15 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-16 | Gopher (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-16 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-16 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-16 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-16 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-17 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-17 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-17 | Gopher (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-17 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-17 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-18 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-18 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-18 | Gopher (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-18 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-18 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-19 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-19 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-19 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-19 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-2 | Gopher (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-2 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-2 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-2 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-2 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-20 | Gopher (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-20 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-20 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-20 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-20 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-21 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-21 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-21 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-3 | Gopher (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-3 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-3 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-3 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-3 | GAL 30B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-4 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-4 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-4 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-4 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-4 | Gopher (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-5 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-5 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-5 | Gopher (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-5 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-5 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-6 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-6 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-6 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-6 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-7 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-7 | Gopher (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-7 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-7 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-7 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-8 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-8 | Chinchilla (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-8 | GAL 30B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-8 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-8 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-9 | BLOOM (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-9 | GAL 120B (zero-shot) | |
| multiple-choice-question-answering-mcqa-on-9 | Gopher (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-9 | OPT (few-shot, k=5) | |
| multiple-choice-question-answering-mcqa-on-9 | Chinchilla (few-shot, k=5) | |
| protein-function-prediction-on-caspsimseq | GAL 1.3B | |
| protein-function-prediction-on-caspsimseq | GAL 30B | |
| protein-function-prediction-on-caspsimseq | GAL 120B | |
| protein-function-prediction-on-caspsimseq | GAL 6.7B | |
| protein-function-prediction-on-caspsimseq | GAL 125M | |
| protein-function-prediction-on-paenseq | GAL 30B | |
| protein-function-prediction-on-paenseq | GAL 120B | |
| protein-function-prediction-on-paenseq | GAL 1.3B | |
| protein-function-prediction-on-paenseq | GAL 125M | |
| protein-function-prediction-on-paenseq | GAL 6.7B | |
| protein-function-prediction-on-uniprotseq | GAL 30B | |
| protein-function-prediction-on-uniprotseq | GAL 125M | |
| protein-function-prediction-on-uniprotseq | GAL 120B | |
| protein-function-prediction-on-uniprotseq | GAL 6.7B | |
| protein-function-prediction-on-uniprotseq | GAL 1.3B | |
| protein-structure-prediction-on-caspseq | GAL 6.7B | Validation perplexity: 17.29 |
| protein-structure-prediction-on-caspseq | GAL 1.3B | Validation perplexity: 17.58 |
| protein-structure-prediction-on-caspseq | GAL 30B | Validation perplexity: 17.27 |
| protein-structure-prediction-on-caspseq | GAL 125M | Validation perplexity: 20.62 |
| protein-structure-prediction-on-caspseq | GAL 120B | Validation perplexity: 17.26 |
| protein-structure-prediction-on-caspsimseq | GAL 1.3B | Validation perplexity: 17.04 |
| protein-structure-prediction-on-caspsimseq | GAL 30B | Validation perplexity: 15.42 |
| protein-structure-prediction-on-caspsimseq | GAL 125M | Validation perplexity: 19.18 |
| protein-structure-prediction-on-caspsimseq | GAL 6.7B | Validation perplexity: 16.35 |
| protein-structure-prediction-on-caspsimseq | GAL 120B | Validation perplexity: 12.77 |
| protein-structure-prediction-on-paenseq | GAL 30B | Validation perplexity: 4.28 |
| protein-structure-prediction-on-paenseq | GAL 6.7B | Validation perplexity: 7.76 |
| protein-structure-prediction-on-paenseq | GAL 120B | Validation perplexity: 3.14 |
| protein-structure-prediction-on-paenseq | GAL 1.3B | Validation perplexity: 12.53 |
| protein-structure-prediction-on-paenseq | GAL 125M | Validation perplexity: 16.35 |
| protein-structure-prediction-on-uniprotseq | GAL 6.7B | Validation perplexity: 11.58 |
| protein-structure-prediction-on-uniprotseq | GAL 125M | Validation perplexity: 19.05 |
| protein-structure-prediction-on-uniprotseq | GAL 1.3B | Validation perplexity: 15.82 |
| protein-structure-prediction-on-uniprotseq | GAL 120B | Validation perplexity: 5.54 |
| protein-structure-prediction-on-uniprotseq | GAL 30B | Validation perplexity: 8.23 |
| question-answering-on-bioasq | GAL 120B (zero-shot) | |
| question-answering-on-bioasq | BLOOM (zero-shot) | |
| question-answering-on-bioasq | OPT (zero-shot) | |
| question-answering-on-medqa-usmle | GAL 120B (zero-shot) | |
| question-answering-on-medqa-usmle | OPT (few-shot, k=5) | |
| question-answering-on-medqa-usmle | BLOOM (few-shot, k=5) | |
| question-answering-on-pubmedqa | GAL 120B (zero-shot) | |
| question-answering-on-pubmedqa | BLOOM (zero-shot) | |
| question-answering-on-pubmedqa | OPT (zero-shot) | |
| question-answering-on-truthfulqa | GAL 6.7B | |
| question-answering-on-truthfulqa | GAL 30B | |
| question-answering-on-truthfulqa | GAL 1.3B | |
| question-answering-on-truthfulqa | GAL 120B | |
| question-answering-on-truthfulqa | GAL 125M | |
| question-answering-on-truthfulqa | OPT 175B | |
| stereotypical-bias-analysis-on-crows-pairs | GAL 120B | Age: 69 Disability: 66.7 Gender: 51.9 Nationality: 51.6 Overall: 60.5 Physical Appearance: 58.7 Race/Color: 59.9 Religion: 51.9 Sexual Orientation: 77.4 Socioeconomic status: 65.7 |
| tdc-admet-benchmarking-group-on-tdcommons | Galactica-GAL-120B | |
| tdc-admet-benchmarking-group-on-tdcommons | Galactica-GAL-125M | |
| tdc-admet-benchmarking-group-on-tdcommons | Galactica-GAL-30B | |
| tdc-admet-benchmarking-group-on-tdcommons | Galactica-GAL-6.7B | |
| tdc-admet-benchmarking-group-on-tdcommons | Galactica-GAL-1.3B | |
| word-sense-disambiguation-on-big-bench | GAL 120B (few-shot, k=5) | |
| word-sense-disambiguation-on-big-bench | BLOOM 176B | |
| word-sense-disambiguation-on-big-bench | GAL 30B (few-shot, k=5) | |
| word-sense-disambiguation-on-big-bench | OPT 175B | |