Command Palette
Search for a command to run...
Guillaume Sanchez; Honglu Fan; Alexander Spangher; Elad Levi; Pawan Sasanka Ammanamanchi; Stella Biderman

Abstract
Classifier-Free Guidance (CFG) has recently emerged in text-to-image generation as a lightweight technique to encourage prompt-adherence in generations. In this work, we demonstrate that CFG can be used broadly as an inference-time technique in pure language modeling. We show that CFG (1) improves the performance of Pythia, GPT-2 and LLaMA-family models across an array of tasks: Q\&A, reasoning, code generation, and machine translation, achieving SOTA on LAMBADA with LLaMA-7B over PaLM-540B; (2) brings improvements equivalent to a model with twice the parameter-count; (3) can stack alongside other inference-time methods like Chain-of-Thought and Self-Consistency, yielding further improvements in difficult tasks; (4) can be used to increase the faithfulness and coherence of assistants in challenging form-driven and content-driven prompts: in a human evaluation we show a 75\% preference for GPT4All using CFG over baseline.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| common-sense-reasoning-on-arc-easy | LLaMA 13B + CFG (0-shot) | Accuracy: 79.1 |
| common-sense-reasoning-on-arc-easy | LLaMA 65B + CFG (0-shot) | Accuracy: 84.2 |
| common-sense-reasoning-on-arc-easy | LLaMA 30B + CFG (0-shot) | Accuracy: 83.2 |
| common-sense-reasoning-on-arc-easy | LLaMA 7B + CFG (0-shot) | Accuracy: 58.9 |
| language-modelling-on-lambada | LLaMA-30B+CFG (zero-shot) | Accuracy: 83.9 |
| language-modelling-on-lambada | LLaMA-13B+CFG (zero-shot) | Accuracy: 82.2 |
| language-modelling-on-lambada | LLaMA-65B+CFG (Zero-Shot) | Accuracy: 84.0 |
| text-generation-on-sciq | LLaMA-13B+CFG (zero-shot) | Accuracy: 95.1 |
| text-generation-on-sciq | LLaMA-30B+CFG (zero-shot) | Accuracy: 96.4 |
| text-generation-on-sciq | LLaMA-65B+CFG (zero-shot) | Accuracy: 96.6 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.