HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Stay on topic with Classifier-Free Guidance

Guillaume Sanchez; Honglu Fan; Alexander Spangher; Elad Levi; Pawan Sasanka Ammanamanchi; Stella Biderman

Stay on topic with Classifier-Free Guidance

Abstract

Classifier-Free Guidance (CFG) has recently emerged in text-to-image generation as a lightweight technique to encourage prompt-adherence in generations. In this work, we demonstrate that CFG can be used broadly as an inference-time technique in pure language modeling. We show that CFG (1) improves the performance of Pythia, GPT-2 and LLaMA-family models across an array of tasks: Q\&A, reasoning, code generation, and machine translation, achieving SOTA on LAMBADA with LLaMA-7B over PaLM-540B; (2) brings improvements equivalent to a model with twice the parameter-count; (3) can stack alongside other inference-time methods like Chain-of-Thought and Self-Consistency, yielding further improvements in difficult tasks; (4) can be used to increase the faithfulness and coherence of assistants in challenging form-driven and content-driven prompts: in a human evaluation we show a 75\% preference for GPT4All using CFG over baseline.

Benchmarks

BenchmarkMethodologyMetrics
common-sense-reasoning-on-arc-easyLLaMA 13B + CFG (0-shot)
Accuracy: 79.1
common-sense-reasoning-on-arc-easyLLaMA 65B + CFG (0-shot)
Accuracy: 84.2
common-sense-reasoning-on-arc-easyLLaMA 30B + CFG (0-shot)
Accuracy: 83.2
common-sense-reasoning-on-arc-easyLLaMA 7B + CFG (0-shot)
Accuracy: 58.9
language-modelling-on-lambadaLLaMA-30B+CFG (zero-shot)
Accuracy: 83.9
language-modelling-on-lambadaLLaMA-13B+CFG (zero-shot)
Accuracy: 82.2
language-modelling-on-lambadaLLaMA-65B+CFG (Zero-Shot)
Accuracy: 84.0
text-generation-on-sciqLLaMA-13B+CFG (zero-shot)
Accuracy: 95.1
text-generation-on-sciqLLaMA-30B+CFG (zero-shot)
Accuracy: 96.4
text-generation-on-sciqLLaMA-65B+CFG (zero-shot)
Accuracy: 96.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Stay on topic with Classifier-Free Guidance | Papers | HyperAI