HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Llemma: An Open Language Model For Mathematics

Zhangir Azerbayev Hailey Schoelkopf Keiran Paster Marco Dos Santos Stephen McAleer Albert Q. Jiang Jia Deng Stella Biderman Sean Welleck

Llemma: An Open Language Model For Mathematics

Abstract

We present Llemma, a large language model for mathematics. We continue pretraining Code Llama on the Proof-Pile-2, a mixture of scientific papers, web data containing mathematics, and mathematical code, yielding Llemma. On the MATH benchmark Llemma outperforms all known open base models, as well as the unreleased Minerva model suite on an equi-parameter basis. Moreover, Llemma is capable of tool use and formal theorem proving without any further finetuning. We openly release all artifacts, including 7 billion and 34 billion parameter models, the Proof-Pile-2, and code to replicate our experiments.

Code Repositories

eleutherai/gpt-neox
Official
pytorch
Mentioned in GitHub
EleutherAI/math-lm
Official
Mentioned in GitHub
wellecks/llmstep
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
arithmetic-reasoning-on-gsm8kLlemma 7B
Accuracy: 36.4
Parameters (Billion): 7
arithmetic-reasoning-on-gsm8kLlemma 34B
Accuracy: 51.5
Parameters (Billion): 34
automated-theorem-proving-on-minif2f-testLLEMMA-7b
ITP: Lean
Pass@32: 26.2
cumulative: 26.2
automated-theorem-proving-on-minif2f-testLLEMMA-34b
ITP: Lean
Pass@32: 25.8
cumulative: 25.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp