HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Formal Mathematics Statement Curriculum Learning

Stanislas Polu Jesse Michael Han Kunhao Zheng Mantas Baksys Igor Babuschkin Ilya Sutskever

Formal Mathematics Statement Curriculum Learning

Abstract

We explore the use of expert iteration in the context of language modeling applied to formal mathematics. We show that at same compute budget, expert iteration, by which we mean proof search interleaved with learning, dramatically outperforms proof search only. We also observe that when applied to a collection of formal statements of sufficiently varied difficulty, expert iteration is capable of finding and solving a curriculum of increasingly difficult problems, without the need for associated ground-truth proofs. Finally, by applying this expert iteration to a manually curated set of problem statements, we achieve state-of-the-art on the miniF2F benchmark, automatically solving multiple challenging problems drawn from high school olympiads.

Code Repositories

openai/lean-gym
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
automated-theorem-proving-on-minif2f-testLean Expert Iteration
ITP: Lean
Pass@1: 29.6
Pass@32: 34.5
Pass@64: 36.6
cumulative: 36.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Formal Mathematics Statement Curriculum Learning | Papers | HyperAI