7 months ago

Zhongyuan Peng Yifan Yao Kaijing Ma Shuyue Guo Yizhe Li Yichi Zhang Chenchen Zhang Yifan Zhang Zhouliang Yu Luming Li

Abstract

Translating natural language mathematical statements into formal, executablecode is a fundamental challenge in automated theorem proving. While prior workhas focused on generation and compilation success, little attention has beenpaid to the critic phase-the evaluation of whether generated formalizationstruly capture the semantic intent of the original problem. In this paper, weintroduce CriticLean, a novel critic-guided reinforcement learning frameworkthat elevates the role of the critic from a passive validator to an activelearning component. Specifically, first, we propose the CriticLeanGPT, trainedvia supervised fine-tuning and reinforcement learning, to rigorously assess thesemantic fidelity of Lean 4 formalizations. Then, we introduce CriticLeanBench,a benchmark designed to measure models' ability to distinguish semanticallycorrect from incorrect formalizations, and demonstrate that our trainedCriticLeanGPT models can significantly outperform strong open- andclosed-source baselines. Building on the CriticLean framework, we constructFineLeanCorpus, a dataset comprising over 285K problems that exhibits richdomain diversity, broad difficulty coverage, and high correctness based onhuman evaluation. Overall, our findings highlight that optimizing the criticphase is essential for producing reliable formalizations, and we hope ourCriticLean will provide valuable insights for future advances in formalmathematical reasoning.

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

7 months ago

Benchmarks

Supervised Fine-Tuning

Reinforcement Learning

AI Infra

Method/Architecture

Zhongyuan Peng Yifan Yao Kaijing Ma Shuyue Guo Yizhe Li Yichi Zhang Chenchen Zhang Yifan Zhang Zhouliang Yu Luming Li

Abstract

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

7 months ago

Benchmarks

Supervised Fine-Tuning

Reinforcement Learning

AI Infra

Method/Architecture

Zhongyuan Peng Yifan Yao Kaijing Ma Shuyue Guo Yizhe Li Yichi Zhang Chenchen Zhang Yifan Zhang Zhouliang Yu Luming Li

Abstract

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Zhongyuan Peng Yifan Yao Kaijing Ma Shuyue Guo Yizhe Li Yichi Zhang Chenchen Zhang Yifan Zhang Zhouliang Yu Luming Li9 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Zhongyuan Peng Yifan Yao Kaijing Ma Shuyue Guo Yizhe Li Yichi Zhang Chenchen Zhang Yifan Zhang Zhouliang Yu Luming Li9 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Zhongyuan Peng Yifan Yao Kaijing Ma Shuyue Guo Yizhe Li Yichi Zhang Chenchen Zhang Yifan Zhang Zhouliang Yu Luming Li9 more

Abstract

Build AI with AI

HyperAI Newsletters

Zhongyuan Peng Yifan Yao Kaijing Ma Shuyue Guo Yizhe Li Yichi Zhang Chenchen Zhang Yifan Zhang Zhouliang Yu Luming Li

Zhongyuan Peng Yifan Yao Kaijing Ma Shuyue Guo Yizhe Li Yichi Zhang Chenchen Zhang Yifan Zhang Zhouliang Yu Luming Li

Zhongyuan Peng Yifan Yao Kaijing Ma Shuyue Guo Yizhe Li Yichi Zhang Chenchen Zhang Yifan Zhang Zhouliang Yu Luming Li