4 months ago

ReCode: Updating Code API Knowledge with Reinforcement Learning

Haoze Wu Yunzhi Yao Wenhao Yu Huajun Chen Ningyu Zhang

Abstract

Large Language Models (LLMs) exhibit remarkable code generation capabilitiesbut falter when adapting to frequent updates in external library APIs. Thiscritical limitation, stemming from reliance on outdated API knowledge fromtheir training data, even with access to current documentation, impedesreliable code generation in dynamic environments. To tackle this issue, wepropose ReCode (rule-based Reinforcement learning for Code Update), a novelframework that mimics human programmer adaptation to API changes. Specifically,we construct a dataset of approximately 2,000 data entries to train the LLMs toperform version migration based on updated information. Then, we introduce amodified string similarity metric for code evaluation as the reward forreinforcement learning. Our experiments demonstrate that ReCode substantiallyboosts LLMs' code generation performance in dynamic API scenarios, especiallyon the unseen CodeUpdateArena task. Crucially, compared to supervisedfine-tuning, ReCode has less impact on LLMs' general code generation abilities.We apply ReCode on various LLMs and reinforcement learning algorithms (GRPO andDAPO), all achieving consistent improvements. Notably, after training,Qwen2.5-Coder-7B outperforms that of the 32B parameter code instruction-tunedmodel and the reasoning model with the same architecture. Code is available athttps://github.com/zjunlp/ReCode.

Code Repositories

zjunlp/recode

Official

pytorch

Mentioned in GitHub

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning