HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning

Jiashuo Sun Hang Zhang Chen Lin Xiangdong Su Yeyun Gong Jian Guo

APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning

Abstract

Long-form numerical reasoning in financial analysis aims to generate a reasoning program to calculate the correct answer for a given question. Previous work followed a retriever-generator framework, where the retriever selects key facts from a long-form document, and the generator generates a reasoning program based on retrieved facts. However, they treated all facts equally without considering the different contributions of facts with and without numbers. Meanwhile, the program consistency were ignored under supervised training, resulting in lower training accuracy and diversity. To solve these problems, we proposed APOLLO to improve the long-form numerical reasoning framework. For the retriever, we adopt a number-aware negative sampling strategy to enable the retriever to be more discriminative on key numerical facts. For the generator, we design consistency-based reinforcement learning and target program augmentation strategy based on the consistency of program execution results. Experimental results on the FinQA and ConvFinQA leaderboard verify the effectiveness of our proposed method, achieving the new state-of-the-art.

Code Repositories

gasolsun36/dynamicrag
pytorch
Mentioned in GitHub
gasolsun36/iter-cot
Mentioned in GitHub
gasolsun36/apollo
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
conversational-question-answering-onAPOLLO
Execution Accuracy: 78.76
Program Accuracy: 77.19
question-answering-on-finqaAPOLLO
Execution Accuracy: 71.07
Program Accuracy: 68.94

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning | Papers | HyperAI