HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

ChatGPT as Data Augmentation for Compositional Generalization: A Case Study in Open Intent Detection

Yihao Fang; Xianzhi Li; Stephen W. Thomas; Xiaodan Zhu

ChatGPT as Data Augmentation for Compositional Generalization: A Case Study in Open Intent Detection

Abstract

Open intent detection, a crucial aspect of natural language understanding, involves the identification of previously unseen intents in user-generated text. Despite the progress made in this field, challenges persist in handling new combinations of language components, which is essential for compositional generalization. In this paper, we present a case study exploring the use of ChatGPT as a data augmentation technique to enhance compositional generalization in open intent detection tasks. We begin by discussing the limitations of existing benchmarks in evaluating this problem, highlighting the need for constructing datasets for addressing compositional generalization in open intent detection tasks. By incorporating synthetic data generated by ChatGPT into the training process, we demonstrate that our approach can effectively improve model performance. Rigorous evaluation of multiple benchmarks reveals that our method outperforms existing techniques and significantly enhances open intent detection capabilities. Our findings underscore the potential of large language models like ChatGPT for data augmentation in natural language understanding tasks.

Code Repositories

fangyihao/gptaug
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
open-intent-detection-on-banking-cgADB+GPTAUG-F4
F1 Score: 66.45
open-intent-detection-on-oos-cgADB+GPTAUG-F4
F1 Score: 56.18
open-intent-detection-on-stackoverflow-cgDA-ADB
F1 Score: 77.77

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
ChatGPT as Data Augmentation for Compositional Generalization: A Case Study in Open Intent Detection | Papers | HyperAI