HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU

Guanhua Chen Yutong Yao Derek F. Wong Lidia S. Chao

A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU

Abstract

Multi-intent natural language understanding (NLU) presents a formidable challenge due to the model confusion arising from multiple intents within a single utterance. While previous works train the model contrastively to increase the margin between different multi-intent labels, they are less suited to the nuances of multi-intent NLU. They ignore the rich information between the shared intents, which is beneficial to constructing a better embedding space, especially in low-data scenarios. We introduce a two-stage Prediction-Aware Contrastive Learning (PACL) framework for multi-intent NLU to harness this valuable knowledge. Our approach capitalizes on shared intent information by integrating word-level pre-training and prediction-aware contrastive fine-tuning. We construct a pre-training dataset using a word-level data augmentation strategy. Subsequently, our framework dynamically assigns roles to instances during contrastive fine-tuning while introducing a prediction-aware contrastive loss to maximize the impact of contrastive learning. We present experimental results and empirical analysis conducted on three widely used datasets, demonstrating that our method surpasses the performance of three prominent baselines on both low-data and full-data scenarios.

Benchmarks

BenchmarkMethodologyMetrics
intent-detection-on-mixatisTFMN (PACL)
Accuracy: 82.9
intent-detection-on-mixatisRoBERTa (PACL)
Accuracy: 79.1
intent-detection-on-mixatisSLIM (PACL)
Accuracy: 81.9
intent-detection-on-mixsnipsRoBERTa (PACL)
Accuracy: 96.5
intent-detection-on-mixsnipsSLIM (PACL)
Accuracy: 96.9
intent-detection-on-mixsnipsTFMN (PACL)
Accuracy: 97.4
slot-filling-on-mixatisTFMN (PACL)
Micro F1: 86.7
slot-filling-on-mixatisRoBERTa (PACL)
Micro F1: 86.0
slot-filling-on-mixatisSLIM (PACL)
Micro F1: 87.3
slot-filling-on-mixsnipsRoBERTa (PACL)
Micro F1: 96.2
slot-filling-on-mixsnipsTFMN (PACL)
Micro F1: 96.3
slot-filling-on-mixsnipsSLIM (PACL)
Micro F1: 96.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU | Papers | HyperAI