HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Efficient Dynamic Hard Negative Sampling for Dialogue Selection

{and Honglak Lee. Stanley Jungkyu Choi SeongHwan Kim Jeesoo Bang Hyunkyung Bae Joongbo Shin Dongkyu Lee Janghoon Han}

Efficient Dynamic Hard Negative Sampling for Dialogue Selection

Abstract

Recent studies have demonstrated significant improvements in selection tasks, and a considerable portion of this success is attributed to incorporating informative negative samples during training. While traditional methods for constructing hard negatives provide meaningful supervision, they depend on static samples that do not evolve during training, leading to sub-optimal performance. Dynamic hard negative sampling addresses this limitation by continuously adapting to the model’s changing state throughout training. However, the high computational demands of this method restrict its applicability to certain model architectures. To overcome these challenges, we introduce an efficient dynamic hard negative sampling (EDHNS). EDHNS enhances efficiency by pre-filtering easily discriminable negatives, thereby reducing the number of candidates the model needs to compute during training. Additionally, it excludes question-candidate pairs where the model already exhibits high confidence from loss computations, further reducing training time. These approaches maintain learning quality while minimizing computation and streamlining the training process. Extensive experiments on DSTC9, DSTC10, Ubuntu, and E-commerce benchmarks demonstrate that EDHNS significantly outperforms baseline models, proving its effectiveness in dialogue selection tasks.

Benchmarks

BenchmarkMethodologyMetrics
conversational-response-selection-on-eBERT-FP+EDHNS
R10@1: 0.957
R10@2: 0.986
R10@5: 0.997
conversational-response-selection-on-ubuntu-1BERT-FP+EDHNS
R10@1: 0.917
R10@2: 0.965
R10@5: 0.994

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Efficient Dynamic Hard Negative Sampling for Dialogue Selection | Papers | HyperAI