HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Modeling Multi-turn Conversation with Deep Utterance Aggregation

Zhuosheng Zhang; Jiangtong Li; Pengfei Zhu; Hai Zhao; Gongshen Liu

Modeling Multi-turn Conversation with Deep Utterance Aggregation

Abstract

Multi-turn conversation understanding is a major challenge for building intelligent dialogue systems. This work focuses on retrieval-based response matching for multi-turn conversation whose related work simply concatenates the conversation utterances, ignoring the interactions among previous utterances for context modeling. In this paper, we formulate previous utterances into context using a proposed deep utterance aggregation model to form a fine-grained context representation. In detail, a self-matching attention is first introduced to route the vital information in each utterance. Then the model matches a response with each refined utterance and the final matching score is obtained after attentive turns aggregation. Experimental results show our model outperforms the state-of-the-art methods on three multi-turn conversation benchmarks, including a newly introduced e-commerce dialogue corpus.

Code Repositories

cooelf/DeepUtteranceAggregation
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
conversational-response-selection-on-douban-1DUA
MAP: 0.551
MRR: 0.599
P@1: 0.421
R10@1: 0.243
R10@2: 0.421
R10@5: 0.780
conversational-response-selection-on-eDUA
R10@1: 0.501
R10@2: 0.700
R10@5: 0.921
conversational-response-selection-on-ubuntu-1DUA
R10@1: 0.752
R10@2: 0.868
R10@5: 0.962

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Modeling Multi-turn Conversation with Deep Utterance Aggregation | Papers | HyperAI