Command Palette
Search for a command to run...
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
Xu Zunnan ; Lin Yukang ; Han Haonan ; Yang Sicheng ; Li Ronghui ; Zhang Yachao ; Li Xiu

Abstract
Gesture synthesis is a vital realm of human-computer interaction, withwide-ranging applications across various fields like film, robotics, andvirtual reality. Recent advancements have utilized the diffusion model andattention mechanisms to improve gesture synthesis. However, due to the highcomputational complexity of these techniques, generating long and diversesequences with low latency remains a challenge. We explore the potential ofstate space models (SSMs) to address the challenge, implementing a two-stagemodeling strategy with discrete motion priors to enhance the quality ofgestures. Leveraging the foundational Mamba block, we introduce MambaTalk,enhancing gesture diversity and rhythm through multimodal integration.Extensive experiments demonstrate that our method matches or exceeds theperformance of state-of-the-art models.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-face-animation-on-beat2 | MambaTalk | MSE: 6.289 |
| gesture-generation-on-beat2 | MambaTalk | FGD: 0.5366 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.