HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2022

Zhang Yuanhang ; Liang Susan ; Yang Shuang ; Shan Shiguang

UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at
  ActivityNet Challenge 2022

Abstract

This report presents a brief description of our winning solution to the AVAActive Speaker Detection (ASD) task at ActivityNet Challenge 2022. Ourunderlying model UniCon+ continues to build on our previous work, the UnifiedContext Network (UniCon) and Extended UniCon which are designed for robustscene-level ASD. We augment the architecture with a simple GRU-based modulethat allows information of recurring identities to flow across scenes throughread and update operations. We report a best result of 94.47% mAP on theAVA-ActiveSpeaker test set, which continues to rank first on this year'schallenge leaderboard and significantly pushes the state-of-the-art.

Benchmarks

BenchmarkMethodologyMetrics
audio-visual-active-speaker-detection-on-avaUniCon+
validation mean average precision: 94.5%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
UniCon+: ICTCAS-UCAS Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2022 | Papers | HyperAI