HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Broadcasted Residual Learning for Efficient Keyword Spotting

Byeonggeun Kim Simyung Chang Jinkyu Lee Dooyong Sung

Broadcasted Residual Learning for Efficient Keyword Spotting

Abstract

Keyword spotting is an important research field because it plays a key role in device wake-up and user interaction on smart devices. However, it is challenging to minimize errors while operating efficiently in devices with limited resources such as mobile phones. We present a broadcasted residual learning method to achieve high accuracy with small model size and computational load. Our method configures most of the residual functions as 1D temporal convolution while still allows 2D convolution together using a broadcasted-residual connection that expands temporal output to frequency-temporal dimension. This residual mapping enables the network to effectively represent useful audio features with much less computation than conventional convolutional neural networks. We also propose a novel network architecture, Broadcasting-residual network (BC-ResNet), based on broadcasted residual learning and describe how to scale up the model according to the target device's resources. BC-ResNets achieve state-of-the-art 98.0% and 98.7% top-1 accuracy on Google speech command datasets v1 and v2, respectively, and consistently outperform previous approaches, using fewer computations and parameters. Code is available at https://github.com/Qualcomm-AI-research/bcresnet.

Code Repositories

Qualcomm-AI-research/bcresnet
Official
pytorch
Mentioned in GitHub
re9ulus/BC-ResNet
pytorch
Mentioned in GitHub
roman-vygon/BCResNet
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
keyword-spotting-on-google-speech-commandsBC-ResNet-8
Google Speech Commands V1 12: 98.0
Google Speech Commands V2 12: 98.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Broadcasted Residual Learning for Efficient Keyword Spotting | Papers | HyperAI