HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Hello Edge: Keyword Spotting on Microcontrollers

Yundong Zhang; Naveen Suda; Liangzhen Lai; Vikas Chandra

Hello Edge: Keyword Spotting on Microcontrollers

Abstract

Keyword spotting (KWS) is a critical component for enabling speech based user interactions on smart devices. It requires real-time response and high accuracy for good user experience. Recently, neural networks have become an attractive choice for KWS architecture because of their superior accuracy compared to traditional speech processing algorithms. Due to its always-on nature, KWS application has highly constrained power budget and typically runs on tiny microcontrollers with limited memory and compute capability. The design of neural network architecture for KWS must consider these constraints. In this work, we perform neural network architecture evaluation and exploration for running KWS on resource-constrained microcontrollers. We train various neural network architectures for keyword spotting published in literature to compare their accuracy and memory/compute requirements. We show that it is possible to optimize these neural network architectures to fit within the memory and compute constraints of microcontrollers without sacrificing accuracy. We further explore the depthwise separable convolutional neural network (DS-CNN) and compare it against other neural network architectures. DS-CNN achieves an accuracy of 95.4%, which is ~10% higher than the DNN model with similar number of parameters.

Code Repositories

robin-ai-ml/ML-KWS-for-MCU
tf
Mentioned in GitHub
Jaskaran197/KWS-TF-MCU
tf
Mentioned in GitHub
Paulonso/ML-KWS-for-MCU
tf
Mentioned in GitHub
magahub/KWS-ARM
tf
Mentioned in GitHub
ARM-software/ML-KWS-for-MCU
Official
tf
Mentioned in GitHub
UT2UH/ML-KWS-for-ESP32
tf
Mentioned in GitHub
hustxsl/C-ML-KWS-for-MCU
tf
Mentioned in GitHub
fracorti/reds
tf
Mentioned in GitHub
widzemin/audio_project
tf
Mentioned in GitHub
Paulonso/ARM-software
tf
Mentioned in GitHub
jayant766/MIDAS-IIITD
tf
Mentioned in GitHub
etrommer/dcsr
tf
Mentioned in GitHub
BgLoveXixi/Keyword_recognition
tf
Mentioned in GitHub
Lebhoryi/ML-KWS-for-MCU
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
keyword-spotting-on-google-speech-commandsLSTM
Google Speech Commands V1 12: 92.9
keyword-spotting-on-google-speech-commandsGRU
Google Speech Commands V1 12: 93.5
keyword-spotting-on-google-speech-commandsDNN
Google Speech Commands V1 12: 91.6
keyword-spotting-on-google-speech-commandsBasic LSTM
Google Speech Commands V1 12: 92.0
keyword-spotting-on-google-speech-commandsCNN
Google Speech Commands V1 12: 84.6
keyword-spotting-on-google-speech-commandsDS-CNN
Google Speech Commands V1 12: 94.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp