HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Efficient keyword spotting using time delay neural networks

Samuel Myer; Vikrant Singh Tomar

Efficient keyword spotting using time delay neural networks

Abstract

This paper describes a novel method of live keyword spotting using a two-stage time delay neural network. The model is trained using transfer learning: initial training with phone targets from a large speech corpus is followed by training with keyword targets from a smaller data set. The accuracy of the system is evaluated on two separate tasks. The first is the freely available Google Speech Commands dataset. The second is an in-house task specifically developed for keyword spotting. The results show significant improvements in false accept and false reject rates in both clean and noisy environments when compared with previously known techniques. Furthermore, we investigate various techniques to reduce computation in terms of multiplications per second of audio. Compared to recently published work, the proposed system provides up to 89% savings on computational complexity.

Benchmarks

BenchmarkMethodologyMetrics
keyword-spotting-on-google-speech-commandsTDNN
10-keyword Speech Commands dataset: 94.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Efficient keyword spotting using time delay neural networks | Papers | HyperAI