HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Let SSMs be ConvNets: State-space Modeling with Optimal Tensor Contractions

Yan Ru Pei

Let SSMs be ConvNets: State-space Modeling with Optimal Tensor Contractions

Abstract

We introduce Centaurus, a class of networks composed of generalized state-space model (SSM) blocks, where the SSM operations can be treated as tensor contractions during training. The optimal order of tensor contractions can then be systematically determined for every SSM block to maximize training efficiency. This allows more flexibility in designing SSM blocks beyond the depthwise-separable configuration commonly implemented. The new design choices will take inspiration from classical convolutional blocks including group convolutions, full convolutions, and bottleneck blocks. We architect the Centaurus network with a mixture of these blocks, to balance between network size and performance, as well as memory and computational efficiency during both training and inference. We show that this heterogeneous network design outperforms its homogeneous counterparts in raw audio processing tasks including keyword spotting, speech denoising, and automatic speech recognition (ASR). For ASR, Centaurus is the first network with competitive performance that can be made fully state-space based, without using any nonlinear recurrence (LSTMs), explicit convolutions (CNNs), or (surrogate) attention mechanism. The source code is available as supplementary material on https://openreview.net/forum?id=PkpNRmBZ32

Benchmarks

BenchmarkMethodologyMetrics
speech-enhancement-on-demandCentaurus (0.51M)
PESQ (wb): 3.25
speech-recognition-on-librispeech-test-cleanCentaurus (30 M)
Word Error Rate (WER): 4.4
speech-recognition-on-speech-commands-2Centaurus
Accuracy (%): 98.53

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Let SSMs be ConvNets: State-space Modeling with Optimal Tensor Contractions | Papers | HyperAI