3 months ago

MUXConv: Information Multiplexing in Convolutional Neural Networks

Zhichao Lu Kalyanmoy Deb Vishnu Naresh Boddeti

Abstract

Convolutional neural networks have witnessed remarkable improvements in computational efficiency in recent years. A key driving force has been the idea of trading-off model expressivity and efficiency through a combination of $1\times 1$ and depth-wise separable convolutions in lieu of a standard convolutional layer. The price of the efficiency, however, is the sub-optimal flow of information across space and channels in the network. To overcome this limitation, we present MUXConv, a layer that is designed to increase the flow of information by progressively multiplexing channel and spatial information in the network, while mitigating computational complexity. Furthermore, to demonstrate the effectiveness of MUXConv, we integrate it within an efficient multi-objective evolutionary algorithm to search for the optimal model hyper-parameters while simultaneously optimizing accuracy, compactness, and computational efficiency. On ImageNet, the resulting models, dubbed MUXNets, match the performance (75.3% top-1 accuracy) and multiply-add operations (218M) of MobileNetV3 while being 1.6$\times$ more compact, and outperform other mobile models in all the three criteria. MUXNet also performs well under transfer learning and when adapted to object detection. On the ChestX-Ray 14 benchmark, its accuracy is comparable to the state-of-the-art while being $3.3\times$ more compact and $14\times$ more efficient. Similarly, detection on PASCAL VOC 2007 is 1.2% more accurate, 28% faster and 6% more compact compared to MobileNetV2. Code is available from https://github.com/human-analysis/MUXConv

Code Repositories

human-analysis/MUXConv

Official

pytorch

Benchmarks

Benchmark	Methodology	Metrics
architecture-search-on-cifar-10-image	MUXNet-m	FLOPS: 200M Params: 2.1M Percentage error: 2.0
image-classification-on-cifar-10	MUXNet-m	Percentage correct: 98.0 Top-1 Accuracy: 98.0
image-classification-on-cifar-100	MUXNet-m	PARAMS: 2.1M Percentage correct: 86.1
image-classification-on-imagenet	MUXNet-s	GFLOPs: 0.234 Number of params: 2.4M Top 1 Accuracy: 71.6%
image-classification-on-imagenet	MUXNet-l	GFLOPs: 0.636 Number of params: 4.0M Top 1 Accuracy: 76.6%
image-classification-on-imagenet	MUXNet-m	GFLOPs: 0.436 Number of params: 3.4M Top 1 Accuracy: 75.3%
image-classification-on-imagenet	MUXNet-xs	GFLOPs: 0.132 Number of params: 1.8M Top 1 Accuracy: 66.7%
neural-architecture-search-on-cifar-10	MUXNet-m	FLOPS: 200M Parameters: 2.1M Top-1 Error Rate: 2.0%
neural-architecture-search-on-cifar-100-1	MUXNet-m	FLOPS: 200M PARAMS: 2.1M Percentage Error: 13.9
neural-architecture-search-on-imagenet	MUXNet-l	Accuracy: 76.6 MACs: 318M Params: 4.0M Top-1 Error Rate: 23.4
neural-architecture-search-on-imagenet	MUXNet-m	Accuracy: 75.3 MACs: 218M Params: 3.4M Top-1 Error Rate: 24.7
neural-architecture-search-on-imagenet	MUXNet-s	Accuracy: 71.6 MACs: 117M Params: 2.4M Top-1 Error Rate: 28.4
neural-architecture-search-on-imagenet	MUXNet-xs	Accuracy: 66.7 MACs: 66M Params: 1.8M Top-1 Error Rate: 33.3
pneumonia-detection-on-chestx-ray14	MUXNet-m	AUROC: 0.841 FLOPS: 200M Params: 2.1M
semantic-segmentation-on-ade20k	MUXNet-m + PPM	Validation mIoU: 35.8
semantic-segmentation-on-ade20k	MUXNet-m + C1	Validation mIoU: 32.42

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette