HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

A Dilated Inception Network for Visual Saliency Prediction

Sheng Yang; Guosheng Lin; Qiuping Jiang; Weisi Lin

A Dilated Inception Network for Visual Saliency Prediction

Abstract

Recently, with the advent of deep convolutional neural networks (DCNN), the improvements in visual saliency prediction research are impressive. One possible direction to approach the next improvement is to fully characterize the multi-scale saliency-influential factors with a computationally-friendly module in DCNN architectures. In this work, we proposed an end-to-end dilated inception network (DINet) for visual saliency prediction. It captures multi-scale contextual features effectively with very limited extra parameters. Instead of utilizing parallel standard convolutions with different kernel sizes as the existing inception module, our proposed dilated inception module (DIM) uses parallel dilated convolutions with different dilation rates which can significantly reduce the computation load while enriching the diversity of receptive fields in feature maps. Moreover, the performance of our saliency model is further improved by using a set of linear normalization-based probability distribution distance metrics as loss functions. As such, we can formulate saliency prediction as a probability distribution prediction task for global saliency inference instead of a typical pixel-wise regression problem. Experimental results on several challenging saliency benchmark datasets demonstrate that our DINet with proposed loss functions can achieve state-of-the-art performance with shorter inference time.

Code Repositories

ysyscool/DINet
Official
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
video-saliency-detection-on-msu-videoDINet
AUC-J: 0.858
CC: 0.671
FPS: 4.85
KLDiv: 0.575
NSS: 1.85
SIM: 0.592

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
A Dilated Inception Network for Visual Saliency Prediction | Papers | HyperAI