HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis

Zhu Wang Sourav Medya Sathya N. Ravi

Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis

Abstract

Often, deep network models are purely inductive during training and while performing inference on unseen data. Thus, when such models are used for predictions, it is well known that they often fail to capture the semantic information and implicit dependencies that exist among objects (or concepts) on a population level. Moreover, it is still unclear how domain or prior modal knowledge can be specified in a backpropagation friendly manner, especially in large-scale and noisy settings. In this work, we propose an end-to-end vision and language model incorporating explicit knowledge graphs. We also introduce an interactive out-of-distribution (OOD) layer using implicit network operator. The layer is used to filter noise that is brought by external knowledge base. In practice, we apply our model on several vision and language downstream tasks including visual question answering, visual reasoning, and image-text retrieval on different datasets. Our experiments show that it is possible to design models that perform similarly to state-of-art results but with significantly fewer samples and training time.

Code Repositories

ellenzhuwang/VK_OOD
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
visual-question-answering-on-ok-vqaVK-OOD
Accuracy: 52.4
visual-question-answering-on-vqa-v2-test-dev-1VK-OOD
Accuracy: 76.8
visual-reasoning-on-nlvr2-devVK-OOD
Accuracy: 83.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis | Papers | HyperAI