HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Estimating Missing Data in Temporal Data Streams Using Multi-directional Recurrent Neural Networks

Jinsung Yoon; William R. Zame; Mihaela van der Schaar

Estimating Missing Data in Temporal Data Streams Using Multi-directional Recurrent Neural Networks

Abstract

Missing data is a ubiquitous problem. It is especially challenging in medical settings because many streams of measurements are collected at different - and often irregular - times. Accurate estimation of those missing measurements is critical for many reasons, including diagnosis, prognosis and treatment. Existing methods address this estimation problem by interpolating within data streams or imputing across data streams (both of which ignore important information) or ignoring the temporal aspect of the data and imposing strong assumptions about the nature of the data-generating process and/or the pattern of missing data (both of which are especially problematic for medical data). We propose a new approach, based on a novel deep learning architecture that we call a Multi-directional Recurrent Neural Network (M-RNN) that interpolates within data streams and imputes across data streams. We demonstrate the power of our approach by applying it to five real-world medical datasets. We show that it provides dramatically improved estimation of missing measurements in comparison to 11 state-of-the-art benchmarks (including Spline and Cubic Interpolations, MICE, MissForest, matrix completion and several RNN methods); typical improvements in Root Mean Square Error are between 35% - 50%. Additional experiments based on the same five datasets demonstrate that the improvements provided by our method are extremely robust.

Code Repositories

jsyoon0823/MRNN
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
multivariate-time-series-imputation-onM-RNN
MAE (PM2.5): 14.24
multivariate-time-series-imputation-on-1M-RNN
MAE (10% of data as GT): 0.451
multivariate-time-series-imputation-on-uciM-RNN
MAE (10% missing): 0.248

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Estimating Missing Data in Temporal Data Streams Using Multi-directional Recurrent Neural Networks | Papers | HyperAI