HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data

Nick Erickson Jonas Mueller Alexander Shirkov Hang Zhang Pedro Larroy Mu Li Alexander Smola

AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data

Abstract

We introduce AutoGluon-Tabular, an open-source AutoML framework that requires only a single line of Python to train highly accurate machine learning models on an unprocessed tabular dataset such as a CSV file. Unlike existing AutoML frameworks that primarily focus on model/hyperparameter selection, AutoGluon-Tabular succeeds by ensembling multiple models and stacking them in multiple layers. Experiments reveal that our multi-layer combination of many models offers better use of allocated training time than seeking out the best. A second contribution is an extensive evaluation of public and commercial AutoML platforms including TPOT, H2O, AutoWEKA, auto-sklearn, AutoGluon, and Google AutoML Tables. Tests on a suite of 50 classification and regression tasks from Kaggle and the OpenML AutoML Benchmark reveal that AutoGluon is faster, more robust, and much more accurate. We find that AutoGluon often even outperforms the best-in-hindsight combination of all of its competitors. In two popular Kaggle competitions, AutoGluon beat 99% of the participating data scientists after merely 4h of training on the raw data.

Code Repositories

jianzhnie/AutoTabular
pytorch
Mentioned in GitHub
Innixma/autogluon-benchmarking
Official
mxnet
Mentioned in GitHub
daekeun-ml/autogluon-on-aws
Mentioned in GitHub
aws-samples/autogluon-on-aws
Mentioned in GitHub
autogluon/autogluon
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
molecular-property-prediction-on-tox21-1Autogluon
ROC-AUC: 77.84

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data | Papers | HyperAI