HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Mixture-of-Linear-Experts for Long-term Time Series Forecasting

Ronghao Ni Zinan Lin Shuaiqi Wang Giulia Fanti

Mixture-of-Linear-Experts for Long-term Time Series Forecasting

Abstract

Long-term time series forecasting (LTSF) aims to predict future values of a time series given the past values. The current state-of-the-art (SOTA) on this problem is attained in some cases by linear-centric models, which primarily feature a linear mapping layer. However, due to their inherent simplicity, they are not able to adapt their prediction rules to periodic changes in time series patterns. To address this challenge, we propose a Mixture-of-Experts-style augmentation for linear-centric models and propose Mixture-of-Linear-Experts (MoLE). Instead of training a single model, MoLE trains multiple linear-centric models (i.e., experts) and a router model that weighs and mixes their outputs. While the entire framework is trained end-to-end, each expert learns to specialize in a specific temporal pattern, and the router model learns to compose the experts adaptively. Experiments show that MoLE reduces forecasting error of linear-centric models, including DLinear, RLinear, and RMLP, in over 78% of the datasets and settings we evaluated. By using MoLE existing linear-centric models can achieve SOTA LTSF results in 68% of the experiments that PatchTST reports and we compare to, whereas existing single-head linear-centric models achieve SOTA results in only 25% of cases.

Code Repositories

rogerni/mole
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
time-series-forecasting-on-electricity-192MoLE-DLinear
MSE: 0.147
time-series-forecasting-on-electricity-336MoLE-DLinear
MSE: 0.162
time-series-forecasting-on-electricity-720MoLE-RMLP
MSE: 0.178
time-series-forecasting-on-electricity-720MoLE-DLinear
MSE: 0.18
time-series-forecasting-on-electricity-96MoLE-DLinear
MSE: 0.131
time-series-forecasting-on-electricity-96MoLE-RMLP
MSE: 0.129
time-series-forecasting-on-etth1-192-1MoLE-DLinear
MSE: 0.453
time-series-forecasting-on-etth1-192-1MoLE-RLinear
MSE: 0.403
time-series-forecasting-on-etth1-336-1MoLE-DLinear
MSE: 0.469
time-series-forecasting-on-etth1-336-1MoLE-RLinear
MSE: 0.43
time-series-forecasting-on-etth1-720-1MoLE-DLinear
MSE: 0.505
time-series-forecasting-on-etth1-720-1MoLE-RLinear
MSE: 0.449
time-series-forecasting-on-etth1-96-1MoLE-DLinear
MSE: 0.377
time-series-forecasting-on-etth1-96-1MoLE-RLinear
MSE: 0.375
time-series-forecasting-on-etth2-192-1MoLE-RLinear
MSE: 0.336
time-series-forecasting-on-etth2-192-1MoLE-DLinear
MSE: 0.362
time-series-forecasting-on-etth2-336-1MoLE-RLinear
MSE: 0.371
time-series-forecasting-on-etth2-336-1MoLE-DLinear
MSE: 0.419
time-series-forecasting-on-etth2-720-1MoLE-DLinear
MSE: 0.605
time-series-forecasting-on-etth2-720-1MoLE-RLinear
MSE: 0.409
time-series-forecasting-on-etth2-96-1MoLE-RLinear
MSE: 0.273
time-series-forecasting-on-etth2-96-1MoLE-DLinear
MSE: 0.287
time-series-forecasting-on-ettm1-192-1MoLE-DLinear
MSE: 0.328
time-series-forecasting-on-ettm1-336-1MoLE-DLinear
MSE: 0.38
time-series-forecasting-on-ettm1-720-1MoLE-DLinear
MSE: 0.447
time-series-forecasting-on-ettm1-96-1MoLE-DLinear
MSE: 0.286
time-series-forecasting-on-ettm2-192-1MoLE-DLinear
MSE: 0.233
time-series-forecasting-on-ettm2-336-1MoLE-DLinear
MSE: 0.289
time-series-forecasting-on-ettm2-720-1MoLE-DLinear
MSE: 0.399
time-series-forecasting-on-ettm2-96-1MoLE-DLinear
MSE: 0.168
time-series-forecasting-on-weather-192MoLE-DLinear
MSE: 0.203
time-series-forecasting-on-weather-192MoLE-RMLP
MSE: 0.19
time-series-forecasting-on-weather-336MoLE-DLinear
MSE: 0.238
time-series-forecasting-on-weather-720MoLE-DLinear
MSE: 0.314
time-series-forecasting-on-weather-96MoLE-DLinear
MSE: 0.147
time-series-forecasting-on-weather2k114-192MoLE-DLinear
MSE: 0.405
time-series-forecasting-on-weather2k114-336MoLE-DLinear
MSE: 0.415
time-series-forecasting-on-weather2k114-720MoLE-DLinear
MSE: 0.425
time-series-forecasting-on-weather2k114-96MoLE-DLinear
MSE: 0.391
time-series-forecasting-on-weather2k1786-192MoLE-RLinear
MSE: 0.581
time-series-forecasting-on-weather2k1786-192MoLE-DLinear
MSE: 0.601
time-series-forecasting-on-weather2k1786-336MoLE-DLinear
MSE: 0.603
time-series-forecasting-on-weather2k1786-720MoLE-RLinear
MSE: 0.628
time-series-forecasting-on-weather2k1786-720MoLE-DLinear
MSE: 0.66
time-series-forecasting-on-weather2k1786-96MoLE-RLinear
MSE: 0.535
time-series-forecasting-on-weather2k1786-96MoLE-DLinear
MSE: 0.535
time-series-forecasting-on-weather2k79-192MoLE-DLinear
MSE: 0.566
time-series-forecasting-on-weather2k79-336MoLE-DLinear
MSE: 0.546
time-series-forecasting-on-weather2k79-720MoLE-DLinear
MSE: 0.535
time-series-forecasting-on-weather2k79-96MoLE-DLinear
MSE: 0.555
time-series-forecasting-on-weather2k850-192MoLE-DLinear
MSE: 0.484
time-series-forecasting-on-weather2k850-336MoLE-DLinear
MSE: 0.474
time-series-forecasting-on-weather2k850-720MoLE-DLinear
MSE: 0.461
time-series-forecasting-on-weather2k850-96MoLE-RLinear
MSE: 0.471
time-series-forecasting-on-weather2k850-96MoLE-DLinear
MSE: 0.474

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Mixture-of-Linear-Experts for Long-term Time Series Forecasting | Papers | HyperAI