Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting

Last update: Jan 08, 2023

Related tags

Overview

Autoformer (NeurIPS 2021)

Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting

Time series forecasting is a critical demand for real applications. Enlighted by the classic time series analysis and stochastic process theory, we propose the Autoformer as a general series forecasting model [paper]. Autoformer goes beyond the Transformer family and achieves the series-wise connection for the first time.

In long-term forecasting, Autoformer achieves SOTA, with a 38% relative improvement on six benchmarks, covering five practical applications: energy, traffic, economics, weather and disease.

Autoformer vs. Transformers

1. Deep decomposition architecture

We renovate the Transformer as a deep decomposition architecture, which can progressively decompose the trend and seasonal components during the forecasting process.

Figure 1. Overall architecture of Autoformer.

2. Series-wise Auto-Correlation mechanism

Inspired by the stochastic process theory, we design the Auto-Correlation mechanism, which can discover period-based dependencies and aggregate the information at the series level. This empowers the model with inherent log-linear complexity. This series-wise connection contrasts clearly from the previous self-attention family.

Figure 2. Auto-Correlation mechansim.

Get Started

Install Python 3.6, PyTorch 1.9.0.
Download data. You can obtain all the six benchmarks from Tsinghua Cloud or Google Drive. All the datasets are well pre-processed and can be used easily.
Train the model. We provide the experiment scripts of all benchmarks under the folder ./scripts. You can reproduce the experiment results by:

bash ./scripts/ETT_script/Autoformer_ETTm1.sh
bash ./scripts/ECL_script/Autoformer.sh
bash ./scripts/Exchange_script/Autoformer.sh
bash ./scripts/Traffic_script/Autoformer.sh
bash ./scripts/Weather_script/Autoformer.sh
bash ./scripts/ILI_script/Autoformer.sh

Sepcial-designed implementation

Speedup Auto-Correlation: We built the Auto-Correlation mechanism as a batch-normalization-style block to make it more memory-access friendly. See the paper for details.
Without the position embedding: Since the series-wise connection will inherently keep the sequential information, Autoformer does not need the position embedding, which is different from Transformers.

Main Results

We experiment on six benchmarks, covering five main-stream applications. We compare our model with ten baselines, including Informer, N-BEATS, etc. Generally, for the long-term forecasting setting, Autoformer achieves SOTA, with a 38% relative improvement over previous baselines.

Citation

If you find this repo useful, please cite our paper.

@inproceedings{wu2021autoformer,
  title={Autoformer: Decomposition Transformers with {Auto-Correlation} for Long-Term Series Forecasting},
  author={Haixu Wu and Jiehui Xu and Jianmin Wang and Mingsheng Long},
  booktitle={Advances in Neural Information Processing Systems},
  year={2021}
}

Contact

If you have any question or want to use the code, please contact [email protected] .

Acknowledgement

We appreciate the following github repos a lot for their valuable code base or datasets:

https://github.com/zhouhaoyi/Informer2020

https://github.com/zhouhaoyi/ETDataset

https://github.com/laiguokun/multivariate-time-series-data

Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting

Related tags

Overview

Autoformer (NeurIPS 2021)

Autoformer vs. Transformers

Get Started

Main Results

Citation

Contact

Acknowledgement

Owner

THUML @ Tsinghua University

All course materials for the Zero to Mastery Deep Learning with TensorFlow course.

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

Official Implementation of DDOD (Disentangle your Dense Object Detector), ACM MM2021

You Only 👀 One Sequence

Source code for TACL paper "KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation".

Activity tragle - Google is tracking everything, we just look at it

A simple Neural Network that predicts the label for a series of handwritten digits

这是一个利用facenet和retinaface实现人脸识别的库，可以进行在线的人脸识别。

OCR Post Correction for Endangered Language Texts

This is a collection of simple PyTorch implementations of neural networks and related algorithms. These implementations are documented with explanations,

RANZCR-CLiP 7th Place Solution

Codes for the AAAI'22 paper "TransZero: Attribute-guided Transformer for Zero-Shot Learning"

PyTorch implementation for OCT-GAN Neural ODE-based Conditional Tabular GANs (WWW 2021)

Pgn2tex - Scripts to convert pgn files to latex document. Useful to build books or pdf from pgn studies

Implementation of Axial attention - attending to multi-dimensional data efficiently

Space robot - (Course Project) Using the space robot to capture the target satellite that is disabled and spinning, then stabilize and fix it up

Code repository for paper `Skeleton Merger: an Unsupervised Aligned Keypoint Detector`.

Tutorials, assignments, and competitions for MIT Deep Learning related courses.

Replication attempt for the Protein Folding Model

A basic neural network for image segmentation.