copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

H. Zhou, S. Zhang, J. Peng, S. Zhang, J. Li, H. Xiong, and W. Zhang. (2020)cite arxiv:2012.07436Comment: 8 pages (main), 5 pages (appendix) and to be appeared in AAAI2021.

Abstract

Many real-world applications require the prediction of long sequence time-series, such as electricity consumption planning. Long sequence time-series forecasting (LSTF) demands a high prediction capacity of the model, which is the ability to capture precise long-range dependency coupling between output and input efficiently. Recent studies have shown the potential of Transformer to increase the prediction capacity. However, there are several severe issues with Transformer that prevent it from being directly applicable to LSTF, including quadratic time complexity, high memory usage, and inherent limitation of the encoder-decoder architecture. To address these issues, we design an efficient transformer-based model for LSTF, named Informer, with three distinctive characteristics: (i) a $ProbSparse$ self-attention mechanism, which achieves $O(L L)$ in time complexity and memory usage, and has comparable performance on sequences' dependency alignment. (ii) the self-attention distilling highlights dominating attention by halving cascading layer input, and efficiently handles extreme long input sequences. (iii) the generative style decoder, while conceptually simple, predicts the long time-series sequences at one forward operation rather than a step-by-step way, which drastically improves the inference speed of long-sequence predictions. Extensive experiments on four large-scale datasets demonstrate that Informer significantly outperforms existing methods and provides a new solution to the LSTF problem.

Description

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Links and resources

BibTeX key: zhou2020informer
entry type: misc
year: 2020
url: http://arxiv.org/abs/2012.07436
note: cite arxiv:2012.07436Comment: 8 pages (main), 5 pages (appendix) and to be appeared in AAAI2021

@qilinw's tags highlighted

Cite this publication

@misc{zhou2020informer, abstract = {Many real-world applications require the prediction of long sequence time-series, such as electricity consumption planning. Long sequence time-series forecasting (LSTF) demands a high prediction capacity of the model, which is the ability to capture precise long-range dependency coupling between output and input efficiently. Recent studies have shown the potential of Transformer to increase the prediction capacity. However, there are several severe issues with Transformer that prevent it from being directly applicable to LSTF, including quadratic time complexity, high memory usage, and inherent limitation of the encoder-decoder architecture. To address these issues, we design an efficient transformer-based model for LSTF, named Informer, with three distinctive characteristics: (i) a $ProbSparse$ self-attention mechanism, which achieves $O(L \log L)$ in time complexity and memory usage, and has comparable performance on sequences' dependency alignment. (ii) the self-attention distilling highlights dominating attention by halving cascading layer input, and efficiently handles extreme long input sequences. (iii) the generative style decoder, while conceptually simple, predicts the long time-series sequences at one forward operation rather than a step-by-step way, which drastically improves the inference speed of long-sequence predictions. Extensive experiments on four large-scale datasets demonstrate that Informer significantly outperforms existing methods and provides a new solution to the LSTF problem.}, added-at = {2023-04-17T01:02:24.000+0200}, author = {Zhou, Haoyi and Zhang, Shanghang and Peng, Jieqi and Zhang, Shuai and Li, Jianxin and Xiong, Hui and Zhang, Wancai}, biburl = {https://www.bibsonomy.org/bibtex/2bf414c063286fafd35423fc9beafa671/qilinw}, description = {Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting}, interhash = {a21a260fb46e089c5dfb9296258702c0}, intrahash = {bf414c063286fafd35423fc9beafa671}, keywords = {time_series transformer}, note = {cite arxiv:2012.07436Comment: 8 pages (main), 5 pages (appendix) and to be appeared in AAAI2021}, timestamp = {2023-04-17T01:02:24.000+0200}, title = {Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting}, url = {http://arxiv.org/abs/2012.07436}, year = 2020 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Comments and Reviews
(0)