Deep learning based state estimation: incorporating Transformer and LSTM to Kalman Filter with EM algorithm

Overview

Kalman Filter requires the true parameters of the model and solves optimal state estimation recursively. Expectation Maximization (EM) algorithm is applicable for estimating the parameters of the model that are not available before Kalman filtering, which is EM-KF algorithm.
To improve the preciseness of EM-KF algorithm, the author presents a state estimation method by combining the Long-Short Term Memory network (LSTM), Transformer and EM-KF algorithm in the framework of Encoder-Decoder in Sequence to Sequence (seq2seq).
Simulation on a linear mobile robot model demonstrates that the new method is more accurate.
Please read our paper on arXiv: Incorporating Transformer and LSTM to Kalman Filter with EM algorithm for state estimation, for understanding the details w.r.t. theoretical analysis and experiment in our method.

Usage

python main.py

Requirements

The code has been tested running under Python3, with package PyTorch, NumPy, Matplotlib, PyKalman and their dependencies installed.

Methodology

We proposed encoder-decoder framework in seq2seq for state estimation, that state estimation is equivalent to encode and decode observation.

Previous works incorporating LSTM to KF, are adopting LSTM encoder and KF decoder. We proposed LSTM-KF adopting LSTM encoder and EM-KF decoder.
Before EM-KF decoder, replace LSTM encoder by Transformer encoder, we call this Transformer-KF.
Integrating Transformer and LSTM, we call this TL-KF.

Integrating Transformer and LSTM to encode observation before filtering, makes it easier for EM algorithm to estimate parameters.

Conclusions

Combining Transformer and LSTM as an encoder-decoder framework for observation, can depict state more effectively, attenuate noise interference, and weaken the assumption of Markov property of states, and conditional independence of observations. This can enhance the preciseness and robustness of state estimation.
Transformer, based on multi-head self attention and residual connection, can capture long-term dependency, while LSTM-encoder can model time-series. TL-KF, a combination of Transformer, LSTM and EM-KF, is precise for state estimation in systems with unknown parameters.
Kalman smoother can ameliorate Kalman filter, but in TL-KF, filtering is precise enough. Therefore, after offline training for parameter estimation, KF for online estimation can be adopted.

Citation

@article{shi2021kalman,
    author={Zhuangwei Shi},
    title={Incorporating Transformer and LSTM to Kalman Filter with EM algorithm for state estimation},
    journal={arXiv preprint arXiv:2105.00250},
    year={2021},
}

Incorporating Transformer and LSTM to Kalman Filter with EM algorithm

Related tags

Overview

Deep learning based state estimation: incorporating Transformer and LSTM to Kalman Filter with EM algorithm

Overview

Usage

Requirements

Methodology

Conclusions

Citation

Owner

zshicode

A 35mm camera, based on the Canonet G-III QL17 rangefinder, simulated in Python.

Pytorch-diffusion - A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'

Chinese Advertisement Board Identification(Pytorch)

This repository is an implementation of our NeurIPS 2021 paper (Stylized Dialogue Generation with Multi-Pass Dual Learning) in PyTorch.

A PyTorch implementation for PyramidNets (Deep Pyramidal Residual Networks)

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

CenterFace(size of 7.3MB) is a practical anchor-free face detection and alignment method for edge devices.

Multiple style transfer via variational autoencoder

Repo for "Event-Stream Representation for Human Gaits Identification Using Deep Neural Networks"

RetinaFace: Deep Face Detection Library in TensorFlow for Python

Code for all the Advent of Code'21 challenges mostly written in python

The first machine learning framework that encourages learning ML concepts instead of memorizing class functions.

Implementation of character based convolutional neural network

Randstad Artificial Intelligence Challenge (powered by VGEN). Soluzione proposta da Stefano Fiorucci (anakin87) - primo classificato

Official code of paper: MovingFashion: a Benchmark for the Video-to-Shop Challenge

Implementation of QuickDraw - an online game developed by Google, combined with AirGesture - a simple gesture recognition application

Pgn2tex - Scripts to convert pgn files to latex document. Useful to build books or pdf from pgn studies

Pytorch implementation of Decoupled Spatial-Temporal Transformer for Video Inpainting

Meshed-Memory Transformer for Image Captioning. CVPR 2020

[ICCV 2021] Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain