Stochastic Positional Encoding (SPE)

This is the source code repository for the ICML 2021 paper Relative Positional Encoding for Transformers with Linear Complexity by Antoine Liutkus, Ondřej Cífka, Shih-Lun Wu, Umut Şimşekli, Yi-Hsuan Yang and Gaël Richard.

In this paper, we propose Stochastic Positional Encoding (SPE), which provably behaves like relative PE while being compatible with linear-complexity Transformers. We do this by drawing a connection between positional encoding and cross-covariance structures of correlated Gaussian processes.

Check out also the companion website with music examples.

Citation:

@inproceedings{pmlr-v139-liutkus21a,
  title = 	 {Relative Positional Encoding for {Transformers} with Linear Complexity},
  author =       {Liutkus, Antoine and C{\'i}fka, Ond{\v r}ej and Wu, Shih-Lun and {\c S}im{\c s}ekli, Umut and Yang, Yi-Hsuan and Richard, Ga{\"e}l},
  booktitle = 	 {Proceedings of the 38th International Conference on Machine Learning},
  pages = 	 {7067--7079},
  year = 	 {2021},
  editor = 	 {Meila, Marina and Zhang, Tong},
  volume = 	 {139},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {18--24 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v139/liutkus21a/liutkus21a.pdf},
  url = 	 {http://proceedings.mlr.press/v139/liutkus21a.html}
}

SPE implementation

We have implemented SPE in PyTorch and JAX/Flax. Each implementation is available as a separate Python package under src.

Experiments

Each of the 3 experiments (LRA, pop piano generation, groove continuation) has a dedicated directory under experiments. See the README files there for how to set up the environment and prepare the datasets. To make sure you have the custom dependencies for each experiment, clone this repository with --recurse-submodules or run git submodule init && git submodule update after cloning.

Relative Positional Encoding for Transformers with Linear Complexity

Related tags

Overview

Stochastic Positional Encoding (SPE)

SPE implementation

Experiments

Owner

Antoine Liutkus

Code for Neurips2021 Paper "Topology-Imbalance Learning for Semi-Supervised Node Classification".

Deep Convolutional Generative Adversarial Networks

How will electric vehicles affect traffic congestion and energy consumption: an integrated modelling approach

Deep Watershed Transform for Instance Segmentation

Implementation for "Conditional entropy minimization principle for learning domain invariant representation features"

A small library for doing fluid simulation with neural networks.

Speckle-free Holography with Partially Coherent Light Sources and Camera-in-the-loop Calibration

Industrial knn-based anomaly detection for images. Visit streamlit link to check out the demo.

reimpliment of DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation

Code for 2021 NeurIPS --- Towards Multi-Grained Explainability for Graph Neural Networks

Efficient neural networks for analog audio effect modeling

HNN: Human (Hollywood) Neural Network

Exploring Versatile Prior for Human Motion via Motion Frequency Guidance (3DV2021)

李云龙二次元风格化!打滚卖萌，使用了animeGANv2进行了视频的风格迁移

Weakly-supervised object detection.

PyTorch implementation of the NIPS-17 paper "Poincaré Embeddings for Learning Hierarchical Representations"

A PyTorch implementation of "ANEMONE: Graph Anomaly Detection with Multi-Scale Contrastive Learning", CIKM-21

Mememoji - A facial expression classification system that recognizes 6 basic emotions: happy, sad, surprise, fear, anger and neutral.

Code and Experiments for ACL-IJCNLP 2021 Paper Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering.

PyTorch implementation of MoCo: Momentum Contrast for Unsupervised Visual Representation Learning