Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"

Last update: Nov 15, 2022

Overview

Recurrent Fast Weight Programmers

This is the official repository containing the code we used to produce the experimental results reported in the paper:

Going Beyond Linear Transformers with Recurrent Fast Weight Programmers

algorithmic directory for code execution and ListOps
language_modeling directory for language modeling
reinforcement_learning directory for RL

Separate license files can be found under each directory.

General instructions

Please refer to the readme file in each directory for further instructions.

In all tasks, our custom CUDA kernels will be automatically compiled. To avoid recompiling the code multiple times, we recommend to specify the path to a directory to store the compiled code via:

export TORCH_EXTENSIONS_DIR="/home/me/torch_extensions/lm"

Such a line is already included in the example scripts we provide. Please change the path to a safe directory of your choice.

Important: separate paths should be used for different tasks (i.e. here, one for language modeling, one for code execution, one for ListOps, and one for RL).

BibTex

@article{irie2021going,
      title={Going Beyond Linear Transformers with Recurrent Fast Weight Programmers}, 
      author={Kazuki Irie and Imanol Schlag and R\'obert Csord\'as and J\"urgen Schmidhuber},
      journal={Preprint arXiv:2106.06295},
      year={2021}
}

Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"

Related tags

Overview

Recurrent Fast Weight Programmers

Contents

General instructions

BibTex

Links

Owner

IDSIA

Repo for CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning

Deep Surface Reconstruction from Point Clouds with Visibility Information

SymmetryNet: Learning to Predict Reflectional and Rotational Symmetries of 3D Shapes from Single-View RGB-D Images

Snscrape-jsonl-urls-extractor - Extracts urls from jsonl produced by snscrape

Codebase for Image Classification Research, written in PyTorch.

Code for NeurIPS 2021 paper 'Spatio-Temporal Variational Gaussian Processes'

DNA sequence classification by Deep Neural Network

Generate vibrant and detailed images using only text.

Text Extraction Formulation + Feedback Loop for state-of-the-art WSD (EMNLP 2021)

Source code of the paper "Deep Learning of Latent Variable Models for Industrial Process Monitoring".

Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.

GeneGAN: Learning Object Transfiguration and Attribute Subspace from Unpaired Data

PyGCL: A PyTorch Library for Graph Contrastive Learning

Context Axial Reverse Attention Network for Small Medical Objects Segmentation

Julia and Matlab codes to simulated all problems in El-Hachem, McCue and Simpson (2021)

A boosting-based Multiple Instance Learning (MIL) package that includes MIL-Boost and MCIL-Boost

repro_eval is a collection of measures to evaluate the reproducibility/replicability of system-oriented IR experiments

OpenPCDet Toolbox for LiDAR-based 3D Object Detection.

Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt

NeurIPS 2021 paper 'Representation Learning on Spatial Networks' code