PyTorch implementation of DCT fast weight RNNs

Last update: Dec 24, 2022

Overview

DCT based fast weights

This repository contains the official code for the paper: Training and Generating Neural Networks in Compressed Weight Space.

The main code includes:

DCT LSTM: LSTMs whose weights are encoded by discrete cosine transform (DCT).
DCT fast weight RNN: RNNs whose weights are encoded by DCT, and the DCT coefficients are parameterized by LSTMs.

The language modeling experiments reported in the paper were produced by porting code (with minor changes due to some clean-up) of this repository in a fork of this toolkit.

Requirements

torch_dct (can be installed via pip install torch_dct)
PyTorch with a version compatible with torch_dct.

Our experiments were conducted using PyTorch version 1.6.0 . More recent versions are apparently not compatible with torch_dct (at least at the time of writing this file). We recommend to run python custom_layer.py to check the compatibility.

References

If you make use of this toolkit for your experiments, please cite:

@inproceedings{irie2021training,
  title={Training and Generating Neural Networks in Compressed Weight Space},
  author={Kazuki Irie and J{\"u}rgen Schmidhuber},
  booktitle={Neural Compression: From Information Theory to Applications -- Workshop @ ICLR 2021},
  year={2021},
  address={Virtual only},
  month=may
}

PyTorch implementation of DCT fast weight RNNs

Related tags

Overview

DCT based fast weights

Requirements

References

Owner

Kazuki Irie

Code for BMVC2021 "MOS: A Low Latency and Lightweight Framework for Face Detection, Landmark Localization, and Head Pose Estimation"

Help you understand Manual and w/ Clutch point while driving.

Steer OpenAI's Jukebox with Music Taggers

[CVPR 2022] TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

Weakly-supervised object detection.

GAN-based 3D human pose estimation model for 3DV'17 paper

End-to-end image segmentation kit based on PaddlePaddle.

Spherical CNNs

The implementation of the paper "A Deep Feature Aggregation Network for Accurate Indoor Camera Localization".

Using deep learning model to detect breast cancer.

RipsNet: a general architecture for fast and robust estimation of the persistent homology of point clouds

Clustering is a popular approach to detect patterns in unlabeled data

ilpyt: imitation learning library with modular, baseline implementations in Pytorch

FS-Mol: A Few-Shot Learning Dataset of Molecules

Some code of the implements of Geological Modeling Using 3D Pixel-Adaptive and Deformable Convolutional Neural Network

This repository gives an example on how to preprocess the data of the HECKTOR challenge

[NeurIPS-2021] Slow Learning and Fast Inference: Efficient Graph Similarity Computation via Knowledge Distillation

piSTAR Lab is a modular platform built to make AI experimentation accessible and fun. (pistar.ai)

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Codes for the ICCV'21 paper "FREE: Feature Refinement for Generalized Zero-Shot Learning"