Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Last update: Jan 07, 2023

Related tags

Overview

Decision Transformer

Lili Chen*, Kevin Lu*, Aravind Rajeswaran, Kimin Lee, Aditya Grover, Michael Laskin, Pieter Abbeel, Aravind Srinivas†, and Igor Mordatch†

*equal contribution, †equal advising

A link to our paper can be found on arXiv.

Overview

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling. Contains scripts to reproduce experiments.

Instructions

We provide code in two sub-directories: atari containing code for Atari experiments and gym containing code for OpenAI Gym experiments. See corresponding READMEs in each folder for instructions; scripts should be run from the respective directories. It may be necessary to add the respective directories to your PYTHONPATH.

Citation

Please cite our paper as:

@article{chen2021decisiontransformer,
  title={Decision Transformer: Reinforcement Learning via Sequence Modeling},
  author={Lili Chen and Kevin Lu and Aravind Rajeswaran and Kimin Lee and Aditya Grover and Michael Laskin and Pieter Abbeel and Aravind Srinivas and Igor Mordatch},
  journal={arXiv preprint arXiv:2106.01345},
  year={2021}
}

Note: this is not an official Google or Facebook product.

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Related tags

Overview

Decision Transformer

Overview

Instructions

Citation

Owner

Kevin Lu

This project aims at providing a concise, easy-to-use, modifiable reference implementation for semantic segmentation models using PyTorch.

The PyTorch implementation of paper REST: Debiased Social Recommendation via Reconstructing Exposure Strategies

MMFlow is an open source optical flow toolbox based on PyTorch

Meaningful titles for tabs and PDF downloads! Also supports tab search.

Project code for weakly supervised 3D object detectors using wide-baseline multi-view traffic camera data: WIBAM.

Invert and perturb GAN images for test-time ensembling

Official PyTorch implementation of BlobGAN: Spatially Disentangled Scene Representations

🛰️ List of earth observation companies and job sites

Data reduction pipeline for KOALA on the AAT.

Using OpenAI's CLIP to upscale and enhance images

[CVPR 2021] Unsupervised Degradation Representation Learning for Blind Super-Resolution

Pytorch implementation of paper Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data

Diabetes-Feature-Engineering - A machine learning model that can predict whether people have diabetes when their characteristics are specified

Quasi-Dense Similarity Learning for Multiple Object Tracking, CVPR 2021 (Oral)

An implementation of the efficient attention module.

Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

A tight inclusion function for continuous collision detection

Highway networks implemented in PyTorch.

Biomarker identification for COVID-19 Severity in BALF cells Single-cell RNA-seq data

Add-on for importing and auto setup of character creator 3 character exports.