Do you want a RL agent nicely moving on Atari?

Rainbow is all you need!

This is a step-by-step tutorial from DQN to Rainbow. Every chapter contains both of theoretical backgrounds and object-oriented implementation. Just pick any topic in which you are interested, and learn! You can execute them right away with Colab even on your smartphone.

Please feel free to open an issue or a pull-request if you have any idea to make it better. :)

If you want a tutorial for policy gradient methods, please see PG is All You Need.

DQN [NBViewer] [Colab]
DoubleDQN [NBViewer] [Colab]
PrioritizedExperienceReplay [NBViewer] [Colab]
DuelingNet [NBViewer] [Colab]
NoisyNet [NBViewer] [Colab]
CategoricalDQN [NBViewer] [Colab]
N-stepLearning [NBViewer] [Colab]
Rainbow [NBViewer] [Colab]

Prerequisites

This repository is tested on Anaconda virtual environment with python 3.7+

$ conda create -n rainbow-is-all-you-need python=3.7
$ conda activate rainbow-is-all-you-need

Installation

First, clone the repository.

git clone https://github.com/Curt-Park/rainbow-is-all-you-need.git
cd rainbow-is-all-you-need

Secondly, install packages required to execute the code. Just type:

make setup

Contributors

Thanks goes to these wonderful people (emoji key):

_{Jinwoo Park (Curt)}

_{Kyunghwan Kim}

_{Wei Chen}

_{WANG Lei}

_leeyaf

_ahmadF

This project follows the all-contributors specification. Contributions of any kind welcome!

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Related tags

Overview

Rainbow is all you need!

Contents

Prerequisites

Installation

Related Papers

Contributors

Owner

Jinwoo Park (Curt)

Implementation of popular bandit algorithms in batch environments.

The dataset of tweets pulling from Twitters with keyword: Hydroxychloroquine, location: US, Time: 2020

Code for the Active Speakers in Context Paper (CVPR2020)

TCTrack: Temporal Contexts for Aerial Tracking (CVPR2022)

Code and Experiments for ACL-IJCNLP 2021 Paper Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering.

Qt-GUI implementation of the YOLOv5 algorithm (ver.6 and ver.5)

Semi-supervised Implicit Scene Completion from Sparse LiDAR

Joint detection and tracking model named DEFT, or ``Detection Embeddings for Tracking.

MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python

[IEEE Transactions on Computational Imaging] Self-Gated Memory Recurrent Network for Efficient Scalable HDR Deghosting

Attention-guided gan for synthesizing IR images

Refactoring dalle-pytorch and taming-transformers for TPU VM

This is a project based on ConvNets used to identify whether a road is clean or dirty. We have used MobileNet as our base architecture and the weights are based on imagenet.

BRepNet: A topological message passing system for solid models

TextBPN Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection

Official Implementation of "Tracking Grow-Finish Pigs Across Large Pens Using Multiple Cameras"

RSNA Intracranial Hemorrhage Detection with python

This GitHub repo consists of Code and Some results of project- Diabetes Treatment using Gold nanoparticles. These Consist of ML Models used for prediction Diabetes and further the basic theory and working of Gold nanoparticles.

a reimplementation of Holistically-Nested Edge Detection in PyTorch

EGNN - Implementation of E(n)-Equivariant Graph Neural Networks, in Pytorch