A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Last update: Dec 28, 2022

Overview

Reinforcement-Learning-Notebooks

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

I wrote these notebooks in March 2017 while I took the COMP 767: Reinforcement Learning [5] class by Prof. Doina Precup at McGill, Montréal. I highly recommend you to go through the class notes and references of all the papers the intructors have posted on the website.

These notebooks should be used while you read the book and go beyond the same with the referenced papers. I would suggest watching David Silver's videos and reading the book simultaneously. And when you are done with a few chapters, start implementing them. The algorithms follow a pattern and mostly are variants of each other. I have tried my best to explain each notebook's results and possible future directions.

Disclaimer: The code is a little messy. I'd written this when I was not a Pythonista. If you would like to clean them up and want to make it into a nice interface, feel free to contact me. I will be very pleased to collaborate. If you use them then please cite the source and also mention the credits as listed below. Also, email me with ways to improve, let me know if you find any bugs.

Feel free to reach me at [email protected] or see my website here

Special Credits:

[1] Denny Britz

[2] Monica Patel

[3] Sutton and Barto

[4] David Silver

[5] Doina Precup's course

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Related tags

Overview

Reinforcement-Learning-Notebooks

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Owner

Pulkit Khandelwal

Neural Reprojection Error: Merging Feature Learning and Camera Pose Estimation

AdaDM: Enabling Normalization for Image Super-Resolution

Transformer part of 12th place solution in Riiid! Answer Correctness Prediction

Implementation of C-RNN-GAN.

Implemenets the Contourlet-CNN as described in C-CNN: Contourlet Convolutional Neural Networks, using PyTorch

[v1 (ISBI'21) + v2] MedMNIST: A Large-Scale Lightweight Benchmark for 2D and 3D Biomedical Image Classification

NAVER BoostCamp Final Project

Asterisk is a framework to generate high-quality training datasets at scale

mlpack: a scalable C++ machine learning library --

Automatically creates genre collections for your Plex media

The official implementation of CVPR 2021 Paper: Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.

Code and data of the ACL 2021 paper: Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision

Read and write layered TIFF ImageSourceData and ImageResources tags

Tensorflow 2 implementations of the C-SimCLR and C-BYOL self-supervised visual representation methods from "Compressive Visual Representations" (NeurIPS 2021)

Collaborative forensic timeline analysis

A PyTorch implementation of EventProp [https://arxiv.org/abs/2009.08378], a method to train Spiking Neural Networks

Open source hardware and software platform to build a small scale self driving car.

E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance Segmentation

Medical Insurance Cost Prediction using Machine earning

Official implementation of our paper "Learning to Bootstrap for Combating Label Noise"