BasicRL: easy and fundamental codes for deep reinforcement learning。It is an improvement on rainbow-is-all-you-need and OpenAI Spinning Up.

Last update: Apr 28, 2022

Overview

BasicRL: easy and fundamental codes for deep reinforcement learning

BasicRL is an improvement on rainbow-is-all-you-need and OpenAI Spinning Up.

It is developped for beginner in DRL with the following advantages:

Practical: it fills the gap between the theory and practice of DRL.
Easy: the codes is easier than OpenAI Spinning Up in terms of achieving the same functionality.
Lightweight: the core codes <1,500 lines, using Pytorch ans OpenAI Gym.

The following DRL algorithms is contained in BasicRL:

DQN, DoubleDQN, DuelingDQN, NoisyDQN, DistributionalDQN
REINFORCE, VPG, PPO, DDPG, TD3 and SAC
PerDQN, N-step-learning DQN and Rainbow are coming

The differences compared to OpenAI Spinning Up:

Pros: BasicRL is currently can be used on Windows and Linux (it hasn't been extensively tested on OSX). However, Spinning Up is only supported on Linux and OSX.
Cons: OpenMPI is not used in BasicRL so it is slower than Spinning Up.
Others: BasicRL considers an agent as a class.

The differences compared to rainbow-is-all-you-need:

Pros: BasicRL reuse the common codes, so it is lightwight. Besides, BasicRL modifies the form of output and plot, it can use the Spinning Up's log file.
Others: BasicRL uses inheritance of classes, so you can see key differences between each other.

File Structure

BasicRL:

├─pg    
│  └─reinforce/vpg/ppo/ddpg/td3/sac.py    
│  └─utils.py      
│  └─logx.py     
├─pg_cpu     
│  └─reinforce/vpg/ppo/ddpg/td3/sac.py  
│  └─utils.py  
│  └─logx.py  
├─rainbow     
│  └─dqn/double_dqn/dueling_dqn/moisy_dqn/distributional_dqn.py  
│  └─utils.py   
│  └─logx.py   
├─requirements.txt  
└─plot.py

Code Structure

Core code

xxx.py(dqn.py...)

- agent class:
  - init
  - compute loss
  - update
  - get action
  - test agent
  - train
- main

Common code

utils.py

- expereience replay buffer: On-policy/Off-policy replay buffer
- network

logx.py

- Logger
- EpochLogger

plot.py

- plot data
- get datasets
- get all datasets
- make plots
- main

Installation

BasicRL is tested on Anaconda virtual environment with Python3.7+

conda create -n BasicRL python=3.7
conda activate BasicRL

Clone the repository:

git clone [email protected]:RayYoh/BasicRL.git
cd BasicRL

Install required libraries:

pip install -r requirements.txt

BasicRL code library makes local experiments easy to do, and there are two ways to run them: either from the command line, or through function calls in scripts.

Experiment

After testing, Basic RL runs perfectly, but its performance has not been tested. Users can tweak the parameters and change the experimental environment to output final results for comparison. Possible outputs are shown below:

Contribution

BasicRL is not yet complete and I will continue to maintain it. To any interested in making BasicRL better, any contribution is warmly welcomed. If you want to contribute, please send a Pull Request.
If you are not familiar with creating a Pull Request, here are some guides:

Citation

To cite this repository:

@misc{lei,
  author = {Lei Yao},
  title = {BasicRL: easy and fundamental codes for deep reinforcement learning},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/RayYoh/BasicRL}},
}

BasicRL: easy and fundamental codes for deep reinforcement learning。It is an improvement on rainbow-is-all-you-need and OpenAI Spinning Up.

Related tags

Overview

BasicRL: easy and fundamental codes for deep reinforcement learning

File Structure

Code Structure

Core code

Common code

Installation

Experiment

Contribution

Related Link

Citation

Owner

RayYoh

A3C LSTM Atari with Pytorch plus A3G design

Joint project of the duo Hacker Ninjas

Generative Exploration and Exploitation - This is an improved version of GENE.

Vehicle Detection Using Deep Learning and YOLO Algorithm

PyTorch Lightning + Hydra. A feature-rich template for rapid, scalable and reproducible ML experimentation with best practices. ⚡🔥⚡

It's A ML based Web Site build with python and Django to find the breed of the dog

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Pytorch reimplement of the paper "A Novel Cascade Binary Tagging Framework for Relational Triple Extraction" ACL2020. The original code is written in keras.

Implementation of the state of the art beat-detection, downbeat-detection and tempo-estimation model

Code for "Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search"

DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weights, in order to achieve a given trade-off between model size and accuracy.

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

Official PyTorch implementation for "Low Precision Decentralized Distributed Training with Heterogenous Data"

XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

Plover-tapey-tape: an alternative to Plover’s built-in paper tape

A Fast Monotone Rotating Shallow Water model

3D Generative Adversarial Network

Source code for the BMVC-2021 paper "SimReg: Regression as a Simple Yet Effective Tool for Self-supervised Knowledge Distillation".

AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning