Reinforcement Learning Tricks, Index

This repository contains the code for the paper "Distilling Reinforcement Learning Tricks for Video Games".

Short story shorter: RL algorithms are neat and all, but to get it to work in video games (RL competitions and whatnot), there are some nifty little tricks involved that need bit of expertise in the domain. This includes reward shaping, curriculum learning, splitting task into subtasks by hand and guiding agent's actions. We took some of these tricks and tried them on three environments with DQN. With right setup you get more out of DQN.

Code authors: Anssi Kanervisto, Christian Scheller and Yanick Schraner.

The experiments in the three environments are split into three git branches:

vizdoom for ViZDoom Deathmatch experiments
minerl for MineRL ObtainDiamond experiments
gfootball for Football environment experiments

To run the experiments, checkout the repository you want to run experiments for with git checkout [branch name], and follow the instructions in the README file there.

After running all the experiments, collect the results as described the respective branches. You should have three directories

vizdoom-runs
minerl-runs
football-runs

After this, running python plot_paper.py should create a figures/learning_curves.pdf file which summarizes the results.

Evaluating different engineering tricks that make RL work

Related tags

Overview

Reinforcement Learning Tricks, Index

Owner

Anssi

[CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving

Codes and scripts for "Explainable Semantic Space by Grounding Languageto Vision with Cross-Modal Contrastive Learning"

Code for Talk-to-Edit (ICCV2021). Paper: Talk-to-Edit: Fine-Grained Facial Editing via Dialog.

Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT

salabim - discrete event simulation in Python

A Pytorch implement of paper "Anomaly detection in dynamic graphs via transformer" (TADDY).

なりすまし検出(anti-spoof-mn3)のWebカメラ向けデモ

PyTorch implementation of the TTC algorithm

This repo is customed for VisDrone.

CVPR 2021 - Official code repository for the paper: On Self-Contact and Human Pose.

InterfaceGAN++: Exploring the limits of InterfaceGAN

To SMOTE, or not to SMOTE?

Training vision models with full-batch gradient descent and regularization

An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.

Gradient representations in ReLU networks as similarity functions

Modelisation on galaxy evolution using PEGASE-HR

Clockwork Convnets for Video Semantic Segmentation

A commany has recently introduced a new type of bidding, the average bidding, as an alternative to the bid given to the current maximum bidding

This is a five-step framework for the development of intrusion detection systems (IDS) using machine learning (ML) considering model realization, and performance evaluation.

Where2Act: From Pixels to Actions for Articulated 3D Objects