Automatic Data-Regularized Actor-Critic (Auto-DrAC)

Last update: Dec 13, 2022

Related tags

Deep Learning auto-drac

Overview

Auto-DrAC: Automatic Data-Regularized Actor-Critic

This is a PyTorch implementation of the methods proposed in

Automatic Data Augmentation for Generalization in Deep Reinforcement Learning by

Roberta Raileanu, Max Goldstein, Denis Yarats, Ilya Kostrikov, and Rob Fergus.

Citation

If you use this code in your own work, please cite our paper:

@article{raileanu2020automatic,
  title={Automatic Data Augmentation for Generalization in Deep Reinforcement Learning},
  author={Raileanu, Roberta and Goldstein, Max and Yarats, Denis and Kostrikov, Ilya and Fergus, Rob},
  journal={arXiv preprint arXiv:2006.12862},
  year={2020}
}

Requirements

The code was run on a GPU with CUDA 10.2. To install all the required dependencies:

conda create -n auto-drac python=3.7
conda activate auto-drac

git clone [email protected]:rraileanu/auto-drac.git
cd auto-drac
pip install -r requirements.txt

git clone https://github.com/openai/baselines.git
cd baselines 
python setup.py install 

pip install procgen

Instructions

cd auto-drac

Train DrAC with crop augmentation on BigFish

python train.py --env_name bigfish --aug_type crop

Train UCB-DrAC on BigFish

python train.py --env_name bigfish --use_ucb

Train RL2-DrAC on BigFish

python train.py --env_name bigfish --use_rl2

Train Meta-DrAC on BigFish

python train.py --env_name bigfish --use_meta

Procgen Results

UCB-DrAC achieves state-of-the-art performance on the Procgen benchmark (easy mode), significantly improving the agent's generalization ability over standard RL methods such as PPO.

Test Results on Procgen

Train Results on Procgen

Agent Videos

You can find some videos of the agent's behavior while training on our website.

Acknowledgements

This code was based on an open sourced PyTorch implementation of PPO.

We also used kornia for some of the augmentations.

Automatic Data-Regularized Actor-Critic (Auto-DrAC)

Related tags

Overview

Auto-DrAC: Automatic Data-Regularized Actor-Critic

Citation

Requirements

Instructions

Train DrAC with crop augmentation on BigFish

Train UCB-DrAC on BigFish

Train RL2-DrAC on BigFish

Train Meta-DrAC on BigFish

Procgen Results

Agent Videos

Acknowledgements

Owner

Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)

DNA-RECON { Automatic Web Reconnaissance Tool }

Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games

A light and fast one class detection framework for edge devices. We provide face detector, head detector, pedestrian detector, vehicle detector......

SplineConv implementation for Paddle.

Object detection using yolo-tiny model and opencv used as backend

This repo provides code for QB-Norm (Cross Modal Retrieval with Querybank Normalisation)

Distance Encoding for GNN Design

Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021

Repository for MeshTalk supplemental material and code once the (already approved) 16 GHS captures our lab will make publicly available are released.

Yolox-bytetrack-sample - Python sample of MOT (Multiple Object Tracking) using YOLOX and ByteTrack

PyTorch implementation for 3D human pose estimation

Shared Attention for Multi-label Zero-shot Learning

一个多语言支持、易使用的 OCR 项目。An easy-to-use OCR project with multilingual support.

[CVPR 2021] Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Text-Based Ideal Points

Iran Open Source Hackathon