DrQ-v2: Improved Data-Augmented Reinforcement Learning

Last update: Jan 01, 2023

Related tags

Overview

DrQ-v2: Improved Data-Augmented RL Agent

Method

DrQ-v2 is a model-free off-policy algorithm for image-based continuous control. DrQ-v2 builds on DrQ, an actor-critic approach that uses data augmentation to learn directly from pixels. We introduce several improvements including:

Switch the base RL learner from SAC to DDPG.
Incorporate n-step returns to estimate TD error.
Introduce a decaying schedule for exploration noise.
Make implementation 3.5 times faster.
Find better hyper-parameters.

These changes allow us to significantly improve sample efficiency and wall-clock training time on a set of challening tasks from the DeepMind Control Suite compared to prior methods. Furthermore, DrQ-v2 is able to solve complex humanoid locomotion tasks directly from pixel observations, previously unattained by model-free RL.

Citation

If you use this repo in your research, please consider citing the paper as follows:

@article{yarats2021drqv2,
  title={Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning},
  author={Denis Yarats and Rob Fergus and Alessandro Lazaric and Lerrel Pinto},
  journal={arXiv preprint arXiv:},
  year={2021}
}

Instructions

Install dependencies:

conda env create -f conda_env.yml
conda activate drqv2

Train the agent:

python train.py task=quadruped_walk

Monitor results:

tensorboard --logdir exp_local

License

The majority of DrQ-v2 is licensed under the MIT license, however portions of the project are available under separate license terms: DeepMind is licensed under the Apache 2.0 license.

DrQ-v2: Improved Data-Augmented Reinforcement Learning

Related tags

Overview

DrQ-v2: Improved Data-Augmented RL Agent

Method

Citation

Instructions

License

Owner

Facebook Research

Neural machine translation between the writings of Shakespeare and modern English using TensorFlow

An Image compression simulator that uses Source Extractor and Monte Carlo methods to examine the post compressive effects different compression algorithms have.

[ICRA 2022] An opensource framework for cooperative detection. Official implementation for OPV2V.

The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Code release for ICCV 2021 paper "Anticipative Video Transformer"

🤗 Push your spaCy pipelines to the Hugging Face Hub

LSTM and QRNN Language Model Toolkit for PyTorch

sense-py-AnishaBaishya created by GitHub Classroom

Planning from Pixels in Environments with Combinatorially Hard Search Spaces -- NeurIPS 2021

PyTorch implementation for "HyperSPNs: Compact and Expressive Probabilistic Circuits", NeurIPS 2021

EigenGAN Tensorflow, EigenGAN: Layer-Wise Eigen-Learning for GANs

Model-free Vehicle Tracking and State Estimation in Point Cloud Sequences

EgGateWayGetShell py脚本

DM-ACME compatible implementation of the Arm26 environment from Mujoco

darija <-> english dictionary

BTC-Generator - BTC Generator With Python

OptNet: Differentiable Optimization as a Layer in Neural Networks

InsightFace: 2D and 3D Face Analysis Project on MXNet and PyTorch

Finetuner allows one to tune the weights of any deep neural network for better embeddings on search tasks

RoIAlign & crop_and_resize for PyTorch