DrQ-v2: Improved Data-Augmented Reinforcement Learning

Last update: Jan 01, 2023

Related tags

Overview

DrQ-v2: Improved Data-Augmented RL Agent

Method

DrQ-v2 is a model-free off-policy algorithm for image-based continuous control. DrQ-v2 builds on DrQ, an actor-critic approach that uses data augmentation to learn directly from pixels. We introduce several improvements including:

Switch the base RL learner from SAC to DDPG.
Incorporate n-step returns to estimate TD error.
Introduce a decaying schedule for exploration noise.
Make implementation 3.5 times faster.
Find better hyper-parameters.

These changes allow us to significantly improve sample efficiency and wall-clock training time on a set of challening tasks from the DeepMind Control Suite compared to prior methods. Furthermore, DrQ-v2 is able to solve complex humanoid locomotion tasks directly from pixel observations, previously unattained by model-free RL.

Citation

If you use this repo in your research, please consider citing the paper as follows:

@article{yarats2021drqv2,
  title={Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning},
  author={Denis Yarats and Rob Fergus and Alessandro Lazaric and Lerrel Pinto},
  journal={arXiv preprint arXiv:},
  year={2021}
}

Instructions

Install dependencies:

conda env create -f conda_env.yml
conda activate drqv2

Train the agent:

python train.py task=quadruped_walk

Monitor results:

tensorboard --logdir exp_local

License

The majority of DrQ-v2 is licensed under the MIT license, however portions of the project are available under separate license terms: DeepMind is licensed under the Apache 2.0 license.

DrQ-v2: Improved Data-Augmented Reinforcement Learning

Related tags

Overview

DrQ-v2: Improved Data-Augmented RL Agent

Method

Citation

Instructions

License

Owner

Facebook Research

To prepare an image processing model to classify the type of disaster based on the image dataset

Implementation for the paper 'YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs'

A library of extension and helper modules for Python's data analysis and machine learning libraries.

A minimalist implementation of score-based diffusion model

Securetar - A streaming wrapper around python tarfile and allow secure handling files and support encryption

The official implementation of the CVPR 2021 paper FAPIS: a Few-shot Anchor-free Part-based Instance Segmenter

Cognate Detection Repository

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

MODALS: Modality-agnostic Automated Data Augmentation in the Latent Space

Download from Onlyfans.com.

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

MaskTrackRCNN for video instance segmentation based on mmdetection

PyTorch implementation of Octave Convolution with pre-trained Oct-ResNet and Oct-MobileNet models

CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes (AAAI2022)

Repo for the Video Person Clustering dataset, and code for the associated paper

Employee-Managment - Company employee registration software in the face recognition system

D²Conv3D: Dynamic Dilated Convolutions for Object Segmentation in Videos

Implementation of the Paper: "Parameterized Hypercomplex Graph Neural Networks for Graph Classification" by Tuan Le, Marco Bertolini, Frank Noé and Djork-Arné Clevert

Boostcamp AI Tech 3rd / Basic Paper reading w.r.t Embedding

Fine-grained Control of Image Caption Generation with Abstract Scene Graphs