Official repository for: Continuous Control With Ensemble DeepDeterministic Policy Gradients

Last update: Dec 06, 2021

Related tags

Overview

Continuous Control With Ensemble Deep Deterministic Policy Gradients

This repository is the official implementation of Continuous Control With Ensemble Deep Deterministic Policy Gradients.

Requirements

Before installation, please make sure you have MuJoCo engine set up on your machine. We use mujoco150 in order to be comparable with previous benchmarks on v2 environments. See this issue

To install requirements:

pip install -r requirements.txt

Training

To train the model(s) in the paper, run this command:

python run.py <experiment_specification path>

Logger automatically stops training and evaluates current policy every log_every environment interactions. The data is printed to standard output and stored on drive.

We include specifications for our most important experiments.

Path	Description
specs/ed2_on_mujoco.py	Benchmark of our method
specs/sac_on_mujoco.py	Benchmark of our implementation of SAC
specs/sunrise_on_mujoco.py	Benchmark of our implementation of SUNRISE
specc/sop_on_mujoco.py	Benchmark of our implementation of SOP

Results

Our model achieves the following performance on the MuJoCo suite:

Official repository for: Continuous Control With Ensemble DeepDeterministic Policy Gradients

Related tags

Overview

Continuous Control With Ensemble Deep Deterministic Policy Gradients

Requirements

Training

Results

Owner

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Convert Pytorch model to onnx or tflite, and the converted model can be visualized by Netron

Implementation EfficientDet: Scalable and Efficient Object Detection in PyTorch

Autoencoder - Reducing the Dimensionality of Data with Neural Network

1st-in-MICCAI2020-CPM - Combined Radiology and Pathology Classification

PyContinual (An Easy and Extendible Framework for Continual Learning)

Fast and Simple Neural Vocoder, the Multiband RNNMS

Official code of paper "PGT: A Progressive Method for Training Models on Long Videos" on CVPR2021

Evaluating Cross-lingual Sentence Representations

ShapeGlot: Learning Language for Shape Differentiation

Official PyTorch code for Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling (HCFlow, ICCV2021)

Reliable probability face embeddings

DPC: Unsupervised Deep Point Correspondence via Cross and Self Construction (3DV 2021)

Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

Code for the KDD 2021 paper 'Filtration Curves for Graph Representation'

Download from Onlyfans.com.

LineBoard - Python+React+MySQL-白板即時系統改善人群行為

CodeContests is a competitive programming dataset for machine-learning

Contrastive Language-Image Pretraining