On Effective Scheduling of Model-based Reinforcement Learning

Last update: Oct 07, 2022

Related tags

Deep Learning autombpo

Overview

On Effective Scheduling of Model-based Reinforcement Learning

Code to reproduce the experiments in On Effective Scheduling of Model-based Reinforcement Learning.

Requirements

To install requirements:

pip install -r requirements.txt

Mujoco license is required to run the experiments on the Mujoco environments.

Training

To train the hyper-controller of the paper, run this command:

python train.py --env=

The env_name can be selected from [hopper,ant,humanoid,hopperbullet,walker2dbullet,halfcheetahbullet]. For example: python train.py --env=hopper

The trained hyper-controller will be saved in saved-models/. The computing infrastructure used in our experiments and the around computation time to train the hyper-controller is provided in Appendix G.

Evaluation

After training, to evaluate the trained hyper-controller, run:

python eval.py --config=config.
   
     --model_path=saved-models

The env_name can be selected from [hopper,ant,humanoid,hopperbullet,walker2dbullet,halfcheetahbullet]. For example: python eval.py --config=config.hopper --model_path=saved-models

Notice this command can only be run after finishing training the hyper-controller on the corresponding environments.

Pre-trained Models

We provided our pre-trained hyper-controller in pre-trained-models/ to better reproduce the experiments. To evaluate the pre-trained models, run:

python eval.py --config=config.
   
     --model_path=pre-trained-models

The env_name can be selected from [hopper,ant,humanoid,hopperbullet,walker2dbullet,halfcheetahbullet]. For example: python eval.py --config=config.hopper --model_path=pre-trained-models

On Effective Scheduling of Model-based Reinforcement Learning

Related tags

Overview

On Effective Scheduling of Model-based Reinforcement Learning

Requirements

Training

Evaluation

Pre-trained Models

Owner

laihang

This a classic fintech problem that introduces real life difficulties such as data imbalance. Check out the notebook to find out more!

Aquarius - Enabling Fast, Scalable, Data-Driven Virtual Network Functions

Tidy interface to polars

Pytorch implementation of "A simple neural network module for relational reasoning" (Relational Networks)

Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt

An open source python library for automated feature engineering

This is the open-source reference implementation of the SIGGRAPH 2021 paper Intersection-free Rigid Body Dynamics.

An evaluation toolkit for voice conversion models.

A python3 tool to take a 360 degree survey of the RF spectrum (hamlib + rotctld + RTL-SDR/HackRF)

Implementation of Bottleneck Transformer in Pytorch

GNPy: Optical Route Planning and DWDM Network Optimization

Source code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).

A PyTorch implementation of "Capsule Graph Neural Network" (ICLR 2019).

Robot Servers and Server Manager software for robo-gym

Distributed Arcface Training in Pytorch

DeepSTD: Mining Spatio-temporal Disturbances of Multiple Context Factors for Citywide Traffic Flow Prediction

Codes for [NeurIPS'21] You are caught stealing my winning lottery ticket! Making a lottery ticket claim its ownership.

Implementation of Axial attention - attending to multi-dimensional data efficiently

Python package to add text to images, textures and different backgrounds

Markov Attention Models