PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

Last update: Dec 14, 2022

Related tags

Deep Learning HIGL

Overview

HIGL

This is a PyTorch implementation for our paper: Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning (NeurIPS 2021).

Our code is based on official implementation of HRAC (NeurIPS 2020) and Map-planner (NeurIPS 2019)

Installation

conda create -n higl python=3.6
conda activate higl
./install_all.sh

Also, to run the MuJoCo experiments, a license is required (see here).

Usage

Training & Evaluation

Point Maze

./scripts/point_maze_sparse.sh ${reward_shaping} ${timesteps} ${gpu} ${seed}
./scripts/point_maze_sparse.sh dense 5e5 0 2
./scripts/point_maze_sparse.sh sparse 5e5 0 2

Ant Maze (U-shape)

./scripts/higl_ant_maze_u.sh ${reward_shaping} ${timesteps} ${gpu} ${seed}
./scripts/higl_ant_maze_u.sh dense 10e5 0 2
./scripts/higl_ant_maze_u.sh sparse 10e5 0 2

Ant Maze (W-shape)

./scripts/higl_ant_maze_w.sh ${reward_shaping} ${timesteps} ${gpu} ${seed}
./scripts/higl_ant_maze_w.sh dense 10e5 0 2
./scripts/higl_ant_maze_w.sh sparse 10e5 0 2

Reacher & Pusher

./scripts/higl_fetch.sh ${env} ${timesteps} ${gpu} ${seed}
./scripts/higl_fetch.sh Reacher3D-v0 5e5 0 2
./scripts/higl_fetch.sh Pusher-v0 10e5 0 2

Stochastic Ant Maze (U-shape)

./scripts/higl_ant_maze_u_stoch.sh ${reward_shaping} ${timesteps} ${gpu} ${seed}
./scripts/higl_ant_maze_u_stoch.sh dense 10e5 0 2
./scripts/higl_ant_maze_u_stoch.sh sparse 10e5 0 2

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

Related tags

Overview

HIGL

Installation

Usage

Training & Evaluation

Owner

Junsu Kim

NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework

Kaggle: Cell Instance Segmentation

We simulate traveling back in time with a modern camera to rephotograph famous historical subjects.

Tensor-based approaches for fMRI classification

Arbitrary Distribution Modeling with Censorship in Real Time 59 2 60 3 Bidding Advertising for KDD'21

OpenMMLab Model Deployment Toolset

A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer from NNAISENSE.

SalFBNet: Learning Pseudo-Saliency Distribution via Feedback Convolutional Networks

Read and write layered TIFF ImageSourceData and ImageResources tags

Python library for science observations from the James Webb Space Telescope

Spatial Contrastive Learning for Few-Shot Classification (SCL)

MLP-Like Vision Permutator for Visual Recognition (PyTorch)

Physics-Informed Neural Networks (PINN) and Deep BSDE Solvers of Differential Equations for Scientific Machine Learning (SciML) accelerated simulation

Fast sparse deep learning on CPUs

A large dataset of 100k Google Satellite and matching Map images, resembling pix2pix's Google Maps dataset.

Unofficial TensorFlow implementation of Protein Interface Prediction using Graph Convolutional Networks.

Easy-to-use micro-wrappers for Gym and PettingZoo based RL Environments

Proximal Backpropagation - a neural network training algorithm that takes implicit instead of explicit gradient steps

Self-Supervised Learning with Kernel Dependence Maximization

SeqAttack: a framework for adversarial attacks on token classification models