Flexible Option Learning - NeurIPS 2021

Last update: Nov 09, 2022

Related tags

Overview

Flexible Option Learning

This repository contains code for the paper Flexible Option Learning presented as a Spotlight at NeurIPS 2021. The implementation is based on gym-miniworld, OpenAI's baselines and the Option-Critic's tabular implementation.

Contents:

FourRooms Experiments
Continuous Control Experiments
Visual Navigation Experiments
Citation

Tabular Experiments (Four-Rooms)

Installation and Launch code

pip install gym==0.12.1
cd diagnostic_experiments/
python main_fixpol.py --multi_option # for experiments with fixed options
python main.py --multi_option # for experiments with learned options

Continuous Control (MuJoCo)

Installation

virtualenv moc_cc --python=python3
source moc_cc/bin/activate
pip install tensorflow==1.12.0 
cd continuous_control
pip install -e . 
pip install gym==0.9.3
pip install mujoco-py==0.5.1

Launch

cd baselines/ppoc_int
python run_mujoco.py --switch --nointfc --env AntWalls --eta 0.9 --mainlr 8e-5 --intlr 8e-5 --piolr 8e-5

Maze Navigation (MiniWorld)

Installation

virtualenv moc_vision --python=python3
source moc_vision/bin/activate
pip install tensorflow==1.13.1
cd vision_miniworld
pip install -e .
pip install gym==0.15.4

Launch

cd baselines/
# Run agent in first task
python run.py --alg=ppo2_options --env=MiniWorld-WallGap-v0 --num_timesteps 2500000 --save_interval 1000  --num_env 8 --noptions 4 --eta 0.7

# Load and run agent in transfer task
python run.py --alg=ppo2_options --env=MiniWorld-WallGapTransfer-v0 --load_path path/to/model --num_timesteps 2500000 --save_interval 1000  --num_env 8 --noptions 4 --eta 0.7

Cite

If you find this work useful to you, please consider adding you to your references.

@inproceedings{
klissarov2021flexible,
title={Flexible Option Learning},
author={Martin Klissarov and Doina Precup},
booktitle={Thirty-Fifth Conference on Neural Information Processing Systems},
year={2021},
url={https://openreview.net/forum?id=L5vbEVIePyb}
}

Flexible Option Learning - NeurIPS 2021

Related tags

Overview

Flexible Option Learning

Tabular Experiments (Four-Rooms)

Installation and Launch code

Continuous Control (MuJoCo)

Installation

Launch

Maze Navigation (MiniWorld)

Installation

Launch

Cite

Owner

Martin Klissarov

Pytorch implementation for reproducing StackGAN_v2 results in the paper StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

Code implementation from my Medium blog post: [Transformers from Scratch in PyTorch]

Multi-task yolov5 with detection and segmentation based on yolov5

Generative code template for PixelBeasts 10k NFT project.

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Ros2-voiceroid2 - ROS2 wrapper package of VOICEROID2

Tensorflow implementation of soft-attention mechanism for video caption generation.

Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.

Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)

This tutorial aims to learn the basics of deep learning by hands, and master the basics through combination of lectures and exercises

Efficient Multi Collection Style Transfer Using GAN

[IJCAI'21] Deep Automatic Natural Image Matting

YKKDetector For Python

Semi-supervised semantic segmentation needs strong, varied perturbations

TJU Deep Learning & Neural Network

CVPR2021 Content-Aware GAN Compression

Practical tutorials and labs for TensorFlow used by Nvidia, FFN, CNN, RNN, Kaggle, AE

Time series annotation library.

paper: Hyperspectral Remote Sensing Image Classification Using Deep Convolutional Capsule Network

PartImageNet is a large, high-quality dataset with part segmentation annotations