CoRe: Contrastive Recurrent State-Space Models

Last update: Aug 11, 2022

Related tags

Overview

CoRe: Contrastive Recurrent State-Space Models

This code implements the CoRe model and reproduces experimental results found in
Robust Robotic Control from Pixels using Contrastive Recurrent State-Space models
NeurIPS Deep Reinforcement Learning Workshop 2021
Nitish Srivastava, Walter Talbott, Martin Bertran Lopez, Shuangfei Zhai & Joshua M. Susskind
[paper]

Requirements and Installation

Clone this repository and then execute the following steps. See setup.sh for an example of how to run these steps on a Ubuntu 18.04 machine.

Install dependencies.

apt install -y libgl1-mesa-dev libgl1-mesa-glx libglew-dev \
        libosmesa6-dev software-properties-common net-tools unzip \
        virtualenv wget xpra xserver-xorg-dev libglfw3-dev patchelf xvfb ffmpeg

Download the DAVIS 2017 dataset. Make sure to select the 2017 TrainVal - Images and Annotations (480p). The training images will be used as distracting backgrounds. The DAVIS directory should be in the same directory as the code. Check that ls ./DAVIS/JPEGImages/480p/... shows 90 video directories.
Install MuJoCo 2.1.
- Download MuJoCo version 2.1 binaries for Linux or macOS.
- Unzip the downloaded mujoco210 directory into ~/.mujoco/mujoco210.
Install MuJoCo 2.0 (For robosuite experiments only).
- Download MuJoCo version 2.0 binaries for Linux or macOS.
- Unzip the downloaded directory and move it into ~/.mujoco/.
- Symlink mujoco200_linux (or mujoco200_macos) to mujoco200.
```
ln -s ~/.mujoco/mujoco200_linux ~/.mujoco/mujoco200
```
- Place the license key at ~/.mujoco/mjkey.txt.
- Add the MuJoCo binaries to LD_LIBRARY_PATH.
```
export LD_LIBRARY_PATH=$HOME/.mujoco/mujoco200/bin:$LD_LIBRARY_PATH
```
Setup EGL GPU rendering (if a GPU is available).
- To ensure that the GPU is prioritized over the CPU for EGL rendering
```
cp 10_nvidia.json /usr/share/glvnd/egl_vendor.d/
```
- Create a dummy nvidia directory so that mujoco_py builds the extensions needed for GPU rendering.
```
mkdir -p /usr/lib/nvidia-000
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/lib/nvidia-000
```

Create a conda environment.

For Distracting Control Suite

conda env create -f conda_env.yml

For Robosuite

conda env create -f conda_env_robosuite.yml

Training

The CoRe model can be trained on the Distracting Control Suite as follows:

conda activate core
MUJOCO_GL=egl CUDA_VISIBLE_DEVICES=0 python train.py --config configs/dcs/core.yaml

The training artifacts, including tensorboard logs and videos of validation rollouts will be written in ./artifacts/.

To change the distraction setting, modify the difficulty parameter in configs/dcs/core.yaml. Possible values are ['easy', 'medium', 'hard', 'none', 'hard_bg'].

To change the domain, modify the domain parameter in configs/dcs/core.yaml. Possible values are ['ball_in_cup', 'cartpole', 'cheetah', 'finger', 'reacher', 'walker'].

To train on Robosuite (Door Task, Franka Panda Arm)

Using RGB image and proprioceptive inputs.

conda activate core_robosuite
MUJOCO_GL=egl CUDA_VISIBLE_DEVICES=0 python train.py --config configs/robosuite/core.yaml

Using RGB image inputs only.

conda activate core_robosuite
MUJOCO_GL=egl CUDA_VISIBLE_DEVICES=0 python train.py --config configs/robosuite/core_imageonly.yaml

Citation

@article{srivastava2021core,
    title={Robust Robotic Control from Pixels using Contrastive Recurrent State-Space Models}, 
    author={Nitish Srivastava and Walter Talbott and Martin Bertran Lopez and Shuangfei Zhai and Josh Susskind},
    journal={NeurIPS Deep Reinforcement Learning Workshop},
    year={2021}
}

License

This code is released under the LICENSE terms.

CoRe: Contrastive Recurrent State-Space Models

Related tags

Overview

CoRe: Contrastive Recurrent State-Space Models

Requirements and Installation

Training

Citation

License

Owner

Apple

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

An essential implementation of BYOL in PyTorch + PyTorch Lightning

🥈78th place in Riiid Solution🥈

Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks

StarGAN-ZSVC: Unofficial PyTorch Implementation

The comma.ai Calibration Challenge!

Hough Transform and Hough Line Transform Using OpenCV

Source code for "OmniPhotos: Casual 360° VR Photography"

Implementation of Basic Machine Learning Algorithms on small datasets using Scikit Learn.

This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)

Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deformable Attention"

🤗 Push your spaCy pipelines to the Hugging Face Hub

A Strong Baseline for Image Semantic Segmentation

🛠 All-in-one web-based IDE specialized for machine learning and data science.

Self-supervised learning algorithms provide a way to train Deep Neural Networks in an unsupervised way using contrastive losses

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Yet Another Robotics and Reinforcement (YARR) learning framework for PyTorch.

AsymmetricGAN - Dual Generator Generative Adversarial Networks for Multi-Domain Image-to-Image Translation

This is the PyTorch implementation of GANs N’ Roses: Stable, Controllable, Diverse Image to Image Translation

An example showing how to use jax to train resnet50 on multi-node multi-GPU