Visual Adversarial Imitation Learning using Variational Models (VMAIL)

This is the official implementation of the NeurIPS 2021 paper.

Method

VMAIL simultaneously learns a variational dynamics model and trains an on-policy adversarial imitation learning algorithm in the latent space using only model-based rollouts. This allows for stable and sample efficient training, as well as zero-shot imitation learning by transfering the learned dynamics model

Instructions

Get dependencies:

conda env create -f vmail.yml
conda activate vmail
cd robel_claw/robel
pip install -e .

To train agents for each environmnet download the expert data from the provided link and run:

python3 -u vmail.py --logdir .logdir --expert_datadir expert_datadir

The training will generate tensorabord plots and GIFs in the log folder:

tensorboard --logdir ./logdir

Citation

If you find this code useful, please reference in your paper:

@article{rafailov2021visual,
      title={Visual Adversarial Imitation Learning using Variational Models}, 
      author={Rafael Rafailov and Tianhe Yu and Aravind Rajeswaran and Chelsea Finn},
      year={2021},
      journal={Neural Information Processing Systems}
}

Visual Adversarial Imitation Learning using Variational Models (VMAIL)

Related tags

Overview

Visual Adversarial Imitation Learning using Variational Models (VMAIL)

Method

Instructions

Citation

Owner

You can draw the corresponding bounding box into the image and save it according to the result file (txt format) run by the tracker.

Context-Sensitive Misspelling Correction of Clinical Text via Conditional Independence, CHIL 2022

Point cloud processing tool library.

Visualize Camera's Pose Using Extrinsic Parameter by Plotting Pyramid Model on 3D Space

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

Video-face-extractor - Video face extractor with Python

Stochastic Tensor Optimization for Robot Motion - A GPU Robot Motion Toolkit

PyTorch implementations of the beta divergence loss.

A Real-Time-Strategy game for Deep Learning research

3D mesh stylization driven by a text input in PyTorch

Code for Towards Streaming Perception (ECCV 2020) :car:

Nested Graph Neural Network (NGNN) is a general framework to improve a base GNN's expressive power and performance

Tree LSTM implementation in PyTorch

Official code base for the poster "On the use of Cortical Magnification and Saccades as Biological Proxies for Data Augmentation" published in NeurIPS 2021 Workshop (SVRHM)

Implementation of Hierarchical Transformer Memory (HTM) for Pytorch

Cross-media Structured Common Space for Multimedia Event Extraction (ACL2020)

PyTorch implementation for STIN

Repo for the Tutorials of Day1-Day3 of the Nordic Probabilistic AI School 2021 (https://probabilistic.ai/)

Paaster is a secure by default end-to-end encrypted pastebin built with the objective of simplicity.

Code for Robust Contrastive Learning against Noisy Views