Discovering and Achieving Goals via World Models

Last update: Dec 22, 2022

Related tags

Overview

Discovering and Achieving Goals via World Models

[Project Website] [Benchmark Code] [Video (2min)] [Oral Talk (13min)] [Paper]

Russell Mendonca*¹, Oleh Rybkin*², Kostas Daniilidis², Danijar Hafner^3,4, Deepak Pathak¹
(* equal contribution, random order)

¹Carnegie Mellon University
²University of Pennsylvania
³Google Research, Brain Team
⁴University of Toronto

Official implementation of the Lexa agent from the paper Discovering and Achieving Goals via World Models.

Setup

Create the conda environment by running :

conda env create -f environment.yml

Clone the lexa-benchmark repo, and modify the python path
export PYTHONPATH= /lexa:

Export the following variables for rendering
export MUJOCO_RENDERER=egl; export MUJOCO_GL=egl

Training

First source the environment : source activate lexa

For training, run :

export CUDA_VISIBLE_DEVICES=
   
      
python train.py --configs defaults 
    
      --task 
     
       --logdir

where method can be lexa_temporal, lexa_cosine, ddl, diayn or gcsl
Supported tasks are dmc_walker_walk, dmc_quadruped_run, robobin, kitchen, joint

To view the graphs and gifs during training, run tensorboard --logdir

Bibtex

If you find this code useful, please cite:

@misc{lexa2021,
    title={Discovering and Achieving Goals via World Models},
    author={Mendonca, Russell and Rybkin, Oleh and
    Daniilidis, Kostas and Hafner, Danijar and Pathak, Deepak},
    year={2021},
    Booktitle={NeurIPS}
}

Acknowledgements

This code was developed using Dreamer V2 and Plan2Explore.

Discovering and Achieving Goals via World Models

Related tags

Overview

Discovering and Achieving Goals via World Models

[Project Website] [Benchmark Code] [Video (2min)] [Oral Talk (13min)] [Paper]

Setup

Training

Bibtex

Acknowledgements

Owner

Oleg Rybkin

Official implementation of Influence-balanced Loss for Imbalanced Visual Classification in PyTorch.

Code for layerwise detection of linguistic anomaly paper (ACL 2021)

An implementation of RetinaNet in PyTorch.

Analysis of rationale selection in neural rationale models

EsViT: Efficient self-supervised Vision Transformers

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

Exploit ILP to learn symmetry breaking constraints of ASP programs.

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

Source code for our paper "Do Not Trust Prediction Scores for Membership Inference Attacks"

9th place solution

An experimentation and research platform to investigate the interaction of automated agents in an abstract simulated network environments.

Multi-View Radar Semantic Segmentation

This is the official implementation of the paper "Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation".

CoMoGAN: continuous model-guided image-to-image translation. CVPR 2021 oral.

Training code and evaluation benchmarks for the "Self-Supervised Policy Adaptation during Deployment" paper.

Python with OpenCV - MediaPip Framework Hand Detection

Pytorch implementation for "Implicit Semantic Response Alignment for Partial Domain Adaptation"

TensorFlow implementation of "Variational Inference with Normalizing Flows"

Voice assistant - Voice assistant with python

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.