A pytorch reprelication of the model-based reinforcement learning algorithm MBPO

Last update: Jan 05, 2023

Related tags

Overview

This is a re-implementation of the model-based RL algorithm MBPO in pytorch as described in the following paper: When to Trust Your Model: Model-Based Policy Optimization.

This code is based on a previous paper in the NeurIPS reproducibility challenge that reproduces the result with a tensorflow ensemble model but shows a significant drop in performance with a pytorch ensemble model. This code re-implements the ensemble dynamics model with pytorch and closes the gap.

Reproduced results

The comparison are done on two tasks while other tasks are not tested. But on the tested two tasks, the pytorch implementation achieves similar performance compared to the official tensorflow code.

Dependencies

MuJoCo 1.5 & MuJoCo 2.0

Usage

python main_mbpo.py --env_name 'Walker2d-v2' --num_epoch 300 --model_type 'pytorch'

python main_mbpo.py --env_name 'Hopper-v2' --num_epoch 300 --model_type 'pytorch'

Reference

Official tensorflow implementation: https://github.com/JannerM/mbpo
Code to the reproducibility challenge paper: https://github.com/jxu43/replication-mbpo

A pytorch reprelication of the model-based reinforcement learning algorithm MBPO

Related tags

Overview

Overview

Reproduced results

Dependencies

Usage

Reference

Owner

Xingyu Lin

Codes for TIM2021 paper "Anchor-Based Spatio-Temporal Attention 3-D Convolutional Networks for Dynamic 3-D Point Cloud Sequences"

The source code for Generating Training Data with Language Models: Towards Zero-Shot Language Understanding.

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

State-of-the-art data augmentation search algorithms in PyTorch

implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks

A collection of educational notebooks on multi-view geometry and computer vision.

Bayesian-Torch is a library of neural network layers and utilities extending the core of PyTorch to enable the user to perform stochastic variational inference in Bayesian deep neural networks

StarGAN-ZSVC: Unofficial PyTorch Implementation

EvoJAX is a scalable, general purpose, hardware-accelerated neuroevolution toolkit

Streamlit tool to explore coco datasets

Checkout some cool self-projects you can try your hands on to curb your boredom this December!

ArtEmis: Affective Language for Art

A PyTorch Reimplementation of TecoGAN: Temporally Coherent GAN for Video Super-Resolution

MonoRCNN is a monocular 3D object detection method for automonous driving

Learned Initializations for Optimizing Coordinate-Based Neural Representations

PyTorch Implementation for Deep Metric Learning Pipelines

CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms

Voice Gender Recognition

An image classification app boilerplate to serve your deep learning models asap!