PPO-EWMA

[Paper]

This is code for training agents using PPO-EWMA and PPG-EWMA, introduced in the paper Batch size-invariance for policy optimization (citation). It is based on the code for Phasic Policy Gradient.

Installation

Supported platforms: MacOS and Ubuntu, Python 3.7

Installation using Miniconda:

git clone https://github.com/openai/ppo-ewma.git
conda env update --name ppo-ewma --file ppo-ewma/environment.yml
conda activate ppo-ewma
pip install -e ppo-ewma

Alternatively, install the dependencies from environment.yml manually.

Visualize results

Results are stored in blob storage at https://openaipublic.blob.core.windows.net/rl-batch-size-invariance/, and can be visualized as in the paper using this Colab notebook.

Citation

Please cite using the following BibTeX entry:

@article{hilton2021batch,
  title={Batch size-invariance for policy optimization},
  author={Hilton, Jacob and Cobbe, Karl and Schulman, John},
  journal={arXiv preprint arXiv:2110.00641},
  year={2021}
}

Code for Mining the Benefits of Two-stage and One-stage HOI Detection

Related tags

Overview

PPO-EWMA

[Paper]

Installation

Visualize results

Citation

Owner

OpenAI

UnsupervisedR&R: Unsupervised Pointcloud Registration via Differentiable Rendering

The Wearables Development Toolkit - a development environment for activity recognition applications with sensor signals

catch-22: CAnonical Time-series CHaracteristics

3DV 2021: Synergy between 3DMM and 3D Landmarks for Accurate 3D Facial Geometry

A curated list of neural network pruning resources.

CROSS-LINGUAL ABILITY OF MULTILINGUAL BERT: AN EMPIRICAL STUDY

PyTorch implementation of PP-LCNet: A Lightweight CPU Convolutional Neural Network

Repo for the Tutorials of Day1-Day3 of the Nordic Probabilistic AI School 2021 (https://probabilistic.ai/)

The official code for PRIMER: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization

Implementation for "Manga Filling Style Conversion with Screentone Variational Autoencoder" (SIGGRAPH ASIA 2020 issue)

PyTorch implementation of federated learning framework based on the acceleration of global momentum

Code for testing convergence rates of Lipschitz learning on graphs

Implementation of ICCV2021(Oral) paper - VMNet: Voxel-Mesh Network for Geodesic-aware 3D Semantic Segmentation

Tensorflow 2 Object Detection API kurulumu, GPU desteği, custom model hazırlama

Instance-wise Feature Importance in Time (FIT)

Help you understand Manual and w/ Clutch point while driving.

The repository contains reproducible PyTorch source code of our paper Generative Modeling with Optimal Transport Maps, ICLR 2022.

Implementing DeepMind's Fast Reinforcement Learning paper

Cartoon-StyleGan2 🙃 : Fine-tuning StyleGAN2 for Cartoon Face Generation

The full training script for Enformer (Tensorflow Sonnet) on TPU clusters