Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

Last update: Sep 19, 2022

Related tags

Overview

Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

This is the official codebase for Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL. Here, we provide a sample implementation of SAFARI on the cooperative navigation environment. This specific repository is untested; however, many of the given files match the code used to run experiments in the paper exactly. Refer to agents/safari.py.

Requirements

To install requirements, run:

pip install -r requirements.txt

Not all dependencies may be used; however, all dependencies that are needed can be found here.

Run

To kick off a training run of SAFARI, add a dataset into the data/ folder. Then running:

python main.py safari

will start the script from the entry point, main.py.

Data Format

SAFARI expects there to be a dataset present at data/ / for each parallel seed that is run. We expect three files:

actions.txt (Shape: [N, H])
rewards.txt (Shape: [N, H])
obs.txt (Shape: [N, H, O])

each of which expects each line to be an episodic trajectory. We convert each buffer into a list (1), cast them to str (2), and print them on separate lines of the file (3).

Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

Related tags

Overview

Sample Code for "Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL"

Requirements

Run

Data Format

Owner

Genetic feature selection module for scikit-learn

[CVPR'21] Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-view Transformation

Repositório para arquivos sobre o Módulo 1 do curso Top Coders da Let's Code + Safra

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

R interface to fast.ai

Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples

diablo2 resurrected loot filter

PyTorch implementation of the REMIND method from our ECCV-2020 paper "REMIND Your Neural Network to Prevent Catastrophic Forgetting"

End-to-end Temporal Action Detection with Transformer. [Under review]

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

Migration of Edge-based Distributed Federated Learning

Image Segmentation Evaluation

A colab notebook for training Stylegan2-ada on colab, transfer learning onto your own dataset.

HiddenMarkovModel implements hidden Markov models with Gaussian mixtures as distributions on top of TensorFlow

QR2Pass-project - A proof of concept for an alternative (passwordless) authentication system to a web server

A community run, 5-day PyTorch Deep Learning Bootcamp

NDE: Climate Modeling with Neural Diffusion Equation, ICDM'21

UDP++ (ECCVW 2020 Oral), (Winner of COCO 2020 Keypoint Challenge).

A template repository for submitting a job to the Slurm Cluster installed at the DISI - University of Bologna

Accelerated Multi-Modal MR Imaging with Transformers