A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Last update: Dec 28, 2022

Overview

Multi-Agent-Deep-Deterministic-Policy-Gradients

A Pytorch implementation of the multi agent deep deterministic policy gradients(MADDPG) algorithm

This is my implementation of the algorithm presented in the paper: Multi Agent Actor Critic for Mixed Cooperative-Competitive Environments. You can find this paper here: https://arxiv.org/pdf/1706.02275.pdf

You will need to install the Multi Agent Particle Environment(MAPE), which you can find here: https://github.com/openai/multiagent-particle-envs

Make sure to create a virtual environment with the dependencies for the MAPE, since they are somewhat out of date. I also recommend running this with PyTorch version 1.4.0, as the latest version (1.8) seems to have an issue with an in place operation I use in the calculation of the critic loss.

It's probably easiest to just clone this repo into the same directory as the MAPE, as the main file requires the make_env function from that package.

The video for this tutorial is found here: https://youtu.be/tZTQ6S9PfkE

A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

Related tags

Overview

Multi-Agent-Deep-Deterministic-Policy-Gradients

Owner

Phil Tabor

Tutorial materials for Part of NSU Intro to Deep Learning with PyTorch.

[ICCV 2021] Deep Hough Voting for Robust Global Registration

Reimplementation of Learning Mesh-based Simulation With Graph Networks

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Official implementation of "Open-set Label Noise Can Improve Robustness Against Inherent Label Noise" (NeurIPS 2021)

Graph parsing approach to structured sentiment analysis.

The Most Efficient Temporal Difference Learning Framework for 2048

Video-Captioning - A machine Learning project to generate captions for video frames indicating the relationship between the objects in the video

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

Official pytorch implementation of the AAAI 2021 paper Semantic Grouping Network for Video Captioning

Code for CVPR 2018 paper --- Texture Mapping for 3D Reconstruction with RGB-D Sensor

Text mining project; Using distilBERT to predict authors in the classification task authorship attribution.

This is a repository for a No-Code object detection inference API using the OpenVINO. It's supported on both Windows and Linux Operating systems.

Source code for The Power of Many: A Physarum Swarm Steiner Tree Algorithm

Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective

Applicator Kit for Modo allow you to apply Apple ARKit Face Tracking data from your iPhone or iPad to your characters in Modo.

A library of extension and helper modules for Python's data analysis and machine learning libraries.

PyElecCL - Electron Monte Carlo Second Checks

Official Code for ICML 2021 paper "Revisiting Point Cloud Shape Classification with a Simple and Effective Baseline"

Quickly comparing your image classification models with the state-of-the-art models (such as DenseNet, ResNet, ...)