A project studying the influence of communication in multi-objective normal-form games

Last update: Dec 17, 2021

Related tags

Overview

Communication in Multi-Objective Normal-Form Games

This repo consists of five different types of agents that we have used in our study of communication in multi-objective normal-form games. The settings that involve communication do this following a leader-follower model as seen in Stackelberg games. In such settings, agents switch in a round-robin fashion between being the leader and communicating something and being the follower and observing the communication.

No communication setting

In this setting two agents play a normal-form game for a certain amount of episodes. This experiment serves as a baseline for all other experiments.

Cooperative action communication setting

In this setting, agents communicate the next action that they will play. The follower uses this message to pre-update their policy. This setting is similar to Iterated Best Response and attempts to find the optimal joint policy.

Competitive action communication setting

This setting places the agents in a more competitive environment. This means that agents learn a specific best-response policy to every possible message. As such, agent's are not optimising for an optimal joint policy, but rather are acting in a self-interested manner.

Cooperative policy communication setting

This setting follows the same dynamics as the cooperative action communication setting, but communicates the entire policy instead of the next action that will be played.

Optional communication setting

The last setting gives agents the chance to learn for themselves whether communication helps them. All agents learn a top-level policy that chooses whether they will communicate when they are the leader or not. They also have two low-level agents, one "no communication agent" and one agent that does communicate. Which agent that is used as the communicating agent, is completely optional. When agents choose to communicate, they utilise their lower level communicating agent. When agents opt out of communication, they utilise their lower level no communication agent.

Getting Started

Experiments can be run from the MONFG.py file. There are 5 MONFGs available, having different equilibria properties under the SER optimisation criterion, using the specified non linear utility functions. You can also specify the type of experiment to run and other parameters.

License

This project is licensed under the GNU General Public License v3.0 - see the LICENSE file for details

A project studying the influence of communication in multi-objective normal-form games

Related tags

Overview

Communication in Multi-Objective Normal-Form Games

No communication setting

Cooperative action communication setting

Competitive action communication setting

Cooperative policy communication setting

Optional communication setting

Getting Started

License

Owner

Willem Röpke

Face recognition project by matching the features extracted using SIFT.

Official repository for the ICLR 2021 paper Evaluating the Disentanglement of Deep Generative Models with Manifold Topology

Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT

Implements a fake news detection program using classifiers.

Event sourced bank - A wide-and-shallow example using the Python event sourcing library

The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"

Deformable DETR is an efficient and fast-converging end-to-end object detector.

Expand human face editing via Global Direction of StyleCLIP, especially to maintain similarity during editing.

This solves the autonomous driving issue which is supported by deep learning technology. Given a video, it splits into images and predicts the angle of turning for each frame.

Parris, the automated infrastructure setup tool for machine learning algorithms.

UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems

FMA: A Dataset For Music Analysis

3D ResNet Video Classification accelerated by TensorRT

Shuffle Attention for MobileNetV3

Doods2 - API for detecting objects in images and video streams using Tensorflow

Code for the paper "How Attentive are Graph Attention Networks?"

Train a deep learning net with OpenStreetMap features and satellite imagery.

Official Implementation of Swapping Autoencoder for Deep Image Manipulation (NeurIPS 2020)

Code accompanying "Adaptive Methods for Aggregated Domain Generalization"

Implements MLP-Mixer: An all-MLP Architecture for Vision.