An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions

Last update: Jun 09, 2022

Overview

Agar.io_Q-Learning_AI

An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions.

An image of the circle categorisation function in action. Food blobs are outlined in blue, edible cells in green and dangerous cells in red according to where our program detects them. Screen edges mess that up a bit. The agents action at this moment is labelled with the green arrow.

States are calculated using the shortest euclidian distance to each of the three circle types: food, edible cells and dangerous cells. These distances are measured and discretized according to which interval they fall within. The rulers in this image are to scale.

Currently the agent can't press any keyboard buttons, only move around using the mouse. It could be added without too much hassle, but it would require a rework of some aspects of the code and a ton training, which already takes ages. The q-learning part could also do with a proper implementation of stochastic q-learning instead of our generic iterative q-learning, if I knew how to do it. I look forward to learning that at a later point.

Feel free to ask any questions about the code or the project. I hope you enjoy!

The humans in the experiment were subject to the same move set as the bots and agents, so only mouse movement.

An experiment on the performance of homemade Q-learning AIs in Agar.io depending on their state representation and available actions

Related tags

Overview

Agar.io_Q-Learning_AI

Owner

Pytorch GUI(demo) for iVOS(interactive VOS) and GIS (Guided iVOS)

Recursive Bayesian Networks

A full pipeline AutoML tool for tabular data

Kaggle | 9th place (part of) solution for the Bristol-Myers Squibb – Molecular Translation challenge

Indonesian Car License Plate Character Recognition using Tensorflow, Keras and OpenCV.

A platform to display the carbon neutralization information for researchers, decision-makers, and other participants in the community.

The Official TensorFlow Implementation for SPatchGAN (ICCV2021)

Systemic Evolutionary Chemical Space Exploration for Drug Discovery

we propose a novel deep network, named feature aggregation and refinement network (FARNet), for the automatic detection of anatomical landmarks.

A CNN model to detect hand gestures.

A graph-to-sequence model for one-step retrosynthesis and reaction outcome prediction.

The official PyTorch code for 'DER: Dynamically Expandable Representation for Class Incremental Learning' accepted by CVPR2021

LoFTR:Detector-Free Local Feature Matching with Transformers CVPR 2021

⚡ H2G-Net for Semantic Segmentation of Histopathological Images

Run Effective Large Batch Contrastive Learning on Limited Memory GPU

This is an open-source toolkit for Heterogeneous Graph Neural Network(OpenHGNN) based on DGL [Deep Graph Library] and PyTorch.

FastCover: A Self-Supervised Learning Framework for Multi-Hop Influence Maximization in Social Networks by Anonymous.

Towards Flexible Blind JPEG Artifacts Removal (FBCNN, ICCV 2021)

Additional functionality for use with fastai’s medical imaging module

An Empirical Investigation of Model-to-Model Distribution Shifts in Trained Convolutional Filters