evolvingrl

Supplementary Data for Evolving Reinforcement Learning Algorithms

This dataset contains 1000 loss graphs from two experiments: 500 unique graphs learned from scratch, and 500 unique graphs seeded by the DQN loss.

There are two csv files: from_scratch.csv and dqn_seeded.csv. They have two columns: id and reward. Each file is sorted by reward from highest to lowest. Graph with is visualized in a png file named .png. These graphs are under folders from_scratch_graphs/ and dqn_seeded_graphs/.

Notes on reading the graph:

Input nodes are in green, the output node is in blue.
The directed edges represent the data flow. A red edge represents the 2nd input for a binary operator, and all other edges are in black. Such coloring scheme is necesssary for encoding inputs for non-commutative operators like -, /, etc.
It’s common to have isolated input nodes and intermediate nodes that do not contribute to the final output. We can ignore these nodes.
As an example, Q(s_{t-1}, a_{t-1}) is represented by 5 nodes:
- Q_param → QValueListOp ← s_tm1. This gives Q(s_{t-1}, -).
- QValueListOp → SelectList ← a_{t-1}. This uses a_{t-1} to index into Q(s_{t-1}, -).

Supplementary Data for Evolving Reinforcement Learning Algorithms

Related tags

Overview

evolvingrl

Owner

John Co-Reyes

A collection of design patterns/idioms in Python

Greedy Algorithm-Problem Solving

The DarkRift2 networking framework written in Python 3

Algorithms and data structures for educational, demonstrational and experimental purposes.

A Python description of the Kinematic Bicycle Model with an animated example.

A priority of preferences for teacher assignment problem

A pure Python implementation of a mixed effects random forest (MERF) algorithm

So far implements A* will add more later

An NUS timetable generator which uses a genetic algorithm to optimise timetables to suit the needs of NUS students.

Algorithms and utilities for SAR sensors

Ralebel is an interpreted, Haitian Creole programming language that aims to help Haitians by starting with the fundamental algorithm

Algorithm for Cutting Stock Problem using Google OR-Tools. Link to the tool:

Nature-inspired algorithms are a very popular tool for solving optimization problems.

Pathfinding algorithm based on A*

A custom prime algorithm, implementation, and performance code & review

Minimal examples of data structures and algorithms in Python

Distributed Grid Descent: an algorithm for hyperparameter tuning guided by Bayesian inference, designed to run on multiple processes and potentially many machines with no central point of control

This python algorithm creates a simple house floor plan based on a user-provided CSV file.

This project is an implementation of a simple K-means algorithm

A simple python implementation of A* and bfs algorithm solving Eight-Puzzle