Multi-Objective Reinforced Active Learning

Last update: Nov 19, 2022

Related tags

Deep Learning moral_rl

Overview

Multi-Objective Reinforced Active Learning

Dependencies

wandb
tqdm
pytorch >= 1.7.0
numpy >= 1.20.0
scipy >= 1.1.0
pycolab == 1.2

Weights and Biases

Our code depends on for visualizing and logging results during training. As a result, we call wandb.init(), which will prompt to add an API key for linking the training runs with your personal wandb account. This can be done by pasting the WANDB_API_KEY into the respective box when running the code for the first time.

Environments

Our gridworlds (Emergency: randomized_v2.py, Delivery: randomized_v3.py) build on the game engine with a custom wrapper to provide similar functionality as the gym . This engine comes with a user interface and any environment can be played in the console using python environment.py with arrow keys and w, a, s, d as controls.

Training

There are four training scripts for

manually training a PPO agent on custom rewards (ppo_train.py),
training AIRL on a single expert dataset (airl_train.py),
active MORL with custom/automatic preferences (moral_train.py) and
training DRLHP with custom/automatic preferences (drlhp_train.py).

When using automatic preferences, a desired ratio can be passed as an argument. For example,

python moral_train.py --ratio a b c

will run MORAL using a (real-valued) ratio of a:b:c among the three explicit objectives in Delivery.

Hyperparameters

Hyperparameters are passed as arguments to wandb.init() and can be changed by modifying the respective training files.

Multi-Objective Reinforced Active Learning

Related tags

Overview

Multi-Objective Reinforced Active Learning

Dependencies

Weights and Biases

Environments

Training

Hyperparameters

Owner

Markus Peschl

BT-Unet: A-Self-supervised-learning-framework-for-biomedical-image-segmentation-using-Barlow-Twins

STARCH compuets regional extreme storm physical characteristics and moisture balance based on spatiotemporal precipitation data from reanalysis or climate model data.

The PyTorch implementation for paper "Neural Texture Extraction and Distribution for Controllable Person Image Synthesis" (CVPR2022 Oral)

Code for paper: Group-CAM: Group Score-Weighted Visual Explanations for Deep Convolutional Networks

Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight

Torch-ngp - A pytorch implementation of the hash encoder proposed in instant-ngp

The Self-Supervised Learner can be used to train a classifier with fewer labeled examples needed using self-supervised learning.

Ranger deep learning optimizer rewrite to use newest components

Certifiable Outlier-Robust Geometric Perception

Predicting Event Memorability from Contextual Visual Semantics

Official pytorch implementation of paper "Image-to-image Translation via Hierarchical Style Disentanglement".

PyTorch implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"

Real time Human Detection Counting

The goal of the exercises below is to evaluate the candidate knowledge and problem solving expertise regarding the main development focuses for the iFood ML Platform team: MLOps and Feature Store development.

[SIGGRAPH Asia 2021] DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning.

mmfewshot is an open source few shot learning toolbox based on PyTorch

Tensorflow implementation for "Improved Transformer for High-Resolution GANs" (NeurIPS 2021).

Explainable Zero-Shot Topic Extraction

PyTorch implementation of DreamerV2 model-based RL algorithm

Iowa Project - My second project done at General Assembly, focused on feature engineering and understanding Linear Regression as a concept