Multimodal Reinforcement Learning

JAX implementations of the following multimodal reinforcement learning approaches.

Dual-coding Episodic Memory from "Grounded Language Learning Fast and Slow"

The goal in this setting is for the agent to be presented with multiple objects with made up names following "This is a _____" statements and to then carry out an instruction such as "Move the wazzle to the table." This task requires the agent to learn long-term language and vision representations for concepts like "This is a" and objects that carry over between episodes such as "table" while also being able to learn one-shot representations of novel objects and their names.

Usage

Start by setting up the environment locally by running

poetry install
poetry shell

The learning environment depends on Docker and requires that the Docker Desktop program is running (on Mac). Once that's done you can run the default environment (fast mapping with 3 objects from the paper).

python fast_slow_learning/main.py

Solving reinforcement learning tasks which require language and vision

Related tags

Overview

Multimodal Reinforcement Learning

Usage

Owner

Henry Prior

A forwarding MPI implementation that can use any other MPI implementation via an MPI ABI

Kaggle | 9th place single model solution for TGS Salt Identification Challenge

A universal memory dumper using Frida

A semismooth Newton method for elliptic PDE-constrained optimization

Shape-aware Semi-supervised 3D Semantic Segmentation for Medical Images

Learning Optical Flow from a Few Matches (CVPR 2021)

deep_image_prior_extension

Systemic Evolutionary Chemical Space Exploration for Drug Discovery

Convolutional Neural Network for 3D meshes in PyTorch

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

Codes for CIKM'21 paper 'Self-Supervised Graph Co-Training for Session-based Recommendation'.

This reposityory contains the PyTorch implementation of our paper "Generative Dynamic Patch Attack".

Qcover is an open source effort to help exploring combinatorial optimization problems in Noisy Intermediate-scale Quantum(NISQ) processor.

⚡ H2G-Net for Semantic Segmentation of Histopathological Images

Instance-wise Occlusion and Depth Orders in Natural Scenes (CVPR 2022)

Custom TensorFlow2 implementations of forward and backward computation of soft-DTW algorithm in batch mode.

AugLiChem - The augmentation library for chemical systems.

[RSS 2021] An End-to-End Differentiable Framework for Contact-Aware Robot Design

Diverse Image Generation via Self-Conditioned GANs

Safe Control for Black-box Dynamical Systems via Neural Barrier Certificates