Multi Task RL Baselines

Last update: Jan 09, 2023

Related tags

Deep Learning mtrl

Overview

MTRL

Multi Task RL Algorithms

Introduction
Setup
Usage
Documentation
Contributing to MTRL
Community
Acknowledgements

Introduction

MTRL is a library of multi-task reinforcement learning algorithms. It has two main components:

Building blocks and agents that implement the multi-task RL algorithms.
Experiment setups that enable training/evaluation on different setups.

Together, these two components enable use of MTRL across different environments and setups.

List of publications & submissions using MTRL (please create a pull request to add the missing entries):

Learning Robust State Abstractions for Hidden-Parameter Block MDPs

License

Citing MTRL

If you use MTRL in your research, please use the following BibTeX entry:

@Misc{Sodhani2021MTRL,
  author =       {Shagun Sodhani and Amy Zhang},
  title =        {MTRL - Multi Task RL Algorithms},
  howpublished = {Github},
  year =         {2021},
  url =          {https://github.com/facebookresearch/mtrl}
}

Setup

Clone the repository: git clone [email protected]:facebookresearch/mtrl.git.
Install dependencies: pip install -r requirements/dev.txt

Usage

MTRL supports 8 different multi-task RL algorithms as described here.
MTRL supports multi-task environments using MTEnv. These environments include MetaWorld and multi-task variants of DMControl Suite
Refer the tutorial to get started with MTRL.

Documentation

https://mtrl.readthedocs.io

Contributing to MTRL

There are several ways to contribute to MTRL.

Use MTRL in your research.
Contribute a new algorithm. We currently support 8 multi-task RL algorithms and are looking forward to adding more environments.
Check out the good-first-issues on GitHub and contribute to fixing those issues.
Check out additional details here.

Community

Ask questions in the chat or github issues:

Chat
Issues

Acknowledgements

Our implementation of SAC is inspired by Denis Yarats' implementation of SAC.
Project file pre-commit, mypy config, towncrier config, circleci etc are based on same files from Hydra.

Multi Task RL Baselines

Related tags

Overview

MTRL

Contents

Introduction

List of publications & submissions using MTRL (please create a pull request to add the missing entries):

License

Citing MTRL

Setup

Usage

Documentation

Contributing to MTRL

Community

Acknowledgements

Owner

Facebook Research

Tutorial repo for an end-to-end Data Science project

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021

The Adapter-Bot: All-In-One Controllable Conversational Model

Implementation of Online Label Smoothing in PyTorch

Sign-to-Speech for Sign Language Understanding: A case study of Nigerian Sign Language

A Multi-modal Perception Tracker (MPT) for speaker tracking using both audio and visual modalities

PyTorch implementation of Barlow Twins.

Repo for FUZE project. I will also publish some Linux kernel LPE exploits for various real world kernel vulnerabilities here. the samples are uploaded for education purposes for red and blue teams.

Locally Differentially Private Distributed Deep Learning via Knowledge Distillation (LDP-DL)

Demos of essentia classifiers hosted on replicate.ai

Code for the paper "Adversarially Regularized Autoencoders (ICML 2018)" by Zhao, Kim, Zhang, Rush and LeCun

Markov Attention Models

Analysis of Smiles through reservoir sampling & RDkit

A state-of-the-art semi-supervised method for image recognition

Pytorch implementation for "Open Compound Domain Adaptation" (CVPR 2020 ORAL)

DAN: Unfolding the Alternating Optimization for Blind Super Resolution

FairFuzz: AFL extension targeting rare branches

Code of our paper "Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning"

Benchmarks for Object Detection in Aerial Images