Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Last update: Dec 19, 2021

Related tags

Overview

Official code for Continual Learning In Environments With Polynomial Mixing Times

Continual Learning in Environments with Polynomial Mixing Times

This repository provides official code base for the paper "Continual Learning in Environments with Polynomial Mixing Times"

Basic Setup

Clone this repository and then follow this command

cd polynomial-mixing-times

Create either use a python virtualenv or a conda environment and activate it.

pip install virtualenv
virtualenv -p /usr/bin/python3.7 mixing-times
source mixing-times/bin/activate

To install all the relevant packages use the following command:

pip install -e .

Running the experiments

We provide a running script with all relevant hyperparameters used for both baselines and our proposed model. One can run run_bottleneck.sh to run all the models.

To run the experiments of the proposed models on the Example 2 Bottleneck MDP class with 4 rooms, "random" task evolution and a random seed of 1, use the following command

bash run_bottleneck.sh 1 4 "random"

Available Models

Online Q learning
Q learning with Replay
Q learning w/ Dyna
Model based n-step TD
Vanilla Policy Gradient
Onpolicy rho learning
Off-policy rho learning
rho Policy Gradient

List of Environments

ScaleClass-v0
NBottleneckClass-v0
NCycleClass-v0

System requirements

We used python 3.7 version to run all our experiments.

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Related tags

Overview

Continual Learning in Environments with Polynomial Mixing Times

Basic Setup

Running the experiments

Available Models

List of Environments

System requirements

Owner

Sharath Raparthy

Real life contra a deep learning project built using mediapipe and openc

Trading Strategies for Freqtrade

Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch.

[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers

Official Implement of CVPR 2021 paper “Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting”

Evaluating Cross-lingual Sentence Representations

Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)

[CVPR'22] COAP: Learning Compositional Occupancy of People

A complete, self-contained example for training ImageNet at state-of-the-art speed with FFCV

Dashboard for the COVID19 spread

Official repo for AutoInt: Automatic Integration for Fast Neural Volume Rendering in CVPR 2021

Over9000 optimizer

OptNet: Differentiable Optimization as a Layer in Neural Networks

A python package to perform same transformation to coco-annotation as performed on the image.

MAU: A Motion-Aware Unit for Video Prediction and Beyond, NeurIPS2021

这是一个利用facenet和retinaface实现人脸识别的库，可以进行在线的人脸识别。

Code release for the paper “Worldsheet Wrapping the World in a 3D Sheet for View Synthesis from a Single Image”, ICCV 2021.

Code for ACL 2019 Paper: "COMET: Commonsense Transformers for Automatic Knowledge Graph Construction"