AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning

Last update: Dec 19, 2022

Related tags

Overview

SimSR

Code and dataset for the paper SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning (AAAI-22).

Requirements

We assume you have access to a gpu that can run CUDA 11. All of the dependencies are in the conda_env.yml file.

conda env create -f conda_env.yml

After the instalation ends you can activate your environment with

conda activate simsr

Instructions

To train a SimSR agent on the cartpole swingup task from image-based observations run bash run.sh from the root of this directory. The run.sh file contains the following command, which you can modify to try different environments / hyperparamters.

DOMAIN=cartpole
TASK=swingup
SEED=1

MUJOCO_GL="egl" CUDA_VISIBLE_DEVICES=0 nohup python -u train.py \
	--domain_name ${DOMAIN} \
	--task_name ${TASK} \
	--encoder_type pixel \
	--action_repeat 4 \
	--pre_transform_image_size 84 \
	--image_size 84 \
	--work_dir ./tmp \
	--agent simsr_sac \
	--frame_stack 3\
	--seed ${SEED} --critic_lr 1e-3 \
	--actor_lr 1e-3 \
	--eval_freq 10000 \
	--batch_size 128 \
	--num_train_steps 260000 > ${DOMAIN}_${TASK}_${SEED}.log &

Note that the MuJoCo Python bindings support three different OpenGL rendering backends: "glfw", "egl", or "osmesa". You can also specify a particular backend to use by setting the MUJOCO_GL= environment variable to one of them.

To visualize progress with tensorboard run:

tensorboard --logdir ./path/to/your/log --port 6006

References

Please cite the paper SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning if you found the resources in the repository useful.

AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning

Related tags

Overview

SimSR

Requirements

Instructions

References

Owner

Optimal Adaptive Allocation using Deep Reinforcement Learning in a Dose-Response Study

“Robust Lightweight Facial Expression Recognition Network with Label Distribution Training”, AAAI 2021.

Source code for our Paper "Learning in High-Dimensional Feature Spaces Using ANOVA-Based Matrix-Vector Multiplication"

MaskTrackRCNN for video instance segmentation based on mmdetection

Deep Surface Reconstruction from Point Clouds with Visibility Information

[WACV21] Code for our paper: Samuel, Atzmon and Chechik, "From Generalized zero-shot learning to long-tail with class descriptors"

A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.

Vision-Language Pre-training for Image Captioning and Question Answering

SuRE Evaluation: A Supplementary Material

Grammar Induction using a Template Tree Approach

This repository contains the files for running the Patchify GUI.

TipToiDog - Tip Toi Dog With Python

Semantic code search implementation using Tensorflow framework and the source code data from the CodeSearchNet project

Using deep actor-critic model to learn best strategies in pair trading

Deep Learning Specialization by Andrew Ng, deeplearning.ai.

[ICCV 2021] Official PyTorch implementation for Deep Relational Metric Learning.

3DIAS: 3D Shape Reconstruction with Implicit Algebraic Surfaces (ICCV 2021)

Predicting future trajectories of people in cameras of novel scenarios and views.

Tracking Pipeline helps you to solve the tracking problem more easily

🕺Full body detection and tracking