Robot Reinforcement Learning on the Constraint Manifold

Last update: Dec 05, 2022

Related tags

Deep Learning rl_on_manifold

Overview

Acting on the Tangent Space of the Constraint Manifold

Implementation of "Robot Reinforcement Learning on the Constraint Manifold"

[paper] [website]

Install

pip install -e .

Run Examples

cd examples

CircularMotion Environment.

Environment options [A, E, T], algorithms options [TRPO, PPO, SAC, DDPG, TD3]

python circle_exp.py --render --env A --alg TRPO

PlanarAirHockey Environment.

Environment options [H, D, UH, UD], algorithms options [TRPO, PPO, SAC, DDPG, TD3]

python planar_air_hockey_exp.py --debug-gui --env H --alg SAC

IiwaAirHockey Environment.

Environment options [7H, RMP], algorithms options [TRPO, PPO, SAC, DDPG, TD3]

python iiwa_air_hockey_exp.py --debug-gui --env 7H --alg SAC

CollisionAvoidance Environment.

Environment options [C], algorithms options [TRPO, PPO, SAC, DDPG, TD3]

python collision_avoidance_exp.py --render --env C --alg SAC

Bibtex

@inproceedings{CORL_2021_Learning_on_the_Manifold,
  author =      "Liu, P. and  Tateo D. and  Bou-Ammar, H. and  Peters, J.",
  year =        "2021",
  title =       "Robot Reinforcement Learning on the Constraint Manifold",
  booktitle =   "Proceedings of the Conference on Robot Learning (CoRL)",
  key =	        "robot learning, constrained reinforcement learning, safe exploration",
}

Robot Reinforcement Learning on the Constraint Manifold

Related tags

Overview

Acting on the Tangent Space of the Constraint Manifold

Install

Run Examples

CircularMotion Environment.

PlanarAirHockey Environment.

IiwaAirHockey Environment.

CollisionAvoidance Environment.

Bibtex

Owner

KinectFusion implemented in Python with PyTorch

This is the dataset and code release of the OpenRooms Dataset.

Codebase for Time-series Generative Adversarial Networks (TimeGAN)

Novel Instances Mining with Pseudo-Margin Evaluation for Few-Shot Object Detection

QT Py Media Knob using rotary encoder & neopixel ring

Finetuning Pipeline

a grammar based feedback fuzzer

Research shows Google collects 20x more data from Android than Apple collects from iOS. Block this non-consensual telemetry using pihole blocklists.

IJCAI2020 & IJCV 2020 :city_sunrise: Unsupervised Scene Adaptation with Memory Regularization in vivo

Dynamic View Synthesis from Dynamic Monocular Video

Implementation of the famous Image Manipulation\Forgery Detector "ManTraNet" in Pytorch

A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).

On Nonlinear Latent Transformations for GAN-based Image Editing - PyTorch implementation

(EI 2022) Controllable Confidence-Based Image Denoising

scikit-learn inspired API for CRFsuite

Convert dog pictures into various painting styles. Try LimnPet

Fake videos detection by tracing the source using video hashing retrieval.

ICS 4u HD project, start before-wards. A curtain shooting game using python.

A PyTorch implementation for PyramidNets (Deep Pyramidal Residual Networks)

MILK: Machine Learning Toolkit