Offline Reinforcement Learning with Implicit Q-Learning

This repository contains the official implementation of Offline Reinforcement Learning with Implicit Q-Learning by Ilya Kostrikov, Ashvin Nair, and Sergey Levine.

If you use this code for your research, please consider citing the paper:

@article{kostrikov2021iql,
    title={Offline Reinforcement Learning with Implicit Q-Learning},
    author={Ilya Kostrikov and Ashvin Nair and Sergey Levine},
    year={2021},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

How to run the code

Install dependencies

pip install -r requirements.txt

See instructions for CUDA.

Run training

Locomotion

python train_offline.py --env_name=halfcheetah-medium-expert-v2 --config=configs/mujoco_config.py

AntMaze

python train_offline.py --env_name=antmaze-large-play-v0 --config=configs/antmaze_config.py --eval_episodes=100 --eval_interval=100000

Kitchen and Adroit

python train_offline.py --env_name=pen-human-v0 --config=configs/kitchen_config.py

Misc

The implementation is based on JAXRL.

Offline Reinforcement Learning with Implicit Q-Learning

Related tags

Overview

Offline Reinforcement Learning with Implicit Q-Learning

How to run the code

Install dependencies

Run training

Misc

Owner

Ilya Kostrikov

Scaling and Benchmarking Self-Supervised Visual Representation Learning

Implementation of Auto-Conditioned Recurrent Networks for Extended Complex Human Motion Synthesis

Research code for Arxiv paper "Camera Motion Agnostic 3D Human Pose Estimation"

Implementation of ProteinBERT in Pytorch

This repository is an implementation of our NeurIPS 2021 paper (Stylized Dialogue Generation with Multi-Pass Dual Learning) in PyTorch.

Deep Latent Force Models

PyTorch code for our ECCV 2018 paper "Image Super-Resolution Using Very Deep Residual Channel Attention Networks"

Creating multimodal multitask models

【ACMMM 2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

Zeyuan Chen, Yangchao Wang, Yang Yang and Dong Liu.

Code for "ATISS: Autoregressive Transformers for Indoor Scene Synthesis", NeurIPS 2021

ICRA 2021 - Robust Place Recognition using an Imaging Lidar

Calibrated Hyperspectral Image Reconstruction via Graph-based Self-Tuning Network.

Nonnegative spatial factorization for multivariate count data

Convnext-tf - Unofficial tensorflow keras implementation of ConvNeXt

[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games

Keras attention models including botnet,CoaT,CoAtNet,CMT,cotnet,halonet,resnest,resnext,resnetd,volo,mlp-mixer,resmlp,gmlp,levit

Hyperbolic Hierarchical Clustering.

Source code of all the projects of Udacity Self-Driving Car Engineer Nanodegree.

[CVPR 2020] Interpreting the Latent Space of GANs for Semantic Face Editing