Code for "MetaMorph: Learning Universal Controllers with Transformers", Gupta et al, ICLR 2022

Last update: Jan 03, 2023

Related tags

Overview

MetaMorph: Learning Universal Controllers with Transformers

This is the code for the paper

MetaMorph: Learning Universal Controllers with Transformers
Agrim Gupta, Linxi Fan, Surya Ganguli, Fei-Fei Li

Multiple domains like vision, natural language, and audio are witnessing tremendous progress by leveraging Transformers for large scale pre-training followed by task specific fine tuning. In contrast, in robotics we primarily train a single robot for a single task. However, modular robot systems now allow for the flexible combination of general-purpose building blocks into task optimized morphologies. However, given the exponentially large number of possible robot morphologies, training a controller for each new design is impractical. In this work, we propose MetaMorph, a Transformer based approach to learn a universal controller over a modular robot design space. MetaMorph is based on the insight that robot morphology is just another modality on which we can condition the output of a Transformer. Through extensive experiments we demonstrate that large scale pre-training on a variety of robot morphologies results in policies with combinatorial generalization capabilities, including zero shot generalization to unseen robot morphologies. We further demonstrate that our pre-trained policy can be used for sample-efficient transfer to completely new robot morphologies and tasks.

Code Structure

The code consists of two main components:

Metamorph: Code for joint pre-training of different robots.
Environments and evaluation tasks: Three pre-training environments and two evaluation environments.

Benchmark

We also provide Unimal-100 benchmark. The benchmark consists of 100 train morphologies, 1600 morphologies with dynamics variations, 800 morphologies with kinematics variations, and 100 test morphologies.

# Install gdown
pip install gdown
# Download data
gdown 1LyKYTCevnqWrDle1LTBMlBF58RmCjSzM
# Unzip
unzip unimals_100.zip

Setup

We provide Dockerfile for easy installation and development. If you prefer to work without docker please take a look at Dockerfile and ensure that your local system has all the necessary dependencies installed.

Training

# Build docker container. Ensure that MuJoCo license is present: docker/mjkey.txt
./scripts/build_docker.sh
# Joint pre-training. Please change MOUNT_DIR location inside run_docker_gpu.sh
# Finally ensure that ENV.WALKER_DIR points to benchmark files and is accessible
# from docker.
./scripts/run_docker_gpu.sh python tools/train_ppo.py --cfg ./configs/ft.yaml

The default parameters assume that you are running the code on a machine with atlesat 1 GPU.

Citation

If you find this code useful, please consider citing:

@inproceedings{
    gupta2022metamorph,
    title={MetaMorph: Learning Universal Controllers with Transformers},
    author={Agrim Gupta and Linxi Fan and Surya Ganguli and Li Fei-Fei},
    booktitle={International Conference on Learning Representations},
    year={2022},
    url={https://openreview.net/forum?id=Opmqtk_GvYL}
}

Credit

This codebase would not have been possible without the following amazing open source codebases:

Code for "MetaMorph: Learning Universal Controllers with Transformers", Gupta et al, ICLR 2022

Related tags

Overview

MetaMorph: Learning Universal Controllers with Transformers

Code Structure

Benchmark

Setup

Training

Citation

Credit

Owner

Agrim Gupta

Oriented Response Networks, in CVPR 2017

(CVPR2021) DANNet: A One-Stage Domain Adaptation Network for Unsupervised Nighttime Semantic Segmentation

Help you understand Manual and w/ Clutch point while driving.

This repository implements WGAN_GP.

A LiDAR point cloud cluster for panoptic segmentation

The official repository for paper ''Domain Generalization for Vision-based Driving Trajectory Generation'' submitted to ICRA 2022

InsightFace: 2D and 3D Face Analysis Project on MXNet and PyTorch

Code for the paper "Learning-Augmented Algorithms for Online Steiner Tree"

An Artificial Intelligence trying to drive a car by itself on a user created map

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch

Code and real data for the paper "Counterfactual Temporal Point Processes", available at arXiv.

Object DGCNN and DETR3D, Our implementations are built on top of MMdetection3D.

Official code for 'Pixel-wise Energy-biased Abstention Learning for Anomaly Segmentationon Complex Urban Driving Scenes'

Real-time Object Detection for Streaming Perception, CVPR 2022

This repo is for segmentation of T2 hyp regions in gliomas.

Neural Scene Graphs for Dynamic Scene (CVPR 2021)

Barlow Twins and HSIC

render sprites into your desktop environment as shaped windows using GTK

Head and Neck Tumour Segmentation and Prediction of Patient Survival Project

OBBDetection is a oriented object detection library, which is based on MMdetection.

Code for "MetaMorph: Learning Universal Controllers with Transformers", Gupta et al, ICLR 2022

Related tags

Overview

MetaMorph: Learning Universal Controllers with Transformers

Code Structure

Benchmark

Setup

Training

Citation

Credit

Owner

Agrim Gupta

Oriented Response Networks, in CVPR 2017

(CVPR2021) DANNet: A One-Stage Domain Adaptation Network for Unsupervised Nighttime Semantic Segmentation

Help you understand Manual and w/ Clutch point while driving.

This repository implements WGAN_GP.

A LiDAR point cloud cluster for panoptic segmentation

The official repository for paper ''Domain Generalization for Vision-based Driving Trajectory Generation'' submitted to ICRA 2022

InsightFace: 2D and 3D Face Analysis Project on MXNet and PyTorch

Code for the paper "Learning-Augmented Algorithms for Online Steiner Tree"

An Artificial Intelligence trying to drive a car by itself on a user created map

NuPIC Studio is an all­-in-­one tool that allows users create a HTM neural network from scratch

Code and real data for the paper "Counterfactual Temporal Point Processes", available at arXiv.

Object DGCNN and DETR3D, Our implementations are built on top of MMdetection3D.

Official code for 'Pixel-wise Energy-biased Abstention Learning for Anomaly Segmentationon Complex Urban Driving Scenes'

Real-time Object Detection for Streaming Perception, CVPR 2022

This repo is for segmentation of T2 hyp regions in gliomas.

Neural Scene Graphs for Dynamic Scene (CVPR 2021)

Barlow Twins and HSIC

render sprites into your desktop environment as shaped windows using GTK

Head and Neck Tumour Segmentation and Prediction of Patient Survival Project

OBBDetection is a oriented object detection library, which is based on MMdetection.

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch