Embodied Intelligence via Learning and Evolution

This is the code for the paper

Embodied Intelligence via Learning and Evolution
Agrim Gupta, Silvio Savarese, Surya Ganguli, Fei-Fei Li

The intertwined processes of learning and evolution in complex environmental niches have resulted in a remarkable diversity of morphological forms. Moreover, many aspects of animal intelligence are deeply embodied in these evolved morphologies. However, the principles governing relations between environmental complexity, evolved morphology, and the learnability of intelligent control, remain elusive, partially due to the substantial challenge of performing large-scale in silico experiments on evolution and learning. We introduce Deep Evolutionary Reinforcement Learning (DERL): a novel computational framework which can evolve diverse agent morphologies to learn challenging locomotion and manipulation tasks in complex environments using only low level egocentric sensory information. Leveraging DERL we demonstrate several relations between environmental complexity, morphological intelligence and the learnability of control.

Code Structure

The code consists of three main components:

UNIMAL Design Space: A UNIversal aniMAL morphological design space that is both highly expressive yet also enriched for useful controllable morphologies.
DERL: An efficient asynchronous method for parallelizing computations underlying learning and evolution across many compute nodes.
Evolutionary environments and evaluation tasks: A set of three evolutionary environments and eight evaluation tasks.

Setup

We provide Dockerfile for easy installation and development. If you prefer to work without docker please take a look at Dockerfile and ensure that your local system has all the necessary dependencies installed.

Evolving Unimals

# Build docker container. Ensure that MuJoCo license is present: docker/mjkey.txt
./scripts/build_docker.sh
# Evolve unimals. Please change MOUNT_DIR location inside run_docker_cpu.sh
./scripts/run_docker_cpu.sh python tools/evolution.py --cfg ./configs/evo/ft_test.yml NODE_ID 0

The default parameters assume that you are running the code on 16 machines. Please ensure that each machine has a minimum of 72 CPUs. While running the script on multiple nodes you would have to ensure that NODE_ID on each machine is unique and between [0, NUM_NODES - 1].

Visualizing Environments

If you have installed all dependencies in your local machine. You can visualize the environment as follows:

python tools/terrain_builder.py --cfg configs/evo/mvt.yml

Credit

This codebase would not have been possible without the following amazing open source codebases:

Embodied Intelligence via Learning and Evolution

Related tags

Overview

Embodied Intelligence via Learning and Evolution

Code Structure

Setup

Evolving Unimals

Visualizing Environments

Credit

Owner

Agrim Gupta

HiPAL: A Deep Framework for Physician Burnout Prediction Using Activity Logs in Electronic Health Records

Reinforcement learning algorithms in RLlib

CPPE - 5 (Medical Personal Protective Equipment) is a new challenging object detection dataset

FreeSOLO for unsupervised instance segmentation, CVPR 2022

Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

FactSeg: Foreground Activation Driven Small Object Semantic Segmentation in Large-Scale Remote Sensing Imagery (TGRS)

Autonomous Movement from Simultaneous Localization and Mapping

The devkit of the nuScenes dataset.

Luminous is a framework for testing the performance of Embodied AI (EAI) models in indoor tasks.

DSL for matching Python ASTs

GLANet - The code for Global and Local Alignment Networks for Unpaired Image-to-Image Translation arxiv

Learning Synthetic Environments and Reward Networks for Reinforcement Learning

Official implementation of the paper DeFlow: Learning Complex Image Degradations from Unpaired Data with Conditional Flows

Using machine learning to predict undergrad college admissions.

Classic Papers for Beginners and Impact Scope for Authors.

This is a classifier which basically predicts whether there is a gun law in a state or not, depending on various things like murder rates etc.

The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color

Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.

Mememoji - A facial expression classification system that recognizes 6 basic emotions: happy, sad, surprise, fear, anger and neutral.

Pytorch version of SfmLearner from Tinghui Zhou et al.