Unsupervised Learning of Compositional Energy Concepts

This is the pytorch code for the paper Unsupervised Learning of Compositional Energy Concepts.

Demo

Please download a pretrained model at this link and then execute the following code to test a pretrained CelebA-HQ 128x128 COMET model

python demo.py im_path=im0.png

Global Factor Decomposition

Please utilize the following command to run global factor decomposition on CelebA-HQ (or other datasets)

python train.py --exp=celebahq --batch_size=12 --gpus=1 --cuda --train --dataset=celebahq --step_lr=500.0

You may further run the code on high-resolution 128x128 images below

python train.py --exp=celebahq_128 --batch_size=12 --gpus=1 --cuda --train --dataset=celebahq_128 --step_lr=500.0

Local Factor Decomposition

Please utilize the following command to run local factor decomposition on CLEVR

python train.py --exp=clevr_local_decomp --num_steps=5 --step_lr=1000.0 --components=4 --dataset=clevr --cuda --train --batch_size=24 --latent_dim=16 --recurrent_model --pos_embed

Dataset Download

Please utilize the following link to download the CLEVR dataset utilized in our experiments. Downloads for additional datasets will be posted soon. Feel free to raise an issue if there is a particular dataset you would like downloaded

Citing our Paper

If you find our code useful for your research, please consider citing

@inproceedings{du2021comet,
  title={Unsupervised Learning of Compositional Energy Concepts},
  author={Du, Yilun and Li, Shuang and Sharma, Yash and Tenenbaum, B. Joshua
  and Mordatch, Igor},
  booktitle={Advances in Neural Information Processing Systems},
  year={2021}
}

[NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts

Related tags

Overview

Unsupervised Learning of Compositional Energy Concepts

Demo

Global Factor Decomposition

Local Factor Decomposition

Dataset Download

Citing our Paper

Owner

Code for the ICML 2021 paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

This repository is an official implementation of the paper MOTR: End-to-End Multiple-Object Tracking with TRansformer.

A solution to the 2D Ising model of ferromagnetism, implemented using the Metropolis algorithm

Parasite: a tool allowing you to compress and decompress files, to reduce their size

This repository is the official implementation of the Hybrid Self-Attention NEAT algorithm.

Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation (ICLR 2020)

Accurate Phylogenetic Inference with Symmetry-Preserving Neural Networks

Instance-wise Occlusion and Depth Orders in Natural Scenes (CVPR 2022)

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis

Apply AnimeGAN-v2 across frames of a video clip

ISBI 2022: Cross-level Contrastive Learning and Consistency Constraint for Semi-supervised Medical Image.

3D HourGlass Networks for Human Pose Estimation Through Videos

Code for Mesh Convolution Using a Learned Kernel Basis

Controlling the MicriSpotAI robot from scratch

Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

SpanNER: Named EntityRe-/Recognition as Span Prediction

Code for ACL'2021 paper WARP 🌀 Word-level Adversarial ReProgramming

CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.

Global Rhythm Style Transfer Without Text Transcriptions