Official implementation of the MM'21 paper Constrained Graphic Layout Generation via Latent Optimization

Last update: Dec 27, 2022

Overview

[MM'21] Constrained Graphic Layout Generation via Latent Optimization

This repository provides the official code for the paper "Constrained Graphic Layout Generation via Latent Optimization", especially the code for:

LayoutGAN++: generative adversarial networks for layout generation
CLG-LO: a framework for generating layouts that satisfy constraints
Layout evaluation: measuring the quantitative metrics of Layout FID, Maximum IoU, Alignment, and Overlap for generated layouts

Installation

Clone this repository

git clone https://github.com/ktrk115/const_layout.git
cd const_layout

Create a new conda environment (Python 3.8)

conda create -n const_layout python=3.8
conda activate const_layout

Install PyTorch 1.8.* and the corresponding versoin of PyTorch Geometric
Install the other dependent libraries
```
pip install -r requirements.txt
```
Prepare data (see this instruction)
Download pre-trained models
```
./download_model.sh
```

Development environment

Ubuntu 18.04, CUDA 11.1

LayoutGAN++

Architecture

Training animation

Generate layouts with LayoutGAN++

python generate.py pretrained/layoutganpp_rico.pth.tar --out_path output/generated_layouts.pkl --num_save 5

Train LayoutGAN++ model

python train.py --dataset rico --batch_size 64 --iteration 200000 --latent_size 4 --lr 1e-05 --G_d_model 256 --G_nhead 4 --G_num_layers 8 --D_d_model 256 --D_nhead 4 --D_num_layers 8

CLG-LO

w/ beautification constraints	w/ relational constraints

Generate layouts with beautification constraints

python generate_const.py pretrained/layoutganpp_publaynet.pth.tar --const_type beautify --out_path output/beautify/generated_layouts.pkl --num_save 5

Generate layouts with relational constraints

python generate_const.py pretrained/layoutganpp_publaynet.pth.tar --const_type relation --out_path output/relation/generated_layouts.pkl --num_save 5

Layout evaluation

Evaluate generated layouts

python eval.py rico output/generated_layouts.pkl

A pickle file should be a list of layouts, where each layout is a tuple of bounding boxes and labels. The bounding box is represented by [x, y, width, height] in normalized coordinates, and the label is represented by an index. An example is shown below.

In [x]: layouts
Out[x]:
[(array([[0.47403812, 0.11276676, 0.6250037 , 0.02210438],
         [0.49971417, 0.8550553 , 0.81388366, 0.03492427],
         [0.49919674, 0.47857162, 0.81024694, 0.7070079 ]], dtype=float32),
  array([0, 0, 3]),
  ...

Citation

If this repository helps your research, please consider citing our paper.

@inproceedings{Kikuchi2021,
    title = {Constrained Graphic Layout Generation via Latent Optimization},
    author = {Kotaro Kikuchi and Edgar Simo-Serra and Mayu Otani and Kota Yamaguchi},
    booktitle = {Proceedings of the ACM International Conference on Multimedia},
    series = {MM '21},
    volume = {},
    year = {2021},
    pages = {},
    doi = {10.1145/3474085.3475497}
}

Licence

GNU AGPLv3

Official implementation of the MM'21 paper Constrained Graphic Layout Generation via Latent Optimization

Related tags

Overview

[MM'21] Constrained Graphic Layout Generation via Latent Optimization

Installation

Development environment

LayoutGAN++

Generate layouts with LayoutGAN++

Train LayoutGAN++ model

CLG-LO

Generate layouts with beautification constraints

Generate layouts with relational constraints

Layout evaluation

Evaluate generated layouts

Citation

Licence

Related repositories

Owner

Kotaro Kikuchi

This is the source code for: Context-aware Entity Typing in Knowledge Graphs.

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

A Traffic Sign Recognition Project which can help the driver recognise the signs via text as well as audio. Can be used at Night also.

Tensors and neural networks in Haskell

Implementation of "Efficient Regional Memory Network for Video Object Segmentation" (Xie et al., CVPR 2021).

Reinforcement Learning for Portfolio Management

ParaGen is a PyTorch deep learning framework for parallel sequence generation

Fusion-in-Decoder Distilling Knowledge from Reader to Retriever for Question Answering

(AAAI2020)Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing

Simple Pixelbot for Diablo 2 Resurrected written in python and opencv.

AsymmetricGAN - Dual Generator Generative Adversarial Networks for Multi-Domain Image-to-Image Translation

ICSS - Interactive Continual Semantic Segmentation

This is a package for LiDARTag, described in paper: LiDARTag: A Real-Time Fiducial Tag System for Point Clouds

Efficient Online Bayesian Inference for Neural Bandits

CIFS: Improving Adversarial Robustness of CNNs via Channel-wise Importance-based Feature Selection

Official implementation of the method ContIG, for self-supervised learning from medical imaging with genomics

Recognize Handwritten Digits using Deep Learning on the browser itself.

Deep generative modeling for time-stamped heterogeneous data, enabling high-fidelity models for a large variety of spatio-temporal domains.

library for nonlinear optimization, wrapping many algorithms for global and local, constrained or unconstrained, optimization

RobustVideoMatting and background composing in one model by using onnxruntime.