Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning

Last update: Aug 09, 2022

Related tags

Overview

Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning

Code for the paper Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning (TMM 2021).

Introduction

Automatic typography is important because it helps designers avoid highly repetitive tasks and amateur users achieve high-quality textual layout designs. However, there are often many parameters and complicated aesthetic rules that need to be adjusted in automatic typography work. In this paper, we propose an efficient deep aesthetics learning approach to generate harmonious textual layout over natural images, which can be decomposed into two stages, saliency-aware text region proposal and aesthetics-based textual layout selection. Our method incorporates both semantic features and visual perception principles. First, we propose a semantic visual saliency detection network combined with a text region proposal algorithm to generate candidate text anchors with various positions and sizes. Second, a discriminative deep aesthetics scoring model is developed to assess the aesthetic quality of the candidate textual layouts. The results demonstrate that our method can generate harmonious textual layouts in various actual scenarios with better performance.

Dependencies and Installation

Python 3
PyTorch >= 1.0

Notes of compilation

For Python3 users, before you start to build the source code and install the packages, please specify the architecture of your GPU card and CUDA_HOME path in both ./roi_align/make.sh and ./rod_align/make.sh
Build and install by running:
```
bash make_all.sh
```

Usage

Download the source code and the pretrained models: gdi-basnet and SMT.
Make sure your device is CUDA enabled. Build and install source code of roi_align_api and rod_align_api.
Run SmartText_demo.py to test the pretrained model on your images.
```
python SmartText_demo.py -opt test_opt.yml
```

Acknowledgement

This work is the extension of our conference version (ICME 2020). Some codes of this repository benefit from BASNet and GAIC. Thanks for their excellent work!

Citation

If you find this work useful, please cite our paper:

@article{li2021harmonious,
    title     = {Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning},
    author    = {Li, Chenhui and Zhang, Peiying and Wang, Changbo},
    journal   = {IEEE Transactions on Multimedia},
    year      = {2021},
    publisher = {IEEE}
}

Contact

If you have any question, contact us through email at [email protected].

Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning

Related tags

Overview

Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning

Introduction

Dependencies and Installation

Notes of compilation

Usage

Acknowledgement

Citation

Contact

Owner

Code for "Single-view robot pose and joint angle estimation via render & compare", CVPR 2021 (Oral).

paper: Hyperspectral Remote Sensing Image Classification Using Deep Convolutional Capsule Network

This repository is an official implementation of the paper MOTR: End-to-End Multiple-Object Tracking with TRansformer.

Semi-Supervised Graph Prototypical Networks for Hyperspectral Image Classification, IGARSS, 2021.

A collection of easy-to-use, ready-to-use, interesting deep neural network models

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services

Official Implementation of DAFormer: Improving Network Architectures and Training Strategies for Domain-Adaptive Semantic Segmentation

CS5242_2021 - Neural Networks and Deep Learning, NUS CS5242, 2021

[ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation

Permeability Prediction Via Multi Scale 3D CNN

DataCLUE: 国内首个以数据为中心的AI测评（含模型分析报告）

Luminaire is a python package that provides ML driven solutions for monitoring time series data.

Code for paper "Learning to Reweight Examples for Robust Deep Learning"

Jupyter notebooks showing best practices for using cx_Oracle, the Python DB API for Oracle Database

In this project we predict the forest cover type using the cartographic variables in the training/test datasets.

Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes, ICCV 2017

Pytorch implementation of Deep Recursive Residual Network for Super Resolution (DRRN)

A flexible submap-based framework towards spatio-temporally consistent volumetric mapping and scene understanding.

We utilize deep reinforcement learning to obtain favorable trajectories for visual-inertial system calibration.

Neural network for stock price prediction