Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device" @ CAD&Graphics2019

Last update: Dec 01, 2022

Overview

PortraitNet

Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device". @ CAD&Graphics 2019

Introduction

We propose a real-time portrait segmentation model, called PortraitNet, that can run effectively and efficiently on mobile device. PortraitNet is based on a lightweight U-shape architecture with two auxiliary losses at the training stage, while no additional cost is required at the testing stage for portrait inference.

Portrait segmentation applications on mobile device.

Experimental setup

Requirements

python 2.7
PyTorch 0.3.0.post4
Jupyter Notebook
pip install easydict matplotlib tqdm opencv-python scipy pyyaml numpy

Download datasets

EG1800 Since several image URL links are invalid in the original EG1800 dataset, we finally use 1447 images for training and 289 images for validation.
Supervise-Portrait Supervise-Portrait is a portrait segmentation dataset collected from the public human segmentation dataset Supervise.ly using the same data process as EG1800.

Training

Network Architecture

Overview of PortraitNet.

Training Steps

Download the datasets (EG1800 or Supervise-Portriat). If you want to training at your own dataset, you need to modify data/datasets.py and data/datasets_portraitseg.py.
Prepare training/testing files, like data/select_data/eg1800_train.txt and data/select_data/eg1800_test.txt.
Select and modify the parameters in the folder of config.
Start the training with single gpu:

cd myTrain
python2.7 train.py

Testing

In the folder of myTest:

you can use EvalModel.ipynb to test on testing datasets.
you can use VideoTest.ipynb to test on a single image or video.

Visualization

Using tensorboard to visualize the training process:

cd path_to_save_model
tensorboard --logdir='./log'

Download models

from Dropbox:

mobilenetv2_eg1800_with_two_auxiliary_losses(Training on EG1800 with two auxiliary losses)
mobilenetv2_supervise_portrait_with_two_auxiliary_losses(Training on Supervise-Portrait with two auxiliary losses)
mobilenetv2_total_with_prior_channel(Training on Human with prior channel)

from Baidu Cloud:

mobilenetv2_eg1800_with_two_auxiliary_losses(Training on EG1800 with two auxiliary losses)
mobilenetv2_supervise_portrait_with_two_auxiliary_losses(Training on Supervise-Portrait with two auxiliary losses)
mobilenetv2_total_with_prior_channel(Training on Human with prior channel)

Code for the paper "PortraitNet: Real-time portrait segmentation network for mobile device" @ CAD&Graphics2019

Related tags

Overview

PortraitNet

Introduction

Experimental setup

Requirements

Download datasets

Training

Network Architecture

Training Steps

Testing

Visualization

Download models

Owner

This is an official implementation of our CVPR 2021 paper "Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression" (https://arxiv.org/abs/2104.02300)

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers

Council-GAN - Implementation for our paper Breaking the Cycle - Colleagues are all you need (CVPR 2020)

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

Company clustering with K-means/GMM and visualization with PCA, t-SNE, using SSAN relation extraction

Implementation of the paper "Self-Promoted Prototype Refinement for Few-Shot Class-Incremental Learning"

A2LP for short, ECCV2020 spotlight, Investigating SSL principles for UDA problems

Using some basic methods to show linkages and transformations of robotic arms

Combining Diverse Feature Priors

Alias-Free Generative Adversarial Networks (StyleGAN3) Official PyTorch implementation

Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"

Accuracy Aligned. Concise Implementation of Swin Transformer

An Artificial Intelligence trying to drive a car by itself on a user created map

🛰️ Awesome Satellite Imagery Datasets

nextPARS, a novel Illumina-based implementation of in-vitro parallel probing of RNA structures.

TransCD: Scene Change Detection via Transformer-based Architecture

Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image of a face as input and produces a 3D reconstruction. EMOCA sets the new standard on reconstructing highly emotional images in-the-wild

This is the official code for the paper "Ad2Attack: Adaptive Adversarial Attack for Real-Time UAV Tracking".

Python 3 module to print out long strings of text with intervals of time inbetween

Fantasy Points Prediction and Dream Team Formation