Exploring the Dual-task Correlation for Pose Guided Person Image Generation

Last update: Dec 15, 2022

Overview

Dual-task Pose Transformer Network

The source code for our paper "Exploring Dual-task Correlation for Pose Guided Person Image Generation“ (CVPR2022)

Get Start

1) Requirement

Python 3.7.9
Pytorch 1.7.1
torchvision 0.8.2
CUDA 11.1
NVIDIA A100 40GB PCIe

2) Data Preperation

Following PATN, the dataset split files and extracted keypoints files can be obtained as follows:

DeepFashion

Download the DeepFashion dataset in-shop clothes retrival benchmark, and put them under the ./dataset/fashion directory.
Download train/test pairs and train/test keypoints annotations from Google Drive, including fasion-resize-pairs-train.csv, fasion-resize-pairs-test.csv, fasion-resize-annotation-train.csv, fasion-resize-annotation-train.csv, train.lst, test.lst, and put them under the ./dataset/fashion directory.
Split the raw image into the training set (./dataset/fashion/train) and test set (./dataset/fashion/test):

python data/generate_fashion_datasets.py

Market1501

Download the Market1501 dataset from here. Rename bounding_box_train and bounding_box_test as train and test, and put them under the ./dataset/market directory.
Download train/test key points annotations from Google Drive including market-pairs-train.csv, market-pairs-test.csv, market-annotation-train.csv, market-annotation-train.csv. Put these files under the ./dataset/market directory.

3) Train a model

DeepFashion

python train.py --name=DPTN_fashion --model=DPTN --dataset_mode=fashion --dataroot=./dataset/fashion --batchSize 32 --gpu_id=0

Market1501

python train.py --name=DPTN_market --model=DPTN --dataset_mode=market --dataroot=./dataset/market --dis_layer=3 --lambda_g=5 --lambda_rec 2 --t_s_ratio=0.8 --save_latest_freq=10400 --batchSize 32 --gpu_id=0

4) Test the model

You can directly download our test results from Google Drive: Deepfashion, Market1501.

DeepFashion

python test.py --name=DPTN_fashion --model=DPTN --dataset_mode=fashion --dataroot=./dataset/fashion --which_epoch latest --results_dir ./results/DPTN_fashion --batchSize 1 --gpu_id=0

Market1501

python test.py --name=DPTN_market --model=DPTN --dataset_mode=market --dataroot=./dataset/market --which_epoch latest --results_dir=./results/DPTN_market  --batchSize 1 --gpu_id=0

5) Evaluation

We adopt SSIM, PSNR, FID and LPIPS for the evaluation.

DeepFashion

python -m  metrics.metrics --gt_path=./dataset/fashion/test --distorated_path=./results/DPTN_fashion --fid_real_path=./dataset/fashion/train --name=./fashion

Market1501

python -m  metrics.metrics --gt_path=./dataset/market/test --distorated_path=./results/DPTN_market --fid_real_path=./dataset/market/train --name=./market --market

6) Pre-trained Model

Our pre-trained model can be downloaded from Google Drive: Deepfashion, Market1501.

Citation

Acknowledgement

We build our project based on pix2pix. Some dataset preprocessing methods are derived from PATN.

Exploring the Dual-task Correlation for Pose Guided Person Image Generation

Related tags

Overview

Dual-task Pose Transformer Network

Get Start

1) Requirement

2) Data Preperation

3) Train a model

4) Test the model

5) Evaluation

6) Pre-trained Model

Citation

Acknowledgement

Owner

3rd Place Solution for ICCV 2021 Workshop SSLAD Track 3A - Continual Learning Classification Challenge

Jarvis Project is a basic virtual assistant that uses TensorFlow for learning.

YOLOX + ROS(1, 2) object detection package

OpenDILab Multi-Agent Environment

Semantic Edge Detection with Diverse Deep Supervision

Implementation of CVPR 2020 Dual Super-Resolution Learning for Semantic Segmentation

Stock-Prediction - prediction of stock market movements using sentiment analysis and deep learning.

A knowledge base construction engine for richly formatted data

Code for our paper "Interactive Analysis of CNN Robustness"

The official TensorFlow implementation of the paper Action Transformer: A Self-Attention Model for Short-Time Pose-Based Human Action Recognition

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

MDETR: Modulated Detection for End-to-End Multi-Modal Understanding

Pytorch implementation of Cut-Thumbnail in the paper Cut-Thumbnail:A Novel Data Augmentation for Convolutional Neural Network.

Official implement of "CAT: Cross Attention in Vision Transformer".

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.

From this paper "SESNet: A Semantically Enhanced Siamese Network for Remote Sensing Change Detection"

Toontown House CT Edition

Python Tensorflow 2 scripts for detecting objects of any class in an image without knowing their label.

implicit displacement field

WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

Exploring the Dual-task Correlation for Pose Guided Person Image Generation

Related tags

Overview

Dual-task Pose Transformer Network

Get Start

1) Requirement

2) Data Preperation

3) Train a model

4) Test the model

5) Evaluation

6) Pre-trained Model

Citation

Acknowledgement

Owner

3rd Place Solution for ICCV 2021 Workshop SSLAD Track 3A - Continual Learning Classification Challenge

Jarvis Project is a basic virtual assistant that uses TensorFlow for learning.

YOLOX + ROS(1, 2) object detection package

OpenDILab Multi-Agent Environment

Semantic Edge Detection with Diverse Deep Supervision

Implementation of CVPR 2020 Dual Super-Resolution Learning for Semantic Segmentation

Stock-Prediction - prediction of stock market movements using sentiment analysis and deep learning.

A knowledge base construction engine for richly formatted data

Code for our paper "Interactive Analysis of CNN Robustness"

The official TensorFlow implementation of the paper Action Transformer: A Self-Attention Model for Short-Time Pose-Based Human Action Recognition

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

MDETR: Modulated Detection for End-to-End Multi-Modal Understanding

Pytorch implementation of Cut-Thumbnail in the paper Cut-Thumbnail:A Novel Data Augmentation for Convolutional Neural Network.

Official implement of "CAT: Cross Attention in Vision Transformer".

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren*, Raymond A. Yeh*, Alexander G. Schwing.

From this paper "SESNet: A Semantically Enhanced Siamese Network for Remote Sensing Change Detection"

Toontown House CT Edition

Python Tensorflow 2 scripts for detecting objects of any class in an image without knowing their label.

implicit displacement field

WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.