(AAAI2020)Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing

Last update: Aug 04, 2022

Overview

Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing

This repository contains pytorch source code for AAAI2020 oral paper: Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing by Haoyu He, Jing Zhang, Qiming Zhang and Dacheng Tao.

Grapy-ML:

Getting Started:

Environment:

Pytorch = 1.1.0
torchvision
scipy
tensorboardX
numpy
opencv-python
matplotlib

Data Preparation:

You need to download the three datasets. The CIHP dataset and ATR dataset can be found in this repository and our code is heavily borrowed from it as well.

Then, the datasets should be arranged in the following folder, and images should be rearranged with the provided file structure.

/data/dataset/

Testing:

The pretrain models and some trained models are provided here for testing and training.

Model Name	Description	Derived from
deeplab_v3plus_v3.pth	The Deeplab v3+'s pretrain weights
CIHP_pretrain.pth	The reproduced Deeplab v3+ model trained on CIHP dataset	deeplab_v3plus_v3.pth
CIHP_trained.pth	GPM model trained on CIHP dataset	CIHP_pretrain.pth
deeplab_multi-dataset.pth	The reproduced multi-task learning Deeplab v3+ model trained on CIHP, PASCAL-Person-Part and ATR dataset	deeplab_v3plus_v3.pth
GPM-ML_multi-dataset.pth	Grapy-ML model trained on CIHP, PASCAL-Person-Part and ATR dataset	deeplab_multi-dataset.pth
GPM-ML_finetune_PASCAL.pth	Grapy-ML model finetuned on PASCAL-Person-Part dataset	GPM-ML_multi-dataset.pth

To test, run the following two scripts:

bash eval_gpm.sh
bash eval_gpm_ml.sh

Training:

GPM:

During training, you first need to get the Deeplab pretrain model(e.g. CIHP_dlab.pth) on each dataset. Such act aims to provide a trustworthy initial raw result for the GSA operation in GPM.

bash train_dlab.sh

The imageNet pretrain model is provided in the following table, and you should swith the dataset name and target classes to the dataset you want in the script. (CIHP: 20 classes, PASCAL: 7 classes and ATR: 18 classes)

In the next step, you should utilize the Deeplab pretrain model to further train the GPM model.

bash train_gpm.sh

It is recommended to follow the training settings in our paper to reproduce the results.

GPM-ML:

Firstly, you can conduct the deeplab pretrain process by the following script:

bash train_dlab_ml.sh

The multi-dataset Deeplab V3+ is transformed as a simple multi-task task.

Then, you can train the GPM-ML model with the training set from all three datasets by:

bash train_gpm_ml_all.sh

After this phase, the first two levels of the GPM-ML model would be more robust and generalized.

Finally, you can try to finetune on each dataset by the unified pretrain model.

bash train_gpm_ml_pascal.sh

Citation:

@inproceedings{he2020grapy,
title={Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing},
author={He, Haoyu and Zhang, Jing and Zhang, Qiming and Tao, Dacheng},
booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
year={2020}
}

Maintainer:

[email protected]

(AAAI2020)Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing

Related tags

Overview

Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing

Grapy-ML:

Getting Started:

Environment:

Data Preparation:

Testing:

Training:

GPM:

GPM-ML:

Citation:

Maintainer:

Owner

Exploring Versatile Prior for Human Motion via Motion Frequency Guidance (3DV2021)

Language Models for the legal domain in Spanish done @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

Transport Mode detection - can detect the mode of transport with the help of features such as acceeration,jerk etc

Official implementation of the paper Visual Parser: Representing Part-whole Hierarchies with Transformers

This repository contains the code for the paper Neural RGB-D Surface Reconstruction

An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"

Source Code for AAAI 2022 paper "Graph Convolutional Networks with Dual Message Passing for Subgraph Isomorphism Counting and Matching"

PyTorch Implementation of our paper Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation

A concise but complete implementation of CLIP with various experimental improvements from recent papers

Hashformers is a framework for hashtag segmentation with transformers.

Research code for Arxiv paper "Camera Motion Agnostic 3D Human Pose Estimation"

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Live Hand Tracking Using Python

pytorch implementation of openpose including Hand and Body Pose Estimation.

Code for the paper "Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds" (ICCV 2021)

Code for "Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search"

A system used to detect whether a person is wearing a medical mask or not.

Image-retrieval-baseline - MUGE Multimodal Retrieval Baseline

FTIR-Deep Learning - FTIR Deep Learning With Python

Space Invaders For Python