Joint Discriminative and Generative Learning for Person Re-identification. CVPR'19 (Oral)

Last update: Dec 30, 2022

Overview

Joint Discriminative and Generative Learning for Person Re-identification

[Project] [Paper] [YouTube] [Bilibili] [Poster] [Supp]

Joint Discriminative and Generative Learning for Person Re-identification, CVPR 2019 (Oral)
Zhedong Zheng, Xiaodong Yang, Zhiding Yu, Liang Zheng, Yi Yang, Jan Kautz

News
Features
Prerequisites
Getting Started
DG-Market
Tips
Citation
Related Work
License

News

02/18/2021: We release DG-Net++: the extention of DG-Net for unsupervised cross-domain re-id.
08/24/2019: We add the direct transfer learning results of DG-Net here.
08/01/2019: We add the support of multi-GPU training: python train.py --config configs/latest.yaml --gpu_ids 0,1.

Features

We have supported:

Multi-GPU training (fp32)
APEX to save GPU memory (fp16/fp32)
Multi-query evaluation
Random erasing
Visualize training curves
Generate all figures in the paper

Prerequisites

Python 3.6
GPU memory >= 15G (fp32)
GPU memory >= 10G (fp16/fp32)
NumPy
PyTorch 1.0+
[Optional] APEX (fp16/fp32)

Getting Started

Installation

Install PyTorch
Install torchvision from the source:

git clone https://github.com/pytorch/vision
cd vision
python setup.py install

[Optional] You may skip it. Install APEX from the source:

git clone https://github.com/NVIDIA/apex.git
cd apex
python setup.py install --cuda_ext --cpp_ext

Clone this repo:

git clone https://github.com/NVlabs/DG-Net.git
cd DG-Net/

Our code is tested on PyTorch 1.0.0+ and torchvision 0.2.1+ .

Dataset Preparation

Download the dataset Market-1501 [Google Drive] [Baidu Disk]

Preparation: put the images with the same id in one folder. You may use

python prepare-market.py          # for Market-1501

Note to modify the dataset path to your own path.

Testing

Download the trained model

We provide our trained model. You may download it from Google Drive (or Baidu Disk password: rqvf). You may download and move it to the outputs.

├── outputs/
│   ├── E0.5new_reid0.5_w30000
├── models
│   ├── best/

Person re-id evaluation

Supervised learning

	Market-1501	DukeMTMC-reID	MSMT17	CUHK03-NP
[email protected]	94.8%	86.6%	77.2%	65.6%
mAP	86.0%	74.8%	52.3%	61.1%

Direct transfer learning
To verify the generalizability of DG-Net, we train the model on dataset A and directly test the model on dataset B (with no adaptation). We denote the direct transfer learning protocol as A→B.

	Market→Duke	Duke→Market	Market→MSMT	MSMT→Market	Duke→MSMT	MSMT→Duke
[email protected]	42.62%	56.12%	17.11%	61.76%	20.59%	61.89%
[email protected]	58.57%	72.18%	26.66%	77.67%	31.67%	75.81%
[email protected]	64.63%	78.12%	31.62%	83.25%	37.04%	80.34%
mAP	24.25%	26.83%	5.41%	33.62%	6.35%	40.69%

Image generation evaluation

Please check the README.md in the ./visual_tools.

You may use the ./visual_tools/test_folder.py to generate lots of images and then do the evaluation. The only thing you need to modify is the data path in SSIM and FID.

Training

Train a teacher model

You may directly download our trained teacher model from Google Drive (or Baidu Disk password: rqvf). If you want to have it trained by yourself, please check the person re-id baseline repository to train a teacher model, then copy and put it in the ./models.

├── models/
│   ├── best/                   /* teacher model for Market-1501
│       ├── net_last.pth        /* model file
│       ├── ...

Train DG-Net

Setup the yaml file. Check out configs/latest.yaml. Change the data_root field to the path of your prepared folder-based dataset, e.g. ../Market-1501/pytorch.
Start training

python train.py --config configs/latest.yaml

Or train with low precision (fp16)

python train.py --config configs/latest-fp16.yaml

Intermediate image outputs and model binary files are saved in outputs/latest.

Check the loss log

 tensorboard --logdir logs/latest

DG-Market

We provide our generated images and make a large-scale synthetic dataset called DG-Market. This dataset is generated by our DG-Net and consists of 128,307 images (613MB), about 10 times larger than the training set of original Market-1501 (even much more can be generated with DG-Net). It can be used as a source of unlabeled training dataset for semi-supervised learning. You may download the dataset from Google Drive (or Baidu Disk password: qxyh).

	DG-Market	Market-1501 (training)
#identity	-	751
#images	128,307	12,936

Tips

Note the format of camera id and number of cameras. For some datasets (e.g., MSMT17), there are more than 10 cameras. You need to modify the preparation and evaluation code to read the double-digit camera id. For some vehicle re-id datasets (e.g., VeRi) having different naming rules, you also need to modify the preparation and evaluation code.

Citation

Please cite this paper if it helps your research:

@inproceedings{zheng2019joint,
  title={Joint discriminative and generative learning for person re-identification},
  author={Zheng, Zhedong and Yang, Xiaodong and Yu, Zhiding and Zheng, Liang and Yang, Yi and Kautz, Jan},
  booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2019}
}

Related Work

Other GAN-based methods compared in the paper include LSGAN, FDGAN and PG2GAN. We forked the code and made some changes for evaluatation, thank the authors for their great work. We would also like to thank to the great projects in person re-id baseline, MUNIT and DRIT.

License

Copyright (C) 2019 NVIDIA Corporation. All rights reserved. Licensed under the CC BY-NC-SA 4.0 (Attribution-NonCommercial-ShareAlike 4.0 International). The code is released for academic research use only. For commercial use, please contact [email protected].

Joint Discriminative and Generative Learning for Person Re-identification. CVPR'19 (Oral)

Related tags

Overview

Joint Discriminative and Generative Learning for Person Re-identification

Table of contents

News

Features

Prerequisites

Getting Started

Installation

Dataset Preparation

Testing

Download the trained model

Person re-id evaluation

Image generation evaluation

Training

Train a teacher model

Train DG-Net

DG-Market

Tips

Citation

Related Work

License

Owner

NVIDIA Research Projects

Official Pytorch implementation of "CLIPstyler:Image Style Transfer with a Single Text Condition"

particle tracking model, works with the ROMS output file(qck.nc, his.nc)

Revisting Open World Object Detection

This is the source code of the 1st place solution for segmentation task (with Dice 90.32%) in 2021 CCF BDCI challenge.

Решения, подсказки, тесты и утилиты для тренировки по алгоритмам от Яндекса.

Agent-based model simulator for air quality and pandemic risk assessment in architectural spaces

Memory Defense: More Robust Classificationvia a Memory-Masking Autoencoder

One Million Scenes for Autonomous Driving

Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

This repository contains the code and models for the following paper.

A distributed deep learning framework that supports flexible parallelization strategies.

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

PyTorch implementation of Neural Dual Contouring.

Equivariant GNN for the prediction of atomic multipoles up to quadrupoles.

[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang

ByteTrack(Multi-Object Tracking by Associating Every Detection Box)のPythonでのONNX推論サンプル

Predict stock movement with Machine Learning and Deep Learning algorithms

A collection of IPython notebooks covering various topics.

MINERVA: An out-of-the-box GUI tool for offline deep reinforcement learning