Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning" (AAAI 2021)

Last update: Dec 06, 2022

Overview

Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning

Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning" (AAAI 2021)

Geonmo Gu*¹, Byungsoo Ko*¹, Han-Gyu Kim² (* Authors contributed equally.)

¹@NAVER/LINE Vision, ²@NAVER Clova Speech

Paper | Presentation Video | PPT | Poster

Overview

Proxy Synthesis

Proxy Synthesis (PS) is a novel regularizer for any softmax variants and proxy-based losses in deep metric learning.

How it works?

Proxy Synthesis exploits synthetic classes and improves generalization by considering class relations and obtaining smooth decision boundaries.
Synthetic classes mimic unseen classes during training phase as described in below Figure.

Experimental results

Proxy Synthesis improves performance for every loss and benchmark dataset.

Getting Started

Installation

Clone the repository locally

$ git clone https://github.com/navervision/proxy-synthesis

Create conda virtual environment

$ conda create -n proxy_synthesis python=3.7 anaconda
$ conda activate proxy_synthesis

Install pytorch

Install pytorch according to your CUDA version

$ conda install pytorch torchvision cudatoolkit=<YOUR_CUDA_VERSION> -c pytorch

Install faiss

We use faiss library for faster evaluation
Install faiss according to your CUDA version

$ conda install faiss-gpu cudatoolkit=<YOUR_CUDA_VERSION> -c pytorch

Install requirements

$ pip install -r requirements.txt

Prepare Data

Download CARS196 dataset and unzip

$ wget http://imagenet.stanford.edu/internal/car196/car_ims.tgz
$ tar zxvf car_ims.tgz -C ./dataset

Rearrange CARS196 directory by following structure

# Dataset structure
/dataset/carDB/
  train/
    class1/
      img1.jpeg
    class2/
      img2.jpeg
  test/
    class1/
      img3.jpeg
    class2/
      img4.jpeg

# Rearrange dataset structure
$ python dataset/prepare_cars.py

Train models

Norm-SoftMax loss with CARS196

# Norm-SoftMax
$ python main.py --gpu=0 \
--save_path=./logs/CARS196_norm_softmax \
--data=./dataset/carDB --data_name=cars196 \
--dim=512 --batch_size=128 --epochs=130 \
--freeze_BN --loss=Norm_SoftMax \
--decay_step=50 --decay_stop=50 --n_instance=1 \
--scale=23.0 --check_epoch=5

PS + Norm-SoftMax loss with CARS196

# PS + Norm-SoftMax
$ python main.py --gpu=0 \
--save_path=./logs/CARS196_PS_norm_softmax \
--data=./dataset/carDB --data_name=cars196 \
 --dim=512 --batch_size=128 --epochs=130 \
--freeze_BN --loss=Norm_SoftMax \
--decay_step=50 --decay_stop=50 --n_instance=1 \
--scale=23.0 --check_epoch=5 \
--ps_alpha=0.40 --ps_mu=1.0

Proxy-NCA loss with CARS196

# Proxy-NCA
$ python main.py --gpu=0 \
--save_path=./logs/CARS196_proxy_nca \
--data=./dataset/carDB --data_name=cars196 \
--dim=512 --batch_size=128 --epochs=130 \
--freeze_BN --loss=Proxy_NCA \
--decay_step=50 --decay_stop=50 --n_instance=1 \
--scale=12.0 --check_epoch=5

PS + Proxy-NCA loss with CARS196

# PS + Proxy-NCA
$ python main.py --gpu=0 \
--save_path=./logs/CARS196_PS_proxy_nca \
--data=./dataset/carDB --data_name=cars196 \
--dim=512 --batch_size=128 --epochs=130 \
--freeze_BN --loss=Proxy_NCA \
--decay_step=50 --decay_stop=50 --n_instance=1 \
--scale=12.0 --check_epoch=5 \
--ps_alpha=0.40 --ps_mu=1.0

Check Test Results

$ tensorboard --logdir=logs --port=10000

Experimental results

We report [email protected], RP and MAP performances of each loss, which are trained with CARS196 dataset for 8 runs.

[email protected]

Loss	1	2	3	4	5	6	7	8	Mean ± std
Norm-SoftMax	83.38	83.25	83.25	83.18	83.05	82.90	82.83	82.79	83.08 ± 0.21
PS + Norm-SoftMax	84.69	84.58	84.45	84.35	84.22	83.95	83.91	83.89	84.25 ± 0.31
Proxy-NCA	83.74	83.69	83.62	83.32	83.06	83.00	82.97	82.84	83.28 ± 0.36
PS + Proxy-NCA	84.52	84.39	84.32	84.29	84.22	84.12	83.94	83.88	84.21 ± 0.21

RP

Loss	1	2	3	4	5	6	7	8	Mean ± std
Norm-SoftMax	35.85	35.51	35.28	35.28	35.24	34.95	34.87	34.84	35.23 ± 0.34
PS + Norm-SoftMax	37.01	36.98	36.92	36.74	36.74	36.73	36.54	36.45	36.76 ± 0.20
Proxy-NCA	36.08	35.85	35.79	35.66	35.66	35.63	35.47	35.43	35.70 ± 0.21
PS + Proxy-NCA	36.97	36.84	36.72	36.64	36.63	36.60	36.43	36.41	36.66 ± 0.18

MAP

Loss	1	2	3	4	5	6	7	8	Mean ± std
Norm-SoftMax	25.56	25.56	25.00	24.93	24.90	24.59	24.57	24.56	24.92 ± 0.35
PS + Norm-SoftMax	26.71	26.67	26.65	26.56	26.53	26.52	26.30	26.17	26.51 ± 0.18
Proxy-NCA	25.66	25.52	25.37	25.36	25.33	25.26	25.22	25.04	25.35 ± 0.18
PS + Proxy-NCA	26.77	26.63	26.50	26.42	26.37	26.31	26.25	26.12	26.42 ± 0.20

Performance Graph

Below figure shows performance graph of test set during training.

Reference

Our code is based on SoftTriple repository (Arxiv, Github)

Citation

If you find Proxy Synthesis useful in your research, please consider to cite the following paper.

@inproceedings{gu2020proxy,
    title={Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning},
    author={Geonmo Gu, Byungsoo Ko, and Han-Gyu Kim},
    booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
    year={2021}
}

License

Copyright 2021-present NAVER Corp.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning" (AAAI 2021)

Related tags

Overview

Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning

Overview

Proxy Synthesis

How it works?

Experimental results

Getting Started

Installation

Prepare Data

Train models

Norm-SoftMax loss with CARS196

PS + Norm-SoftMax loss with CARS196

Proxy-NCA loss with CARS196

PS + Proxy-NCA loss with CARS196

Check Test Results

Experimental results

[email protected]

RP

MAP

Performance Graph

Reference

Citation

License

Owner

NAVER/LINE Vision

Classify the disease status of a plant given an image of a passion fruit

Neural Network to colorize grayscale images

(EI 2022) Controllable Confidence-Based Image Denoising

Human Activity Recognition example using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six activity categories - Guillaume Chevalier

SEAN: Image Synthesis with Semantic Region-Adaptive Normalization (CVPR 2020, Oral)

code and models for "Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation"

PyTorch Implementation of AnimeGANv2

A object detecting neural network powered by the yolo architecture and leveraging the PyTorch framework and associated libraries.

Pytorch Lightning code guideline for conferences

Deep universal probabilistic programming with Python and PyTorch

Meli Data Challenge 2021 - First Place Solution

[ECCV 2020] XingGAN for Person Image Generation

UnsupervisedR&R: Unsupervised Pointcloud Registration via Differentiable Rendering

TCTrack: Temporal Contexts for Aerial Tracking (CVPR2022)

For IBM Quantum Challenge 2021 (May 20 - 26)

Jupyter notebooks for using & learning Keras

Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(2021) paper

Pytorch implementation for DFN: Distributed Feedback Network for Single-Image Deraining.

UFPR-ADMR-v2 Dataset

This is an official pytorch implementation of Lite-HRNet: A Lightweight High-Resolution Network.