An official PyTorch implementation of the paper "Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences", ICCV 2021.

Last update: Nov 05, 2022

Related tags

Overview

PyTorch implementation of Learning by Aligning (ICCV 2021)

This is an official PyTorch implementation of the paper "Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences", ICCV 2021.

For more details, visit our project site or see our paper.

Requirements

Python 3.8
PyTorch 1.7.1
GPU memory >= 11GB

Getting started

First, clone our git repository.

git clone https://github.com/cvlab-yonsei/LbA.git
cd LbA

Docker

You can use docker pull sanghslee/ps:1.7.1-cuda11.0-cudnn8-runtime

Prepare datasets

SYSU-MM01: download from this link.
- For SYSU-MM01, you need to preprocess the .jpg files into .npy files by running:
  - python utils/pre_preprocess_sysu.py --data_dir /path/to/SYSU-MM01
- Modify the dataset directory below accordingly.
  - L63 of train.py
  - L54 of test.py

Train

run python train.py --method full
Important:
- Performances reported during training does not reflect exact performances of your model. This is due to 1) evaluation protocols of the datasets and 2) random seed configurations.
- Make sure you seperately run test.py to obtain correct results to be reported in your paper.

Test

run python test.py --method full
The results should be around:

dataset	method	mAP	rank-1
SYSU-MM01	baseline	49.54	50.43
SYSU-MM01	full	54.14	55.41

Pretrained weights

Download [SYSU-MM01]
The results should be:

dataset	method	mAP	rank-1
SYSU-MM01	full	55.22	56.31

Bibtex

@article{park2021learning,
  title={Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences},
  author={Park, Hyunjong and Lee, Sanghoon and Lee, Junghyup and Ham, Bumsub},
  journal={arXiv preprint arXiv:2108.07422},
  year={2021}
}

Credits

Our implementation is based on Mang Ye's code here.

Comments

something about run this code

thanks for your code, there is something wrong when i run you code,in this line: loss = torch.mean(comask_pos * self.criterion(feat, feat_recon_pos, feat_recon_neg)) the wrong is:RuntimeError: The size of tensor a (9) must match the size of tensor b (18) at non-singleton dimension 3 could you give me some help?

opened by zhuchuanleiqq 12
When running "train. Py", there is a problem on line 132 of the "model. Py" file:

When running "train. Py", there is a problem on line（loss = torch.mean(comask_pos * self.criterion(feat, feat_recon_pos, feat_recon_neg))） 132 of the "model. Py" file: Traceback：RuntimeError: The size of tensor a (9) must match the size of tensor b (18) at non-singleton dimension 3

opened by redsoup 1
Question about the training speed

Thanks for your work.

When I tried to reproduce your results with an Nvidia 2080Ti (as recommended by the paper), however, the training speed seemed very slow. It nearly took 20 minutes for each epoch on SYSU-MM01, which mismatched with the reported 8 hours training time.

I have already used cuda for acceleration. Thus, I wonder how did this happen. Thank you.

opened by hansonchen1996 1
Problems about the performance

I have run your source code on both SYSU and RegDB datasets, but I didn't get the performance of your paper. So I want to know how to set the hyper-parameter to get the performance of your paper?

opened by Mrkkew 1
Visualization problem

Hello， Thanks for your great work, I am wondering about the visualization part, use mask and comask matrix in SYSU-MM01 dataset. Can I get some details about the steps of your visualization method? Thank you very much.

opened by sunset233 0

Releases(v1.0)

v1.0(Aug 22, 2021)

Source code(tar.gz)
Source code(zip)
sysu_pretrained.t(273.10 MB)

Owner

CV Lab @ Yonsei University

GitHub Repository

This pyhton script converts a pdf to Image then using tesseract as OCR engine converts Image to Text

Script_Convertir_PDF_IMG_TXT Este script de pyhton convierte un pdf en Imagen luego utilizando tesseract como motor OCR convierte la Imagen a Texto. p

1 Jan 27, 2022

Program created with opencv that allows you to automatically count your repetitions on several fitness exercises.

Virtual partner of gym Description Program created with opencv that allows you to automatically count your repetitions on several fitness exercises li

1 Jan 04, 2022

This project is basically to draw lines with your hand, using python, opencv, mediapipe.

Paint Opencv 📷 This project is basically to draw lines with your hand, using python, opencv, mediapipe. Screenshoots 📱 Tools ⚙️ Python Opencv Mediap

3 Nov 17, 2021

FastOCR is a desktop application for OCR API.

FastOCR FastOCR is a desktop application for OCR API. Installation Arch Linux fastocr-git @ AUR Build from AUR or install with your favorite AUR helpe

58 Jan 07, 2023

RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection

RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection For more details, please refer to our paper. Citing Please cite the related works

102 Jun 29, 2022

A Python wrapper for Google Tesseract

Python Tesseract Python-tesseract is an optical character recognition (OCR) tool for python. That is, it will recognize and "read" the text embedded i

4.6k Jan 06, 2023

Aloception is a set of package for computer vision: aloscene, alodataset, alonet.

86 Dec 28, 2022

Convert PDF/Image to TXT using EasyOcr - the best OCR engine available!

PDFImage2TXT - DOWNLOAD INSTALLER HERE What can you do with it? Convert scanned PDFs to TXT. Convert scanned Documents to TXT. No coding required!! In

2 Feb 22, 2022

轻量级公式 OCR 小工具：一键识别各类公式图片，并转换为 LaTeX 格式

QC-Formula | 青尘公式 OCR 介绍轻量级开源公式 OCR 小工具：一键识别公式图片，并转换为 LaTeX 格式。支持从电脑本地导入公式图片；（后续版本将支持直接从网页导入图片）公式图片支持 .png / .jpg / .bmp，大小为 4M 以内均可；支持印刷体及手写体，前

26 Jan 07, 2023

Balabobapy - Using artificial intelligence algorithms to continue the text

1 Feb 04, 2022

color detection using python

colordetection color detection using python In this color detection Python project, we are going to build an application through which you can automat

1 Nov 04, 2021

Provides OCR (Optical Character Recognition) services through web applications

OCR4all As suggested by the name one of the main goals of OCR4all is to allow basically any given user to independently perform OCR on a wide variety

174 Dec 31, 2022

CellProfiler is a open-source application for biological image analysis

CellProfiler is a free open-source software designed to enable biologists without training in computer vision or programming to quantitatively measure phenotypes from thousands of images automaticall

732 Dec 23, 2022

https://arxiv.org/abs/1904.01941

Character-Region-Awareness-for-Text-Detection- https://arxiv.org/abs/1904.01941 Train You can train SynthText data use python source/train_SynthText.p

120 Dec 28, 2022

Intruder detection systems are common place now, and readily available in industry, but how do they work? They must detect people and large animals, but not generate false alarms in the presence of small animals, changes in lighting, environmental motion such as trees, or melting snow. To work correctly, the system must learn the background, in order to differentiate foreground objects.

Intruder-Detection Intruder detection systems are common place now, and readily available in industry, but how do they work? They must detect people a

4 Jul 18, 2021

Corner-based Region Proposal Network

Corner-based Region Proposal Network CRPN is a two-stage detection framework for multi-oriented scene text. It employs corners to estimate the possibl

140 Nov 04, 2022

3点クリックで円を指定し、極座標変換を行うサンプルプログラム

click-warpPolar 3点クリックで円を指定し、極座標変換を行うサンプルプログラムです。 Requirements OpenCV 3.4.2 or Later Usage 実行方法は以下です。起動後、マウスで3点をクリックし円を指定してください。 python click-warpPol

17 Dec 30, 2022

This repository lets you train neural networks models for performing end-to-end full-page handwriting recognition using the Apache MXNet deep learning frameworks on the IAM Dataset.

Handwritten Text Recognition (OCR) with MXNet Gluon These notebooks have been created by Jonathan Chung, as part of his internship as Applied Scientis

422 Jan 03, 2023

SceneCollisionNet This repo contains the code for "Object Rearrangement Using Learned Implicit Collision Functions", an ICRA 2021 paper. For more info

31 Nov 22, 2022

Deep LearningImage Captcha 2

滑动验证码深度学习识别本项目使用深度学习 YOLOV3 模型来识别滑动验证码缺口，基于 https://github.com/eriklindernoren/PyTorch-YOLOv3 修改。只需要几百张缺口标注图片即可训练出精度高的识别模型，识别效果样例：克隆项目运行命令： git cl

117 Dec 28, 2022