Code for "Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search"

Last update: Dec 03, 2022

Overview

Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search

This is an implementation for our paper Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search. The code is modified from Github repositoty "pytorch implementation for ECCV2018 paper Deep Cross-Modal Projection Learning for Image-Text Matching".

Requirement

Python 3.7
Pytorch 1.0.0 & torchvision 0.2.1
numpy
matplotlib (not necessary unless the need for the result figure)
scipy 1.2.1
pytorch_transformers

Usage

Data Preparation

Please download CUHK-PEDES dataset .
Put reid_raw.json under project_directory/data/
run data.sh
Copy files test_reid.json, train_reid.json and val_reid.json under CUHK-PEDES/data/ to project_directory/data/processed_data/
Download pretrained Resnet50 model, bert-base-uncased model and vocabulary to project_directory/pretrained/

Training & Testing

You should firstly change the parameter BASE_ROOT to your current directory and IMAGE_DIR to the directory of CUHK-PEDES dataset. Run command sh scripts/train.sh to train the model. Run command sh scripts/test.sh to evaluate the model.

Code for "Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search"

Related tags

Overview

Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search

Requirement

Usage

Data Preparation

Training & Testing

Model Framework

Model Performance

Owner

Tencent YouTu Research

Hl classification bc - A Network-Based High-Level Data Classification Algorithm Using Betweenness Centrality

Some methods for comparing network representations in deep learning and neuroscience.

Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.

SimDeblur is a simple framework for image and video deblurring, implemented by PyTorch

Fine-Tune EleutherAI GPT-Neo to Generate Netflix Movie Descriptions in Only 47 Lines of Code Using Hugginface And DeepSpeed

Learning to Reconstruct 3D Non-Cuboid Room Layout from a Single RGB Image

This repository contains the code for the paper "Hierarchical Motion Understanding via Motion Programs"

StrongSORT: Make DeepSORT Great Again

An Exact Solver for Semi-supervised Minimum Sum-of-Squares Clustering

ColossalAI-Benchmark - Performance benchmarking with ColossalAI

Barlow Twins and HSIC

Official implementation of "Variable-Rate Deep Image Compression through Spatially-Adaptive Feature Transform", ICCV 2021

This project intends to use SVM supervised learning to determine whether or not an individual is diabetic given certain attributes.

Evaluating Cross-lingual Sentence Representations

PyTorch implementation of Barlow Twins.

Source code for Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Revisting Open World Object Detection

WatermarkRemoval-WDNet-WACV2021

Streamlit Tutorial (ex: stock price dashboard, cartoon-stylegan, vqgan-clip, stylemixing, styleclip, sefa)

An addon uses SMPL's poses and global translation to drive cartoon character in Blender.