Codes for "Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier"

Last update: May 26, 2022

Related tags

Overview

Deep-RTC [project page]

This repository contains the source code accompanying our ECCV 2020 paper.

Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier
Tz-Ying Wu, Pedro Morgado, Pei Wang, Chih-Hui Ho, Nuno Vasconcelos

@inproceedings{Wu20DeepRTC,
	title={Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier},
	author={Tz-Ying Wu and Pedro Morgado and Pei Wang and Chih-Hui Ho and Nuno Vasconcelos},
	booktitle={European Conference on Computer Vision (ECCV)},
	year={2020}
}

Dependencies

Python (3.5.6)
PyTorch (1.2.0)
torchvision (0.4.0)
NumPy (1.15.2)
Pillow (5.2.0)
PyYaml (5.1.2)
tensorboardX (1.8)

Data preparation

CIFAR100 [Raw images] [Long-tail version]
AWA2 [Raw images]
ImageNet [Raw images] [Long-tail version]
iNaturalist [Raw images]

These datasets can be downloaded from the above links. Please organize the images in the hierarchical folders that represent the dataset hierarchy, and put the root folder under prepro/raw. For example,

prepro/raw/imagenet
--abstraction
----bubble
------ILSVRC2012_val_00014026.JPEG
------ILSVRC2012_val_00000697.JPEG
...
--physical_entity
----object
...

While CIFAR100 and iNaturalist have released taxonomies, we built the tree-type taxonomy of AWA2 and ImageNet with WordNet. All the taxonomies are provided in prepro/data/{dataset}/tree.npy, and the data splits are provided in prepro/splits/{dataset}/{split}.json. Please refer to prepro/README.md for more details. After the raw images are managed hierarchically, run

$ ./prepare_data.sh {dataset}

where {dataset}=awa2/cifar100/imagenet/inaturalist. This will automatically generate the data lists for all splits, and build the codeword matrices needed for training Deep-RTC. Note that our codes can be applied to other datasets once they are organized hierarchically.

Training and evaluation

To train and evaluate Deep-RTC, run

$ export PYTHONPATH=${PWD}/prepro:${PYTHONPATH}
$ ./run.sh {dataset}

where {dataset}=awa2/cifar100/imagenet/inaturalist. Our pretrained models can be downloaded here.

Codes for "Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier"

Related tags

Overview

Deep-RTC [project page]

Dependencies

Data preparation

Training and evaluation

Owner

Gina Wu

torchlm is aims to build a high level pipeline for face landmarks detection, it supports training, evaluating, exporting, inference(Python/C++) and 100+ data augmentations

Look Who’s Talking: Active Speaker Detection in the Wild

Data pipelines for both TensorFlow and PyTorch!

Dados coletados e programas desenvolvidos no processo de iniciação científica

CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

ESTDepth: Multi-view Depth Estimation using Epipolar Spatio-Temporal Networks (CVPR 2021)

Self-supervised Multi-modal Hybrid Fusion Network for Brain Tumor Segmentation

An alarm clock coded in Python 3 with Tkinter

Convolutional 2D Knowledge Graph Embeddings resources

A Dying Light 2 (DL2) PAKFile Utility for Modders and Mod Makers.

Model search is a framework that implements AutoML algorithms for model architecture search at scale

The 2nd place solution of 2021 google landmark retrieval on kaggle.

A copy of Ares that costs 30 fucking dollars.

Vehicle detection using machine learning and computer vision techniques for Udacity's Self-Driving Car Engineer Nanodegree.

Source code and dataset for ACL2021 paper: "ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning".

The 2nd Version Of Slothybot

Thermal Control of Laser Powder Bed Fusion using Deep Reinforcement Learning

A simple baseline for 3d human pose estimation in PyTorch.

Official implementation of Neural Bellman-Ford Networks (NeurIPS 2021)

DynaTune: Dynamic Tensor Program Optimization in Deep Neural Network Compilation