Datasets and source code for our paper Webly Supervised Fine-Grained Recognition: Benchmark Datasets and An Approach

Last update: Sep 30, 2022

Related tags

Overview

Introduction

Datasets and source code for our paper Webly Supervised Fine-Grained Recognition: Benchmark Datasets and An Approach

Datasets: WebFG-496 & WebiNat-5089

WebFG-496

WebFG-496 contains 200 subcategories of the "Bird" (Web-bird), 100 subcategories of the Aircraft" (Web-aircraft), and 196 subcategories of the "Car" (Web-car). It has a total number of 53339 web training images.

Download the dataset:

wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-aircraft.tar.gz
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-bird.tar.gz
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-car.tar.gz

WebiNat-5089

WebiNat-5089 is a large-scale webly supervised fine-grained dataset, which consists of 5089 subcategories and 1184520 web training images.

Download the dataset:

wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-iNat.tar.gz.part-00
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-iNat.tar.gz.part-01
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-iNat.tar.gz.part-02
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-iNat.tar.gz.part-03
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-iNat.tar.gz.part-04
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-iNat.tar.gz.part-05
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-iNat.tar.gz.part-06
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-iNat.tar.gz.part-07
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-iNat.tar.gz.part-08
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-iNat.tar.gz.part-09
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-iNat.tar.gz.part-10
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-iNat.tar.gz.part-11
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-iNat.tar.gz.part-12
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/web-iNat.tar.gz.part-13

Dataset Briefing

The statistics of popular ﬁne-grained datasets and our datasets. “Supervision" means the training data is manually labeled (“Manual”) or collected from the web (“Web”).

Detailed construction process of training data in WebFG-496 and WebiNat-5089. “Testing Source” indicates where testing images come from. “Imbalance” is the number of images in the largest class divided by the number of images in the smallest.

Rough label accuracy of training data estimated by random sampling for WebFG-496 and WebiNat-5089.

Peer-learning model

Network Architecture

The architecture of our proposed peer-learning model is as follows

Installation

After creating a virtual environment of python 3.5, run pip install -r requirements.txt to install all dependencies

How to use

The code is currently tested only on GPU

Data Preparation
- WebFG-496
  
  Download data into PLM root directory and decompress them using
```
tar -xvf web-aircraft.tar.gz
tar -xvf web-bird.tar.gz
tar -xvf web-car.tar.gz
```
- WebiNat-5089
  
  Download data into PLM root directory and decompress them using
```
cat web-iNat.tar.gz.part-* | tar -zxv
```
Source Code
- If you want to train the whole network from beginning using source code on the WebFG-496 dataset, please follow subsequent steps
  - In Web496_train.sh
    - Modify CUDA_VISIBLE_DEVICES to proper cuda device id.
    - Modify DATA to web-aircraft/web-bird/web-car as needed and then modify N_CLASSES accordingly.
  - Activate virtual environment(e.g. conda) and then run the script
```
bash Web496_train.sh
```
- If you want to train the whole network from beginning using source code on the WebiNat-5089 dataset, please follow subsequent steps
  - Modify CUDA_VISIBLE_DEVICES to proper cuda device id in Web5089_train.sh.
  - Activate virtual environment(e.g. conda) and then run the script
```
bash Web5089_train.sh
```
Demo
- If you just want to do a quick test on the model and check the final fine-grained recognition performance on the WebFG-496 dataset, please follow subsequent steps
  - Download one of the following trained models into model/ using
```
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/Models/plm_web-aircraft_bcnn_best-epoch_74.38.pth
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/Models/plm_web-bird_bcnn_best-epoch_76.48.pth
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/Models/plm_web-car_bcnn_best-epoch_78.52.pth
```
  - Activate virtual environment (e.g. conda)
  - In Web496_demo.sh
    - Modify CUDA_VISIBLE_DEVICES to proper cuda device id.
    - Modify the model name according to the model downloaded.
    - Modify DATA to web-aircraft/web-bird/web-car according to the model downloaded and then modify N_CLASSES accordingly.
  - Run demo using bash Web496_demo.sh
- If you just want to do a quick test on the model and check the final fine-grained recognition performance on the WebiNat-5089 dataset, please follow subsequent steps
  - Download one of the following trained models into model/ using
```
wget https://web-fgvc-496-5089-sh.oss-cn-shanghai.aliyuncs.com/Models/plm_web-inat_resnet50_best-epoch_54.56.pth
```
  - Activate virtual environment (e.g. conda)
  - In Web5089_demo.sh
    - Modify CUDA_VISIBLE_DEVICES to proper cuda device id.
    - Modify the model name according to the model downloaded.
  - Run demo using bash Web5089_demo.sh

Results

The comparison of classification accuracy (%) for benchmark methods and webly supervised baselines (Decoupling, Co-teaching, and our Peer-learning) on the WebFG-496 dataset.

The comparison of classification accuracy (%) of benchmarks and our proposed webly supervised baseline Peer-learning on the WebiNat-5089 dataset.

The comparisons among our Peer-learning model (PLM), VGG-19, B-CNN, Decoupling (DP), and Co-teaching (CT) on sub-datasets Web-aircraft, Web-bird, and Web-car in WebFG-496 dataset. The value on each sub-dataset is plotted in the dotted line and the average value is plotted in solid line. It should be noted that the classification accuracy is the result of the second stage in the two-step training strategy. Since we have trained 60 epochs in the second stage on the basic network VGG-19, we only compare the first 60 epochs in the second stage of our approach with VGG-19

Citation

If you find this useful in your research, please consider citing:

@inproceedings{
title={Webly Supervised Fine-Grained Recognition: Benchmark Datasets and An Approach},
author={Zeren Sun, Yazhou Yao, Xiu-Shen Wei, Yongshun Zhang, Fumin Shen, Jianxin Wu, Jian Zhang, Heng Tao Shen},
booktitle={IEEE International Conference on Computer Vision (ICCV)},
year={2021}
}

Datasets and source code for our paper Webly Supervised Fine-Grained Recognition: Benchmark Datasets and An Approach

Related tags

Overview

Introduction

Datasets: WebFG-496 & WebiNat-5089

WebFG-496

WebiNat-5089

Dataset Briefing

Peer-learning model

Network Architecture

Installation

How to use

Results

Citation

Owner

Testing the Facial Emotion Recognition (FER) algorithm on animations

Multi-objective constrained optimization for energy applications via tree ensembles

Edge-aware Guidance Fusion Network for RGB-Thermal Scene Parsing

Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data based on Pytorch Framework

Siamese-nn-semantic-text-similarity - A repository containing comprehensive Neural Networks based PyTorch implementations for the semantic text similarity task

Learning to Stylize Novel Views

This is the code repository for the paper "Identification of the Generalized Condorcet Winner in Multi-dueling Bandits" (NeurIPS 2021).

PyTorch Implementation of Fully Convolutional Networks. (Training code to reproduce the original result is available.)

Pytorch Implementation of Continual Learning With Filter Atom Swapping (ICLR'22 Spolight) Paper

A Python parser that takes the content of a text file and then reads it into variables.

PyTorch version of the paper 'Enhanced Deep Residual Networks for Single Image Super-Resolution' (CVPRW 2017)

Pairwise Learning for Neural Link Prediction for OGB (PLNLP-OGB)

Citation Intent Classification in scientific papers using the Scicite dataset an Pytorch

Code for the paper "SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness" (NeurIPS 2021)

Springer Link Download Module for Python

blind SQLIpy sebuah alat injeksi sql yang menggunakan waktu sql untuk mendapatkan sebuah server database.

Survival analysis (SA) is a well-known statistical technique for the study of temporal events.

Resco: A simple python package that report the effect of deep residual learning

This repository provides some of the code implemented and the data used for the work proposed in "A Cluster-Based Trip Prediction Graph Neural Network Model for Bike Sharing Systems".

Supervised Contrastive Learning for Downstream Optimized Sequence Representations