Generalized Category Discovery

This repo is a placeholder for code for our paper: Generalized Category Discovery

Abstract: In this paper, we consider a highly general image recognition setting wherein, given a labelled and unlabelled set of images, the task is to categorize all images in the unlabelled set. Here, the unlabelled images may come from labelled classes or from novel ones. Existing recognition methods are not able to deal with this setting, because they make several restrictive assumptions, such as the unlabelled instances only coming from known --- or unknown --- classes and the number of unknown classes being known a-priori. We address the more unconstrained setting, naming it `Generalized Category Discovery', and challenge all these assumptions. We first establish strong baselines by taking state-of-the-art algorithms from novel category discovery and adapting them for this task. Next, we propose the use of vision transformers with contrastive representation learning for this open world setting. We then introduce a simple yet effective semi-supervised $k$-means method to cluster the unlabelled data into seen and unseen classes automatically, substantially outperforming the baselines. Finally, we also propose a new approach to estimate the number of classes in the unlabelled data. We thoroughly evaluate our approach on public datasets for generic object classification including CIFAR10, CIFAR100 and ImageNet-100, and for fine-grained visual recognition including CUB, Stanford Cars and Herbarium19, benchmarking on this new setting to foster future research.

Code for our paper 'Generalized Category Discovery'

Related tags

Overview

Generalized Category Discovery

Code Coming Soon!

Owner

A system for quickly generating training data with weak supervision

A 3D sparse LBM solver implemented using Taichi

Convert ONNX model graph to Keras model format.

Implementation of "Scaled-YOLOv4: Scaling Cross Stage Partial Network" using PyTorch framwork.

Super Pix Adv - Offical implemention of Robust Superpixel-Guided Attentional Adversarial Attack (CVPR2020)

D2Go is a toolkit for efficient deep learning

A code repository associated with the paper A Benchmark for Rough Sketch Cleanup by Chuan Yan, David Vanderhaeghe, and Yotam Gingold from SIGGRAPH Asia 2020.

[NeurIPS 2020] Code for the paper "Balanced Meta-Softmax for Long-Tailed Visual Recognition"

本步态识别系统主要基于GaitSet模型进行实现

This repository is an implementation of our NeurIPS 2021 paper (Stylized Dialogue Generation with Multi-Pass Dual Learning) in PyTorch.

Seeing if I can put together an interactive version of 3b1b's Manim in Streamlit

Code for LIGA-Stereo Detector, ICCV'21

3D mesh stylization driven by a text input in PyTorch

This repository contains codes of ICCV2021 paper: SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation

Adversarial Framework for (non-) Parametric Image Stylisation Mosaics

Adaptive Denoising Training (ADT) for Recommendation.

CTF challenges from redpwnCTF 2021

A collection of papers about Transformer in the field of medical image analysis.

Improving Factual Completeness and Consistency of Image-to-text Radiology Report Generation

Next-gen Rowhammer fuzzer that uses non-uniform, frequency-based patterns.