The PASS dataset: pretrained models and how to get the data - PASS: Pictures without humAns for Self-Supervised Pretraining

Last update: Dec 22, 2022

Overview

PASS: Pictures without humAns for Self-Supervised Pretraining

TL;DR: An ImageNet replacement dataset for self-supervised pretraining without humans

Content

PASS is a large-scale image dataset that does not include any humans, human parts, or other personally identifiable information that can be used for high-quality pretraining while significantly reducing privacy concerns.

Download the dataset

Generally: all information is on our webpage.

For downloading the dataset, please visit our dataset on zenodo. There you can download it in tar files and find the meta-data.

You can also download the images from their AWS urls, from here.

Pretrained models

Pretraining	Method	Epochs	Places205 lin. Acc.	Model weights
IN-1k	MoCo-v2	200	50.1	R50 weights
PASS	MoCo-v2	200	52.8	R50 weights
PASS	MoCo-v2-CLD	200	53.1	R50 weights
PASS	SwAV	200	55.5	R50 weights
PASS	DINO	100	X	ViT S16 weights
PASS	DINO	300		coming soon
PASS	MoCo-v2	800		coming soon

Contribute your models

Please let us know if you have a model pretrained on this dataset and I will add this to the list above.

Citation

@Article{asano21pass,
author = "Yuki M. Asano and Christian Rupprecht and Andrew Zisserman and Andrea Vedaldi",
title = "PASS: An ImageNet replacement for self-supervised pretraining without humans",
journal = "NeurIPS Track on Datasets and Benchmarks",
year = "2021"
}

The PASS dataset: pretrained models and how to get the data - PASS: Pictures without humAns for Self-Supervised Pretraining

Related tags

Overview

PASS: Pictures without humAns for Self-Supervised Pretraining

Content

Download the dataset

Pretrained models

Contribute your models

Citation

Owner

Yuki M. Asano

Code for our NeurIPS 2021 paper Mining the Benefits of Two-stage and One-stage HOI Detection

Official implementation of the paper 'Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution'

Embeds a story into a music playlist by sorting the playlist so that the order of the music follows a narrative arc.

EmoTag helps you train emotion detection model for Chinese audios

QA-GNN: Question Answering using Language Models and Knowledge Graphs

Train Dense Passage Retriever (DPR) with a single GPU

A simple python module to generate anchor (aka default/prior) boxes for object detection tasks.

Implementation of our paper 'RESA: Recurrent Feature-Shift Aggregator for Lane Detection' in AAAI2021.

Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices,

End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)

The first public PyTorch implementation of Attentive Recurrent Comparators

Video Frame Interpolation without Temporal Priors (a general method for blurry video interpolation)

A project studying the influence of communication in multi-objective normal-form games

ML From Scratch

[CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Yang Zhang, Michael Carbin, Zhangyang Wang

WeakVRD-Captioning - Implementation of paper Improving Image Captioning with Better Use of Caption

DexterRedTool - Dexter's Red Team Tool that creates cronjob/task scheduler to consistently creates users

Pairwise model for commonlit competition

(to be released) [NeurIPS'21] Transformers Generalize DeepSets and Can be Extended to Graphs and Hypergraphs

Posterior predictive distributions quantify uncertainties ignored by point estimates.