Codebase for "ProtoAttend: Attention-Based Prototypical Learning."

Authors: Sercan O. Arik and Tomas Pfister

Paper: Sercan O. Arik and Tomas Pfister, "ProtoAttend: Attention-Based Prototypical Learning" Link: https://arxiv.org/abs/1902.06292

We propose a novel inherently interpretable machine learning method that bases decisions on few relevant examples that we call prototypes. Our method, ProtoAttend, can be integrated into a wide range of neural network architectures including pre-trained models. It utilizes an attention mechanism that relates the encoded representations to samples in order to determine prototypes. The resulting model outperforms state of the art in three high impact problems without sacrificing accuracy of the original model: (1) it enables high-quality interpretability that outputs samples most relevant to the decision-making (i.e. a sample-based interpretability method); (2) it achieves state of the art confidence estimation by quantifying the mismatch across prototype labels; and (3) it obtains state of the art in distribution mismatch detection. All this can be achieved with minimal additional test time and a practically viable training time computational cost.

This codebase exemplifies the ProtoAttend training and evaluation pipeline for Fashion-MNIST dataset, using ResNet as the image encoder model.

To run the training pipeline, simply use python3 main_protoattend.py. The results and visualizations will be ported to Tensorboard.

To modify the experiment to other datasets and models:

Implement data batching and preprocessing functions (modify input_data.py and data iterators like iter_train etc.).
Integrate the encoder model function suitable for the data type (modify cnn_encoder in model.py).
Reoptimize the learning hyperparameters for the new dataset.

Codebase for "ProtoAttend: Attention-Based Prototypical Learning."

Related tags

Overview

Codebase for "ProtoAttend: Attention-Based Prototypical Learning."

Owner

47

Rendering color and depth images for ShapeNet models.

Converts geometry node attributes to built-in attributes

Finetune SSL models for MOS prediction

Self-supervised Product Quantization for Deep Unsupervised Image Retrieval - ICCV2021

A deep learning CNN model to identify and classify and check if a person is wearing a mask or not.

Voice Conversion by CycleGAN (语音克隆/语音转换)：CycleGAN-VC3

Vignette is a face tracking software for characters using osu!framework.

Voice assistant - Voice assistant with python

Unified learning approach for egocentric hand gesture recognition and fingertip detection

PyTorch code for DriveGAN: Towards a Controllable High-Quality Neural Simulation

labelpix is a graphical image labeling interface for drawing bounding boxes

Confident Semantic Ranking Loss for Part Parsing

Official implementation for paper: A Latent Transformer for Disentangled Face Editing in Images and Videos.

[ICCV 2021] Official PyTorch implementation for Deep Relational Metric Learning.

An end-to-end machine learning library to directly optimize AUC loss

Pretty Tensor - Fluent Neural Networks in TensorFlow

Using this codebase as a tool for my own research. Making some modifications to the original repo for my own purposes.

Awesome Transformers in Medical Imaging

The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"

BERT model training impelmentation using 1024 A100 GPUs for MLPerf Training v1.1