PyTorch implementation for the ICLR 2020 paper "Understanding the Limitations of Variational Mutual Information Estimators"

Last update: Nov 09, 2022

Related tags

Deep Learning smile-mi-estimator

Overview

Smoothed Mutual Information ``Lower Bound'' Estimator

PyTorch implementation for the ICLR 2020 paper Understanding the Limitations of Variational Mutual Information Estimators.

by Jiaming Song and Stefano Ermon, Stanford Artificial Intelligence Laboratory.

Running the experiments

The code depends on PyTorch >= 1.2, numpy, pandas and matplotlib. It has been tested on both Python 3.7.

We implement several mutual information estimators, including:

InfoNCE: Contrastive predictive coding / Info Noise Contrastive Estimation.
NWJ: Variational representation of the KL divergence (lower bound).
NWJ (JS): Train with variational representation of JS divergence lower bound, evaluate with KL.
MINE / DV: Variational representation of the KL divergence based on Donsker-Varadhan inequality.
SMILE: our method with clipping for estimating partition functions.

These functions are implemented in estimators.py.

See demo.ipynb for the procedures to produce the figures in the paper.

Citation

If you use this code for your research, please cite our paper:

@article{song2020understanding,
  title="Understanding the Limitations of Variational Mutual Information Estimators",
  author="Song, Jiaming and Ermon, Stefano",
  conference="International Conference on Learning Representations",
  year="2020"
}

Contact

[email protected]

PyTorch implementation for the ICLR 2020 paper "Understanding the Limitations of Variational Mutual Information Estimators"

Related tags

Overview

Smoothed Mutual Information ``Lower Bound'' Estimator

Running the experiments

Citation

Contact

Owner

Python wrapper of LSODA (solving ODEs) which can be called from within numba functions.

An implementation of Deep Graph Infomax (DGI) in PyTorch

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR2021)

Categorical Depth Distribution Network for Monocular 3D Object Detection

quantize aware training package for NCNN on pytorch

Rethinking of Pedestrian Attribute Recognition: A Reliable Evaluation under Zero-Shot Pedestrian Identity Setting

A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering.

Compact Bidirectional Transformer for Image Captioning

This code is a near-infrared spectrum modeling method based on PCA and pls

SSPNet: Scale Selection Pyramid Network for Tiny Person Detection from UAV Images.

Semi-supervised Representation Learning for Remote Sensing Image Classification Based on Generative Adversarial Networks

CATE: Computation-aware Neural Architecture Encoding with Transformers

💛 Code and Dataset for our EMNLP 2021 paper: "Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes"

A Simple Example for Imitation Learning with Dataset Aggregation (DAGGER) on Torcs Env

Decompose to Adapt: Cross-domain Object Detection via Feature Disentanglement

Human-Pose-and-Motion History

A 10000+ hours dataset for Chinese speech recognition

How to use TensorLayer

Minimal implementation and experiments of "No-Transaction Band Network: A Neural Network Architecture for Efficient Deep Hedging".

An implementation of the BADGE batch active learning algorithm.