Contrastive Learning for Neural Topic Model

This repository contains the implementation of the paper Contrastive Learning for Neural Topic Model.

Thong Nguyen, Luu Anh Tuan (NeurIPS 2021)

In this work, we target the problem of capturing meaningful representations through modeling the relations among samples from a mathematical perspective and propose a novel contrastive objective to train the neural topic model, along with the optimization of the variational lower bound. In our contrastive learning framework, we introduce a novel sampling strategy that is motivated by human behavior when comparing numerous documents. Our results show that capturing mutual information between the prototype and its positive sample provides a strong foundation for constructing coherent topics, while differentiating the prototype from the negative samples plays a less fundamental role.

@inproceedings{
nguyen2021contrastive,
title={Contrastive Learning for Neural Topic Model},
author={Thong Thanh Nguyen and Anh Tuan Luu},
booktitle={Advances in Neural Information Processing Systems},
editor={A. Beygelzimer and Y. Dauphin and P. Liang and J. Wortman Vaughan},
year={2021},
url={https://openreview.net/forum?id=NEgqO9yB7e}
}

Requirements

python3
pandas
gensim
numpy
torchvision
pytorch 1.7.0
scipy

How to Run

Download and put the dataset in the data folder: https://drive.google.com/file/d/1JeeUCzBRQqJUvdWGDN7aMRvIoBAIbZIc/view?usp=sharing
Train the model by running ./scripts/train_models/run_{dataset}_{topk}.sh
Evaluate the model via executing ./scripts/evaluate/run_{dataset}_npmi_{topk}.sh

Acknowledgement

Our implementation is based on the official code of SCHOLAR.

CLNTM - Contrastive Learning for Neural Topic Model

Related tags

Overview

Contrastive Learning for Neural Topic Model

Requirements

How to Run

Acknowledgement

Owner

Thong Thanh Nguyen

SHRIMP: Sparser Random Feature Models via Iterative Magnitude Pruning

Source codes of CenterTrack++ in 2021 ICME Workshop on Big Surveillance Data Processing and Analysis

An Object Oriented Programming (OOP) interface for Ontology Web language (OWL) ontologies.

The official implementation of A Unified Game-Theoretic Interpretation of Adversarial Robustness.

Tgbox-bench - Simple TGBOX upload speed benchmark

UMPNet: Universal Manipulation Policy Network for Articulated Objects

labelpix is a graphical image labeling interface for drawing bounding boxes

Official implementation of "Learning Forward Dynamics Model and Informed Trajectory Sampler for Safe Quadruped Navigation" (RSS 2022)

Python Fanduel API (2021) - Lineup Automation

Flax is a neural network ecosystem for JAX that is designed for flexibility.

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Model Quantization Benchmark

Probabilistic Cross-Modal Embedding (PCME) CVPR 2021

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

Facial detection, landmark tracking and expression transfer library for Windows, Linux and Mac

Cross-Image Region Mining with Region Prototypical Network for Weakly Supervised Segmentation

Self-Learning - Books Papers, Courses & more I have to learn soon

[ICCV2021] Official Pytorch implementation for SDGZSL (Semantics Disentangling for Generalized Zero-Shot Learning)

PSML: A Multi-scale Time-series Dataset for Machine Learning in Decarbonized Energy Grids

Neural Dynamic Policies for End-to-End Sensorimotor Learning