(SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’

Last update: Dec 01, 2022

Overview

Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback

About

This repository accompanies the real-world experiments conducted in the paper "Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback" by Yuta Saito, which has been accepted at SIGIR2020 as a full paper.

If you find this code useful in your research then please cite:

@inproceedings{saito2020asymmetric,
  title={Asymmetric tri-training for debiasing missing-not-at-random explicit feedback},
  author={Saito, Yuta},
  booktitle={Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval},
  year={2020}
}

Dependencies

numpy==1.17.2
pandas==0.25.1
scikit-learn==0.22.1
tensorflow==1.15.2
optuna==0.17.0
pyyaml==5.1.2

Running the code

To run the simulation with real-world datasets,

download the Coat dataset from https://www.cs.cornell.edu/~schnabts/mnar/ and put train.ascii and test.ascii files into ./data/coat/ directory.
download the Yahoo! R3 dataset from https://webscope.sandbox.yahoo.com/catalog.php?datatype=r and put train.txt and test.txt files into ./data/yahoo/ directory.

Then, run the following commands in the ./src/ directory:

for the MF-IPS models without asymmetric tri-training

for data in yahoo coat
do
  for model in uniform user item both nb nb_true
  do
    python main.py -d $data -m $model
  done
done

for the MF-IPS models with asymmetric tri-training (our proposal)

for data in coat yahoo
do
  for model in uniform-at user-at item-at both-at nb-at nb_true-at
  do
    python main.py -d $data -m $model
  done
done

where (uniform, user, item, both, nb, nb_true) correspond to (uniform propenisty, user propensity, item propensity, user-item propensity, NB (uniform), NB (true)), respectively.

These commands will run simulations with real-world datasets conducted in Section 5. The tuned hyperparameters for all models can be found in ./hyper_params.yaml.
(By adding the -t option to the above code, you can re-run the hyperparameter tuning procedure by Optuna.)

Once the simulations have finished running, the summarized results can be obtained by running the following command in the ./src/ directory:

python summarize_results -d coat yahoo

This creates ./paper_results/.

(SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’

Related tags

Overview

Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback

About

Dependencies

Running the code

Owner

yuta-saito

A Broader Picture of Random-walk Based Graph Embedding

A deep learning based semantic search platform that computes similarity scores between provided query and documents

Learning with Noisy Labels via Sparse Regularization, ICCV2021

RMTD: Robust Moving Target Defence Against False Data Injection Attacks in Power Grids

A library to inspect itermediate layers of PyTorch models.

《DeepViT: Towards Deeper Vision Transformer》(2021)

shufflev2-yolov5：lighter, faster and easier to deploy

A Transformer-Based Siamese Network for Change Detection

Demo for Real-time RGBD-based Extended Body Pose Estimation paper

Codebase for BMVC 2021 paper "Text Based Person Search with Limited Data"

[NeurIPS 2021] Garment4D: Garment Reconstruction from Point Cloud Sequences

A framework for joint super-resolution and image synthesis, without requiring real training data

YOLOV4运行在嵌入式设备上

An official PyTorch implementation of the TKDE paper "Self-Supervised Graph Representation Learning via Topology Transformations".

dataset for ECCV 2020 "Motion Capture from Internet Videos"

An implementation of Equivariant e2 convolutional kernals into a convolutional self attention network, applied to radio astronomy data.

A data-driven approach to quantify the value of classifiers in a machine learning ensemble.

GPT-Code-Clippy (GPT-CC) is an open source version of GitHub Copilot

GUI for TOAD-GAN, a PCG-ML algorithm for Token-based Super Mario Bros. Levels.

Losslandscapetaxonomy - Taxonomizing local versus global structure in neural network loss landscapes