A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.

Last update: Nov 23, 2022

Overview

SOFA

This repository is the implementation of SOFA, the Simulator for OFfline leArning and evaluation.

Keeping Dataset Biases out of the Simulation: A Debiased Simulator for Reinforcement Learning based Recommender Systems. Jin Huang, Harrie Oosterhuis, Maarten de Rijke, Herke van Hoof. Recsys 2020.

The framework shows how RL4Rec typically interacts with a simulation-based environment. A state is user historical interactions, an action is an item being recommended bytheRS, and a reward is related to user feedback.

As a solution to the effect of bias present in logged data, we introduce a debiasing step in the simulation pipeline, which corrects for the biases present in the logged data before it is used to simulate user behavior.

Running the code

$ cd examples
$ python run_dqn.py

More details

We provide the details of DQN-based Policy used in experiments and the related hyperparamters (See Appendix). And we also provide the slide used for presentation in recsys 2020.

Cite

If you use our code, please cite our paper:

@inproceedings{huang2020keeping,
  title={Keeping Dataset Biases out of the Simulation: A Debiased Simulator for Reinforcement Learning based Recommender Systems},
  author={Huang, Jin and Oosterhuis, Harrie and de Rijke, Maarten and van Hoof, Herke},
  booktitle={Fourteenth ACM Conference on Recommender Systems},
  pages={190--199},
  year={2020}
}

A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.

Related tags

Overview

SOFA

Running the code

More details

Cite

Owner

Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (CVPR 2022)

Multi-Stage Spatial-Temporal Convolutional Neural Network (MS-GCN)

An improvement of FasterGICP: Acceptance-rejection Sampling based 3D Lidar Odometry

Adversarial Self-Defense for Cycle-Consistent GANs

Viperdb - A tiny log-structured key-value database written in pure Python

💊 A 3D Generative Model for Structure-Based Drug Design (NeurIPS 2021)

A 3D sparse LBM solver implemented using Taichi

Notebooks for my "Deep Learning with TensorFlow 2 and Keras" course

Neural Scene Flow Prior (NeurIPS 2021 spotlight)

Bonnet: An Open-Source Training and Deployment Framework for Semantic Segmentation in Robotics.

The official implementation of paper Siamese Transformer Pyramid Networks for Real-Time UAV Tracking, accepted by WACV22

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)

Subdivision-based Mesh Convolutional Networks

Experiments for Fake News explainability project

Compartmental epidemic model to assess undocumented infections: applications to SARS-CoV-2 epidemics in Brazil - Datasets and Codes

Fast sparse deep learning on CPUs

Small little script to scrape, parse and check for active tor nodes. Can be used as proxies.

High-Resolution Image Synthesis with Latent Diffusion Models

Implementations of polygamma, lgamma, and beta functions for PyTorch