Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations

Last update: Sep 09, 2022

Related tags

Overview

HierarchicyBandit

Introduction

This is the implementation of WSDM 2022 paper : Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations
The reference codes for HCB and pHCB, which are based on three different base bandit algorithms.

LinUCB from A contextual-bandit approach to personalized news article recommendation
epsilon-Greedy [This strategy, with random exploration on an epsilon fraction of the traffic and greedy exploitation on the rest]
Thompson Sampling from Thompson Sampling for Contextual Bandits with Linear Payoffs

Files in the folder

data/
- MIND/ and TaoBao/
  - item_info.pkl: processed item file, including item id, item feature and embeddings for simulator;
  - user_info.pkl: processed user file, including user id, and embeddings for simulator;
  - item_info_ts.pkl: processed item file for Thompson sampling;
algs/: implementations of PCB and pHCB based on LinUCB.
algsE/: implementations of PCB and pHCB based on epsilon-Greedy.
algsTS/: implementations of PCB and pHCB based on Thompson Sampling.

Note

Before testing the algorithms, you should modify the settings in config.py.
For thompson sampling, we provide another 16 dimensonal feature vectors to run the experiments, since it can be faster . The original feature vectors are also work with the algorithms.
the user_info.pkl and item_info.pkl is formated as dictionary type.
The implementation of ConUCB is released at ConUCB. HMAB and ICTRUCB are specical case of CB-Category and CB-Leaf.

Usage:

Download the HierarchicyBandit.zip and unzip. You will get five folders, they are algs/, algsE/, algsTS/, data/, and logger/.

Parameters:
The config.py file contains:

dataset: is the dataset used in the experiment, it can be 'MIND' or 'TaoBao';  
T: the number of rounds of each bandit algorithm;  
k: the number of items recommended to user at each round, default is 1;  
activate_num: the hyper-papamter p for pHCB;  
activate_prob: the hyper-papamter q for pHCB;  
epsilon: the epsilon value for greedy-based algorithms;  
new_tree_file: the tree file name;  
noise_scale: the standard deviation of environmental noise;  
keep_prob: sample ratio; default is 1.0, which means testing all users.
linucb_para: the hyper-parameters for linucb algorithm;
ts_para: the hyper-parameters for thompson sampling algorithm;
poolsize: the size of candidate pool;
random_choice: whether random choice an item to user;

Environment: python 3.6 with Anaconda To run the bandit codes based on LinUCB:

$ cd algs
$ python simulator_multi_process.py

To run the bandit codes based on epsilon-Greedy:

$ cd algsE
$ python simulator_multi_process.py

To run the bandit codes based on Thompson sampling:

$ cd algsTS
$ python simulator_multi_process.py

Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations

Related tags

Overview

HierarchicyBandit

Introduction

Files in the folder

Usage:

Owner

yu song

Pytorch implementation of MLP-Mixer with loading pre-trained models.

Apply Graph Self-Supervised Learning methods to graph-level task(TUDataset, MolculeNet Datset)

Leveraging Social Influence based on Users Activity Centers for Point-of-Interest Recommendation

A memory-efficient implementation of DenseNets

The code for our NeurIPS 2021 paper "Kernelized Heterogeneous Risk Minimization".

This is the official implementation of 3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object Detection, built on SECOND.

TensorRT examples (Jetson, Python/C++)(object detection)

A fast python implementation of Ray Tracing in One Weekend using python and Taichi

ZeroVL - The official implementation of ZeroVL

Shape-aware Semi-supervised 3D Semantic Segmentation for Medical Images

implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks

GAN encoders in PyTorch that could match PGGAN, StyleGAN v1/v2, and BigGAN. Code also integrates the implementation of these GANs.

[ICCV 2021] Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation

CONditionals for Ordinal Regression and classification in PyTorch

IMBENS: class-imbalanced ensemble learning in Python.

InterfaceGAN++: Exploring the limits of InterfaceGAN

TorchGRL is the source code for our paper Graph Convolution-Based Deep Reinforcement Learning for Multi-Agent Decision-Making in Mixed Traffic Environments for IV 2022.

Implementation for the paper: Invertible Denoising Network: A Light Solution for Real Noise Removal (CVPR2021).

Finding Biological Plausibility for Adversarially Robust Features via Metameric Tasks

A Python library for differentiable optimal control on accelerators.