The source code and dataset for the RecGURU paper (WSDM 2022)

Last update: Jan 07, 2023

Overview

RecGURU

About The Project

Source code and baselines for the RecGURU paper "RecGURU: Adversarial Learning of Generalized User Representations for Cross-Domain Recommendation (WSDM 2022)"

Code Structure

RecGURU  
├── README.md                                 Read me file 
├── data_process                              Data processing methods
│   ├── __init__.py                           Package initialization file     
│   └── amazon_csv.py                         Code for processing the amazon data (in .csv format)
│   └── business_process.py                   Code for processing the collected data
│   └── item_frequency.py                     Calculate item frequency in each domain
│   └── run.sh                                Shell script to perform data processing  
├── GURU                                      Scripts for modeling, training, and testing 
│   ├── data                                  Dataloader package      
│     ├── __init__.py                         Package initialization file 
│     ├── data_loader.py                      Customized dataloaders 
│   └── tools                                 Tools such as loss function, evaluation metrics, etc.
│     ├── __init__.py                         Package initialization file
│     ├── lossfunction.py                     Customized loss functions
│     ├── metrics.py                          Evaluation metrics
│     ├── plot.py                             Plot function
│     ├── utils.py                            Other tools
│  ├── Transformer                            Transformer package
│     ├── __init__.py                         Package initialization 
│     ├── transformer.py                      transformer module
│  ├── AutoEnc4Rec.py                         Autoencoder based sequential recommender
│  ├── AutoEnc4Rec_cross.py                   Cross-domain recommender modules
│  ├── config_auto4rec.py                     Model configuration file
│  ├── gan_training.py                        Training methods of the GAN framework
│  ├── train_auto.py                          Main function for training and testing single-domain sequential recommender
│  ├── train_gan.py                           Main function for training and testing cross-domain sequential recommender
└── .gitignore                                gitignore file

Dataset

The public datasets: Amazon view dataset at: https://nijianmo.github.io/amazon/index.html
Collected datasets: https://drive.google.com/file/d/1NbP48emGPr80nL49oeDtPDR3R8YEfn4J/view
Data processing:

Amazon dataset:

```shell
cd ../data_process
python amazon_csv.py   
```

Collected dataset

```shell
cd ../data_process
python business_process.py --rate 0.1  # portion of overlapping user = 0.1   
```

After data process, for each cross-domain scenario we have a dataset folder:

."a_domain"-"b_domain"
├── a_only.pickle         # users in domain a only
├── b_only.pickle         # users in domain b only
├── a.pickle              # all users in domain a
├── b.pickle              # all users in domain b
├── a_b.pickle            # overlapped users of domain a and b

Note: see the code for processing details and make modifications accordingly.

Run

Single-domain Methods:

# SAS
python train_auto.py --sas "True"
# AutoRec (ours)
python train_auto.py

Cross-Domain Methods:

# RecGURU
python train_gan.py --cross "True"

The source code and dataset for the RecGURU paper (WSDM 2022)

Related tags

Overview

RecGURU

About The Project

Code Structure

Dataset

Amazon dataset:

Collected dataset

Run

Owner

Chenglin Li

OpenVisionAPI server

Dataset used in "PlantDoc: A Dataset for Visual Plant Disease Detection" accepted in CODS-COMAD 2020

Make differentially private training of transformers easy for everyone

ppo_pytorch_cpp - an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch

Pytorch implementation of the paper DocEnTr: An End-to-End Document Image Enhancement Transformer.

Python PID Tuner - Based on a FOPDT model obtained using a Open Loop Process Reaction Curve

Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR 2018).

Tools for computational pathology

Official Pytorch implementation for video neural representation (NeRV)

[CVPR'21] DeepSurfels: Learning Online Appearance Fusion

PyGAD, a Python 3 library for building the genetic algorithm and training machine learning algorithms (Keras & PyTorch).

PyTorch implementations of algorithms for density estimation

Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration

Codes for NeurIPS 2021 paper "Adversarial Neuron Pruning Purifies Backdoored Deep Models"

Convolutional neural network that analyzes self-generated images in a variety of languages to find etymological similarities

Answering Open-Domain Questions of Varying Reasoning Steps from Text

PCGNN - Procedural Content Generation with NEAT and Novelty

RoboDesk A Multi-Task Reinforcement Learning Benchmark

Gradient Inversion with Generative Image Prior