This is a pytorch implementation for the BST model from Alibaba https://arxiv.org/pdf/1905.06874.pdf

Last update: Jan 05, 2023

Overview

Behavior-Sequence-Transformer-Pytorch

This is a pytorch implementation for the BST model from Alibaba https://arxiv.org/pdf/1905.06874.pdf

This model is a novel recommender architecture based on seq2seq models. We translate user behaviour into sequences and predict a rating for each target item (movie).

Dataset

For this implementation we used Movielens 1M Dataset that contains timestamps per each rating, making it perfect to test in the sequence recommendation model.

Running

You can run it in colab here. If you prefer to run locally the model architecture is contained on pytorch-best.ipynb while data processing is on the prepare_data.ipynb notebook and should be run first.

Results

Training on all-1 user ratings and leaving the latest rating for test we obtain the following results

Dataset	MAE	RMSE
Train	0.72	0.84
Test	0.74	0.93

Here is a screenshot of training logs we we see overfitting from epoch 12-15.

References

Original paper 1
Keras implementation 2
Tensorflow implementation 3

This is a pytorch implementation for the BST model from Alibaba https://arxiv.org/pdf/1905.06874.pdf

Related tags

Overview

Behavior-Sequence-Transformer-Pytorch

Dataset

Running

Results

References

Owner

Jaime Ferrando Huertas

(ICCV 2021 Oral) Re-distributing Biased Pseudo Labels for Semi-supervised Semantic Segmentation: A Baseline Investigation.

Face recognition system using MTCNN, FACENET, SVM and FAST API to track participants of Big Brother Brasil in real time.

DLL: Direct Lidar Localization

Code for the RA-L (ICRA) 2021 paper "SeqNet: Learning Descriptors for Sequence-Based Hierarchical Place Recognition"

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

This thesis is mainly concerned with state-space methods for a class of deep Gaussian process (DGP) regression problems

"Inductive Entity Representations from Text via Link Prediction" @ The Web Conference 2021

Rank 3 : Source code for OPPO 6G Data Generation Challenge

Official Pytorch implementation of Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference (ICLR 2022)

AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages

Info and sample codes for "NTU RGB+D Action Recognition Dataset"

Neural Scene Graphs for Dynamic Scene (CVPR 2021)

Code for Deep Single-image Portrait Image Relighting

Implementation for paper MLP-Mixer: An all-MLP Architecture for Vision

Predictive AI layer for existing databases.

Contrastive Loss Gradient Attack (CLGA)

DWIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data.

DeepMetaHandles: Learning Deformation Meta-Handles of 3D Meshes with Biharmonic Coordinates

Fbone (Flask bone) is a Flask (Python microframework) starter/template/bootstrap/boilerplate application.

PyTorch implementation of the Pose Residual Network (PRN)