Meli Data Challenge 2021 - First Place Solution

Last update: Mar 09, 2022

Related tags

Deep Learning meli-data-challenge-2021

Overview

Meli Data Challenge 2021 - First Place Solution

My solution for the Meli Data Challenge 2021, first place in both public and private leaderboards.

The Model

My final model is an ensemble combining recurrent neural networks and XGBoost regressors. Neural networks are trained to predict the stock days probability distribution using the RPS as loss function. XGBoost regressors are trained to predict stock days using different objectives, here the intuition behind this:

MSE loss: the regressor trained with this loss will output values close to the expected mean.
Pseudo-Huber loss: an alternative for the MAE loss, this regressor outputs values close to the expected median.
Quantile loss: 11 regressors are trained using a quantile loss with alpha 0, 0.1, 0.2, ..., 1. This helps to build the final probability distribution.

The outputs of all these level-0 models are concatenated to train a feedforward neural network with the RPS as loss function.

The last 30 days of the train dataset are used to generate the labels and the target stock input. The remaining 29 days are used to generate the time series input.

The train/validation split is done at a sku level:

For level-0 models: 450000 sku's are used for training and the rest for validation.
For the level-1 model: the sku's used for training level-0 models are removed from the dataset and the remaining sku's are split again into train/validation.

Once all models are trained, the last 29 days of the train dataset and the provided target stock values are used as input to generate the submission.

Disclaimer: the entire solution lacks some fine tuning since I came up with this little ensemble monster towards the end of the competition. I didn't have the time to fine-tune each model (there are technically 16 models to tune if we consider each quantile regressor as an independent model).

How to run the solution

Requirements

TensorFlow v2.
Pandas.
Numpy.
Scikit-learn.

CUDA drivers and a CUDA-compatible GPU is required (I didn't have the time to test this on a CPU).

Some scripts require up to 30GB of RAM (again, I didn't have the time to implement a more memory-efficient solution).

The solution was tested on Ubuntu 20.04 with Python 3.8.10.

Downloading the dataset

Download the dataset files from https://ml-challenge.mercadolibre.com/downloads and put them into the dataset/ directory.

On linux, you can do that by running:

cd dataset && wget \
https://meli-data-challenge.s3.amazonaws.com/2021/test_data.csv \
https://meli-data-challenge.s3.amazonaws.com/2021/train_data.parquet \
https://meli-data-challenge.s3.amazonaws.com/2021/items_static_metadata_full.jl

Running the scripts

All-in-one script

A convenient script to run the entire solution is provided:

cd src
./run-solution.sh

Note: the entire process may take more than 3 hours to run.

Step by step

If you find trouble running the al-in-one script, you can run the solution step by step following the instructions bellow:

cd into the src directory:

cd src

Extract time series from the dataset:

python3 ./preprocessing/extract-time-series.py

Generate a supervised learning dataset:

python3 ./preprocessing/generate-sl-dataset.py

Train all level-0 models:

python3 ./train-all.py

Train the level-1 ensemble:

python3 ./train-ensemble.py

Generate the submission file and gzip it:

python3 ./generate-submission.py && gzip ./submission.csv

Utility scripts

The training_scripts directory contains some scripts to train each model separately, example usage:

python3 ./training_scripts/train-lstm.py

Meli Data Challenge 2021 - First Place Solution

Related tags

Overview

Meli Data Challenge 2021 - First Place Solution

The Model

How to run the solution

Requirements

Downloading the dataset

Running the scripts

All-in-one script

Step by step

Utility scripts

Owner

Matias Moreyra

Collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and related datasets

StackRec: Efficient Training of Very Deep Sequential Recommender Models by Iterative Stacking

Ensembling Off-the-shelf Models for GAN Training

Demonstrates how to divide a DL model into multiple IR model files (division) and introduce a simplest way to implement a custom layer works with OpenVINO IR models.

This repo includes our code for evaluating and improving transferability in domain generalization (NeurIPS 2021)

In this project, we develop a face recognize platform based on MTCNN object-detection netcwork and FaceNet self-supervised network.

The official repo of the CVPR2021 oral paper: Representative Batch Normalization with Feature Calibration

The Official Implementation of Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose [NIPS 2021].

Pytorch implementation of PTNet for high-resolution and longitudinal infant MRI synthesis

验证码识别深度学习 tensorflow 神经网络

Compute descriptors for 3D point cloud registration using a multi scale sparse voxel architecture

[MICCAI'20] AlignShift: Bridging the Gap of Imaging Thickness in 3D Anisotropic Volumes

Personalized Transfer of User Preferences for Cross-domain Recommendation (PTUPCDR)

Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification

Generalized hybrid model for mode-locked laser diodes with an extended passive cavity

なりすまし検出(anti-spoof-mn3)のWebカメラ向けデモ

Code for the CIKM 2019 paper "DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting".

Multi Task RL Baselines

Implementation of 'lightweight' GAN, proposed in ICLR 2021, in Pytorch. High resolution image generations that can be trained within a day or two

Implementation of momentum^2 teacher

Meli Data Challenge 2021 - First Place Solution

Related tags

Overview

Meli Data Challenge 2021 - First Place Solution

The Model

How to run the solution

Requirements

Downloading the dataset

Running the scripts

All-in-one script

Step by step

Utility scripts

Owner

Matias Moreyra

Collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and related datasets

StackRec: Efficient Training of Very Deep Sequential Recommender Models by Iterative Stacking

Ensembling Off-the-shelf Models for GAN Training

Demonstrates how to divide a DL model into multiple IR model files (division) and introduce a simplest way to implement a custom layer works with OpenVINO IR models.

This repo includes our code for evaluating and improving transferability in domain generalization (NeurIPS 2021)

In this project, we develop a face recognize platform based on MTCNN object-detection netcwork and FaceNet self-supervised network.

The official repo of the CVPR2021 oral paper: Representative Batch Normalization with Feature Calibration

The Official Implementation of Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose [NIPS 2021].

Pytorch implementation of PTNet for high-resolution and longitudinal infant MRI synthesis

验证码识别 深度学习 tensorflow 神经网络

Compute descriptors for 3D point cloud registration using a multi scale sparse voxel architecture

[MICCAI'20] AlignShift: Bridging the Gap of Imaging Thickness in 3D Anisotropic Volumes

Personalized Transfer of User Preferences for Cross-domain Recommendation (PTUPCDR)

Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification

Generalized hybrid model for mode-locked laser diodes with an extended passive cavity

なりすまし検出(anti-spoof-mn3)のWebカメラ向けデモ

Code for the CIKM 2019 paper "DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting".

Multi Task RL Baselines

Implementation of 'lightweight' GAN, proposed in ICLR 2021, in Pytorch. High resolution image generations that can be trained within a day or two

Implementation of momentum^2 teacher

验证码识别深度学习 tensorflow 神经网络