Extreme Dynamic Classifier Chains

Classifier chains is a key technique in multi-label classification, sinceit allows to consider label dependencies effectively. However, the classifiers arealigned according to a static order of the labels. In the concept of dynamic classifier chains (DCC) the label ordering is chosen for each prediction dynamically depending on the respective instance at hand. We combine this concept with the boosting of extreme gradient boosted trees (XGBoot), an effective and scalable state-of-the-art technique, and incorporate DCC in a fast multi-label extension of XGBoost which we make publicly available. As only positive labels have to be predicted and these are usually only few, the training costs can be further substantially reduced. Moreover, as experiments on ten datasets show, the length of the chain allows for a more control over the usage of previous predictions and hence over the measure one want to optimize,

Installation

The first step requires to build the modified multilabel version of XGBoost and install the resulting python package to build the dynamic chain model. This requires MinGW, i.e. the mingw32-make command, and Python 3. To start the build run the following commands:

cd XGBoost_ML
mingw32-make -j4

After a successful execution the python package can be installed.

cd python-package
python setup.py install

You should now be able to import the package into your Python project:

import xgboost as xgb

Training the Dynamic Chain Model

We recommend running the models by calling train_dcc.py from within a console. Place all datasets as .arff files into the datasets directory. Append -train to the train set and -test to the test set.

Parameters:

The following parameters are available:

Parameter	Short	Description	Required
`--filename <string>`	`-f`	Name of your dataset .arff file located in the datasets sub-directory	yes
`--num_labels <int>`	`-l`	Number of Labels in the dataset	yes
`--models <string>`	`-m`	Specifies all models that will be build. Available options: `dcc`: The proposed dynamic chain model `sxgb`: A single multilabel XGBoost model `cc-dcc`: A classifier chain with the label order of a previously built dynamic chain `cc-freq`: A classifier chain with a label order sorted by label frequency (frequent to rare) in the train set `cc-rare`: A classifier chain with a label order sorted by label frequency (rare to frequent) in the train set `cc-rand`: A classifier chain with a random label order `br`: A binary relevance model example: `-m "dc,br"`	yes
`--validation <int>`	`-v`	Size of validation set. The first XX% of the train set will be used for validating the model. If the parameter is not set, the test set will be used for evaluation. Example: `--validation 20` The frist 20% will be used for evaluation, the last 80% for training. (default: 0)	no
`--max_depth <int>`	`-d`	Max depth of each XGBoost multilabel tree (default: 10)	no
`--num_rounds <int>`	`-r`	Number of boosting rounds of each XGBoost model (default: 10)	no
`--chain_length <int>`	`-c`	Length of the chain. Represents number of labeling-rounds. Each round builds a new XGBoost model that will predict a single label per instance (default: num_labels)	no
`--split <int>`	`-s`	Index of split method used for building the trees. Available options: maxGain: 1 maxWeight: 2 sumGain: 3 sumWeight: 4 maxAbsGain: 5 sumAbsGain: 6 (default: 1)	no
`--parameters <string>`	`-p`	XGBoost parameters used for each model in the chain. Example: `-p "{'silent':1, 'eta':0.1}"` (default: {})	no
`--features_to_transform <string>`	`-t`	A list of all features in the dataset that have to be encoded. XGBoost can only process numerical features. Use this parameter to encode categorical features. Example: `-t "featureA,featureB"`	no
`--output_extra`	`-o`	Write extended log and json files (default: True)	no

Example

We train two models, the dynamic chain and a binary relevance model, on a dataset called emotions with 6 labels. So we specify the models with -m "dc, br" and the dataset with -f "emotions". Additionally we place the files for training and testing into the datasets directory:

project
│   README.md
│   train_dcc.py   
│
└───datasets
│   │   emotions-train.arff
│   │   emotions-test.arff
│   
└───XGBoost_ML
    │   ...

The dcc model should build a full chain with 6 models, so we use -l 6. All XGBoost models, also the one for binary relevance, should train for 100 rounds with a maximum tree depth of 10 and a step size of 0.1. Therefore we add -p "{'eta':0.1}" -r 100 -d 10

The full command to train and evaluate both models is:

 train_dcc.py -p "{'eta':0.1}" -f "emotions" -l 6 -r 100 -d 10 -c 6 -m 'dcc, br'

Extreme Dynamic Classifier Chains - XGBoost for Multi-label Classification

Related tags

Overview

Extreme Dynamic Classifier Chains

Installation

Training the Dynamic Chain Model

Parameters:

Example

Owner

A cool little repl-based simulation written in Python

Code to produce syntactic representations that can be used to study syntax processing in the human brain

MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;

Nicely is a real-time Feedback and Intervention Program Depression is a prevalent issue across all age groups, socioeconomic classes, and cultural identities.

Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models Benchmark and Efficient Evaluation

Pytorch implementation for A-NeRF: Articulated Neural Radiance Fields for Learning Human Shape, Appearance, and Pose

Official code for "End-to-End Optimization of Scene Layout" -- including VAE, Diff Render, SPADE for colorization (CVPR 2020 Oral)

LSTM and QRNN Language Model Toolkit for PyTorch

🐸STT integration examples

This GitHub repository contains code used for plots in NeurIPS 2021 paper 'Stochastic Multi-Armed Bandits with Control Variates.'

Level Based Customer Segmentation

Transformer part of 12th place solution in Riiid! Answer Correctness Prediction

Companion repo of the UCC 2021 paper "Predictive Auto-scaling with OpenStack Monasca"

End-to-End Object Detection with Fully Convolutional Network

Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

Code for sound field predictions in domains with impedance boundaries. Used for generating results from the paper

Robust & Reliable Route Recommendation on Road Networks

Very deep VAEs in JAX/Flax

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English