This repository is the official implementation of Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

Last update: Aug 19, 2022

Overview

Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

Link to paper

Abstract

We study prediction of future outcomes with supervised models that use privileged information during learning. The privileged information comprises samples of time series observed between the baseline time of prediction and the future outcome; this information is only available at training time which differs from the traditional supervised learning. Our question is when using this privileged data leads to more sample-efficient learning of models that use only baseline data for predictions at test time. We give an algorithm for this setting and prove that when the time series are drawn from a non-stationary Gaussian-linear dynamical system of fixed horizon, learning with privileged information is more efficient than learning without it. On synthetic data, we test the limits of our algorithm and theory, both when our assumptions hold and when they are violated. On three diverse real-world datasets, we show that our approach is generally preferable to classical learning, particularly when data is scarce. Finally, we relate our estimator to a distillation approach both theoretically and empirically.

Requirements

Required libraries found in requirements.txt

Models

Baseline and LUPTS are implemented using sklearn, the code is found in /src/model/

Evaluation

Synthethic

To re-produce experiments, run /notebooks/synthetic.ipynb Necessary experiment code is found in /src/synthetic/

Forecasting Air Quality

To re-produce experiments, run /notebooks/fivecities.ipynb Necessary experiment code is found in /src/fivecities/

The data is found in /data/fivecities/, but can also be downloaded from here.

Modeling Progression of Chronic Disease

Note: For the Alzheimer’s and Multiple myeloma progression modeling tasks, the data is not publicly available, but the code which produced the results is still found in this repository.

Alzheimer's progression modelling

Code is found in /notebooks/ADNI.ipynb and /src/adni/

Multiple myeloma progression modelling

Code is found in /notebooks/mm-prfs.ipynb and /notebooks/mm-tr.ipynb

This repository is the official implementation of Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

Related tags

Overview

Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

Link to paper

Abstract

Requirements

Models

Evaluation

Synthethic

Forecasting Air Quality

Modeling Progression of Chronic Disease

Alzheimer's progression modelling

Multiple myeloma progression modelling

Owner

Rickard Karlsson

Bridging Vision and Language Model

Combine Tacotron2 and Hifi GAN to generate speech from text

This repository provides an unified frameworks to train and test the state-of-the-art few-shot font generation (FFG) models.

Implementation of our NeurIPS 2021 paper "A Bi-Level Framework for Learning to Solve Combinatorial Optimization on Graphs".

Automatically align face images 🙃→🙂. Can also do windowing and warping.

Degree-Quant: Quantization-Aware Training for Graph Neural Networks.

An implementation of the research paper "Retina Blood Vessel Segmentation Using A U-Net Based Convolutional Neural Network"

Unofficial implementation of the paper: PonderNet: Learning to Ponder in TensorFlow

Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation

Best practices for segmentation of the corporate network of any company

A new data augmentation method for extreme lighting conditions.

High-Fidelity Pluralistic Image Completion with Transformers (ICCV 2021)

This repository lets you interact with Lean through a REPL.

Image Segmentation using U-Net, U-Net with skip connections and M-Net architectures

Pure python implementations of popular ML algorithms.

Good Classification Measures and How to Find Them

Segmentation for medical image.

Sudoku solver - A sudoku solver with python

Learning Chinese Character style with conditional GAN

Implementation of "Efficient Regional Memory Network for Video Object Segmentation" (Xie et al., CVPR 2021).