tsflex - feature-extraction benchmarking

Last update: Mar 25, 2022

Overview

tsflex - feature-extraction benchmarking

This repository withholds the benchmark results and visualization code of the tsflex paper and toolkit.

Flow

The benchmark process follows these steps for each feature-extraction configuration:

The corresponding feature-extraction Python script is called. This is done 20 times to average out the memory usage and create upper memory bounds. Remark that by (re)calling the script sequentially, no caching or memory is shared among the separate script-executions.
In this script:
1. Load the data and store as a pd.DataFrame
2. VizTracer starts logging
3. Create the feature extraction configuration
4. Extract & store the features
5. VizTracer stops logging
6. Write the VizTracer results to a JSON-file

The existing benchmark JSONS were collected on a desktop with an Intel(R) Xeon(R) CPU E5-2650 v2 @ 2.60GHz CPU and SAMSUNG M393B1G73QH0-CMA DDR3 1600MT/s RAM, with Ubuntu 18.04.5 LTS x86_64 as operating system. Other running processes were limited to a minimum.

Instructions

To install the required dependencies, just run:

pip install -r requirements.txt

If you want to re-run the benchmarks, use the run_scripts notebook to generate new benchmark JSONs and then visualize them with the benchmark visualization notebook.

We are open to new-benchmark use-cases via pull-requests!
Examples of other interesting benchmarks are different sample rates, other feature extraction functions, other data properties, ...

Referencing our package

If you use tsflex in a scientific publication, we would highly appreciate citing us as:

@article{vanderdonckt2021tsflex,
    author = {Van Der Donckt, Jonas and Van Der Donckt, Jeroen and Deprost, Emiel and Van Hoecke, Sofie},
    title = {tsflex: flexible time series processing \& feature extraction},
    journal = {SoftwareX},
    year = {2021},
    url = {https://github.com/predict-idlab/tsflex},
    publisher={Elsevier}
}

👤 Jonas Van Der Donckt, Jeroen Van Der Donckt

tsflex - feature-extraction benchmarking

Related tags

Overview

tsflex - feature-extraction benchmarking

Flow

Instructions

Referencing our package

Owner

PreDiCT.IDLab

The CLRS Algorithmic Reasoning Benchmark

An AI made using artificial intelligence (AI) and machine learning algorithms (ML) .

Fake-user-agent-traffic-geneator - Python CLI Tool to generate fake traffic against URLs with configurable user-agents

RCT-ART is an NLP pipeline built with spaCy for converting clinical trial result sentences into tables through jointly extracting intervention, outcome and outcome measure entities and their relations.

Provide baselines and evaluation metrics of the task: traffic flow prediction

Solutions and questions for AoC2021. Merry christmas!

i-SpaSP: Structured Neural Pruning via Sparse Signal Recovery

Evaluating different engineering tricks that make RL work

Open-sourcing the Slates Dataset for recommender systems research

Machine Translation Implement By Bi-GRU And Transformer

Keep CALM and Improve Visual Feature Attribution

Fast Soft Color Segmentation

Official Repository for our ICCV2021 paper: Continual Learning on Noisy Data Streams via Self-Purified Replay

CM building dataset Timisoara

Unofficial Implement PU-Transformer

DRIFT is a tool for Diachronic Analysis of Scientific Literature.

This is the reference implementation for "Coresets via Bilevel Optimization for Continual Learning and Streaming"

PyGAD, a Python 3 library for building the genetic algorithm and training machine learning algorithms (Keras & PyTorch).

Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet.

Modification of convolutional neural net "UNET" for image segmentation in Keras framework