Benchmark library for high-dimensional HPO of black-box models based on Weighted Lasso regression

Last update: Mar 15, 2022

Related tags

Deep Learning LassoBench

Overview

LassoBench

LassoBench is a library for high-dimensional hyperparameter optimization benchmarks based on Weighted Lasso regression.

Note: LassoBench is under active construction. Follow for more benchmarks soon.

Install and work with the development version

From a console or terminal clone the repository and install LassoBench:

git clone https://github.com/ksehic/LassoBench.git
cd LassoBench/
pip install -e .

Overview

The objective is to optimize the multi-dimensional hyperparameter that balances the least-squares estimation and the penalty term that promotes the sparsity.

The ambient space bounds are defined between [-1, 1].

LassoBench comes with two classes SyntheticBenchmark and RealBenchmark. While RealBenchmark is based on real-world applications found in medicine and finance, SyntheticBenchmark covers synthetic well-defined conditions. The user can select one of the predefined synthetic benchmarks or create a different bechmark.

Each benchmark comes with .evaluate that is used to evaluate the objective function, .test that provides the post-processing metrics (such as MSE on the test data and the F-score for synt benchs) and the argument mf_opt to define the multi-fidelity framework that is evaluated via .fidelity_evaluate.

The results are compared with the baselines LassoCV (.run_LASSOCV), AdaptiveLassoCV (to be implemented soon) and Sparse-HO (.run_sparseho).

Simple experiments are provided in example.py. In hesbo_example.py and alebo_example.py, we demostrate how to use LassoBench with some well-known HPO algorithms for high-dimensional problems.

Please refer to the reference for more details.

.
├── ...
├── example                    # Examples how to use LassoBench for HDBO algorithms
│   ├── alebo_example.py       # ALEBO applied on synt bench
│   ├── example.py             # Simple cases how to run with synt, real, and multifidelity benchs
│   ├── hesbo_example.py        # HesBO applied on synt and real bench
│   ├── hesbo_lib.pu            # HesBO library
│
└── ...

License

LassoBench is distributed under the MIT license. More information on the license can be found here

Simple synthetic bench code

import numpy as np
import LassoBench
synt_bench = LassoBench.SyntheticBenchmark(pick_bench='synt_simple')
d = synt_bench.n_features
random_config = np.random.uniform(low=-1.0, high=1.0, size=(d,))
loss = synt_bench.evaluate(random_config)

Real-world bench code

import numpy as np
import LassoBench
real_bench = LassoBench.RealBenchmark(pick_data='rcv1')
d = real_bench.n_features
random_config = np.random.uniform(low=-1.0, high=1.0, size=(d,))
loss = real_bench.evaluate(random_config)

Multi-information source bench code

import numpy as np
import LassoBench
real_bench_mf = LassoBench.RealBenchmark(pick_data='rcv1', mf_opt='discrete_fidelity')
d = real_bench_mf.n_features
random_config = np.random.uniform(low=-1.0, high=1.0, size=(d,))
fidelity_pick = 0
loss = real_bench_mf.fidelity_evaluate(random_config, index_fidelity=fidelity_pick)

List of synthetic benchmarks

Name	Dimensionality	Axis-aligned Subspace
synt_simple	60	3
synt_medium	100	5
synt_high	300	15
synt_hard	1000	50

List of real world benchmarks

Name	Dimensionality	Approx. Axis-aligned Subspace
breast_cancer	10	3
diabetes	8	5
leukemia	7 129	22
dna	180	43
rcv1	19 959	75

Cite

If you use this code, please cite:


Šehić Kenan, Gramfort Alexandre, Salmon Joseph and Nardi Luigi. "LassoBench: A High-Dimensional Hyperparameter Optimization Benchmark Suite for Lasso", TBD, 2021.

Benchmark library for high-dimensional HPO of black-box models based on Weighted Lasso regression

Related tags

Overview

LassoBench

Install and work with the development version

Overview

License

Simple synthetic bench code

Real-world bench code

Multi-information source bench code

List of synthetic benchmarks

List of real world benchmarks

Cite

Owner

Kenan Šehić

Use evolutionary algorithms instead of gridsearch in scikit-learn

DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language Models

This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL 2021.

Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021

Convert Table data to approximate values with GUI

Official Pytorch implementation for video neural representation (NeRV)

PySOT - SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.

Supplementary code for TISMIR paper "Sliding-Window Pitch-Class Histograms as a Means of Modeling Musical Form"

Detecting drunk people through thermal images using Deep Learning (CNN)

Metric learning algorithms in Python

Accuracy Aligned. Concise Implementation of Swin Transformer

Tutorial in Python targeted at Epidemiologists. Will discuss the basics of analysis in Python 3

A Kitti Road Segmentation model implemented in tensorflow.

Learning multiple gaits of quadruped robot using hierarchical reinforcement learning

Semantic similarity computation with different state-of-the-art metrics

Code for the paper "Relation of the Relations: A New Formalization of the Relation Extraction Problem"

Code-free deep segmentation for computational pathology

[TPDS'21] COSCO: Container Orchestration using Co-Simulation and Gradient Based Optimization for Fog Computing Environments

This is the pytorch code for the paper Curious Representation Learning for Embodied Intelligence.

Code for our EMNLP 2021 paper “Heterogeneous Graph Neural Networks for Keyphrase Generation”