slim-python is a package to learn customized scoring systems for decision-making problems.

Last update: Nov 02, 2022

Related tags

Overview

slim-python is a package to learn customized scoring systems for decision-making problems.

These are simple decision aids that let users make yes-no predictions by adding and subtracting a few small numbers.

SLIM is designed to learn the most accurate scoring system for a given dataset and set of constraints. These models are produced by solving a hard optimization problem that directly optimizes for accuracy, sparsity, and customized constraints (e.g., hard limits on model size, TPR, FPR).

Requirements

slim-python was developed using Python 2.7.11 and CPLEX 12.6.2.

CPLEX

CPLEX is cross-platform commercial optimization tool with a Pytho API. It is freely available to students and faculty members at accredited institutions as part of the IBM Academic Initiative. To get CPLEX:

Join the IBM Academic Initiative. Note that it may take up to a week to obtain approval.
Download IBM ILOG CPLEX Optimization Studio V12.6.1 (or higher) from the software catalog
Install the file on your computer. Note mac/unix users will need to install a .bin file.
Setup the CPLEX Python modules as described here here.

Please check the CPLEX user manual or the CPLEX forums if you have problems installing CPLEX.

Citation

If you use SLIM for academic research, please cite our paper!

@article{
    ustun2015slim,
    year = {2015},
    issn = {0885-6125},
    journal = {Machine Learning},
    doi = {10.1007/s10994-015-5528-6},
    title = {Supersparse linear integer models for optimized medical scoring systems},
    url = {http://dx.doi.org/10.1007/s10994-015-5528-6},
    publisher = { Springer US},
    author = {Ustun, Berk and Rudin, Cynthia},
    pages = {1-43},
    language = {English}
}

slim-python is a package to learn customized scoring systems for decision-making problems.

Related tags

Overview

Requirements

CPLEX

Citation

Owner

Berk Ustun

Automatically create Faiss knn indices with the most optimal similarity search parameters.

Crypto-trading - ML techiques are used to forecast short term returns in 14 popular cryptocurrencies

Predicting diabetes over a five year period using logistic regression and the Pima First-Nation dataset

Built various Machine Learning algorithms (Logistic Regression, Random Forest, KNN, Gradient Boosting and XGBoost. etc)

Summer: compartmental disease modelling in Python

PennyLane is a cross-platform Python library for differentiable programming of quantum computers

Timeseries analysis for neuroscience data

Estudos e projetos feitos com PySpark.

Model search (MS) is a framework that implements AutoML algorithms for model architecture search at scale.

Bayesian Additive Regression Trees For Python

Python library for multilinear algebra and tensor factorizations

A collection of video resources for machine learning

About Solve CTF offline disconnection problem - based on python3's small crawler

A Python implementation of FastDTW

A Time Series Library for Apache Spark

Simple, fast, and parallelized symbolic regression in Python/Julia via regularized evolution and simulated annealing

TensorFlow implementation of an arbitrary order Factorization Machine

Datetimes for Humans™

This is a Machine Learning model which predicts the presence of Diabetes in Patients

STUMPY is a powerful and scalable Python library for computing a Matrix Profile, which can be used for a variety of time series data mining tasks