ELI5 is a Python package which helps to debug machine learning classifiers and explain their predictions

Last update: Dec 17, 2022

Related tags

Machine Learning eli5

Overview

ELI5

ELI5 is a Python package which helps to debug machine learning classifiers and explain their predictions.

It provides support for the following machine learning frameworks and packages:

scikit-learn. Currently ELI5 allows to explain weights and predictions of scikit-learn linear classifiers and regressors, print decision trees as text or as SVG, show feature importances and explain predictions of decision trees and tree-based ensembles. ELI5 understands text processing utilities from scikit-learn and can highlight text data accordingly. Pipeline and FeatureUnion are supported. It also allows to debug scikit-learn pipelines which contain HashingVectorizer, by undoing hashing.
Keras - explain predictions of image classifiers via Grad-CAM visualizations.
xgboost - show feature importances and explain predictions of XGBClassifier, XGBRegressor and xgboost.Booster.
LightGBM - show feature importances and explain predictions of LGBMClassifier, LGBMRegressor and lightgbm.Booster.
CatBoost - show feature importances of CatBoostClassifier, CatBoostRegressor and catboost.CatBoost.
lightning - explain weights and predictions of lightning classifiers and regressors.
sklearn-crfsuite. ELI5 allows to check weights of sklearn_crfsuite.CRF models.

ELI5 also implements several algorithms for inspecting black-box models (see Inspecting Black-Box Estimators):

TextExplainer allows to explain predictions of any text classifier using LIME algorithm (Ribeiro et al., 2016). There are utilities for using LIME with non-text data and arbitrary black-box classifiers as well, but this feature is currently experimental.
Permutation importance method can be used to compute feature importances for black box estimators.

Explanation and formatting are separated; you can get text-based explanation to display in console, HTML version embeddable in an IPython notebook or web dashboards, a pandas.DataFrame object if you want to process results further, or JSON version which allows to implement custom rendering and formatting on a client.

License is MIT.

Check docs for more.

Note

This is the same project as https://github.com/TeamHG-Memex/eli5/, but due to temporary github access issues, 0.11 release is prepared in https://github.com/eli5-org/eli5 (this repo).

ELI5 is a Python package which helps to debug machine learning classifiers and explain their predictions

Related tags

Overview

ELI5

Owner

Case studies with Bayesian methods

Python implementation of Weng-Lin Bayesian ranking, a better, license-free alternative to TrueSkill

The Simpsons and Machine Learning: What makes an Episode Great?

A collection of video resources for machine learning

Price Prediction model is used to develop an LSTM model to predict the future market price of Bitcoin and Ethereum.

Open source time series library for Python

Diabetes Prediction with Logistic Regression

A library of sklearn compatible categorical variable encoders

This repository has datasets containing information of Uber pickups in NYC from April 2014 to September 2014 and January to June 2015. data Analysis , virtualization and some insights are gathered here

CobraML: Completely Customizable A python ML library designed to give the end user full control

A quick reference guide to the most commonly used patterns and functions in PySpark SQL

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

Warren - Stock Price Predictor

Traingenerator 🧙 A web app to generate template code for machine learning ✨

Predicting job salaries from ads - a Kaggle competition

This project used bitcoin, S&P500, and gold to construct an investment portfolio that aimed to minimize risk by minimizing variance.

Uses WiFi signals :signal_strength: and machine learning to predict where you are

Book Recommender System Using Sci-kit learn N-neighbours

CrayLabs and user contibuted examples of using SmartSim for various simulation and machine learning applications.

Short PhD seminar on Machine Learning Security (Adversarial Machine Learning)