scikit-multimodallearn is a Python package implementing algorithms multimodal data.

Overview
pipeline status coverage report

scikit-multimodallearn

scikit-multimodallearn is a Python package implementing algorithms multimodal data.

It is compatible with scikit-learn, a popular package for machine learning in Python.

Documentation

The documentation including installation instructions, API documentation and examples is available online.

Installation

Dependencies

scikit-multimodallearn works with Python 3.5 or later.

scikit-multimodallearn depends on scikit-learn (version >= 0.19).

Optionally, matplotlib is required to run the examples.

Installation using pip

scikit-multimodallearn is available on PyPI and can be installed using pip:

pip install scikit-multimodallearn

Development

The development of this package follows the guidelines provided by the scikit-learn community.

Refer to the Developer's Guide of the scikit-learn project for more details.

Source code

You can get the source code from the Git repository of the project:

git clone [email protected]:dev/multiconfusion.git

Testing

pytest and pytest-cov are required to run the test suite with:

cd multimodal
pytest

A code coverage report is displayed in the terminal when running the tests. An HTML version of the report is also stored in the directory htmlcov.

Generating the documentation

The generation of the documentation requires sphinx, sphinx-gallery, numpydoc and matplotlib and can be run with:

python setup.py build_sphinx

The resulting files are stored in the directory build/sphinx/html.

Credits

scikit-multimodallearn is developped by the development team of the LIS.

If you use scikit-multimodallearn in a scientific publication, please cite the following paper:

@InProceedings{Koco:2011:BAMCC,
 author={Ko\c{c}o, Sokol and Capponi, C{\'e}cile},
 editor={Gunopulos, Dimitrios and Hofmann, Thomas and Malerba, Donato
         and Vazirgiannis, Michalis},
 title={A Boosting Approach to Multiview Classification with Cooperation},
 booktitle={Proceedings of the 2011 European Conference on Machine Learning
            and Knowledge Discovery in Databases - Volume Part II},
 year={2011},
 location={Athens, Greece},
 publisher={Springer-Verlag},
 address={Berlin, Heidelberg},
 pages={209--228},
 numpages = {20},
 isbn={978-3-642-23783-6}
 url={https://link.springer.com/chapter/10.1007/978-3-642-23783-6_14},
 keywords={boosting, classification, multiview learning,
           supervised learning},
}

@InProceedings{Huu:2019:BAMCC,
 author={Huusari, Riika, Kadri Hachem and Capponi, C{\'e}cile},
 editor={},
 title={Multi-view Metric Learning in Vector-valued Kernel Spaces},
 booktitle={arXiv:1803.07821v1},
 year={2018},
 location={Athens, Greece},
 publisher={},
 address={},
 pages={209--228},
 numpages = {12}
 isbn={978-3-642-23783-6}
 url={https://link.springer.com/chapter/10.1007/978-3-642-23783-6_14},
 keywords={boosting, classification, multiview learning,
           merric learning, vector-valued, kernel spaces},
}

References

  • Sokol Koço, Cécile Capponi, "Learning from Imbalanced Datasets with cross-view cooperation" Linking and mining heterogeneous an multi-view data, Unsupervised and semi-supervised learning Series Editor M. Emre Celeri, pp 161-182, Springer
  • Sokol Koço, Cécile Capponi, "A boosting approach to multiview classification with cooperation", Proceedings of the 2011 European Conference on Machine Learning (ECML), Athens, Greece, pp.209-228, 2011, Springer-Verlag.
  • Sokol Koço, "Tackling the uneven views problem with cooperation based ensemble learning methods", PhD Thesis, Aix-Marseille Université, 2013.
  • Riikka Huusari, Hachem Kadri and Cécile Capponi, "Multi-View Metric Learning in Vector-Valued Kernel Spaces" in International Conference on Artificial Intelligence and Statistics (AISTATS) 2018

Copyright

Université d'Aix Marseille (AMU) - Centre National de la Recherche Scientifique (CNRS) - Université de Toulon (UTLN).

Copyright © 2017-2018 AMU, CNRS, UTLN

License

scikit-multimodallearn is free software: you can redistribute it and/or modify it under the terms of the New BSD License

A project based example of Data pipelines, ML workflow management, API endpoints and Monitoring.

MLOps template with examples for Data pipelines, ML workflow management, API development and Monitoring.

Utsav 33 Dec 03, 2022
Kaggle Tweet Sentiment Extraction Competition: 1st place solution (Dark of the Moon team)

Kaggle Tweet Sentiment Extraction Competition: 1st place solution (Dark of the Moon team)

Artsem Zhyvalkouski 64 Nov 30, 2022
A naive Bayes model for cancer classification using a set of documents

Naivebayes text classifcation model for cancer and noncancer documents Author: Alex King Purpose Requirements/files included How to use 1. Purpose The

Alex W King 1 Nov 24, 2021
100 Days of Machine and Deep Learning Code

💯 Days of Machine Learning and Deep Learning Code MACHINE LEARNING TOPICS COVERED - FROM SCRATCH Linear Regression Logistic Regression K Means Cluste

Tanishq Gautam 66 Nov 02, 2022
Covid-polygraph - a set of Machine Learning-driven fact-checking tools

Covid-polygraph, a set of Machine Learning-driven fact-checking tools that aim to address the issue of misleading information related to COVID-19.

1 Apr 22, 2022
SageMaker Python SDK is an open source library for training and deploying machine learning models on Amazon SageMaker.

SageMaker Python SDK SageMaker Python SDK is an open source library for training and deploying machine learning models on Amazon SageMaker. With the S

Amazon Web Services 1.8k Jan 01, 2023
Automated Machine Learning with scikit-learn

auto-sklearn auto-sklearn is an automated machine learning toolkit and a drop-in replacement for a scikit-learn estimator. Find the documentation here

AutoML-Freiburg-Hannover 6.7k Jan 07, 2023
Machine learning that just works, for effortless production applications

Machine learning that just works, for effortless production applications

Elisha Yadgaran 16 Sep 02, 2022
ETNA – time series forecasting framework

ETNA Time Series Library Predict your time series the easiest way Homepage | Documentation | Tutorials | Contribution Guide | Release Notes ETNA is an

Tinkoff.AI 675 Jan 08, 2023
Adversarial Framework for (non-) Parametric Image Stylisation Mosaics

Fully Adversarial Mosaics (FAMOS) Pytorch implementation of the paper "Copy the Old or Paint Anew? An Adversarial Framework for (non-) Parametric Imag

Zalando Research 120 Dec 24, 2022
A model to predict steering torque fully end-to-end

torque_model The torque model is a spiritual successor to op-smart-torque, which was a project to train a neural network to control a car's steering f

Shane Smiskol 4 Jun 03, 2022
Forecast dynamically at scale with this unique package. pip install scalecast

🌄 Scalecast: Dynamic Forecasting at Scale About This package uses a scaleable forecasting approach in Python with common scikit-learn and statsmodels

Michael Keith 158 Jan 03, 2023
Cohort Intelligence used to solve various mathematical functions

Cohort-Intelligence-for-Mathematical-Functions About Cohort Intelligence : Cohort Intelligence ( CI ) is an optimization technique. It attempts to mod

Aayush Khandekar 2 Oct 25, 2021
A simple application that calculates the probability distribution of a normal distribution

probability-density-function General info An application that calculates the probability density and cumulative distribution of a normal distribution

1 Oct 25, 2022
All-in-one web-based development environment for machine learning

All-in-one web-based development environment for machine learning Getting Started • Features & Screenshots • Support • Report a Bug • FAQ • Known Issu

3 Feb 03, 2021
Automated Machine Learning Pipeline with Feature Engineering and Hyper-Parameters Tuning

The mljar-supervised is an Automated Machine Learning Python package that works with tabular data. I

MLJAR 2.4k Jan 02, 2023
A toolkit for making real world machine learning and data analysis applications in C++

dlib C++ library Dlib is a modern C++ toolkit containing machine learning algorithms and tools for creating complex software in C++ to solve real worl

Davis E. King 11.6k Jan 02, 2023
Predict the income for each percentile of the population (Python) - FRENCH

05.income-prediction Predict the income for each percentile of the population (Python) - FRENCH Effectuez une prédiction de revenus Prérequis Pour ce

1 Feb 13, 2022
Python library which makes it possible to dynamically mask/anonymize data using JSON string or python dict rules in a PySpark environment.

pyspark-anonymizer Python library which makes it possible to dynamically mask/anonymize data using JSON string or python dict rules in a PySpark envir

6 Jun 30, 2022
Machine Learning University: Accelerated Natural Language Processing Class

Machine Learning University: Accelerated Natural Language Processing Class This repository contains slides, notebooks and datasets for the Machine Lea

AWS Samples 2k Jan 01, 2023