Model factory is a ML training platform to help engineers to build ML models at scale

Last update: Sep 23, 2022

Related tags

Overview

Model Factory

Machine learning today is powering many businesses today, e.g., search engine, e-commerce, news or feed recommendation. Training high quality ML models is critical to all of these systems.

However, training a model is not trivial. Traditionally, engineers use single devvm to train models. It might be doable if you were only to build a few models. If you are interested in exploring hundreds or even thousands of ideas, repeating the workflow manually will be a painful process.

There are many issues with the above workflow:

Hard to scale
No tracking
No monitor
No end-to-end automation
Not easy to share with others
No centralized model management

The above pain points really slows engineers down when they are developing their ML models. Model factory is a project that targets at addressing the above issues.

Background

There are existing work in the industry which tries to address the above issues as well, e.g., Facebook fblearner, Google Kubeflow.

The key difference between model factory and other projects is that model factory promotes a pure python based authoring experience, while most others uses DAG (Directed Acyclic Graph). The philosophy gives model factory the following advantages:

Easy to learn: there is almost no learning curve. As long as you know how to write python, you know how to use model factory.
More flexible: control flow logic can be easily implemented on it.
Allow communication between nodes: free form communication can be done between operators, which opens up the possibility of building distributed training on top of model factory.

Installation

Please follow the Installation page to deploy model factory in your production or testing environment.

Development Guide

Please follow the Development Guide page to try out your first model factory pipeline.

Model factory is a ML training platform to help engineers to build ML models at scale

Related tags

Overview

Model Factory

Background

Installation

Development Guide

Owner

A Lightweight Hyperparameter Optimization Tool 🚀

Combines Bayesian analyses from many datasets.

🔬 A curated list of awesome machine learning strategies & tools in financial market.

A Lucid Framework for Transparent and Interpretable Machine Learning Models.

This is my implementation on the K-nearest neighbors algorithm from scratch using Python

Test symmetries with sklearn decision tree models

Bayesian optimization in JAX

Land Cover Classification Random Forest

Xeasy-ml is a packaged machine learning framework.

InfiniteBoost: building infinite ensembles with gradient descent

Stats, linear algebra and einops for xarray

Time series forecasting with PyTorch

Python factor analysis library (PCA, CA, MCA, MFA, FAMD)

Free MLOps course from DataTalks.Club

Python-based implementations of algorithms for learning on imbalanced data.

Datetimes for Humans™

Evaluate on three different ML model for feature selection using Breast cancer data.

Learning --> Numpy January 2022 - winter'22

Breast-Cancer-Classification - Using SKLearn breast cancer dataset which contains 569 examples and 32 features classifying has been made with 6 different algorithms

This handbook accompanies the course: Machine Learning with Hung-Yi Lee