Model Training as a CI/CD System

This project demonstrates the machine model training as a CI/CD system in GCP platform. You will see more detailed workflow in the below section, but it is about rebuilding and redeploying (continuous integration) the currently deployed machine learning pipeline based on changes in code. Such changes could happen in the training data, data pre-processing logic, model architecture and training code, custom pipeline components, and so on.

Workflow #1

We create initial code, or we make some changes in the existing codebase for pipeline.
Based on the changes in the step 2, a GitHub action gets triggered to initiate a Cloud Build process.
The Cloud Build runs unit tests to see if those components work without errors.
If there is no error at all, there are two common sub-workflows from this point.
- Cloud Build containerizes the current codebase. This is an optional step. If you have any custom components unchanges, this step might be omitted.
  - The Cloud Build compiles a new pipeline. It creates an updated docker image, and it uploads the new docker image to GCR
- If there is any codes changed in data preprocessing, modeling, training steps, we only have to upload those source files to designated GCS bucket
The final step of the Cloud Build is to execute a pipeline run on Vertex AI

Workflow #2

Workflow in a nutshell

We create initial code, or we make some changes in the existing codebase for modules.
Based on the changes in the step 2, a GitHub action gets triggered to initiate a Cloud Build process.
The Cloud Build runs unit tests to see if those components work without errors.
If there is no error at all, there are two common sub-workflows from this point.
- If there is any codes changed in data preprocessing and models, we only have to upload those source files to designated GCS bucket.
The final step of the Cloud Build is to execute a pipeline run on Vertex AI. Trainer and Transform TFX components will look up the changed modules accordingly.

Acknowledgements

ML-GDE program for providing GCP credits.

Demonstration of the Model Training as a CI/CD System in Vertex AI

Related tags

Overview

Model Training as a CI/CD System

Workflow #1

Workflow #2

Workflow in a nutshell

Acknowledgements

Owner

Chansung Park

Interactive Image Generation via Generative Adversarial Networks

A concise but complete implementation of CLIP with various experimental improvements from recent papers

Unofficial implementation of the Involution operation from CVPR 2021

The codes and related files to reproduce the results for Image Similarity Challenge Track 2.

LUKE -- Language Understanding with Knowledge-based Embeddings

Keras code and weights files for popular deep learning models.

Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)

Anchor Retouching via Model Interaction for Robust Object Detection in Aerial Images

CSD: Consistency-based Semi-supervised learning for object Detection

A configurable, tunable, and reproducible library for CTR prediction

AttentionGAN for Unpaired Image-to-Image Translation & Multi-Domain Image-to-Image Translation

This respository includes implementations on Manifoldron: Direct Space Partition via Manifold Discovery

Source codes for "Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs"

This is a simple framework to make object detection dataset very quickly

Funnels: Exact maximum likelihood with dimensionality reduction.

Graph Posterior Network: Bayesian Predictive Uncertainty for Node Classification (NeurIPS 2021)

Source code for CVPR 2020 paper "Learning to Forget for Meta-Learning"

Code to reproduce the results for Compositional Attention

DeepI2I: Enabling Deep Hierarchical Image-to-Image Translation by Transferring from GANs

Implementation of TimeSformer, a pure attention-based solution for video classification