Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.

Overview

Monk - A computer vision toolkit for everyone Tweet

Version Build_Status


Why use Monk

  • Issue: Want to begin learning computer vision

    • Solution: Start with Monk's hands-on study roadmap tutorials
  • Issue: Multiple libraries hence multiple syntaxes to learn

    • Solution: Monk's one syntax to rule them all - pytorch, keras, mxnet, etc
  • Issue: Tough to keep track of all the trial projects while participating in a deep learning competition

    • Solution: Use monk's project management and work on multiple prototyping experiments
  • Issue: Tough to set hyper-parameters while training a classifier

    • Solution: Try out hyper-parameter analyser to find the right fit
  • Issue: Looking for a library to build quick solutions for your customer

    • Solution: Train, Infer and deploy with monk's low-code syntax


Create real-world Image Classification applications

Medical Domain Fashion Domain Autonomous Vehicles Domain
Agriculture Domain Wildlife Domain Retail Domain
Satellite Domain Healthcare Domain Activity Analysis Domain

...... For more check out the Application Model Zoo!!!!



How does Monk make image classification easy

  • Write less code and create end to end applications.
  • Learn only one syntax and create applications using any deep learning library - pytorch, mxnet, keras, tensorflow, etc
  • Manage your entire project easily with multiple experiments


For whom this library is built

  • Students
    • Seamlessly learn computer vision using our comprehensive study roadmaps
  • Researchers and Developers
    • Create and Manage multiple deep learning projects
  • Competiton participants (Kaggle, Codalab, Hackerearth, AiCrowd, etc)
    • Expedite the prototyping process and jumpstart with a higher rank


Table of Contents




Sample Showcase - Quick Mode

Create an image classifier.

#Create an experiment
ptf.Prototype("sample-project-1", "sample-experiment-1")

#Load Data
ptf.Default(dataset_path="sample_dataset/", 
             model_name="resnet18", 
             num_epochs=2)
# Train
ptf.Train()

Inference

predictions = ptf.Infer(img_name="sample.png", return_raw=True);

Compare Experiments

#Create comparison project
ctf.Comparison("Sample-Comparison-1");

#Add all your experiments
ctf.Add_Experiment("sample-project-1", "sample-experiment-1");
ctf.Add_Experiment("sample-project-1", "sample-experiment-2");
   
# Generate statistics
ctf.Generate_Statistics();



Installation

  • CUDA 9.0          : pip install -U monk-cuda90
  • CUDA 9.0          : pip install -U monk-cuda92
  • CUDA 10.0        : pip install -U monk-cuda100
  • CUDA 10.1        : pip install -U monk-cuda101
  • CUDA 10.2        : pip install -U monk-cuda102
  • CPU (+Mac-OS) : pip install -U monk-cpu
  • Google Colab   : pip install -U monk-colab
  • Kaggle              : pip install -U monk-kaggle

For More Installation instructions visit: Link




Study Roadmaps




Documentation




TODO-2020

Features

  • Model Visualization
  • Pre-processed data visualization
  • Learned feature visualization
  • NDimensional data input - npy - hdf5 - dicom - tiff
  • Multi-label Image Classification
  • Custom model development

General

  • Functional Documentation
  • Tackle Multiple versions of libraries
  • Add unit-testing
  • Contribution guidelines
  • Python pip packaging support

Backend Support

  • Tensorflow 2.0 provision support with v1
  • Tensorflow 2.0 complete
  • Chainer

External Libraries

  • TensorRT Acceleration
  • Intel Acceleration
  • Echo AI - for Activation functions


Connect with the project contributors



Copyright

Copyright 2019 onwards, Tessellate Imaging Private Limited Licensed under the Apache License, Version 2.0 (the "License"); you may not use this project's files except in compliance with the License. A copy of the License is provided in the LICENSE file in this repository.

Owner
Tessellate Imaging
Computer Vision and Deep Learning Consultance and Development
Tessellate Imaging
Source code and dataset for ACL2021 paper: "ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning".

ERICA Source code and dataset for ACL2021 paper: "ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive L

THUNLP 75 Nov 02, 2022
Multi-resolution SeqMatch based long-term Place Recognition

MRS-SLAM for long-term place recognition In this work, we imply an multi-resolution sambling based visual place recognition method. This work is based

METASLAM 6 Dec 06, 2022
PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos

PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos. By adopting a unified pipeline-ba

PyKale 370 Dec 27, 2022
ALL Snow Removed: Single Image Desnowing Algorithm Using Hierarchical Dual-tree Complex Wavelet Representation and Contradict Channel Loss (HDCWNet)

ALL Snow Removed: Single Image Desnowing Algorithm Using Hierarchical Dual-tree Complex Wavelet Representation and Contradict Channel Loss (HDCWNet) (

Wei-Ting Chen 49 Dec 27, 2022
Offical implementation of Shunted Self-Attention via Multi-Scale Token Aggregation

Shunted Transformer This is the offical implementation of Shunted Self-Attention via Multi-Scale Token Aggregation by Sucheng Ren, Daquan Zhou, Shengf

156 Dec 27, 2022
Real time Human Detection Counting

In this python project, we are going to build the Human Detection and Counting System through Webcam or you can give your own video or images. This is a deep learning project on computer vision, whic

Mir Nawaz Ahmad 2 Jun 17, 2022
Pytorch implementation of the paper "Topic Modeling Revisited: A Document Graph-based Neural Network Perspective"

Graph Neural Topic Model (GNTM) This is the pytorch implementation of the paper "Topic Modeling Revisited: A Document Graph-based Neural Network Persp

Dazhong Shen 8 Sep 14, 2022
Identify the emotion of multiple speakers in an Audio Segment

MevonAI - Speech Emotion Recognition Identify the emotion of multiple speakers in a Audio Segment Report Bug · Request Feature Try the Demo Here Table

Suyash More 110 Dec 03, 2022
An Api for Emotion recognition.

PLAYEMO Playemo was built from the ground-up with Flask, a python tool that makes it easy for developers to build APIs. Use Cases Is Python your langu

greek geek 2 Jul 16, 2022
[CVPR 2021] Pytorch implementation of Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs

Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs In this work, we propose a framework HijackGAN, which enables non-linear latent space travers

Hui-Po Wang 46 Sep 05, 2022
Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021

Hierarchical reinforcement learning with Timed Subgoals (HiTS) This repository contains code for reproducing experiments from our paper "Hierarchical

Autonomous Learning Group 21 Dec 03, 2022
Predicting future trajectories of people in cameras of novel scenarios and views.

Pedestrian Trajectory Prediction Predicting future trajectories of pedestrians in cameras of novel scenarios and views. This repository contains the c

8 Sep 03, 2022
Numerical-computing-is-fun - Learning numerical computing with notebooks for all ages.

As much as this series is to educate aspiring computer programmers and data scientists of all ages and all backgrounds, it is also a reminder to mysel

EKA foundation 758 Dec 25, 2022
Implementation of PersonaGPT Dialog Model

PersonaGPT An open-domain conversational agent with many personalities PersonaGPT is an open-domain conversational agent cpable of decoding personaliz

ILLIDAN Lab 42 Jan 01, 2023
Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)

Realtime Multi-Person Pose Estimation By Zhe Cao, Tomas Simon, Shih-En Wei, Yaser Sheikh. Introduction Code repo for winning 2016 MSCOCO Keypoints Cha

Zhe Cao 4.9k Dec 31, 2022
Code of paper "CDFI: Compression-Driven Network Design for Frame Interpolation", CVPR 2021

CDFI (Compression-Driven-Frame-Interpolation) [Paper] (Coming soon...) | [arXiv] Tianyu Ding*, Luming Liang*, Zhihui Zhu, Ilya Zharkov IEEE Conference

Tianyu Ding 95 Dec 04, 2022
EfficientNetv2 TensorRT int8

EfficientNetv2_TensorRT_int8 EfficientNetv2模型实现来自https://github.com/d-li14/efficientnetv2.pytorch 环境配置 ubuntu:18.04 cuda:11.0 cudnn:8.0 tensorrt:7

34 Apr 24, 2022
Multiview Dataset Toolkit

Multiview Dataset Toolkit Using multi-view cameras is a natural way to obtain a complete point cloud. However, there is to date only one multi-view 3D

11 Dec 22, 2022
Doods2 - API for detecting objects in images and video streams using Tensorflow

DOODS2 - Return of DOODS Dedicated Open Object Detection Service - Yes, it's a b

Zach 101 Jan 04, 2023
A hyperparameter optimization framework

Optuna: A hyperparameter optimization framework Website | Docs | Install Guide | Tutorial Optuna is an automatic hyperparameter optimization software

7.4k Jan 04, 2023