SemTorch

Last update: Dec 07, 2022

Related tags

Overview

SemTorch

This repository contains different deep learning architectures definitions that can be applied to image segmentation.

All the architectures are implemented in PyTorch and can been trained easily with FastAI 2.

In Deep-Tumour-Spheroid repository can be found and example of how to apply it with a custom dataset, in that case brain tumours images are used.

These architectures are classified as:

Semantic Segmentation: each pixel of an image is linked to a class label.
Instance Segmentation: is similar to semantic segmentation, but goes a bit deeper, it identifies , for each pixel, the object instance it belongs to.
Salient Object Detection (Binary clases only): detection of the most noticeable/important object in an image.

🚀 Getting Started

To start using this package, install it using pip:

For example, for installing it in Ubuntu use:

pip3 install SemTorch

👩‍💻 Usage

This package creates an abstract API to access a segmentation model of different architectures. This method returns a FastAI 2 learner that can be combined with all the fastai's functionalities.

# SemTorch
from semtorch import get_segmentation_learner

learn = get_segmentation_learner(dls=dls, number_classes=2, segmentation_type="Semantic Segmentation",
                                 architecture_name="deeplabv3+", backbone_name="resnet50", 
                                 metrics=[tumour, Dice(), JaccardCoeff()],wd=1e-2,
                                 splitter=segmentron_splitter).to_fp16()

You can find a deeper example in Deep-Tumour-Spheroid repository, in this repo the package is used for the segmentation of brain tumours.

def get_segmentation_learner(dls, number_classes, segmentation_type, architecture_name, backbone_name,
                             loss_func=None, opt_func=Adam, lr=defaults.lr, splitter=trainable_params, 
                             cbs=None, pretrained=True, normalize=True, image_size=None, metrics=None, 
                             path=None, model_dir='models', wd=None, wd_bn_bias=False, train_bn=True,
                             moms=(0.95,0.85,0.95)):

This function return a learner for the provided architecture and backbone

Parameters:

dls (DataLoader): the dataloader to use with the learner
number_classes (int): the number of clases in the project. It should be >=2
segmentation_type (str): just Semantic Segmentation accepted for now
architecture_name (str): name of the architecture. The following ones are supported: unet, deeplabv3+, hrnet, maskrcnn and u2^net
backbone_name (str): name of the backbone
loss_func (): loss function.
opt_func (): opt function.
lr (): learning rates
splitter (): splitter function for freazing the learner
cbs (List[cb]): list of callbacks
pretrained (bool): it defines if a trained backbone is needed
normalize (bool): if normalization is applied
image_size (int): REQUIRED for MaskRCNN. It indicates the desired size of the image.
metrics (List[metric]): list of metrics
path (): path parameter
model_dir (str): the path in which save models
wd (float): wieght decay
wd_bn_bias (bool):
train_bn (bool):
moms (Tuple(float)): tuple of different momentuns

Returns:

learner: value containing the learner object

Supported configs

Architecture	supported config	backbones
unet	`Semantic Segmentation`,`binary` `Semantic Segmentation`,`multiple`	`resnet18`, `resnet34`, `resnet50`, `resnet101`, `resnet152`, `xresnet18`, `xresnet34`, `xresnet50`, `xresnet101`, `xresnet152`, `squeezenet1_0`, `squeezenet1_1`, `densenet121`, `densenet169`, `densenet201`, `densenet161`, `vgg11_bn`, `vgg13_bn`, `vgg16_bn`, `vgg19_bn`, `alexnet`
deeplabv3+	`Semantic Segmentation`,`binary` `Semantic Segmentation`,`multiple`	`resnet18`, `resnet34`, `resnet50`, `resnet101`, `resnet152`, `resnet50c`, `resnet101c`, `resnet152c`, `xception65`, `mobilenet_v2`
hrnet	`Semantic Segmentation`,`binary` `Semantic Segmentation`,`multiple`	`hrnet_w18_small_model_v1`, `hrnet_w18_small_model_v2`, `hrnet_w18`, `hrnet_w30`, `hrnet_w32`, `hrnet_w48`
maskrcnn	`Semantic Segmentation`,`binary`	`resnet50`
u2^net	`Semantic Segmentation`,`binary`	`small`, `normal`

📩 Contact

📧 [email protected]

💼 Linkedin David Lacalle Castillo

SemTorch

Related tags

Overview

SemTorch

🚀 Getting Started

👩‍💻 Usage

Parameters:

Returns:

Supported configs

📩 Contact

Owner

David Lacalle Castillo

This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"

Reference Code for AAAI-20 paper "Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labels"

An official PyTorch implementation of the paper "Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences", ICCV 2021.

An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

🔎 Like Chardet. 🚀 Package for encoding & language detection. Charset detection.

An unofficial package help developers to implement ZATCA (Fatoora) QR code easily which required for e-invoicing

Dirty, ugly, and hopefully useful OCR of Facebook Papers docs released by Gizmodo

Recognizing cropped text in natural images.

A curated list of papers, code and resources pertaining to image composition

Characterizing possible failure modes in physics-informed neural networks.

Read Japanese manga inside browser with selectable text.

A machine learning software for extracting information from scholarly documents

Educational application aimed at automating user-defined workflows for the mobile game, "Granblue Fantasy", using a variety of CV technologies in the backend such as OpenCV, PyAutoGUI and EasyOCR and a frontend coded in Typescript.

Single Shot Text Detector with Regional Attention

Solution for Problem 1 by team codesquad for AIDL 2020. Uses ML Kit for OCR and OpenCV for image processing

Controlling Volume by Hand Gestures

Neural search engine for AI papers

Text-to-Image generation

A simple python program to record security cam footage by detecting a face and body of a person in the frame.

Captcha Recognition