SemTorch

Last update: Dec 07, 2022

Related tags

Overview

SemTorch

This repository contains different deep learning architectures definitions that can be applied to image segmentation.

All the architectures are implemented in PyTorch and can been trained easily with FastAI 2.

In Deep-Tumour-Spheroid repository can be found and example of how to apply it with a custom dataset, in that case brain tumours images are used.

These architectures are classified as:

Semantic Segmentation: each pixel of an image is linked to a class label.
Instance Segmentation: is similar to semantic segmentation, but goes a bit deeper, it identifies , for each pixel, the object instance it belongs to.
Salient Object Detection (Binary clases only): detection of the most noticeable/important object in an image.

🚀 Getting Started

To start using this package, install it using pip:

For example, for installing it in Ubuntu use:

pip3 install SemTorch

👩‍💻 Usage

This package creates an abstract API to access a segmentation model of different architectures. This method returns a FastAI 2 learner that can be combined with all the fastai's functionalities.

# SemTorch
from semtorch import get_segmentation_learner

learn = get_segmentation_learner(dls=dls, number_classes=2, segmentation_type="Semantic Segmentation",
                                 architecture_name="deeplabv3+", backbone_name="resnet50", 
                                 metrics=[tumour, Dice(), JaccardCoeff()],wd=1e-2,
                                 splitter=segmentron_splitter).to_fp16()

You can find a deeper example in Deep-Tumour-Spheroid repository, in this repo the package is used for the segmentation of brain tumours.

def get_segmentation_learner(dls, number_classes, segmentation_type, architecture_name, backbone_name,
                             loss_func=None, opt_func=Adam, lr=defaults.lr, splitter=trainable_params, 
                             cbs=None, pretrained=True, normalize=True, image_size=None, metrics=None, 
                             path=None, model_dir='models', wd=None, wd_bn_bias=False, train_bn=True,
                             moms=(0.95,0.85,0.95)):

This function return a learner for the provided architecture and backbone

Parameters:

dls (DataLoader): the dataloader to use with the learner
number_classes (int): the number of clases in the project. It should be >=2
segmentation_type (str): just Semantic Segmentation accepted for now
architecture_name (str): name of the architecture. The following ones are supported: unet, deeplabv3+, hrnet, maskrcnn and u2^net
backbone_name (str): name of the backbone
loss_func (): loss function.
opt_func (): opt function.
lr (): learning rates
splitter (): splitter function for freazing the learner
cbs (List[cb]): list of callbacks
pretrained (bool): it defines if a trained backbone is needed
normalize (bool): if normalization is applied
image_size (int): REQUIRED for MaskRCNN. It indicates the desired size of the image.
metrics (List[metric]): list of metrics
path (): path parameter
model_dir (str): the path in which save models
wd (float): wieght decay
wd_bn_bias (bool):
train_bn (bool):
moms (Tuple(float)): tuple of different momentuns

Returns:

learner: value containing the learner object

Supported configs

Architecture	supported config	backbones
unet	`Semantic Segmentation`,`binary` `Semantic Segmentation`,`multiple`	`resnet18`, `resnet34`, `resnet50`, `resnet101`, `resnet152`, `xresnet18`, `xresnet34`, `xresnet50`, `xresnet101`, `xresnet152`, `squeezenet1_0`, `squeezenet1_1`, `densenet121`, `densenet169`, `densenet201`, `densenet161`, `vgg11_bn`, `vgg13_bn`, `vgg16_bn`, `vgg19_bn`, `alexnet`
deeplabv3+	`Semantic Segmentation`,`binary` `Semantic Segmentation`,`multiple`	`resnet18`, `resnet34`, `resnet50`, `resnet101`, `resnet152`, `resnet50c`, `resnet101c`, `resnet152c`, `xception65`, `mobilenet_v2`
hrnet	`Semantic Segmentation`,`binary` `Semantic Segmentation`,`multiple`	`hrnet_w18_small_model_v1`, `hrnet_w18_small_model_v2`, `hrnet_w18`, `hrnet_w30`, `hrnet_w32`, `hrnet_w48`
maskrcnn	`Semantic Segmentation`,`binary`	`resnet50`
u2^net	`Semantic Segmentation`,`binary`	`small`, `normal`

📩 Contact

📧 [email protected]

💼 Linkedin David Lacalle Castillo

SemTorch

Related tags

Overview

SemTorch

🚀 Getting Started

👩‍💻 Usage

Parameters:

Returns:

Supported configs

📩 Contact

Owner

David Lacalle Castillo

CellProfiler is a open-source application for biological image analysis

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Super Mario Game With Python

Give a solution to recognize MaoYan font.

A toolbox of scene text detection and recognition

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

InverseRenderNet: Learning single image inverse rendering, CVPR 2019.

This is a implementation of CRAFT OCR method

A simple python program to record security cam footage by detecting a face and body of a person in the frame.

Fine tuning keras-ocr python package with custom synthetic dataset from scratch

A curated list of resources dedicated to scene text localization and recognition

A selectional auto-encoder approach for document image binarization

A little but useful tool to explore OCR data extracted with `pytesseract` and `opencv`

Converts an image into funny, smaller amongus characters

This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the robots of the future.

QED-C: The Quantum Economic Development Consortium provides these computer programs and software for use in the fields of quantum science and engineering.

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

docstrum

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database.

Hand Detection and Finger Detection on Live Feed