Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP

Last update: Dec 19, 2022

Related tags

Overview

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP

Abstract: We introduce a method that allows to automatically segment images into semantically meaningful regions without human supervision. Derived regions are consistent across different images and coincide with human-defined semantic classes on some datasets. In cases where semantic regions might be hard for human to define and consistently label, our method is still able to find meaningful and consistent semantic classes. In our work, we use pretrained StyleGAN2 generative model: clustering in the feature space of the generative model allows to discover semantic classes. Once classes are discovered, a synthetic dataset with generated images and corresponding segmentation masks can be created. After that a segmentation model is trained on the synthetic dataset and is able to generalize to real images. Additionally, by using CLIP we are able to use prompts defined in a natural language to discover some desired semantic classes. We test our method on publicly available datasets and show state-of-the-art results.

This repository contains the official Pytorch implementation of the following paper:

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP
Daniil Pakhomov, Sanchit Hira, Narayani Wagle, Kemar E. Green, Nassir Navab
https://arxiv.org/abs/2107.12518

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP

Related tags

Overview

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP

Owner

Daniil Pakhomov

QRec: A Python Framework for quick implementation of recommender systems (TensorFlow Based)

Code for layerwise detection of linguistic anomaly paper (ACL 2021)

SemiNAS: Semi-Supervised Neural Architecture Search

salabim - discrete event simulation in Python

AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition

A PyTorch-based library for semi-supervised learning

This package contains deep learning models and related scripts for RoseTTAFold

Face recognition system using MTCNN, FACENET, SVM and FAST API to track participants of Big Brother Brasil in real time.

Simple Pixelbot for Diablo 2 Resurrected written in python and opencv.

Multi Agent Reinforcement Learning for ROS in 2D Simulation Environments

Official implementation of the paper Momentum Capsule Networks (MoCapsNet)

BboxToolkit is a tiny library of special bounding boxes.

TraSw for FairMOT - A Single-Target Attack example (Attack ID: 19; Screener ID: 24):

TensorRT examples (Jetson, Python/C++)(object detection)

S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)

In this project, two programs can help you take full agvantage of time on the model training with a remote server

A embed able annotation tool for end to end cross document co-reference

Adjusting for Autocorrelated Errors in Neural Networks for Time Series

Spatial Contrastive Learning for Few-Shot Classification (SCL)

Atomistic Line Graph Neural Network