Navigating StyleGAN2 w latent space using CLIP

an attempt to build sth with the official SG2-ADA Pytorch impl kinda inspired by Generating Images from Prompts using CLIP and StyleGAN based on the og projector.py

things learned:

it's better to generate initial w values from a well converged sample rather than starting with random or median ones
optimizing w and noise inputs works better than w alone
default values of 0.02 for LR/noise work fine with portraits

Quick start

clone SG2 repo, copy clip dir from CLIP repo, install pytorch 1.7.1 and stuff
pick a suitable SG2 PKL (eg FFHQ)
pick a seed
run python3 approach.py --network network-snapshot-ffhq.pkl --outdir project --num-steps 100 --text 'an image of a girl with a face resembling Paul Krugman' --psi 0.8 --seed 12345
alternatively, one can start from a w vector stored as .npz python3 approach.py --network network-snapshot-ffhq.pkl --outdir project --num-steps 100 --text 'an image of a girl with a face resembling Paul Krugman' --w w-7660ca0b7e95428cac94c89459b5cebd8a7acbd4.npz

FFHQ test

python3 approach.py --network stylegan2-ffhq-config-f.pkl --outdir ffhq --num-steps 100 --text 'an image of an Instagram influencer girl' --psi 0.7 --seed 32

Navigating StyleGAN2 w latent space using CLIP

Related tags

Overview

Navigating StyleGAN2 w latent space using CLIP

Quick start

FFHQ test

Owner

Mike K.

A Temporal Extension Library for PyTorch Geometric

A lightweight tool to get an AI Infrastructure Stack up in minutes not days.

MetaBalance: Improving Multi-Task Recommendations via Adapting Gradient Magnitudes of Auxiliary Tasks

Time Delayed NN implemented in pytorch

Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"

Pytorch implementation of the AAAI 2022 paper "Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification"

Implementation of Barlow Twins paper

This is a vision-based 3d model manipulation and control UI

Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)

Label-Free Model Evaluation with Semi-Structured Dataset Representations

'A C2C E-COMMERCE TRUST MODEL BASED ON REPUTATION' Python implementation

Multi-scale discriminator feature-wise loss function

Enhancing Aspect-Based Sentiment Analysis with Supervised Contrastive Learning.

TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.

Simple embedding based text classifier inspired by fastText, implemented in tensorflow

Do Neural Networks for Segmentation Understand Insideness?

Official implementation of deep-multi-trajectory-based single object tracking (IEEE T-CSVT 2021).

Multiple paper open-source codes of the Microsoft Research Asia DKI group

an implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch

Contrastive Learning for Compact Single Image Dehazing, CVPR2021