A simple library that implements CLIP guided loss in PyTorch.

Last update: Dec 26, 2022

Overview

pytorch_clip_guided_loss: Pytorch implementation of the CLIP guided loss for Text-To-Image, Image-To-Image, or Image-To-Text generation.

A simple library that implements CLIP guided loss in PyTorch.

Install package

pip install pytorch_clip_guided_loss

Install the latest version

pip install --upgrade git+https://github.com/bes-dev/pytorch_clip_guided_loss.git

Features

The library supports multiple prompts (images or texts) as targets for optimization.
The library automatically detects the language of the input text, and multilingual translate it via google translate.
The library supports the original CLIP model by OpenAI and ruCLIP model by SberAI.

Usage

Simple code

import torch
from pytorch_clip_guided_loss import get_clip_guided_loss

loss_fn = get_clip_guided_loss(clip_type="ruclip", input_range = (-1, 1)).eval().requires_grad_(False)
# text prompt
loss_fn.add_prompt(text="text description of the what we would like to generate")
# image prompt
loss_fn.add_prompt(image=torch.randn(1, 3, 224, 224))

# variable
var = torch.randn(1, 3, 224, 224).requires_grad_(True)
loss = loss_fn(image=var)["loss"]
loss.backward()
print(var.grad)

VQGAN-CLIP

We provide our tiny implementation of the VQGAN-CLIP pipeline for image generation as an example of the usage of our library. To start using our implementation of the VQGAN-CLIP please follow by documentation.

A simple library that implements CLIP guided loss in PyTorch.

Related tags

Overview

pytorch_clip_guided_loss: Pytorch implementation of the CLIP guided loss for Text-To-Image, Image-To-Image, or Image-To-Text generation.

Install package

Install the latest version

Features

Usage

Simple code

VQGAN-CLIP

Owner

Sergei Belousov

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

Contrastive Learning Inverts the Data Generating Process

Hooks for VCOCO

PyTorch implementation of the supervised learning experiments from the paper Model-Agnostic Meta-Learning (MAML)

Using Self-Supervised Pretext Tasks for Active Learning - Official Pytorch Implementation

Learning an Adaptive Meta Model-Generator for Incrementally Updating Recommender Systems

Adversarial Texture Optimization from RGB-D Scans (CVPR 2020).

ReferFormer - Official Implementation of ReferFormer

Building blocks for uncertainty-aware cycle consistency presented at NeurIPS'21.

Code basis for the paper "Camera Condition Monitoring and Readjustment by means of Noise and Blur" (2021)

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

NCVX (NonConVeX): A User-Friendly and Scalable Package for Nonconvex Optimization in Machine Learning.

Train CNNs for the fruits360 data set in NTOU CS「Machine Vision」class.

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

Implementation of the method proposed in the paper "Neural Descriptor Fields: SE(3)-Equivariant Object Representations for Manipulation"

Extremely easy multi instancing software for minecraft speedrunning.

CCP dataset from Clothing Co-Parsing by Joint Image Segmentation and Labeling

A neuroanatomy-based augmented reality experience powered by computer vision. Features 3D visuals of the Atlas Brain Map slices.

Tutorials and implementations for "Self-normalizing networks"