The AugNet Python module contains functions for the fast computation of image similarity.

Last update: Dec 28, 2022

Overview

AugNet

AugNet: End-to-End Unsupervised Visual Representation Learning with Image Augmentation arxiv link

In our work, we propose AugNet, a new deep learning training paradigm to learn image features from a collection of unlabeled pictures. We develop a method to construct the similarities between pictures as distance metrics in the embedding space by leveraging the inter-correlation between augmented versions of samples. Our experiments demonstrate that the method is able to represent the image in low dimensional space and performs competitively in downstream tasks such as image classification and image similarity comparison. Moreover, unlike many deep-learning-based image retrieval algorithms, our approach does not require access to external annotated datasets to train the feature extractor, but still shows comparable or even better feature representation ability and easy-to-use characteristics.

Install

pip install imgsim

Usage

import imgsim
import cv2

vtr = imgsim.Vectorizer()

img0 = cv2.imread("img0.png")
img1 = cv2.imread("img1.png")

vec0 = vtr.vectorize(img0)
vec1 = vtr.vectorize(img1)

dist = imgsim.distance(vec0, vec1)
print("distance =", dist)

Image Comparision Examples:

Please download the STL10 dataset from: https://cs.stanford.edu/~acoates/stl10/ and put the files under "./data/stl10_binary".

Please download the pretrained model from: https://drive.google.com/file/d/1pV3EBZPDDc3z_YKdRJu6ZBF5yn_IHhsK/view?usp=sharing and put the pth file under "./models"

Run "res34_model_training_with_STL.py" if you would like to train your own model. Run "kmeans_demo.ipynb" to test with K-Means clustering.

The followings are some image comparison examples. The left most images are the queries. The rest images are the topK most similar images that the algorithm found from the dataset based on the distances between the embeddings to the queries'.

Welcome to cite our work:

@misc{chen2021augnet,
    title={AugNet: End-to-End Unsupervised Visual Representation Learning with Image Augmentation},
    author={Mingxiang Chen and Zhanguo Chang and Haonan Lu and Bitao Yang and Zhuang Li and Liufang Guo and Zhecheng Wang},
    year={2021},
    eprint={2106.06250},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

TODO:

batch vectorization
multiple gpu

The AugNet Python module contains functions for the fast computation of image similarity.

Related tags

Overview

AugNet

Install

Usage

Image Comparision Examples:

Paris6k

Anime Illustrations:

Pokemons:

Humans Sketchs:

Welcome to cite our work:

TODO:

Owner

Ming

A Python module for parallel optimization of expensive black-box functions

Physics-Aware Training (PAT) is a method to train real physical systems with backpropagation.

This is a Python Module For Encryption, Hashing And Other stuff

Easy way to add GoogleMaps to Flask applications. maintainer: @getcake

Camera ready code repo for the NeuRIPS 2021 paper: "Impression learning: Online representation learning with synaptic plasticity".

Identifying a Training-Set Attack’s Target Using Renormalized Influence Estimation

Colab notebook and additional materials for Python-driven analysis of redlining data in Philadelphia

A Python package to process & model ChEMBL data.

BRNet - code for Automated assessment of BI-RADS categories for ultrasound images using multi-scale neural networks with an order-constrained loss function

[WACV 2020] Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints

Graph-total-spanning-trees - A Python script to get total number of Spanning Trees in a Graph

Codes for SIGIR'22 Paper 'On-Device Next-Item Recommendation with Self-Supervised Knowledge Distillation'

This repo will contain code to reproduce and build upon understanding transfer learning

4th place solution for the SIGIR 2021 challenge.

VSR-Transformer - This paper proposes a new Transformer for video super-resolution (called VSR-Transformer).

Few-shot Learning of GPT-3

Wikidated : An Evolving Knowledge Graph Dataset of Wikidata’s Revision History

Official repository for "PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation"

CNNs for Sentence Classification in PyTorch

Research shows Google collects 20x more data from Android than Apple collects from iOS. Block this non-consensual telemetry using pihole blocklists.