H&M Fashion Image similarity search with Weaviate and DocArray

This example shows how to do image similarity search using DocArray and Weaviate as Document Store.

How to use the notebook

This repository includes sample data, but you can download the full dataset from Kaggle (see below).

Before you start using the notebook, you need to start a Weaviate instance, by running docker compose up. Weaviate will be running on http://localhost:8080. Alternatively,, you can start a Weaviate instance for free with WCS: Weaviate Cloud Service. Make sure you adapt the Weaviate server in the notebook accordingly.

How to download data (optional - sample data included in this repository)

You need to download the fashion image data from the H&M dataset on Kaggle. You can download it and put in the right folder using:

$ mkdir data
& cd data
$ kaggle competitions download -c h-and-m-personalized-fashion-recommendations
$ unzip h-and-m-personalized-fashion-recommendations.zip

Optional: you can use resize_image.py to downscale the images before using them in the notebook.

Make sure to adapt the file location in the notebook.

Install requirements

The requirements will be installed in the first cell of the notebook. Alternatively, you can run pip install -r requirements.txt.

Embed, store and query

You can run the Jupyter notebook to embed, store and query fashion image data using ResNet50, DocArray and Weaviate.

H&M Fashion Image similarity search with Weaviate and DocArray

Related tags

Overview

H&M Fashion Image similarity search with Weaviate and DocArray

How to use the notebook

How to download data (optional - sample data included in this repository)

Install requirements

Embed, store and query

Owner

Laura Ham

Domain Generalization with MixStyle, ICLR'21.

Fashion Entity Classification

Medical image analysis framework merging ANTsPy and deep learning

Deep Text Search is an AI-powered multilingual text search and recommendation engine with state-of-the-art transformer-based multilingual text embedding (50+ languages).

Code for testing convergence rates of Lipschitz learning on graphs

N-Omniglot is a large neuromorphic few-shot learning dataset

A simple, unofficial implementation of MAE using pytorch-lightning

Pytorch implementation of the paper Progressive Growing of Points with Tree-structured Generators (BMVC 2021)

Traductor de lengua de señas al español basado en Python con Opencv y MedaiPipe

An OpenAI-Gym Package for Training and Testing Reinforcement Learning algorithms with OpenSim Models

a reimplementation of UnFlow in PyTorch that matches the official TensorFlow version

Character Grounding and Re-Identification in Story of Videos and Text Descriptions

Code release for our paper, "SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo"

Proto-RL: Reinforcement Learning with Prototypical Representations

A Deep Reinforcement Learning Framework for Stock Market Trading

For holding anime-related object classification and detection models

Context-Sensitive Misspelling Correction of Clinical Text via Conditional Independence, CHIL 2022

Localized representation learning from Vision and Text (LoVT)

PyTorch implementation of our method for adversarial attacks and defenses in hyperspectral image classification.

Multi-task Multi-agent Soft Actor Critic for SMAC