INTRODUCTION

This is a modification of the OpenAI-CLIP repo of moein-shariatnia(https://github.com/moein-shariatnia/OpenAI-CLIP).

The current training dataset supports flicker-8k or flicker-30k, and the image encoder supports Resnet50 or ViT(vit_base_patch16_384).

Text encoder supports only DistilBert like moein-shariatnia.

ENVIRONTMENT SETTING

$ virtualenv .venv --python=python3.6
$ source .venv/bin/activate
$ pip install -r requirements.txt

EXECUTTION

Pretrain

$ python3 pretrain.py

Inference

$ python3 inference.py --qeury={YOUR QUERY}

CAUTION

You must set(or check) some options in config.py before pretrain & inference

ex1) dataset("8k" or "30k"): Train dataset(flicker-8k or flicker-30k)

ex2) model_name("resnet50" or "vit_base_patch16_384"): Type of image encoder

ex3) pretrained(True or False): Decide whether to learn by loading pretrain versions of text encoder(DistilBert) and image encoder(resnet50 or ViT)

ex4) batch_size: Set according to the capacity of the machine

This is a modification of the OpenAI-CLIP repository of moein-shariatnia

Related tags

Overview

INTRODUCTION

ENVIRONTMENT SETTING

EXECUTTION

CAUTION

Owner

Sangwon Beak

NLP topic mdel LDA - Gathered from New York Times website

Pangu-Alpha for Transformers

One Stop Anomaly Shop: Anomaly detection using two-phase approach: (a) pre-labeling using statistics, Natural Language Processing and static rules; (b) anomaly scoring using supervised and unsupervised machine learning.

Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"

Official Pytorch implementation of Test-Agnostic Long-Tailed Recognition by Test-Time Aggregating Diverse Experts with Self-Supervision.

Code for the paper "Are Sixteen Heads Really Better than One?"

Based on 125GB of data leaked from Twitch, you can see their monthly revenues from 2019-2021

VD-BERT: A Unified Vision and Dialog Transformer with BERT

Topic Modelling for Humans

Knowledge Graph,Question Answering System，基于知识图谱和向量检索的医疗诊断问答系统

Th2En & Th2Zh: The large-scale datasets for Thai text cross-lingual summarization

Repository to hold code for the cap-bot varient that is being presented at the SIIC Defence Hackathon 2021.

Statistics and Mathematics for Machine Learning, Deep Learning , Deep NLP

Machine Learning Course Project, IMDB movie review sentiment analysis by lstm, cnn, and transformer

Training code for Korean multi-class sentiment analysis

COVID-19 Related NLP Papers

Korean stereoypte detector with TUNiB-Electra and K-StereoSet

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS)

Backend for the Autocomplete platform. An AI assisted coding platform.

Curso práctico: NLP de cero a cien 🤗