Frozen️ in Time
❄️
️️️️
⏳
A Joint Video and Image Encoder for End-to-End Retrieval
(arXiv)
Repository to contain the code, models, data for end-to-end retrieval.
Work in progress
Code provided to train end-to-end model on MSRVTT.
Set path locations in msrvtt_4f_i21k.json
conda env create -f requirements/frozen.yml
python train.py --config configs/msrvtt_4f_i21k.json
TODO:
[x] conda env
[ ] msrvtt data zip
[ ] pretrained models
[ ] webvid data
[ ] Other benchmarks
