STRIVE: Scene Text Replacement In Videos

Dataset Types:

RoboText
SynthText
RealWorld videos

RoboText : Videos of texts collected using navigation robot in indoor environment. The overall duration of these videos is 10hrs+ Each text's background can be extracted from the bottom rectangle of its text rectangle. The orginial unprocessed data is stored as RoboText-OriginalZip.7z. Around 200 preprocessed videos are stored as RoboTextZip1.7z

SynthText : Using unity, we have created paired videos from synthetic scenes. These videos are stored with similar naming convention in drive. File name : SynthText7Zip.7z

Note: Unity bbox are recorded as mirror values, hence the bbox extraction process will be different than other two video types.

Real World videos: We have collected videos using high resolution mobile camera to capture texts in different lighting conditions and motion blur. File name: RealWorld.7z

Preparing data

We have extracted text bounding box from RoboText and Real world videos using AWS Rekognition API. The code available as runAWS.py file. Synthetic videos bbox is recorded in unity environment

Data Preprocessing

Refer to the preprocessing python file for each dataset type to get crop images of text.

Data download

Data can be downloaded from here

Please contact Jeyasri Subramanian( [email protected] ) for any data queries

STRIVE: Scene Text Replacement In Videos

Related tags

Overview

STRIVE: Scene Text Replacement In Videos

Dataset Types:

Preparing data

Data Preprocessing

Data download

Owner

a reimplementation of Holistically-Nested Edge Detection in PyTorch

Accelerated SMPL operation, commonly used in generate 3D human mesh, STAR included.

Pytorch implementation for "Density-aware Chamfer Distance as a Comprehensive Metric for Point Cloud Completion" (NeurIPS 2021)

[CVPR 2022] Unsupervised Image-to-Image Translation with Generative Prior

A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset.

PyTorch implementation of our ICCV 2019 paper: Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers

PyTorch implementation of the ExORL: Exploratory Data for Offline Reinforcement Learning

DetCo: Unsupervised Contrastive Learning for Object Detection

Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"

kapre: Keras Audio Preprocessors

Repository aimed at compiling code, papers, demos etc.. related to my PhD on 3D vision and machine learning for fruit detection and shape estimation at the university of Lincoln

EGNN - Implementation of E(n)-Equivariant Graph Neural Networks, in Pytorch

Implementation of the bachelor's thesis "Real-time stock predictions with deep learning and news scraping".

Fast and Context-Aware Framework for Space-Time Video Super-Resolution (VCIP 2021)

Perfect implement. Model shared. x0.5 (Top1:60.646) and 1.0x (Top1:69.402).

Everything about being a TA for ITP/AP course!

[NeurIPS'21] Shape As Points: A Differentiable Poisson Solver

Source code for the paper "Periodic Traveling Waves in an Integro-Difference Equation With Non-Monotonic Growth and Strong Allee Effect"

Automatically Build Multiple ML Models with a Single Line of Code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.