Character Grounding and Re-Identification in Story of Videos and Text Descriptions

Last update: Dec 09, 2022

Related tags

Overview

Character in Story Identification Network (CiSIN)

This project hosts the code for our paper.

Youngjae Yu, Jongseok Kim, Heeseung Yun, Jiwan Chung and Gunhee Kim. Character Grounding and Re-Identification inStory of Videos and Text Descriptions. In ECCV (spotlight), 2020.

This project is an Winning Solution in LSMDC 19 "Fill-in the Characters" task. For more information about the LSMDC visit the Large Scale Movie Description Challenge (LSMDC) 2019

Reference

If you use this code as part of any published research, please refer following paper,

@inproceedings{yu:2020:ECCV,
    title="{Character Grounding and Re-Identification inStory of Videos and Text Descriptions}",
    author={Yu, Youngjae and Kim, Jongseok and Yun, Heeseung and Chung Jiwan and Kim, Gunhee},
    booktitle={ECCV},
    year=2020
}

System Requirements

The following dependencies should be installed:

Python 3.6
Pytorch 1.4.0
torchvision 0.5.0
CUDA 10.0 supported GPU with at least 12GB memory
see requirements.txt for more details

Data Setup

Coming soon,

CiSIN

To train our model,

python train.py

Acknowledgement

We thank SNUVL lab members for helpful comments. This research was supported by Seoul National University, Brain Research Program by National Research Foundation of Korea (NRF) (2017M3C7A1047860), and AIR Lab (AI Research Lab) in Hyundai Motor Company through HMC-SNU AI Consortium Fund.

License

LICENSE.md.

Character Grounding and Re-Identification in Story of Videos and Text Descriptions

Related tags

Overview

Character in Story Identification Network (CiSIN)

Reference

System Requirements

Data Setup

CiSIN

Acknowledgement

License

Owner

Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"

Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising

Convert Pytorch model to onnx or tflite, and the converted model can be visualized by Netron

Consensus Learning from Heterogeneous Objectives for One-Class Collaborative Filtering

Fast, accurate and reliable software for algebraic CT reconstruction

Azua - build AI algorithms to aid efficient decision-making with minimum data requirements.

Self-Supervised Generative Style Transfer for One-Shot Medical Image Segmentation

Numenta published papers code and data

OpenDelta - An Open-Source Framework for Paramter Efficient Tuning.

AAI supports interdisciplinary research to help better understand human, animal, and artificial cognition.

My personal code and solution to the Synacor Challenge from 2012 OSCON.

A large-image collection explorer and fast classification tool

QTool: A Low-bit Quantization Toolbox for Deep Neural Networks in Computer Vision

A new video text spotting framework with Transformer

Segmentation and Identification of Vertebrae in CT Scans using CNN, k-means Clustering and k-NN

This repository provides an unified frameworks to train and test the state-of-the-art few-shot font generation (FFG) models.

SOTA easy to use PyTorch-based DL training library

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

Code and description for my BSc Project, September 2021

Code for the paper: On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations