PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Last update: Dec 19, 2022

Related tags

Deep Learning stochastic-cslr

Overview

Stochastic CSLR

This is the PyTorch implementation for the ECCV 2020 paper: Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Quick Start

1. Installation

pip install git+https://github.com/zheniu/stochastic-cslr

Also, you need to install sclite for evaluation. Take a look at step 2 for instructions.

2. Prepare the dataset

Download the RWTH-PHOENIX-2014 dataset here.
Unzip it and obtain the path to phoenix-2014-multisigner/ folder for later use.
Install sclite for evaluation. Check phoenix-2014-multisigner/evaluation/NIST-sclite_sctk-2.4.0-20091110-0958.tar.bz2 for detail.
After installing sclite, put it in your PATH.

3. Run a quick test

You can use the script quick_test.py for a quick test.

python3 quick_test.py --data-root your_path_to/phoenix-2014-multisigner

By specifying the model type --model sfl/dfl, the data split --split dev/test, whether to use a language model--use-lm, you can get the following results:

Model	WER (dev)	sub/del/ins (dev)	WER (test)	sub/del/ins (test)
DFL	27.1	12.7/7.4/7.0	27.7	13.8/7.3/6.6
SFL	26.2	12.7/6.9/6.7	26.6	13.7/6.5/6.4
DFL + LM	25.6	11.5/9.2/4.9	26.4	12.4/9.3/4.7
SFL + LM	24.3	11.4/8.5/4.4	25.3	12.4/8.5/4.3

Note that these results are slightly different from the paper as a different random seed is used.

You may also take a look at quick_test.py as it shows how to use the pretrained models.

4. Train your own model

The configuration files for deterministic and stochastic fine-grained labeling are put under config/. The training script is based on a PyTorch experiment runner torchzq, which automatically reads the hyperparameters in the YAML file and passes them to stochastic_cslr/runner.py.

Before running, change the data_root in the YAML configurations to phoenix-2014-multisigner/ first.

Train (for instance, dfl):

tzq config/dfl-fp16.yml train

Test the trained model

tzq config/dfl-fp16.yml test

Citation

You may cite this work by:

@inproceedings{niu2020stochastic,
  title={Stochastic Fine-Grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition},
  author={Niu, Zhe and Mak, Brian},
  booktitle={European Conference on Computer Vision},
  pages={172--186},
  year={2020},
  organization={Springer}
}

PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Related tags

Overview

Stochastic CSLR

Quick Start

1. Installation

2. Prepare the dataset

3. Run a quick test

4. Train your own model

Train (for instance, dfl):

Test the trained model

Citation

Owner

Zhe Niu

This code implements constituency parse tree aggregation

Collection of NLP model explanations and accompanying analysis tools

How to train a CNN to 99% accuracy on MNIST in less than a second on a laptop

The Turing Change Point Detection Benchmark: An Extensive Benchmark Evaluation of Change Point Detection Algorithms on real-world data

A semismooth Newton method for elliptic PDE-constrained optimization

Filtering variational quantum algorithms for combinatorial optimization

This repository contains the code for EMNLP-2021 paper "Word-Level Coreference Resolution"

DeepSTD: Mining Spatio-temporal Disturbances of Multiple Context Factors for Citywide Traffic Flow Prediction

Learned Initializations for Optimizing Coordinate-Based Neural Representations

SegNet-like Autoencoders in TensorFlow

Dynamic Attentive Graph Learning for Image Restoration, ICCV2021 [PyTorch Code]

Pytorch Implementation for (STANet+ and STANet)

This is the code for the paper "Jinkai Zheng, Xinchen Liu, Wu Liu, Lingxiao He, Chenggang Yan, Tao Mei: Gait Recognition in the Wild with Dense 3D Representations and A Benchmark. (CVPR 2022)"

Tree LSTM implementation in PyTorch

Code for the paper "How Attentive are Graph Attention Networks?"

A TensorFlow implementation of FCN-8s

Diverse Image Generation via Self-Conditioned GANs

Code for our paper Aspect Sentiment Quad Prediction as Paraphrase Generation in EMNLP 2021.

CVPR 2021 - Official code repository for the paper: On Self-Contact and Human Pose.

A fast and easy to use, moddable, Python based Minecraft server!