Sign Language Transformers (CVPR'20)

This repo contains the training and evaluation code for the paper Sign Language Transformers: Sign Language Transformers: Joint End-to-end Sign Language Recognition and Translation.

This code is based on Joey NMT but modified to realize joint continuous sign language recognition and translation. For text-to-text translation experiments, you can use the original Joey NMT framework.

Requirements

Download the feature files using the data/download.sh script.
[Optional] Create a conda or python virtual environment.
Install required packages using the requirements.txt file.

pip install -r requirements.txt

Usage

python -m signjoey train configs/sign.yaml

! Note that the default data directory is ./data. If you download them to somewhere else, you need to update the data_path parameters in your config file.

ToDo:

Initial code release.
Release image features for Phoenix2014T.
Share extensive qualitative and quantitative results & config files to generate them.
(Nice to have) - Guide to set up conda environment and docker image.

Reference

Please cite the paper below if you use this code in your research:

@inproceedings{camgoz2020sign,
  author = {Necati Cihan Camgoz and Oscar Koller and Simon Hadfield and Richard Bowden},
  title = {Sign Language Transformers: Joint End-to-end Sign Language Recognition and Translation},
  booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2020}
}

Acknowledgements

_{This work was funded by the SNSF Sinergia project "Scalable Multimodal Sign Language Technology for Sign Language Learning and Assessment" (SMILE) grant agreement number CRSII2 160811 and the European Union’s Horizon2020 research and innovation programme under grant agreement no. 762021 (Content4All). This work reflects only the author’s view and the Commission is not responsible for any use that may be made of the information it contains. We would also like to thank NVIDIA Corporation for their GPU grant.}

Sign Language Transformers (CVPR'20)

Related tags

Overview

Sign Language Transformers (CVPR'20)

Requirements

Usage

ToDo:

Reference

Acknowledgements

Owner

Necati Cihan Camgoz

Classifies galaxy morphology with Bayesian CNN

This is the code used in the paper "Entity Embeddings of Categorical Variables".

Source codes for Improved Few-Shot Visual Classification (CVPR 2020), Enhancing Few-Shot Image Classification with Unlabelled Examples

Conversion between units used in magnetism

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

PyTorch code for our paper "Attention in Attention Network for Image Super-Resolution"

Establishing Strong Baselines for TripClick Health Retrieval; ECIR 2022

Segmentation models with pretrained backbones. PyTorch.

Spherical CNNs

This library contains a Tensorflow implementation of the paper Stability Analysis of Unfolded WMMSE for Power Allocation

Listing arxiv - Personalized list of today's articles from ArXiv

Using LSTM to detect spoofing attacks in an Air-Ground network

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

A multi-entity Transformer for multi-agent spatiotemporal modeling.

YOLO-v5 기반 단안 카메라의 영상을 활용해 차간 거리를 일정하게 유지하며 주행하는 Adaptive Cruise Control 기능 구현

KE-Dialogue: Injecting knowledge graph into a fully end-to-end dialogue system.

[ICCV'21] UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction

A Pytorch Implementation for Compact Bilinear Pooling.

SweiNet is an uncertainty-quantifying shear wave speed (SWS) estimator for ultrasound shear wave elasticity (SWE) imaging.

PyTorch implementation of the Flow Gaussian Mixture Model (FlowGMM) model from our paper