Non-Autoregressive Predictive Coding

This repository contains the implementation of Non-Autoregressive Predictive Coding (NPC) as described in the preprint paper submitted to ICASSP 2021.

A quick example for training NPC

python main.py --config config/self_supervised/npc_example.yml \
               --task self-learning

For more complete examples including downstream tasks, please see the example script.
For preparing data, please visit preprocess.
For detailed hyperparameters setting and description, please checkout example config file of NPC.
For all run-time options, use -h flag.
Implementation of Autoregressive Predictive Coding (APC, 2019, Chung et al.) and Vector-Quantized APC (VQ-APC, 2020, Chung et al.) are also available using similar training/downstream execution with example config files here.

Some notes

We found the unmasked feature produced by the last ConvBlock layer a better representation. In the phone classification tasks, switching to the unmasked feature (PER 25.6%) provided a 1.6% improvement over the masked feature (PER 27.2%). Currently, this is not included in the preprint version and will be updated to the paper in the future. Please refer to downstream examples to activate this option.
APC/VQ-APC are implemented with the following modifications for improvement (for the unmodified version, please visit the official implementation of APC / VQAPC)
- Multi-group VQ available for VQ-APC, but with VQ on last layer only
- Using utterance-wised CMVN surface feature（just as NPC did)
- Using Gumbel Softmax from official API of pytorch
See package requirement for toolkits used, tensorboard can be used to access log files in --logdir.

Contact

Feel free to contact me for questions or feedbacks, my email can be found in the paper or my personal page.

Citation

If you find our work and/or this repository helpful, please do consider citing us

@article{liu2020nonautoregressive,
  title   = {Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies},
  author  = {Liu, Alexander and Chung, Yu-An and Glass, James},
  journal = {arXiv preprint arXiv:2011.00406},
  year    = {2020}
}

Non-Autoregressive Predictive Coding

Related tags

Overview

Non-Autoregressive Predictive Coding

Some notes

Contact

Citation

Owner

Alexander H. Liu

A Plover python dictionary allowing for consistent symbol input with specification of attachment and capitalisation in one stroke.

Share constant definitions between programming languages and make your constants constant again

topic modeling on unstructured data in Space news articles retrieved from the Guardian (UK) newspaper using API

硕士期间自学的NLP子任务，供学习参考

Non-Autoregressive Translation with Layer-Wise Prediction and Deep Supervision

Trained T5 and T5-large model for creating keywords from text

Python3 to Crystal Translation using Python AST Walker

Model parallel transformers in JAX and Haiku

PyKaldi is a Python scripting layer for the Kaldi speech recognition toolkit.

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

End-2-end speech synthesis with recurrent neural networks

jiant is an NLP toolkit

Lingtrain Aligner — ML powered library for the accurate texts alignment.

Dope Wars game engine on StarkNet L2 roll-up

This repo contains simple to use, pretrained/training-less models for speaker diarization.

gaiic2021-track3-小布助手对话短文本语义匹配复赛rank3、决赛rank4

Reproduction process of BERT on SST2 dataset

Code for the paper "A Simple but Tough-to-Beat Baseline for Sentence Embeddings".

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

HAIS_2GNN: 3D Visual Grounding with Graph and Attention