Unified Pre-training for Self-Supervised Learning and Supervised Learning for ASR

Last update: Jan 09, 2023

Related tags

Overview

UniSpeech

The family of UniSpeech:

UniSpeech (ICML 2021): Unified Pre-training for Self-Supervised Learning and Supervised Learning for ASR

UniSpeech-SAT (ICASSP 2022 Submission): Universal Speech Representation Learning with Speaker Aware Pre-Training

Pre-trained models

We strongly suggest using our UniSpeech-SAT model for speaker related tasks, since it shows very powerful performance on various speaker related benchmarks.

Model	Dataset	Model
UniSpeech Base	1500 hrs CommonVoice	download
UniSpeech Large	1500 hrs CommonVoice	download
UniSpeech-SAT Base	960 hrs LibriSpeech	download
UniSpeech-SAT Base+	60k hrs Libri-Light + 10k hrs GigaSpeech + 24k hrs VoxPopuli	download
UniSpeech-SAT Large	60k hrs Libri-Light + 10k hrs GigaSpeech + 24k hrs VoxPopuli	download

License

This project is licensed under the license found in the LICENSE file in the root directory of this source tree. Portions of the source code are based on the FAIRSEQ project.

Microsoft Open Source Code of Conduct

Contact Information

For help or issues using UniSpeech models, please submit a GitHub issue.

For other communications related to UniSpeech, please contact Yu Wu ([email protected]).

Unified Pre-training for Self-Supervised Learning and Supervised Learning for ASR

Related tags

Overview

UniSpeech

Pre-trained models

License

Contact Information

Owner

Microsoft

HuSpaCy: industrial-strength Hungarian natural language processing

The Illinois repository for Climatehack (https://climatehack.ai/). We won 1st place!

fastgradio is a python library to quickly build and share gradio interfaces of your trained fastai models.

For encoding a text longer than 512 tokens, for example 800. Set max_pos to 800 during both preprocessing and training.

The implementation for "Comprehensive Knowledge Distillation with Causal Intervention".

Code for "Adversarial attack by dropping information." (ICCV 2021)

Dataset used in "PlantDoc: A Dataset for Visual Plant Disease Detection" accepted in CODS-COMAD 2020

Various operations like path tracking, counting, etc by using yolov5

Edge Restoration Quality Assessment

An official PyTorch Implementation of Boundary-aware Self-supervised Learning for Video Scene Segmentation (BaSSL)

Session-aware Item-combination Recommendation with Transformer Network

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

Improving Factual Consistency of Abstractive Text Summarization

TensorFlow implementation of Deep Reinforcement Learning papers

Code for the paper "On the Power of Edge Independent Graph Models"

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

MIM: MIM Installs OpenMMLab Packages

Official implementation of "StyleCariGAN: Caricature Generation via StyleGAN Feature Map Modulation" (SIGGRAPH 2021)

NHS AI Lab Skunkworks project: Long Stayer Risk Stratification

ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives