EfficientTTS

Unofficial Pytorch implementation of "EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture"(arXiv).

Disclaimer: Somebody mistakenly think I'm one of the authors. In fact, I am not even in the author list of this paper. I am just a TTS enthusiast. Some important information of the implementation is not presented by the paper. Some model parameters in current version is based on my understanding and exepriments, which may not be consistent with those used by the authors.

Updates

2020/12/23: Mandarin Chinese Samples uploaded. The experiment setting is exactly the same with the LJSpeech example. A complete description of the usage will be soon uploaded.

2020/12/20: Using the HifiGAN finetuned with Tacotron2 GTA mel spectrograms can increase the quality of the generated samples, please see the newly generated-samples

Current status

Implementation of EFTS-CNN + HifiGAN

Setup with virtualenv

$ cd tools
$ make
# If you want to use distributed training, please run following
# command to install apex.
$ make apex

Note: If you want to specify Python version, CUDA version or PyTorch version, please run for example:

$ make PYTHON=3.7 CUDA_VERSION=10.1 PYTORCH_VERSION=1.6

Training

Please go to egs/lj folder, and see run.sh for example use.

Acknowledgement

The code framework is from https://github.com/kan-bayashi/ParallelWaveGAN

Pytorch implementation of

Related tags

Overview

EfficientTTS

Unofficial Pytorch implementation of "EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture"(arXiv).

Updates

Current status

Setup with virtualenv

Training

Acknowledgement

Owner

Liu Songxiang

Group Fisher Pruning for Practical Network Compression(ICML2021)

This repository collects project-relevant Isabelle/HOL formalizations.

PyMove is a Python library to simplify queries and visualization of trajectories and other spatial-temporal data

The official github repository for Towards Continual Knowledge Learning of Language Models

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

JudeasRx - graphical app for doing personalized causal medicine using the methods invented by Judea Pearl et al.

Empowering journalists and whistleblowers

Generalized hybrid model for mode-locked laser diodes with an extended passive cavity

JAX + dataclasses

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Making self-supervised learning work on molecules by using their 3D geometry to pre-train GNNs. Implemented in DGL and Pytorch Geometric.

[AI6122] Text Data Management & Processing

Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models (published in ICLR2018)

Fast mesh denoising with data driven normal filtering using deep variational autoencoders

Tensorflow implementation for Self-supervised Graph Learning for Recommendation

FB-tCNN for SSVEP Recognition

DeepOBS: A Deep Learning Optimizer Benchmark Suite

Augmenting Physical Models with Deep Networks for Complex Dynamics Forecasting

[ICCV2021] Learning to Track Objects from Unlabeled Videos

Car Parking Tracker Using OpenCv