This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

Last update: Dec 05, 2022

Overview

Non-autoregressive Deep Learning-Based TTS Template

This is a template for the Non-autoregressive TTS model. It contains

Data Preprocessing Pipeline
Data Loader
Model / Trainer
Logger, Postprocessing (logging, synthesizing, plotting, etc..)

How to use it?

Clone the repository.

git clone https://github.com/keonlee9420/Deep-Learning-TTS-Template
cd Deep-Learning-TTS-Template

Replace all MYMODEL strings in this repo with your model name and also rename the file model/MYMODEL.py.
Build your model on model/ and check train.py and synthesize.py.
Use README_template.md for the README.md file of your project.
Feel free to add /img for your model architecture and tensorboard examples. It would also be nice to show your model's output audio in /demo.
Don't forget to update requirements.txt and /config of your project.

Citation

@misc{lee2021deep_learning_tts_template,
  author = {Lee, Keon},
  title = {Deep-Learning-TTS-Template},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/keonlee9420/Deep-Learning-TTS-Template}}
}

References

ming024's FastSpeech2

You might also like...

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

DiffSinger - PyTorch Implementation PyTorch implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension). Status

152 Jan 2, 2023

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

GradTTS Unofficial Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech" (arxiv) About this repo This is an unoffic

103 Dec 23, 2022

A non-linear, non-parametric Machine Learning method capable of modeling complex datasets

Fast Symbolic Regression Symbolic Regression is a non-linear, non-parametric Machine Learning method capable of modeling complex data sets. fastsr aim

3 Jun 22, 2022

A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

Confluence: A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection 1. 介绍用以替代 NMS，在所有 bbox 中挑选出最优的集合。 NMS 仅考虑了 bbox 的得分，然后根据 IOU 来

44 Sep 15, 2022

This project uses Template Matching technique for object detecting by detection of template image over base image.

Object Detection Project Using OpenCV This project uses Template Matching technique for object detecting by detection the template image over base ima

7 May 29, 2022

This project uses Template Matching technique for object detecting by detection of template image over base image

Object Detection Project Using OpenCV This project uses Template Matching technique for object detecting by detection the template image over base ima

4 Nov 16, 2021

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

English | 简体中文 Why Non-Euclidean Geometry Considering these simple graph structures shown below. Nodes with same color has 2-hop distance whereas 1-ho

123 Dec 12, 2022

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

53 Dec 29, 2022

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

One model to speak them all 🌎 Audio Language Text ▷ Chinese 人人生而自由，在尊严和权利上一律平等。 ▷ English All human beings are born free and equal in dignity and rig

60 Nov 14, 2022

This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

Related tags

Overview

Non-autoregressive Deep Learning-Based TTS Template

How to use it?

Citation

References

You might also like...

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"

A non-linear, non-parametric Machine Learning method capable of modeling complex datasets

A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

This project uses Template Matching technique for object detecting by detection of template image over base image.

This project uses Template Matching technique for object detecting by detection of template image over base image

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

Releases(v1.0.0)

v1.0.0(Jun 15, 2021)

Owner

Keon Lee

This code finds bounding box of a single human mouth.

Attention mechanism with MNIST dataset

A sequence of Jupyter notebooks featuring the 12 Steps to Navier-Stokes

Python Assignments for the Deep Learning lectures by Andrew NG on coursera with complete submission for grading capability.

Repository for the semantic WMI loss

Reproduces the results of the paper "Finite Basis Physics-Informed Neural Networks (FBPINNs): a scalable domain decomposition approach for solving differential equations".

This YoloV5 based model is fit to detect people and different types of land vehicles, and displaying their density on a fitted map, according to their coordinates and detected labels.

Earthquake detection via fiber optic cables using deep learning

GeneralOCR is open source Optical Character Recognition based on PyTorch.

Transformer Huffman coding - Complete Huffman coding through transformer

Json2Xml tool will help you convert from json COCO format to VOC xml format in Object Detection Problem.

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

Disturbing Target Values for Neural Network regularization: attacking the loss layer to prevent overfitting

This is the 3D Implementation of 《Inconsistency-aware Uncertainty Estimation for Semi-supervised Medical Image Segmentation》

Just Randoms Cats with python

Face and Body Tracking for VRM 3D models on the web.

VOneNet: CNNs with a Primary Visual Cortex Front-End

A Keras implementation of YOLOv3 (Tensorflow backend)

Discovering and Achieving Goals via World Models

Official implementation of "StyleCariGAN: Caricature Generation via StyleGAN Feature Map Modulation" (SIGGRAPH 2021)