Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Last update: Sep 22, 2022

Overview

Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

This repo contains only model Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration paper.

Citation

@misc{tang2021zeroshot,
      title={Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration}, 
      author={Chuanxin Tang and Chong Luo and Zhiyuan Zhao and Dacheng Yin and Yucheng Zhao and Wenjun Zeng},
      year={2021},
      eprint={2109.05426},
      archivePrefix={arXiv},
      primaryClass={cs.SD}
}

Note

This repo only contain model implementation, not dataloader and training code, also it is not well tested from my side.
For more complete TTS or Speech Synthesis solution please visit DeepSync .

Owner

Rishikesh (ऋषिकेश)

GitHub Repository

Constituency Tree Labeling Tool

Constituency Tree Labeling Tool The purpose of this package is to solve the constituency tree labeling problem. Look from the dataset labeled by NLTK,

6 Dec 20, 2022

Python powered crossword generator with database with 20k+ polish words

crossword_generator Generate simple crossword puzzle from words and definitions fetched from krzyżowki.edu.pl endpoints -/ string:word - returns js

0 Jan 04, 2022

Korean Simple Contrastive Learning of Sentence Embeddings using SKT KoBERT and kakaobrain KorNLU dataset

KoSimCSE Korean Simple Contrastive Learning of Sentence Embeddings implementation using pytorch SimCSE Installation git clone https://github.com/BM-K/

34 Nov 24, 2022

APEACH: Attacking Pejorative Expressions with Analysis on Crowd-generated Hate Speech Evaluation Datasets

APEACH - Korean Hate Speech Evaluation Datasets APEACH is the first crowd-generated Korean evaluation dataset for hate speech detection. Sentences of

70 Dec 06, 2022

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

2.9k Dec 31, 2022

easySpeech is an open-source Python wrapper for google speech to text API that doesn't require PyAudio(So you especially windows user don't have to deal with the errors while installing PyAudio) and also works with hugging face transformers

easySpeech easySpeech is an open source python wrapper for google speech to text api that doesn't require PyAaudio(So you specially windows user don't

14 May 24, 2022

An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.

Extracting OpenAI CLIP (Global/Grid) Features from Image and Text This repo aims at providing an easy to use and efficient code for extracting image &

13 Jan 06, 2023

Source code for the paper "TearingNet: Point Cloud Autoencoder to Learn Topology-Friendly Representations"

TearingNet: Point Cloud Autoencoder to Learn Topology-Friendly Representations Created by Jiahao Pang, Duanshun Li, and Dong Tian from InterDigital In

21 Dec 29, 2022

Machine translation models released by the Gourmet project

Gourmet Models Overview The Gourmet project has released several machine translation models to translate low-resource languages. This repository conta

5 Dec 08, 2021

This is the offline-training-pipeline for our project.

offline-training-pipeline This is the offline-training-pipeline for our project. We adopt the offline training and online prediction Machine Learning

0 Apr 22, 2022

PG-19 Language Modelling Benchmark

PG-19 Language Modelling Benchmark This repository contains the PG-19 language modeling benchmark. It includes a set of books extracted from the Proje

161 Oct 30, 2022

Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further languages

Coreferee Author: Richard Paul Hudson, Explosion AI 1. Introduction 1.1 The basic idea 1.2 Getting started 1.2.1 English 1.2.2 French 1.2.3 German 1.2

70 Dec 12, 2022

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Related tags

Overview

Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Citation

Note

Owner

Rishikesh (ऋषिकेश)

Constituency Tree Labeling Tool

Python powered crossword generator with database with 20k+ polish words

Korean Simple Contrastive Learning of Sentence Embeddings using SKT KoBERT and kakaobrain KorNLU dataset

APEACH: Attacking Pejorative Expressions with Analysis on Crowd-generated Hate Speech Evaluation Datasets

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

easySpeech is an open-source Python wrapper for google speech to text API that doesn't require PyAudio(So you especially windows user don't have to deal with the errors while installing PyAudio) and also works with hugging face transformers

An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.

Source code for the paper "TearingNet: Point Cloud Autoencoder to Learn Topology-Friendly Representations"

Machine translation models released by the Gourmet project

This is the offline-training-pipeline for our project.

PG-19 Language Modelling Benchmark

Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further languages

中文問句產生器；使用台達電閱讀理解資料集(DRCD)

This is an incredibly powerful calculator that is capable of many useful day-to-day functions.

glow-speak is a fast, local, neural text to speech system that uses eSpeak-ng as a text/phoneme front-end.

Multi Task Vision and Language

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Code examples for my Write Better Python Code series on YouTube.

Official PyTorch implementation of SegFormer

CrossNER: Evaluating Cross-Domain Named Entity Recognition (AAAI-2021)