HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Last update: Dec 27, 2022

HiFiGAN Denoiser

This is a Unofficial Pytorch implementation of the paper HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks.

Citations

@misc{su2020hifigan,
      title={HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks}, 
      author={Jiaqi Su and Zeyu Jin and Adam Finkelstein},
      year={2020},
      eprint={2006.05694},
      archivePrefix={arXiv},
      primaryClass={eess.AS}
}

Requirement

Tested on Python 3.6

pip install -r requirements.txt

Train & Tensorboard

python train.py -c [config yaml file]
tensorboard --logdir log_dir

Inference

python inference.py -p [checkpoint path] -i [input wav path]

Checkpoint :

References

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Denoising Wavenet Generator
StarGAN VC Discriminator
Melgan Multi-Scale Discriminator
Parallel Wavegan
HiFi GAN vocoder's MSD and multi-gpu training code

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Related tags

Overview

HiFiGAN Denoiser

Citations

Requirement

Train & Tensorboard

Inference

Checkpoint :

References

Owner

Rishikesh (ऋषिकेश)

Automatically replace ONNX's RandomNormal node with Constant node.

A baseline code for VSPW

BC3407-Group-5-Project - BC3407 Group Project With Python

Everything you need to know about NumPy( Creating Arrays, Indexing, Math,Statistics,Reshaping).

Uni-Fold: Training your own deep protein-folding models.

A PyTorch version of You Only Look at One-level Feature object detector

Deep Learning Visuals contains 215 unique images divided in 23 categories

Python script that allows you to automatically setup your Growtopia server.

Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Implementations of polygamma, lgamma, and beta functions for PyTorch

A stock generator that assess a list of stocks and returns the best stocks for investing and money allocations based on users choices of volatility, duration and number of stocks

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis

Project page for our ICCV 2021 paper "The Way to my Heart is through Contrastive Learning"

Model-based 3D Hand Reconstruction via Self-Supervised Learning, CVPR2021

This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP

This repository is for our EMNLP 2021 paper "Automated Generation of Accurate & Fluent Medical X-ray Reports"

PyTorch implementation of HDN(Homography Decomposition Networks) for planar object tracking

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Pytorch0.4.1 codes for InsightFace