HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Last update: Dec 29, 2022

Related tags

Overview

HiFi++ : a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

This is the unofficial implementation of Vocoder part of HiFi++ : a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement.

Currently, this repo is WIP but you can start your training without any error.

Training:

python train.py --config config_v2.json

Citations:

@misc{https://doi.org/10.48550/arxiv.2203.13086,
  doi = {10.48550/ARXIV.2203.13086},
  
  url = {https://arxiv.org/abs/2203.13086},
  
  author = {Andreev, Pavel and Alanov, Aibek and Ivanov, Oleg and Vetrov, Dmitry},
  
  keywords = {Sound (cs.SD), Machine Learning (cs.LG), Audio and Speech Processing (eess.AS), FOS: Computer and information sciences, FOS: Computer and information sciences, FOS: Electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering},
  
  title = {HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement},
  
  publisher = {arXiv},
  
  year = {2022},
  
  copyright = {arXiv.org perpetual, non-exclusive license}
}

References:

https://github.com/jik876/hifi-gan

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Related tags

Overview

HiFi++ : a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Training:

Citations:

References:

Owner

Rishikesh (ऋषिकेश)

Paper: De-rendering Stylized Texts

An easy way to build PyTorch datasets. Modularly build datasets and automatically cache processed results

A PyTorch Library for Accelerating 3D Deep Learning Research

Agile SVG maker for python

Complete the code of prefix-tuning in low data setting

Computational Pathology Toolbox developed by TIA Centre, University of Warwick.

Official code for the publication "HyFactor: Hydrogen-count labelled graph-based defactorization Autoencoder".

Official pytorch implementation of "Scaling-up Disentanglement for Image Translation", ICCV 2021.

Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, as a standalone package for Pytorch

Captcha-tensorflow - Image Captcha Solving Using TensorFlow and CNN Model. Accuracy 90%+

Keras Implementation of The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation by (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio)

This repository provides an unified frameworks to train and test the state-of-the-art few-shot font generation (FFG) models.

Custom studies about block sparse attention.

PyTorch implementation of SwAV (Swapping Assignments between Views)

Tutorial page of the Climate Hack, the greatest hackathon ever

Deep functional residue identification

Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation

3D Generative Adversarial Network

CRLT: A Unified Contrastive Learning Toolkit for Unsupervised Text Representation Learning

Set of methods to ensemble boxes from different object detection models, including implementation of "Weighted boxes fusion (WBF)" method.