Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample

Last update: Dec 22, 2022

Overview

Alias-Free-Torch

Simple torch module implementation of Alias-Free GAN.

This repository including

Alias-Free GAN style lowpass sinc filter @filter.py
Alias-Free GAN style up/downsample @resample.py
Alias-Free activation @act.py
and test codes @./test

Note: Since this repository is unofficial, filter and upsample could be different with official implementation.

Note: 2d lowpass filter is applying sinc instead of jinc (first order Bessel function of the first kind) in paper

Requirements

Due to torch.kaiser_window and torch.i0 are implemeted after 1.7.0, our repository need torch>=1.7.0.

Pytorch>=1.7.0

TODO

2d sinc filter
2d resample
devide 1d and 2d modules
pip packaging

Test results 1d

Filter sine	Filter noise

upsample	downsample

Test results 2d

Filter L1 norm sine	Filter noise

upsample	downsample

Activation

References

Alias-Free GAN
adefossez/julius
A. V. Oppenheim and R. W. Schafer. Discrete-Time Signal Processing. Pearson, International Edition, 3rd edition, 2010

Acknowledgement

This work is done at MINDsLab Inc.

Thanks to teammates at MINDsLab Inc.

Comments

Batched resampling for the new implementation

Hi, thank you very much for the contribution.

I think the new implementation of resample.Upsample1d and resample.Downsample1d breaks batched resampling when using groups=C without expanding the filter to match the shape. Perhaps the implementation should be like the below (maybe similar goes to 2d):

Upsample1d.forward()

    # x: [B,C,T]
    def forward(self, x):
        B, C, T = x.shape
        x = F.pad(x, (self.pad, self.pad), mode='reflect')
        # TConv with filter expanded to C with C groups for depthwise op
        x = self.ratio * F.conv_transpose1d(
            x, self.filter.expand(C, -1, -1), stride=self.stride, groups=C)
        pad_left = self.pad * self.stride + (self.kernel_size -
                                             self.stride) // 2
        pad_right = self.pad * self.stride + (self.kernel_size - self.stride +
                                              1) // 2
        x = x[..., pad_left:-pad_right]

LowPassFilter1d.forward()

    #input [B,C,T]
    def forward(self, x):
        B, C, T = x.shape
        if self.padding:
            x = F.pad(x, (self.left_pad, self.right_pad),
                      mode=self.padding_mode)
        # Conv with filter expanded to C with C groups for depthwise op
        out = F.conv1d(x, self.filter.expand(C, -1, -1), stride=self.stride, groups=C) # typo 'groupds' btw
        return out

Could you check the correctness? Thanks again for the implementation!

opened by L0SG 2

torch.speical.i1 typo

https://github.com/junjun3518/alias-free-torch/blob/f1fddd52fdd068ee475e82ae60c92e1bc24ffe02/src/alias_free_torch/filter.py#L22

At this line I believe you wanted torch.special.i1.

opened by torridgristle 2
"if self.pad / self.padding" in LowPassFilter2d

https://github.com/junjun3518/alias-free-torch/blob/258551410ff7bf02e06ece7c597466dc970fe5c7/src/alias_free_torch/filter.py#L165 https://github.com/junjun3518/alias-free-torch/blob/258551410ff7bf02e06ece7c597466dc970fe5c7/src/alias_free_torch/filter.py#L173

In LowPassFilter2d it looks like if self.pad: should change to if self.padding:, or self.padding = padding should change to self.pad = padding to match LowPassFilter1d.

opened by torridgristle 1
Padding Bool typo

https://github.com/junjun3518/alias-free-torch/blob/258551410ff7bf02e06ece7c597466dc970fe5c7/src/alias_free_torch/filter.py#L73

padding: bool: True, should be padding: bool = True,

I'm not sure if this causes an error with every version of PyTorch, but it does with PyTorch 1.12.0+cu113 on Python 3.7.13

opened by torridgristle 1
2D Filter Jinc appears to be wrong

Here is a plot of the generated 1D sinc filter kernel.

Here is a plot of the generated 2D jinc filter kernel.

I'd expect it to look more like a series of rings or ripples, rather than a donut or torus.

The FFT output for randn noise put through the 2D filter doesn't look right either.

Changing filter_ = 2 * cutoff * window * jinc(2 * cutoff * time) to filter_ = 2 * cutoff * window * sinc(2 * cutoff * time) in kaiser_jinc_filter2d makes a more familiar kernel.

And the FFT output for randn noise put through this 2D filter looks about how I'd expect.

opened by torridgristle 3

Releases(v0.0.6)

v0.0.6(Jul 26, 2022)

https://pypi.org/project/alias-free-torch/0.0.6/

Tested version
Source code(tar.gz)
Source code(zip)
v0.0.3(Jul 18, 2022)

https://pypi.org/project/alias-free-torch/0.0.3/

Bug fix for torch.special / remove print / split pad from conv_transpose
Source code(tar.gz)
Source code(zip)
v0.0.2(Jun 22, 2022)

https://pypi.org/project/alias-free-torch/0.0.2/

Rewrite upsample, jinc applied
Source code(tar.gz)
Source code(zip)
v0.0.1(Nov 2, 2021)

v0.0.1 released https://pypi.org/project/alias-free-torch/
Source code(tar.gz)
Source code(zip)

Owner

이준혁(Junhyeok Lee)

Audio/Speech Deep Learning Researcher @mindslab-ai

GitHub Repository

The code of paper 'Learning to Aggregate and Personalize 3D Face from In-the-Wild Photo Collection'

Learning to Aggregate and Personalize 3D Face from In-the-Wild Photo Collection Pytorch implemetation of paper 'Learning to Aggregate and Personalize

136 Dec 29, 2022

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Hierarchical Token Semantic Audio Transformer Introduction The Code Repository for "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound

134 Jan 01, 2023

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

StyleGAR TODO: add arxiv link Implementation of Inverting Generative Adversarial Renderer for Face Reconstruction TODO: for test Currently, some model

155 Oct 27, 2022

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

MediumVC MediumVC is an utterance-level method towards any-to-any VC. Before that, we propose SingleVC to perform A2O tasks(Xi → Ŷi) , Xi means utter

47 Dec 25, 2022

Implementation of the paper Recurrent Glimpse-based Decoder for Detection with Transformer.

REGO-Deformable DETR By Zhe Chen, Jing Zhang, and Dacheng Tao. This repository is the implementation of the paper Recurrent Glimpse-based Decoder for

33 Nov 30, 2022

A curated list of awesome projects and resources related fastai

138 Dec 22, 2022

Improving Object Detection by Estimating Bounding Box Quality Accurately

Improving Object Detection by Estimating Bounding Box Quality Accurately Abstrac

2 Apr 14, 2022

Animation of solving the traveling salesman problem to optimality using mixed-integer programming and iteratively eliminating sub tours

tsp-streamlit Animation of solving the traveling salesman problem to optimality using mixed-integer programming and iteratively eliminating sub tours.

4 Nov 05, 2022

Deep Learning Datasets Maker is a QGIS plugin to make datasets creation easier for raster and vector data.

Deep Learning Dataset Maker Deep Learning Datasets Maker is a QGIS plugin to make datasets creation easier for raster and vector data. How to use Down

25 Dec 15, 2022

Clustering with variational Bayes and population Monte Carlo

pypmc pypmc is a python package focusing on adaptive importance sampling. It can be used for integration and sampling from a user-defined target densi

45 Feb 06, 2022

[NeurIPS'21] "AugMax: Adversarial Composition of Random Augmentations for Robust Training" by Haotao Wang, Chaowei Xiao, Jean Kossaifi, Zhiding Yu, Animashree Anandkumar, and Zhangyang Wang.

112 Nov 07, 2022

Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample

Related tags

Overview

Alias-Free-Torch

Requirements

TODO

Test results 1d

Test results 2d

References

Acknowledgement

Comments

Batched resampling for the new implementation

torch.speical.i1 typo

"if self.pad / self.padding" in LowPassFilter2d

Padding Bool typo

2D Filter Jinc appears to be wrong

Releases(v0.0.6)

v0.0.6(Jul 26, 2022)

v0.0.3(Jul 18, 2022)

v0.0.2(Jun 22, 2022)

v0.0.1(Nov 2, 2021)

Owner

이준혁(Junhyeok Lee)

The code of paper 'Learning to Aggregate and Personalize 3D Face from In-the-Wild Photo Collection'

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

Implementation of the paper Recurrent Glimpse-based Decoder for Detection with Transformer.

A curated list of awesome projects and resources related fastai

Improving Object Detection by Estimating Bounding Box Quality Accurately

Animation of solving the traveling salesman problem to optimality using mixed-integer programming and iteratively eliminating sub tours

Deep Learning Datasets Maker is a QGIS plugin to make datasets creation easier for raster and vector data.

Clustering with variational Bayes and population Monte Carlo

[NeurIPS'21] "AugMax: Adversarial Composition of Random Augmentations for Robust Training" by Haotao Wang, Chaowei Xiao, Jean Kossaifi, Zhiding Yu, Animashree Anandkumar, and Zhangyang Wang.

DAFNe: A One-Stage Anchor-Free Deep Model for Oriented Object Detection

学习 python3 以来写的一些垃圾玩具……

Semantic Image Synthesis with SPADE

An example showing how to use jax to train resnet50 on multi-node multi-GPU

Symbolic Parallel Adaptive Importance Sampling for Probabilistic Program Analysis in JAX

ULMFiT for Genomic Sequence Data

Repo for my Tensorflow/Keras CV experiments. Mostly revolving around the Danbooru20xx dataset

Session-aware Item-combination Recommendation with Transformer Network

Complete the code of prefix-tuning in low data setting