A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"

Last update: Jun 04, 2022

Related tags

Deep Learning LASAFT-Net-v2

Overview

LASAFT-Net-v2

Listen, Attend and Separate by Attentively aggregating Frequency Transformation

Woosung Choi, Yeong-Seok Jeong, Jinsung Kim, Jaehwa Chung, Soonyoung Jung, and Joshua D. Reiss

Demonstration (under construction)

Experimental Results

Musdb 18

model	vocals	drums	bass	other	AVG
Meta-TasNet	6.40	5.91	5.58	4.19	5.52
AMSS-Net	6.78	5.92	5.10	4.51	5.58
LaSAFT-Net-v1	7.33	5.68	5.63	4.87	5.88
LASAFT-Net-v2	7.57	6.13	5.28	4.87	5.96

MDX Challenge (Leaderboard A)

model	model type	vocals	drums	bass	other	AVG
KUILAB-MDX-Net	dedicated (1 source/ 1 model)	8.901	7.173	7.232	5.636	7.236
LaSAFT-Net-v1 (light)	conditioned (4 sources/ 1 model)	7.275	5.935	5.823	4.557	5.897
LASAFT-Net-v2 (light)	conditioned (4 sources/ 1 model)	7.324	5.976	5.884	4.642	5.957

How to reproduce

1. Environment

Ubuntu 20.04
wandb for logging

You must create .env file by copying .env.sample to set environmental variables.

wandb_api_key=[Your Key] # "xxxxxxxxxxxxxxxxxxxxxxxx"
data_dir=[Your Path] # "/home/ielab/repos/musdbHQ"

about wandb_api_key
- we currently only support wandb for logging.
- for wandb_api_key, visit wandb, go to setting, and then copy your api key
about data_dir
- the absolute path where datasets are stored

2. Installation (cuda)

conda env create -f environment.yaml -n lasaftv2
conda activate lasaftv2
pip install -r requirements.txt

A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"

Related tags

Overview

LASAFT-Net-v2

Listen, Attend and Separate by Attentively aggregating Frequency Transformation

Experimental Results

How to reproduce

1. Environment

2. Installation (cuda)

Owner

Woosung Choi

A simple PyTorch Implementation of Generative Adversarial Networks, focusing on anime face drawing.

Here is the diagnostic tool for BMVC 2021 paper Diagnosing Errors in Video Relation Detectors.

Real-time Neural Representation Fusion for Robust Volumetric Mapping

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

The PyTorch implementation of Directed Graph Contrastive Learning (DiGCL), NeurIPS-2021

U-Net Brain Tumor Segmentation

Codes for NeurIPS 2021 paper "On the Equivalence between Neural Network and Support Vector Machine".

Hyperparameter tuning for humans

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

an Evolutionary Algorithm assisted GAN

Official code for the paper: Deep Graph Matching under Quadratic Constraint (CVPR 2021)

Codes and pretrained weights for winning submission of 2021 Brain Tumor Segmentation (BraTS) Challenge

working repo for my xumx-sliCQ submissions to the ISMIR 2021 MDX

Image-generation-baseline - MUGE Text To Image Generation Baseline

Universal Probability Distributions with Optimal Transport and Convex Optimization

Demonstration of transfer of knowledge and generalization with distillation

RE3: State Entropy Maximization with Random Encoders for Efficient Exploration

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

An Intelligent Self-driving Truck System For Highway Transportation

3DMV jointly combines RGB color and geometric information to perform 3D semantic segmentation of RGB-D scans.