A fast implementation of bss_eval metrics for blind source separation

Last update: Dec 13, 2022

Overview

fast_bss_eval

Do you have a zillion BSS audio files to process and it is taking days ? Is your simulation never ending ?

Fear no more! fast_bss_eval is here to help you!

fast_bss_eval is a fast implementation of the bss_eval metrics for the evaluation of blind source separation. Our implementation of the bss_eval metrics has the following advantages compared to other existing ones.

seamlessly works with both numpy arrays and pytorch tensors
very fast
can be even faster by using an iterative solver (add use_cg_iter=10 option to the function call)
differentiable via pytorch
can run on GPU via pytorch

Author

Robin Scheibler

Quick Start

Install

# from pypi
pip install fast-bss-eval

# or from source
git clone https://github.com/fakufaku/fast_bss_eval
cd fast_bss_eval
pip install -e .

Use

Assuming you have multichannel signals for the estmated and reference sources stored in wav format files names my_estimate_file.wav and my_reference_file.wav, respectively, you can quickly evaluate the bss_eval metrics as follows.

from scipy.io import wavfile
import fast_bss_eval

# open the files, we assume the sampling rate is known
# to be the same
fs, ref = wavfile.read("my_reference_file.wav")
_, est = wavfile.read("my_estimate_file.wav")

# compute the metrics
sdr, sir, sar, perm = fast_bss_eval.bss_eval_sources(ref.T, est.T)

Benchmark

This package is significantly faster than other packages that also allow to compute bss_eval metrics such as mir_eval or sigsep/bsseval. We did a benchmark using numpy/torch, single/double precision floating point arithmetic (fp32/fp64), and using either Gaussian elimination or a conjugate gradient descent (solve/CGD10).

Citation

If you use this package in your own research, please cite our paper describing it.

@misc{scheibler_sdr_2021,
  title={SDR --- Medium Rare with Fast Computations},
  author={Robin Scheibler},
  year={2021},
  eprint={2110.06440},
  archivePrefix={arXiv},
  primaryClass={eess.AS}
}

License

2021 (c) Robin Scheibler, LINE Corporation

This code is released under MIT License.

A fast implementation of bss_eval metrics for blind source separation

Related tags

Overview

fast_bss_eval

Author

Quick Start

Install

Use

Benchmark

Citation

License

Owner

Robin Scheibler

MacroTools provides a library of tools for working with Julia code and expressions.

Tensorflow 2 implementation of the paper: Learning and Evaluating Representations for Deep One-class Classification published at ICLR 2021

GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models

Pytorch implementation of our paper accepted by NeurIPS 2021 -- Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme

A new data augmentation method for extreme lighting conditions.

Reproduction process of AlexNet

Code repo for "Cross-Scale Internal Graph Neural Network for Image Super-Resolution" (NeurIPS'20)

Rainbow DQN implementation that outperforms the paper's results on 40% of games using 20x less data 🌈

Code of the paper "Shaping Visual Representations with Attributes for Few-Shot Learning (ASL)".

Sync2Gen Code for ICCV 2021 paper: Scene Synthesis via Uncertainty-Driven Attribute Synchronization

Flower - A Friendly Federated Learning Framework

Artstation-Artistic-face-HQ Dataset (AAHQ)

Official pytorch implementation of "DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion"

D2Go is a toolkit for efficient deep learning

Data loaders and abstractions for text and NLP

yolov5目标检测模型的知识蒸馏（基于响应的蒸馏）

Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes

WatermarkRemoval-WDNet-WACV2021

Identify the emotion of multiple speakers in an Audio Segment

Multi-Glimpse Network With Python