The official implementation of Variable-Length Piano Infilling (VLI).

Last update: Sep 01, 2022

Overview

Variable-Length-Piano-Infilling

The official implementation of Variable-Length Piano Infilling (VLI). (paper: Variable-Length Music Score Infilling via XLNet and Musically Specialized Positional Encoding)

VLI is a new Transformer-based model for music score infilling, i.e., to generate a polyphonic music sequence that fills in the gap between given past and future contexts. Our model can infill a variable number of notes for different time spans.

Installation

Clone and install the modified Huggingface Transformer package.
Clone this repo and install the required packages.

git clone https://github.com/reichang182/Variable-Length-Piano-Infilling.git
cd  Variable-Length-Piano-Infilling
pip install -r requirement.txt

Download and unzip the AIlabs.tw Pop1K7 dataset. (Download link: here).

Training & Testing

# Prepare data
python prepare_data.py \
	--midi-folder datasets/midi/midi_synchronized/ \
	--save-folder ./

# Train the model
python train.py --train

# Test the trained model
python train.py

Baselines

The codes to run baselines in our paper are in the baselines folder. We implement ILM and FELIX according to their paper (ILM and FELIX) and based on the implementation of Transformer-XL and BERT in Huggingface Transformer. They can also be trained and tested through the same command as our model does above.

# cd baselines/ILM or cd baselines/FELIX

# Train the model
python train.py --train \
	--dict-file ../../dictionary.pickle \
	--data-file ../../worded_data.pickle

# Test the trained model
python train.py \
	--dict-file ../../dictionary.pickle \
	--data-file ../../worded_data.pickle

Architecture

Results

The training NLL-loss curves of ours and the baseline models.

The objective metrics evaluated on the music pieces generated by VLI(ours), ILM, FELIX, and the real music.

Results of the user study: mean opinion scores in 1–5 in M(melodic fluency), R(rhythmic fluency), I(im-pression), and percentage of votes in F(favorite), from ‘all’ the participants or only the music ‘pro’-fessionals.

The official implementation of Variable-Length Piano Infilling (VLI).

Related tags

Overview

Variable-Length-Piano-Infilling

Installation

Training & Testing

Baselines

Architecture

Results

Owner

This is the code repository implementing the paper "TreePartNet: Neural Decomposition of Point Clouds for 3D Tree Reconstruction".

Virtual hand gesture mouse using a webcam

Only works with the dashboard version / branch of jesse

Semi-supervised learning for object detection

[ICCV'21] Neural Radiance Flow for 4D View Synthesis and Video Processing

A Peer-to-peer Platform for Secure, Privacy-preserving, Decentralized Data Science

InterFaceGAN - Interpreting the Latent Space of GANs for Semantic Face Editing

Hybrid Neural Fusion for Full-frame Video Stabilization

Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)

Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using Deep Learning on Primary Tumor Biopsy Slides

PyTorch implementation of the Deep SLDA method from our CVPRW-2020 paper "Lifelong Machine Learning with Deep Streaming Linear Discriminant Analysis"

This is an implementation of Googles Yogi-Optimizer in Keras (tf.keras)

Replication of Pix2Seq with Pretrained Model

Contextual Attention Network: Transformer Meets U-Net

Contains a bunch of different python programm tasks

Provided is code that demonstrates the training and evaluation of the work presented in the paper: "On the Detection of Digital Face Manipulation" published in CVPR 2020.

Implementation of "Glancing Transformer for Non-Autoregressive Neural Machine Translation"

Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness

A Learning-based Camera Calibration Toolbox

How to Predict Stock Prices Easily Demo