A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

Last update: May 25, 2022

Related tags

Text Data & NLP midi_language

Overview

MIDI Language

Introduction

Reference

Paper: Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions: code

This is a modified version with an extension of multi-instrumental support.

Function

Convert Midi into event sequence, and represented by mapped integer array.

This could send to NLP models for AI auto music composition.

Due to this project considers more about music structures as well as its chord and melody on higher level, including note, drum, tempo, musical instrument (program in midi) and its expressions (tempo and velocity), rather than digging into too much details like sound source & direction, instrumental performing techniques (such as, bend sound, piano sustain pedal, violin overtones), the language of MIDI is design this way (see chapter Details below).

Usage

See language.py, it contains procedures:

load w2i (word to integer) and i2w (integer to word), for not calculating it every time;
encode midi to iteger array, each object handle one mid file;
decode integer array to midi, each object handle many results and export to mid files;

The code language.py has arguments:

input: input file of audio file to encode/decode;
output: output file of audio file to encode;
train: if have, it will switch to training mode with variations (data augmentation);

MidiEncoder data augmentation:

pitch_variation_range: a random pitch shift within a range for whole midi;
velocity_scale_variation_range: a random note/drum velocity scale for whole midi;
velocity_noise_scale_variation_range: a random note/drum velocity scale for each element within midi;
tempo_scale_variation_range: a random tempo change for whole midi;

MidiDecoder needs numerator and denominator time signatures for reconstructing midi files.

Details

Event Structure

Required:

Bar
Position (0~split-1)

Optional:

note:
- Note
- Program (0~127)
- Pitch (0~127)
- Velocity (0~127)
- Duration (0~split*bar_scale-1)
drum:
- Drum
- Program (0~127)
- Pitch (0~127)
- Velocity (0~127)
- Duration (0~split*bar_scale-1)
chord:
- Chord (chroma_name:chord_name)
tempo:
- Tempo_Class (T0~Ti)
- Tempo_Value (0~59)

A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

Related tags

Overview

MIDI Language

Introduction

Reference

Function

Usage

Details

Event Structure

Required:

Optional:

Owner

Robert Bogan Kang

Python module (C extension and plain python) implementing Aho-Corasick algorithm

Trains an OpenNMT PyTorch model and SentencePiece tokenizer.

Language-Agnostic SEntence Representations

Materials (slides, code, assignments) for the NYU class I teach on NLP and ML Systems (Master of Engineering).

Code for Discovering Topics in Long-tailed Corpora with Causal Intervention.

source code for paper: WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.

🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)

The aim of this task is to predict someone's English proficiency based on a text input.

Data manipulation and transformation for audio signal processing, powered by PyTorch

PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop

Mesh TensorFlow: Model Parallelism Made Easier

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Code for our ACL 2021 (Findings) Paper - Fingerprinting Fine-tuned Language Models in the wild .

Exploration of BERT-based models on twitter sentiment classifications

Machine learning models from Singapore's NLP research community

Nested Named Entity Recognition

BeautyNet is an AI powered model which can tell you whether you're beautiful or not.

Anomaly Detection 이상치 탐지 전처리 모듈

NLP topic mdel LDA - Gathered from New York Times website