music-ai

Deep learning transformer model that generates unique music sequences.

Abstract

In 2017, a new state-of-the-art was published for natural language processing: the Transformer. Relying solely on attention mechanisms, the Transformer outperformed existing solutions based on recurrent and convolutional neural networks¹. However, recurrent neural networks, long short-term memory, and gated recurrent neural networks remain dominant in the field of generative music. I aim to introduce the Transformer into the field of music, with the goal of teaching the deep learning model to predict the second half of a composition given the first half. A Transformer equipped with 32 attention heads and sinusoidal positional encoding was trained on the Nottingham MIDI dataset for 5000 epochs over a period of 48 hours, optimized by stochastic gradient descent and measured with cross entropy loss, and regulated by an exponential learning rate decrease schedule. For the first thousand epochs, the model had noticeable improvement but lacked arrangement to the generated sequences. By five thousand epochs, the model clearly demonstrated the knowledge of general music trends used to better predict how classical composers write their pieces, and most tracks were melodic to the human ear. Future applications of this technique include generating tracks for various instruments, rating the quality of existing music tracks, and complete originality if combined with a generative network mapping melodies to latent space.

¹ Attention Is All You Need

Video

Hardware

Ubuntu

32 GB RAM
Intel Core i3-4170 CPU @3.70 GHz x4 (4 GB RAM)
NVIDIA GeForce GTX 1050 Ti

Deep learning transformer model that generates unique music sequences.

Related tags

Overview

music-ai

Abstract

Video

Hardware

Owner

xacer

DeepMusic is an easy to use Spotify like app to manage and listen to your favorites musics.

Klangbecken: The RaBe Endless Music Player

This is a realtime voice translator program which gets input from user at any language and converts it to the desired language that the user asks

Codes for "Efficient Long-Range Attention Network for Image Super-resolution"

This is an AI that runs in the terminal. It is a voice assistant that can do common activities and can also help in your coding doubts like

A Quick Music Player Made Fully in Python

Datamoshing with FFmpeg

SinGlow: Generative Flow for SVS tasks in Tensorflow 2

Python implementation of the Short Term Objective Intelligibility measure

A Python wrapper for the high-quality vocoder "World"

The project aims to develop a personal-assistant for Windows & Linux-based systems

Play any song directly into your group voice chat.

Vixtify - Python Controlled Music Player

Learn chords with your MIDI keyboard !

Open-Source bot to play songs in your Telegram's Group Voice Chat. Powered by @Akki_ThePro

A python script that can play .mp3 URLs upon the ringing or motion detection of a Ring doorbell. The sound plays through Sonos speakers.

This bot can stream audio or video files and urls in telegram voice chats

pyo is a Python module written in C to help digital signal processing script creation.

Gateware for the Terasic/Arrow DECA board, to become a USB2 high speed audio interface

A python wrapper for REAPER