PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "

Last update: Nov 03, 2022

Related tags

Overview

Foley Music: Learning to Generate Music from Videos

This repo holds the code for the framework presented on ECCV 2020.

Foley Music: Learning to Generate Music from Videos Chuang Gan, Deng Huang, Peihao Chen, Joshua B. Tenenbaum, and Antonio Torralba

paper

Usage Guide

Prerequisites

The training and testing in PGCN is reimplemented in PyTorch for the ease of use.

Pytorch 1.4

Other minor Python modules can be installed by running

pip install -r requirements.txt

Data Preparation

Download Datasets

The extracted pose and midi for training and audio generation can be downloaded here and unzip to ./data folder.

The original datasets (including videos) can be found:

URMP: can be downloaded here
MUSIC: can be downloaded here
AtinPiano: proposed by At Your Fingertips: Automatic Piano Fingering Detection. The dataset can be downloaded here

Training

For URMP

CUDA_VISIBLE_DEVICES=6 python train.py -c config/URMP/violin.conf -e exps/urmp-vn

For AtinPiano

CUDA_VISIBLE_DEVICES=6 python train.py -c config/AtinPiano.conf -e exps/atinpiano

For MUSIC

CUDA_VISIBLE_DEVICES=6 python train.py -c config/MUSIC/accordion.conf -e exps/music-accordion

Generating MIDI, sounds and videos

For URMP

VIDEO_PATH=/path/to/video
INSTRUMENT_NAME='Violin'
python test_URMP.py exps/urmp-vn/checkpoint.pth.tar -o exps/urmp-vn/generate -i Violin -v $VIDEO_PATH -i $INSTRUMENT_NAME

For AtinPiano

VIDEO_PATH=/path/to/video
INSTRUMENT_NAME='Acoustic Grand Piano'
python test_AtinPiano_MUSIC.py exps/atinpiano/checkpoint.pth.tar -o exps/atinpiano/generation -v $VIDEO_PATH -i $INSTRUMENT_NAME

For MUSIC

VIDEO_PATH=/path/to/video
INSTRUMENT_NAME='Accordion'
python test_AtinPiano_MUSIC.py exps/music-accordion/checkpoint.pth.tar -o exps/music-accordion/generation -v $VIDEO_PATH -i $INSTRUMENT_NAME

Notes:

Instrument name ($INSTRUMENT_NAME) can be found here
If you do not have the video file or you want to generate MIDI and audio only, you can add -oa flag to skip the generation of video.

Other Info

Citation

Please cite the following paper if you feel our work useful to your research.

@inproceedings{FoleyMusic2020,
  author    = {Chuang Gan and
               Deng Huang and
               Peihao Chen and
               Joshua B. Tenenbaum and
               Antonio Torralba},
  title     = {Foley Music: Learning to Generate Music from Videos},
  booktitle = {ECCV},
  year      = {2020},
}

PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "

Related tags

Overview

Foley Music: Learning to Generate Music from Videos

Usage Guide

Prerequisites

Data Preparation

Download Datasets

Training

Generating MIDI, sounds and videos

Other Info

Citation

Owner

Chuang Gan

Junction Tree Variational Autoencoder for Molecular Graph Generation (ICML 2018)

CVPR 2021 - Official code repository for the paper: On Self-Contact and Human Pose.

A very impractical 3D rendering engine that runs in the python terminal.

AI-based, context-driven network device ranking

Deep Residual Networks with 1K Layers

Torch implementation of various types of GAN (e.g. DCGAN, ALI, Context-encoder, DiscoGAN, CycleGAN, EBGAN, LSGAN)

A script helps the user to update Linux and Mac systems through the terminal

An open-source, low-cost, image-based weed detection device for fallow scenarios.

Ranger deep learning optimizer rewrite to use newest components

Point Cloud Registration Network

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network.

MISSFormer: An Effective Medical Image Segmentation Transformer

🕺Full body detection and tracking

DeepConsensus uses gap-aware sequence transformers to correct errors in Pacific Biosciences (PacBio) Circular Consensus Sequencing (CCS) data.

Generalizing Gaze Estimation with Outlier-guided Collaborative Adaptation

FS-Mol: A Few-Shot Learning Dataset of Molecules

Identifying Stroke Indicators Using Rough Sets

I created My own Virtual Artificial Intelligence named genesis, He can assist with my Tasks and also perform some analysis,,

Official repository of the paper "GPR1200: A Benchmark for General-PurposeContent-Based Image Retrieval"

Shape-Adaptive Selection and Measurement for Oriented Object Detection