Zalo AI challenge 2021 task hum to song

Last update: Dec 16, 2022

Related tags

Deep Learning hum2song

Overview

Zalo AI challenge 2021 task Hum to Song

pipeline:

Chuẩn bị dữ liệu cho quá trình train:

Sửa các file đường dẫn trong config/preprocess.yaml
- raw_path: đường dẫn đến data thô
- preprocessed_path: đường dẫn đầu ra của quá trình rút trích mel
- temp_dir: đường dẫn chứa dữ liệu mp3 được chuẩn hóa
- Chạy lần lượt các lệnh sau:

        python preprocessing.py

        python utils/split_train_val_by_id.py
   
        python utils/augment_mp3.py
   
        python utils/preprocess_augment.py

Train model:

Sửa các file đường dẫn trong config/config.py
- meta_train: đường dẫn đến file train_meta.csv trong preprocessed_path
- train_root: đường dẫn đến dữ liệu mel đã tiền xử lý
- train_list = 'full_data_train.txt'
- val_list = 'full_data_val.txt'
Chạy lần lượt các lệnh sau:

        python convert_data.py

        python train.py

Infer public test:

Đặt dữ liệu mp3 thô ở địa chỉ /data/public_test (bên trong chứa 2 thư mục full_song và hum)
Chạy lần lượt các lệnh sau:

./predict.sh

Infer private test:

Đặt dữ liệu mp3 thô ở địa chỉ /data/private_test (bên trong chứa 2 thư mục full_song và hum)

Chạy lần lượt các lệnh sau:

./predict_private_test.sh

Team:

Võ Văn Phúc

Nguyễn Văn Thiều

Lâm Bá Thịnh

Zalo AI challenge 2021 task hum to song

Related tags

Overview

Zalo AI challenge 2021 task Hum to Song

pipeline:

Chuẩn bị dữ liệu cho quá trình train:

Train model:

Infer public test:

Infer private test:

Team:

Owner

Vo Van Phuc

Implementation of the paper All Labels Are Not Created Equal: Enhancing Semi-supervision via Label Grouping and Co-training

Fastshap: A fast, approximate shap kernel

Official PyTorch Implementation of Mask-aware IoU and maYOLACT Detector [BMVC2021]

Python scripts form performing stereo depth estimation using the high res stereo model in PyTorch .

ETMO: Evolutionary Transfer Multiobjective Optimization

The PyTorch implementation of DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision.

This is an official source code for implementation on Extensive Deep Temporal Point Process

A Python reference implementation of the CF data model

Dieser Scanner findet Websites, die nicht direkt in Suchmaschinen auftauchen, aber trotzdem erreichbar sind.

Repository to run object detection on a model trained on an autonomous driving dataset.

Time-Optimal Planning for Quadrotor Waypoint Flight

OcclusionFusion: realtime dynamic 3D reconstruction based on single-view RGB-D

Implementation of "Semi-supervised Domain Adaptive Structure Learning"

[NeurIPS 2021] Better Safe Than Sorry: Preventing Delusive Adversaries with Adversarial Training

Continual Learning of Long Topic Sequences in Neural Information Retrieval

A very simple baseline to estimate 2D & 3D SMPL-compatible keypoints from a single color image.

ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.

VSR-Transformer - This paper proposes a new Transformer for video super-resolution (called VSR-Transformer).

Off-policy continuous control in PyTorch, with RDPG, RTD3 & RSAC

Activity image-based video retrieval