Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

Last update: Dec 25, 2022

Related tags

Deep Learning multi-task_loss_optimizer

Overview

multi-task_losses_optimizer

Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

已经实验过了，不会有cuda out of memory情况

##Pareto optimizer

from Pareto_fn import pareto_fn
w_list = [w1,w2,...]
c_list = [c1,c2,...]
[loss1,loss2,...] = model(inputs)
loss_list = [loss1,loss2,...]
# config is the superparameter for training
new_w_list = pareto_fn(w_list,c_list,config,loss_list)
loss = 0
for i in range(len(w_list)):
    loss += new_w_list[i]*loss_list[i]
model.zero_grad()

loss.backward()
optimizer.step()

##pcgrad optimizer

from pcgrad_fn import pcgrad_fn

[loss1,loss2,...] = model(inputs)
loss_list = [loss1,loss2,...]
# config is the superparameter for training

pcgrad_fn(model,loss_list,optimizer)

optimizer.step()

Reference

Please cite as:

@article{yu2020gradient,
  title={Gradient surgery for multi-task learning},
  author={Yu, Tianhe and Kumar, Saurabh and Gupta, Abhishek and Levine, Sergey and Hausman, Karol and Finn, Chelsea},
  journal={arXiv preprint arXiv:2001.06782},
  year={2020}
}

paper: "A Pareto-Efficient Algorithm for Multiple Objective Optimization in E-Commerce Recommendation". RecSys, 2019, Alibaba

Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

Related tags

Overview

multi-task_losses_optimizer

Reference

Owner

ML-Ensemble – high performance ensemble learning

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

PCGNN - Procedural Content Generation with NEAT and Novelty

Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Loop Story Generation"

An end-to-end PyTorch framework for image and video classification

Introducing neural networks to predict stock prices

Template repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.

The Pytorch implementation for "Video-Text Pre-training with Learned Regions"

Reinforcement learning library in JAX.

[ACMMM 2021 Oral] Enhanced Invertible Encoding for Learned Image Compression

Official repository of the AAAI'2022 paper "Contrast and Generation Make BART a Good Dialogue Emotion Recognizer"

This repository contains answers of the Shopify Summer 2022 Data Science Intern Challenge.

This repo uses a combination of logits and feature distillation method to teach the PSPNet model of ResNet18 backbone with the PSPNet model of ResNet50 backbone. All the models are trained and tested on the PASCAL-VOC2012 dataset.

PyTorch implementation of Memory-based semantic segmentation for off-road unstructured natural environments.

Draw like Bob Ross using the power of Neural Networks (With PyTorch)!

Official tensorflow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”

Script utilizando OpenCV e modelo Machine Learning para detectar o uso de máscaras.

A collection of models for image<->text generation in ACM MM 2021.

This repo tries to recognize faces in the dataset you created

Segmentation models with pretrained backbones. Keras and TensorFlow Keras.