Repository for "Improving evidential deep learning via multi-task learning," published in AAAI2022

Last update: Nov 19, 2022

Overview

Improving evidential deep learning via multi task learning

It is a repository of AAAI2022 paper, “Improving evidential deep learning via multi-task learning”, by Dongpin Oh and Bonggun Shin.

This repository contains the code to reproduce the Multi-task evidential neural network (MT-ENet), which uses the Lipschitz MSE loss function as the additional loss function of the evidential regression network (ENet). The Lipschitz MSE loss function can improve the accuracy of the ENet while preserving its uncertainty estimation capability, by avoiding gradient conflict with the NLL loss function—the original loss function of the ENet.

Setup

Please refer to "requirements.txt" for requring packages of this repo.

pip install -r requirements.txt

Training the ENet with the Lipschitz-MSE loss: example

from mtevi.mtevi import EvidentialMarginalLikelihood, EvidenceRegularizer, modified_mse
...
net = EvidentialNetwork() ## Evidential regression network
nll_loss = EvidentialMarginalLikelihood() ## original loss, NLL loss
reg = EvidenceRegularizer() ## evidential regularizer
mmse_loss = modified_mse ## lipschitz MSE loss
...
for inputs, labels in dataloader:
	gamma, nu, alpha, beta = net(inputs)
	loss = nll_loss(gamma, nu, alpha, beta, labels)
	loss += reg(gamma, nu, alpha, beta, labels)
	loss += mmse_loss(gamma, nu, alpha, beta, labels)
	loss.backward()

Quick start

Synthetic data experiment.

python synthetic_exp.py

UCI regression benchmark experiments.

python uci_exp_norm -p energy

Drug target affinity (DTA) regression task on KIBA and Davis datasets.

python train_evinet.py -o test --type davis -f 0 --evi # ENet
python train_evinet.py -o test --type davis -f 0  # MT-ENet

Gradient conflict experiment on the DTA benchmarks

python check_conflict.py --type davis -f 0 # Conflict between the Lipschitz MSE (proposed) and NLL loss. 
python check_conflict.py --type davis -f 0 --abl # Conflict between the simple MSE loss and NLL loss.

Characteristic of the Lipschitz MSE loss

The Lipschitz MSE loss function can support training the ENet to more accurately predicts target values.
It regularizes its gradient to prevent gradient conflict with the NLL loss--the original loss function--if the NLL loss increases predictive uncertainty of the ENet.
Please check our paper for details.

Repository for "Improving evidential deep learning via multi-task learning," published in AAAI2022

Related tags

Overview

Improving evidential deep learning via multi task learning

Setup

Training the ENet with the Lipschitz-MSE loss: example

Quick start

Characteristic of the Lipschitz MSE loss

Owner

deargen

A Python library that provides a simplified alternative to DBAPI 2

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

A library that allows for inference on probabilistic models

Real time Human Detection Counting

Practical tutorials and labs for TensorFlow used by Nvidia, FFN, CNN, RNN, Kaggle, AE

Pytorch port of Google Research's LEAF Audio paper

ALL Snow Removed: Single Image Desnowing Algorithm Using Hierarchical Dual-tree Complex Wavelet Representation and Contradict Channel Loss (HDCWNet)

ML-based medical imaging using Azure

OpenVisionAPI server

Official public repository of paper "Intention Adaptive Graph Neural Network for Category-Aware Session-Based Recommendation"

GPU-Accelerated Deep Learning Library in Python

Training and Evaluation Code for Neural Volumes

Code for HLA-Face: Joint High-Low Adaptation for Low Light Face Detection (CVPR21)

The final project for "Applying AI to Wearable Device Data" course from "AI for Healthcare" - Udacity.

《Dual-Resolution Correspondence Network》(NeurIPS 2020)

Outlier Exposure with Confidence Control for Out-of-Distribution Detection

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

Space Time Recurrent Memory Network - Pytorch

Keras implementation of Deeplab v3+ with pretrained weights

[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction