Inferring Lexicographically-Ordered Rewards from Preferences

Code author: Alihan Hüyük ([email protected])

This repository contains the source code necessary to replicate the main experimental results in the AAAI 2022 paper "Inferring Lexicographically-Ordered Reward from Preferences." Our proposed method, LORI, is implemented in files src/main-lori.py and src/main-lori-liver.py for the problem settings considered in the paper: cancer treatment and organ transplantation respectively.

Usage

First, install the required python packages by running:

    python -m pip install -r requirements.txt

Then, the experiments in the paper can be replicated by running:

    ./src/run.sh        # generates the results in Tables 2 and 3
    ./src/run-liver.sh  # generates the reward functions in (10) and (11)

Note that, in order to run the experiments for the transplantation setting, you need to get access to the Organ Procurement and Transplantation Network (OPTN) dataset for liver transplantations as of December 4, 2020.

Citing

If you use this software please cite as follows:

@inproceedings{huyuk2022inferring,
  author={Alihan H\"uy\"uk and William R. Zame and Mihaela van der Schaar},
  title={Inferring lexicographically-ordered rewards from preferences},
  booktitle={Proceedings of the 36th AAAI Conference on Artificial Intelligence},
  year={2022}
}

Inferring Lexicographically-Ordered Rewards from Preferences

Related tags

Overview

Inferring Lexicographically-Ordered Rewards from Preferences

Usage

Citing

Owner

Alihan Hüyük

Pytorch Implementation of "Diagonal Attention and Style-based GAN for Content-Style disentanglement in image generation and translation" (ICCV 2021)

ML-PersonalWork - Big assignment PersonalWork in Machine Learning, 2021 autumn BUAA.

i-SpaSP: Structured Neural Pruning via Sparse Signal Recovery

Official implementation of "Generating 3D Molecules for Target Protein Binding"

The MATH Dataset

we propose EfficientDerain for high-efficiency single-image deraining

PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)

Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.

A sketch extractor for anime/illustration.

Domain Generalization with MixStyle, ICLR'21.

Decision Transformer: A brand new Offline RL Pattern

FedTorch is an open-source Python package for distributed and federated training of machine learning models using PyTorch distributed API

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation (CIKM'17)

Contrastive Loss Gradient Attack (CLGA)

Patient-Survival - Using Python, I developed a Machine Learning model using classification techniques such as Random Forest and SVM classifiers to predict a patient's survival status that have undergone breast cancer surgery.

ObjectDrawer-ToolBox: a graphical image annotation tool to generate ground plane masks for a 3D object reconstruction system

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

Code to reproduce the results for Statistically Robust Neural Network Classification, published in UAI 2021

The official implementation of the paper, "SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning"

Deep Learning Head Pose Estimation using PyTorch.