Toward Model Interpretability in Medical NLP

LING380: Topics in Computational Linguistics Final Project James Cross ([email protected]) and Daniel Kim ([email protected]), December 2021

Code Organization

data: contains medical report data [LINK TO THAT REPO] used in model fine-tuning and analysis, clinical stop words, and saved accuracy and entropy metrics during evaluation

models: checkpoints of the best performing BERT and BioBERT models after hyperparameter optimization

notebooks:

model_training.ipynb: code to train and fine-tune BERT and BioBERT

model_evaluation.ipynb: code to run various model evaluations, visualize word importances, perform post-training clinical stopword masking, and other analyses

scripts: same functionality as in the notebooks, in executable python scripts / functions

Dependencies

All packages needed to run the code are available in the default Google Colab environment (see documentation for full list), with the exception of huggingface (transformers), used for loading transformer models, and captum.ai (captum), which provides access for a variety of model interpretation tools.

How to run code

Two options available to run the code; on Google colab and/or locally on your machine.

Option 1) Google Colab

Model training notebook: [https://colab.research.google.com/drive/1uPIi-OVchs_8A-SNcQtLfwelr0ccsz19?usp=sharing] Model evaluation/analysis notebook: [https://colab.research.google.com/drive/1Hfy58JvyPbx55lKKhQAzzrhJIbN_Io0j?usp=sharing]

Option 2) Local Machine

Notebooks: You can run the model_training.ipynb or model_evaluation.ipynb notebooks as is, changing directory paths when needed.

Toward Model Interpretability in Medical NLP

Related tags

Overview

Toward Model Interpretability in Medical NLP

Code Organization

Dependencies

How to run code

Option 1) Google Colab

Option 2) Local Machine

Owner

Some embedding layer implementation using ivy library

This is the 25 + 1 year anniversary version of the 1995 Rachford-Rice contest

Creating a chess engine using GPT-3

I label phrases on a scale of five values: negative, somewhat negative, neutral, somewhat positive, positive

Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)

A Semi-Intelligent ChatBot filled with statistical and economical data for the Premier League.

Guide to using pre-trained large language models of source code

PyTorch Implementation of "Bridging Pre-trained Language Models and Hand-crafted Features for Unsupervised POS Tagging" (Findings of ACL 2022)

Ecommerce product title recognition package

Wrapper to display a script output or a text file content on the desktop in sway or other wlroots-based compositors

Proquabet - Convert your prose into proquints and then you essentially have Vogon poetry

Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.

Fast, general, and tested differentiable structured prediction in PyTorch

DiY Oxygen Concentrator based on the OxiKit

Materials (slides, code, assignments) for the NYU class I teach on NLP and ML Systems (Master of Engineering).

CrossNER: Evaluating Cross-Domain Named Entity Recognition (AAAI-2021)

Paradigm Shift in NLP - "Paradigm Shift in Natural Language Processing".

LightSeq: A High-Performance Inference Library for Sequence Processing and Generation

DELTA is a deep learning based natural language and speech processing platform.

IMS-Toucan is a toolkit to train state-of-the-art Speech Synthesis models