Anuvada: Interpretable Models for NLP using PyTorch

So, you want to know why your classifier arrived at a particular decision or why your flashy new deep learning classification model is not performing in the way which you would want it to perform? Or there could be bias in your dataset towards a particular class and you want to understand if there are any such edge cases.

One of the common criticisms of deep learning has been it's black box nature (life itself is a big black box, not at all interpretable, don't even ask me about love). To address this issue, researchers have developed many ways to visualise and explain the inference. It is not necessary that a model has to be explainable, but when important decisions like which jobs to recommend to a person or whether to give a person loan are being made, it would be helpful to cross-check the model's claims. In such domains, self-explainable models are necessary.

This library is an ongoing effort to provide a high-level access to such models by building on top of PyTorch.

Here is what you can expect to visualize from a trained model.

Note: This model is a convolutional neural network trained on IMDB sentiment analysis dataset. I trained the model using SGD till validation loss stopped improving. Here is sensitivity analysis on some sample inputs. You can find more details about training the model in the Jupyter notebooks from the examples directory.

Positive review

Negative review

Installing

Clone this repo and add it to your python library path.

Requirements

PyTorch
NumPy
Pandas
Spacy
Gensim
tqdm

To do list

Acknowledgments

https://github.com/henryre/pytorch-fitmodule

Anuvada: Interpretable Models for NLP using PyTorch

Related tags

Overview

Anuvada: Interpretable Models for NLP using PyTorch

Positive review

Negative review

Installing

Requirements

To do list

Acknowledgments

Owner

EDGE

A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP

A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/casual, active/passive, and many more. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.

IndoBERTweet is the first large-scale pretrained model for Indonesian Twitter. Published at EMNLP 2021 (main conference)

Just a Basic like Language for Zeno INC

Lightweight utility tools for the detection of multiple spellings, meanings, and language-specific terminology in British and American English

TLA - Twitter Linguistic Analysis

تولید اسم های رندوم فینگیلیش

GSoC'2021 | TensorFlow implementation of Wav2Vec2

This project aims to conduct a text information retrieval and text mining on medical research publication regarding Covid19 - treatments and vaccinations.

NSFW A chatbot based on GPT2-chitchat

A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.

Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing

Final Project Bootcamp Zero

Knowledge Oriented Programming Language

Tool which allow you to detect and translate text.

(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.

Japanese NLP Library

Easy to start. Use deep nerual network to predict the sentiment of movie review.

Anuvada: Interpretable Models for NLP using PyTorch

Related tags

Overview

Anuvada: Interpretable Models for NLP using PyTorch

Positive review

Negative review

Installing

Requirements

To do list

Acknowledgments

Owner

EDGE

A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP

A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/casual, active/passive, and many more. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.

IndoBERTweet is the first large-scale pretrained model for Indonesian Twitter. Published at EMNLP 2021 (main conference)

Just a Basic like Language for Zeno INC

Lightweight utility tools for the detection of multiple spellings, meanings, and language-specific terminology in British and American English

TLA - Twitter Linguistic Analysis

تولید اسم های رندوم فینگیلیش

GSoC'2021 | TensorFlow implementation of Wav2Vec2

This project aims to conduct a text information retrieval and text mining on medical research publication regarding Covid19 - treatments and vaccinations.

**NSFW** A chatbot based on GPT2-chitchat

A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.

Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing

Final Project Bootcamp Zero

Knowledge Oriented Programming Language

Tool which allow you to detect and translate text.

(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.

Japanese NLP Library

Easy to start. Use deep nerual network to predict the sentiment of movie review.

NSFW A chatbot based on GPT2-chitchat