A Structured Self-attentive Sentence Embedding

Last update: Nov 28, 2022

Overview

Structured Self-attentive sentence embeddings

Implementation for the paper A Structured Self-Attentive Sentence Embedding, which was published in ICLR 2017: https://arxiv.org/abs/1703.03130 .

USAGE:

For binary sentiment classification on imdb dataset run : python classification.py "binary"

For multiclass classification on reuters dataset run : python classification.py "multiclass"

You can change the model parameters in the model_params.json file Other tranining parameters like number of attention hops etc can be configured in the config.json file.

If you want to use pretrained glove embeddings , set the use_embeddings parameter to "True" ,default is set to False. Do not forget to download the glove.6B.50d.txt and place it in the glove folder.

Implemented:

Classification using self attention
Regularization using Frobenius norm
Gradient clipping
Visualizing the attention weights

Instead of pruning ,used averaging over the sentence embeddings.

Visualization:

After training, the model is tested on 100 test points. Attention weights for the 100 test data are retrieved and used to visualize over the text using heatmaps. A file visualization.html gets saved in the visualization/ folder after successful training. The visualization code was provided by Zhouhan Lin (@hantek). Many thanks.

Below is a shot of the visualization on few datapoints.

Training accuracy 93.4% Tested on 1000 points with 90.2% accuracy

A Structured Self-attentive Sentence Embedding

Related tags

Overview

Structured Self-attentive sentence embeddings

USAGE:

Implemented:

Visualization:

Owner

Kaushal Shetty

An evaluation toolkit for voice conversion models.

A number of methods in order to perform Natural Language Processing on live data derived from Twitter

A flask application to predict the speech emotion of any .wav file.

Sploitus - Command line search tool for sploitus.com. Think searchsploit, but with more POCs

A simple chatbot based on chatterbot that you can use for anything has basic features

String Gen + Word Checker

Transformer training code for sequential tasks

Multilingual word vectors in 78 languages

Implementation of Multistream Transformers in Pytorch

Speech Recognition Database Management with python

An assignment on creating a minimalist neural network toolkit for CS11-747

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

Which Apple Keeps Which Doctor Away? Colorful Word Representations with Visual Oracles

VoiceFixer VoiceFixer is a framework for general speech restoration.

Product-Review-Summarizer - Created a product review summarizer which clustered thousands of product reviews and summarized them into a maximum of 500 characters, saving precious time of customers and helping them make a wise buying decision.

AMUSE - financial summarization

Dope Wars game engine on StarkNet L2 roll-up

Source code of the "Graph-Bert: Only Attention is Needed for Learning Graph Representations" paper

뉴스 도메인 질의응답 시스템 (21-1학기 졸업 프로젝트)

COVID-19 Chatbot with Rasa 2.0: open source conversational AI