Grading tools for Advanced NLP (11-711)

Installation

You'll need docker and unzip to use this repo. For docker, visit the official guide to get started. For unzip, you can install it on ubuntu via sudo apt-get install unzip.

Install the python package by

git clone https://github.com/ProKil/anlp-grading-tools
cd anlp-grading-tools
pip install -e .

Usage

To evaluate your code, you'll need to change the environment variables in test.sh.

ANLP_TMP_DIR: mkdir a new folder, e.g. mkdir tmp, and point this variable to the absolute path of the tmp folder.

SUBMISSION_DIR: this should point to the folder containing your submission zip file. Note that the toolkit will automatically evaluate all zip files in the folder.

SCORES_DIR: this should point to an empty folder. Your score will be logged in a text file there.

DATA_DIR: this should point to the data folder of minnn-assignment. Please copy the original minnn-assignment/classifier.py to minnn-assignment/data/classifier_orig.py to test if your code can be executed with the original classifier.

Example code to prepare the folders:

mkdir tmp
mkdir scores
cp -r path/to/minnn-assignment/data ./
cp path/to/minnn-assignment/classifier.py data/classifier_orig.py
mkdir submission
cp your/submission.zip submission

Now you can evaluate your code through bash test.sh, after which your scores are at SCORES_DIR/andrewid. It is normal to get 0s for the last two (correct labels for the imdb test set are not available), but you should get reasonable accuracies for the first two (~40).

Troubleshooting

You may find writing files inside ANLP_TMP_DIR and SCORE_DIR requiring permission. You can either use sudo or log into docker through docker run -v FOLDER_TO_WRITE:/mnt -it --entrypoint /bin/bash anlp and cd /mnt to write those files.
You may experience other permission issues with docker. Please refer to this page to use docker without sudo.

Grading tools for Advanced NLP (11-711)Grading tools for Advanced NLP (11-711)

Related tags

Overview

Grading tools for Advanced NLP (11-711)

Installation

Usage

Troubleshooting

Owner

Hao Zhu

End-to-end MLOps pipeline of a BERT model for emotion classification.

Conditional probing: measuring usable information beyond a baseline

Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization

FireFlyer Record file format, writer and reader for DL training samples.

Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"

translate using your voice

Text-Based zombie apocalyptic decision-making game in Python

TalkNet: Audio-visual active speaker detection Model

PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop

BERTopic is a topic modeling technique that leverages 🤗 transformers and c-TF-IDF to create dense clusters allowing for easily interpretable topics whilst keeping important words in the topic descriptions

Labelling platform for text using distant supervision

Traditional Chinese Text Recognition Dataset: Synthetic Dataset and Labeled Data

PyABSA - Open & Efficient for Framework for Aspect-based Sentiment Analysis

A curated list of efficient attention modules

Chinese NER with albert/electra or other bert descendable model (keras)

Which Apple Keeps Which Doctor Away? Colorful Word Representations with Visual Oracles

State of the art faster Natural Language Processing in Tensorflow 2.0 .

This repository contains Python scripts for extracting linguistic features from Filipino texts.

基于“Seq2Seq+前缀树”的知识图谱问答

Code for our paper "Transfer Learning for Sequence Generation: from Single-source to Multi-source" in ACL 2021.