FewShotText

This repository contains code for the paper A Neural Few-Shot Text Classification Reality Check

Environment setup

# Create environment
python3 -m virtualenv .venv --python=python3.6

# Install environment
.venv/bin/pip install -r requirements.txt

# Activate environment
source .venv/bin/activate

Fine-tuning BERT on the MLM task

model_name=bert-base-cased
block_size=256
dataset=OOS
output_dir=transformer_models/${dataset}/fine-tuned

python scripts_transformers/run_language_modeling.py \
        --model_name_or_path ${model_name} \
        --output_dir ${output_dir} \
        --mlm \
        --do_train \
        --train_data_file data/${dataset}/full/full-train.txt  \
        --do_eval \
        --eval_data_file data/${dataset}/full/full-test.txt \
        --overwrite_output_dir \
        --evaluate_during_training \
        --logging_steps=1000 \
        --line_by_line \
        --logging_dir ${output_dir} \
        --block_size ${block_size} \
        --save_steps=1000 \
        --num_train_epochs 20 \
        --save_total_limit 20 \
        --seed 42

Training a few-shot model

To run the paper's experiments, simply use the utils/scripts/runner.sh file.

Reference

If you use the data or codes in this repository, please cite our paper:

@article{dopierre2021neural,
    title={A Neural Few-Shot Text Classification Reality Check},
    author={Dopierre, Thomas and Gravier, Christophe and Logerais, Wilfried},
    journal={arXiv preprint arXiv:2101.12073},
    year={2021}
}

Library of various Few-Shot Learning frameworks for text classification

Related tags

Overview

FewShotText

Environment setup

Fine-tuning BERT on the MLM task

Training a few-shot model

Reference

Owner

Thomas Dopierre

HyperSeg: Patch-wise Hypernetwork for Real-time Semantic Segmentation Official PyTorch Implementation

DUE: End-to-End Document Understanding Benchmark

Ppq - A powerful offline neural network quantization tool with custimized IR

image scene graph generation benchmark

TipToiDog - Tip Toi Dog With Python

Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.

Constrained Logistic Regression - How to apply specific constraints to logistic regression's coefficients

Python wrapper to access the amazon selling partner API

Official PyTorch code for WACV 2022 paper "CFLOW-AD: Real-Time Unsupervised Anomaly Detection with Localization via Conditional Normalizing Flows"

Official repository for Natural Image Matting via Guided Contextual Attention

Testability-Aware Low Power Controller Design with Evolutionary Learning, ITC2021

The code for replicating the experiments from the LFI in SSMs with Unknown Dynamics paper.

Pytorch implementation of the paper SPICE: Semantic Pseudo-labeling for Image Clustering

Graph Regularized Residual Subspace Clustering Network for hyperspectral image clustering

A toolkit for developing and comparing reinforcement learning algorithms.

This is the source code for the experiments related to the paper Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

Classify the disease status of a plant given an image of a passion fruit

A deep learning model for style-specific music generation.

Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms

Pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering".