MetaNLI

Meta learning algorithms to train cross-lingual NLI (multi-task) models

Train (source task)

Reptile

To train the model using Reptile algorithm, run the command below:

python reptile.py \
    --meta_tasks sc_en,sc_de,sc_es,sc_fr \
    --queue_len 4 \
    --temp 5.0 \
    --epochs 1 \
    --meta_lr 1e-5 \
    --scheduler \
    --gamma 0.5 \
    --step_size 4000 \
    --shot 4 \
    --meta_iteration 8000 \
    --log_interval 300

Prototypical

To train the model using Prototypical Networks algorithm, run the command below:

python prototype.py \
    --meta_tasks sc_en,sc_de,sc_es,sc_fr \
    --target_task sc_fa \
    --epochs 1 \
    --meta_lr 1e-5 \
    --lambda_1 1 \
    --lambda_2 1 \
    --scheduler \
    --gamma 0.5 \
    --step_size 1000 \
    --shot 8 \
    --query_num 0 \
    --target_shot 8 \
    --meta_iteration 2500 \
    --log_interval 50

Zero-shot Test (on target task)

To perform a zero-shot test of the trained model on the target task, run the command below:

python zeroshot.py \
    --load saved/model_sc.pt \
    --task sc_fa

Fine-tune (target task)

To fine-tune the trained model on the target task, run the command below:

python finetune.py \
    --save saved \
    --model_filename fine.pt \
    --load saved/model_sc.pt \
    --task sc_fa \
    --epochs 5 \
    --lr 1e-5

Meta learning algorithms to train cross-lingual NLI (multi-task) models

Related tags

Overview

MetaNLI

Train (source task)

Reptile

Prototypical

Zero-shot Test (on target task)

Fine-tune (target task)

Owner

M.Hassan Mojab

SDL: Synthetic Document Layout dataset

Translate U is capable of translating the text present in an image from one language to the other.

DeLighT: Very Deep and Light-Weight Transformers

Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃

Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet

IMS-Toucan is a toolkit to train state-of-the-art Speech Synthesis models

Python library for Serbian Natural language processing (NLP)

Sequence Modeling with Structured State Spaces

天池中药说明书实体识别挑战冠军方案；中文命名实体识别；NER; BERT-CRF & BERT-SPAN & BERT-MRC；Pytorch

profile tools for pytorch nn models

Chinese Grammatical Error Diagnosis

CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training

Phrase-Based & Neural Unsupervised Machine Translation

A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model loading takes 12gb free ram.

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Wrapper to display a script output or a text file content on the desktop in sway or other wlroots-based compositors

:P Some basic stuff I'm gonna use for my upcoming Agile Software Development and Devops

This repository contains (not all) code from my project on Named Entity Recognition in philosophical text

Text-Summarization-using-NLP - Text Summarization using NLP to fetch BBC News Article and summarize its text and also it includes custom article Summarization

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.