This is a library for training and applying sparse fine-tunings with torch and transformers.

Last update: Dec 30, 2022

Related tags

Overview

This is a library for training and applying sparse fine-tunings with torch and transformers. Please refer to our paper Composable Sparse Fine-Tuning for Cross Lingual Transfer for background.

Installation

First, install Python 3.9 and PyTorch >= 1.9 (earlier versions may work but haven't been tested), e.g. using conda:

conda create -n sft python=3.9
conda activate sft
conda install pytorch cudatoolkit=11.1 -c pytorch -c conda-forge

Then download and install composable-sft:

git clone https://github.com/cambridgeltl/composable-sft.git
cd composable-sft
pip install -e .

Using pre-trained SFTs

Pre-trained SFTs can be downloaded directly and applied to models as follows:

from transformers import AutoConfig, AutoModelForTokenClassification
from sft import SFT

config = AutoConfig.from_pretrained(
    'bert-base-multilingual-cased',
    num_labels=17,
)

model = AutoModelForTokenClassification.from_pretrained(
    'bert-base-multilingual-cased',
    config=config,
)

language_sft = SFT('cambridgeltl/mbert-lang-sft-bxr-small') # SFT for Buryat
task_sft = SFT('cambridgeltl/mbert-task-sft-pos') # SFT for POS tagging

# Apply SFTs to pre-trained mBERT TokenClassification model
language_sft.apply(model)
task_sft.apply(model)

For a full list of pre-trained SFTs available, see MODELS

Example Scripts

Example scripts are provided in examples/ to show how to train SFTs using LT-SFT and evaluate them.

Citation

If you use this software, please cite the following paper:

@misc{ansell2021composable,
      title={Composable Sparse Fine-Tuning for Cross-Lingual Transfer},
      author={Alan Ansell and Edoardo Maria Ponti and Anna Korhonen and Ivan Vuli\'{c}},
      year={2021},
      eprint={2110.07560},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

This is a library for training and applying sparse fine-tunings with torch and transformers.

Related tags

Overview

Installation

Using pre-trained SFTs

Example Scripts

Citation

Owner

Cambridge Language Technology Lab

A quantum game modeling of pandemic (QHack 2022)

One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking

Equivariant Imaging: Learning Beyond the Range Space

InsTrim: Lightweight Instrumentation for Coverage-guided Fuzzing

This Jupyter notebook shows one way to implement a simple first-order low-pass filter on sampled data in discrete time.

This repository contains code to run experiments in the paper "Signal Strength and Noise Drive Feature Preference in CNN Image Classifiers."

CS50x-AI - Artificial Intelligence with Python from Harvard University

Deep Multimodal Neural Architecture Search

Towards Open-World Feature Extrapolation: An Inductive Graph Learning Approach

Sinkformers: Transformers with Doubly Stochastic Attention

Data for "Driving the Herd: Search Engines as Content Influencers" paper

Multi-Anchor Active Domain Adaptation for Semantic Segmentation (ICCV 2021 Oral)

ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.

This repository contains the code and models necessary to replicate the results of paper: How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective

PyTorch implementation of CVPR 2020 paper (Reference-Based Sketch Image Colorization using Augmented-Self Reference and Dense Semantic Correspondence) and pre-trained model on ImageNet dataset

PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset

Regularizing Nighttime Weirdness: Efficient Self-supervised Monocular Depth Estimation in the Dark (ICCV 2021)

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

Curved Projection Reformation

Find-Lane-Line - Use openCV library and Python to detect the road-lane-line