Black-Box-Tuning

Source code for paper "Black-Box Tuning for Language-Model-as-a-Service".

Being busy recently, the code in this repo and this tutorial will be very brief. Please let me know if you find any issues.

Prepare your environment

The implementation of Black-Box Tuning is quite simple, you can check our code and easily implement it in your own environment. Or you can create a new environment to run our implementation, which is based on Nevergrad, Transformers and FastNLP. Optionally, we use fitlog to monitor experimental results. You can uncomment the fitlog-related lines in our code to use it.

conda create --name bbt python=3.8
conda activate bbt
pip install transformers==4.1.1
pip install datasets
pip install fastNLP
pip install nevergrad
pip install sklearn
git clone https://github.com/txsun1997/Black-Box-Tuning
cd Black-Box-Tuning

Optimize your prompt without gradients

Now you can run Black-Box Tuning with run.sh:

bash run.sh

Results will be saved in a directory named results/. In general, you will obtain the following results:

SST-2 split	Best Accuracy
Train	100
Dev	96.87
Test	88.19

To reproduce other experiments in our paper, change the arguments of bbt.py, for example,

python bbt.py --task_name "agnews" --n_prompt_tokens 50 --intrinsic_dim 500 --k_shot 16 --device "cuda:0" --seed 42 --loss_type "hinge" --cat_or_add "add" --budget 8000

Cite

If you find this work helpful, please cite:

@article{sun2022bbt,
  title={Black-Box Tuning for Language-Model-as-as-Service}, 
  author={Tianxiang Sun and Yunfan Shao and Hong Qian and Xuanjing Huang and Xipeng Qiu},
  journal={arXiv preprint arXiv:2201.03514},
  year={2022}
}

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

Related tags

Overview

Black-Box-Tuning

Prepare your environment

Optimize your prompt without gradients

Cite

Owner

Tianxiang Sun

Planner_backend - Academic planner application designed for students and counselors.

Simulation-based performance analysis of server-less Blockchain-enabled Federated Learning

A PyTorch Implementation of Gated Graph Sequence Neural Networks (GGNN)

Clean Machine Learning, a Coding Kata

Memory-Augmented Model Predictive Control

[WACV 2022] Contextual Gradient Scaling for Few-Shot Learning

A High-Performance Distributed Library for Large-Scale Bundle Adjustment

Code for our WACV 2022 paper "Hyper-Convolution Networks for Biomedical Image Segmentation"

PCACE: A Statistical Approach to Ranking Neurons for CNN Interpretability

End-To-End Optimization of LiDAR Beam Configuration

A Python Reconnection Tool for alt:V

Embeds a story into a music playlist by sorting the playlist so that the order of the music follows a narrative arc.

PyTorch implementation for 3D human pose estimation

DEEPAGÉ: Answering Questions in Portuguese about the Brazilian Environment

【Arxiv】Exploring Separable Attention for Multi-Contrast MR Image Super-Resolution

Transferable Unrestricted Attacks, which won 1st place in CVPR’21 Security AI Challenger: Unrestricted Adversarial Attacks on ImageNet.

Orthogonal Over-Parameterized Training

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

TACTO: A Fast, Flexible and Open-source Simulator for High-Resolution Vision-based Tactile Sensors

Advbox is a toolbox to generate adversarial examples that fool neural networks in PaddlePaddle、PyTorch、Caffe2、MxNet、Keras、TensorFlow and Advbox can benchmark the robustness of machine learning models.