Fastseq

基于ONNXRUNTIME的文本生成加速框架

1. 环境配置

# 创建onnx conda环境
conda create -n onnx_py38 python=3.8
conda activate onnx_py38
conda install pytorch cudatoolkit=10.2 -c pytorch

# 安装onnxruntime-gpu(目前只有1.5.2版本测试成功)
pip install onnxruntime-gpu==1.5.2

# 安装transformers==3.1.0版本
pip install transformers==3.1.0

2. ONNX转换

# 将huggingface保存的 模型/checkpoint 转换为onnx格式。这里使用onnxruntime自带的转换工具。
python -m onnxruntime.transformers.convert_to_onnx \
    -m "path_to_checkpoint/model_name(gpt2)" \
    --model_class GPT2LMHeadModel \
    --output gpt2_fp32.onnx \
    -p fp32

3. DEMO测试

CUDA_VISIBLE_DEVICES=3 python demo.py \
    --onnx_model_path "./gpt2_fp32.onnx" \
    --model_name_or_path "path_to_checkpoint" \
    --prompt_text "here is an example of gpt2 model" \
    --do_sample_top_k 5

Fastseq 基于ONNXRUNTIME的文本生成加速框架

Related tags

Overview

Fastseq

1. 环境配置

2. ONNX转换

3. DEMO测试

4. TODO

Owner

Jun Gao

Simple Annotated implementation of GPT-NeoX in PyTorch

Wind Speed Prediction using LSTMs in PyTorch

Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17

Persian Bert For Long-Range Sequences

Python functions for summarizing and improving voice dictation input.

a test times augmentation toolkit based on paddle2.0.

Chinese Grammatical Error Diagnosis

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

FedNLP: A Benchmarking Framework for Federated Learning in Natural Language Processing

File-based TF-IDF: Calculates keywords in a document, using a word corpus.

ChatBotProyect - This is an unfinished project about a simple chatbot.

Scene Text Retrieval via Joint Text Detection and Similarity Learning

Automatic privilege escalation for misconfigured capabilities, sudo and suid binaries

Translate - a PyTorch Language Library

GPT-3 command line interaction

Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.

STS Benchmark comprises a selection of the English datasets used in the STS tasks organized in the context of SemEval between 2012 and 2017. The selection of datasets include text from image captions, news headlines and user forums.

Speech Recognition Database Management with python

Material for GW4SHM workshop, 16/03/2022.

Harvis is designed to automate your C2 Infrastructure.