CCF BDCI BERT系统调优赛题baseline（Pytorch版本）

此版本基于Pytorch后端的huggingface进行实现。由于此实现使用了Oneflow的dataloader作为数据读入的方式，因此也需要安装Oneflow。其它框架的数据读取可以参考OneflowDataloaderToPytorchDataset类的实现。

使用说明

安装依赖（前置要求：已在环境中安装好Pytorch和Oneflow）

pip install transformers pandas
git clone https://github.com/tea321000/hugging_face_competition
cd hugging_face_competition

运行train_BERT_base.sh和train_BERT_large.sh 单机单卡的baseline。保持其它参数不变，通过调节shell文件里的hidden_size参数，即可观察不同hidden_size所占显存的变化（可通过watch -n 0.1 nvidia-smi直观观察）

python train.py \
--ofrecord_path sample_seq_len_512_example \
--lr 1e-4 --epochs 10 \
--train_batch_size 2 \
--seq_length=512 \
--max_predictions_per_seq=80 \
--num_hidden_layers=24 \
--num_attention_heads=16 \
--hidden_size=1024 \#要调节的参数
--vocab_size=30522

CCF BDCI BERT系统调优赛题baseline（Pytorch版本）

Related tags

Overview

CCF BDCI BERT系统调优赛题baseline（Pytorch版本）

使用说明

Owner

Ziqi Zhou

WikiPron - a command-line tool and Python API for mining multilingual pronunciation data from Wiktionary

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

Beyond the Imitation Game collaborative benchmark for enormous language models

Py65 65816 - Add support for the 65C816 to py65

Kurumi ChatBot

Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"

自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器

Code for Discovering Topics in Long-tailed Corpora with Causal Intervention.

SpeechBrain is an open-source and all-in-one speech toolkit based on PyTorch.

Blender addon - Scrub timeline from viewport with a shortcut

내부 작업용 django + vue(vuetify) boilerplate. 짠 하면 돌아감.

Auto translate textbox from Japanese to English or Indonesia

An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.

NLP: SLU tagging

Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

Materials (slides, code, assignments) for the NYU class I teach on NLP and ML Systems (Master of Engineering).

iBOT: Image BERT Pre-Training with Online Tokenizer

Creating a python chatbot that Starbucks users can text to place an order + help cut wait time of a normal coffee.

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

Search for documents in a domain through Google. The objective is to extract metadata