Pytorch-Named-Entity-Recognition-with-BERT

Last update: Dec 25, 2022

Overview

BERT NER

Use google BERT to do CoNLL-2003 NER !

Train model using Python and Inference using C++

ALBERT-TF2.0

BERT-NER-TENSORFLOW-2.0

BERT-SQuAD

Requirements

python3
pip3 install -r requirements.txt

Run

python run_ner.py --data_dir=data/ --bert_model=bert-base-cased --task_name=ner --output_dir=out_base --max_seq_length=128 --do_train --num_train_epochs 5 --do_eval --warmup_proportion=0.1

Result

BERT-BASE

Validation Data

             precision    recall  f1-score   support

        PER     0.9677    0.9745    0.9711      1842
        LOC     0.9654    0.9711    0.9682      1837
       MISC     0.8851    0.9111    0.8979       922
        ORG     0.9299    0.9292    0.9295      1341

avg / total     0.9456    0.9534    0.9495      5942

Test Data

             precision    recall  f1-score   support

        PER     0.9635    0.9629    0.9632      1617
        ORG     0.8883    0.9097    0.8989      1661
        LOC     0.9272    0.9317    0.9294      1668
       MISC     0.7689    0.8248    0.7959       702

avg / total     0.9065    0.9209    0.9135      5648

Pretrained model download from here

BERT-LARGE

Validation Data

             precision    recall  f1-score   support

        ORG     0.9288    0.9441    0.9364      1341
        LOC     0.9754    0.9728    0.9741      1837
       MISC     0.8976    0.9219    0.9096       922
        PER     0.9762    0.9799    0.9781      1842

avg / total     0.9531    0.9606    0.9568      5942

Test Data

             precision    recall  f1-score   support

        LOC     0.9366    0.9293    0.9329      1668
        ORG     0.8881    0.9175    0.9026      1661
        PER     0.9695    0.9623    0.9659      1617
       MISC     0.7787    0.8319    0.8044       702

avg / total     0.9121    0.9232    0.9174      5648

Pretrained model download from here

Inference

from bert import Ner

model = Ner("out_base/")

output = model.predict("Steve went to Paris")

print(output)
'''
    [
        {
            "confidence": 0.9981840252876282,
            "tag": "B-PER",
            "word": "Steve"
        },
        {
            "confidence": 0.9998939037322998,
            "tag": "O",
            "word": "went"
        },
        {
            "confidence": 0.999891996383667,
            "tag": "O",
            "word": "to"
        },
        {
            "confidence": 0.9991968274116516,
            "tag": "B-LOC",
            "word": "Paris"
        }
    ]
'''

Inference C++

Pretrained and converted bert-base model download from here

Download libtorch from here

install cmake, tested with cmake version 3.10.2
unzip downloaded model and libtorch in BERT-NER

Compile C++ App

  cd cpp-app/
  cmake -DCMAKE_PREFIX_PATH=../libtorch

make

Runing APP
```
   ./app ../base
```

NB: Bert-Base C++ model is split in to two parts.

Bert Feature extractor and NER classifier.
This is done because jit trace don't support input depended for loop or if conditions inside forword function of model.

Deploy REST-API

BERT NER model deployed as rest api

python api.py

API will be live at 0.0.0.0:8000 endpoint predict

cURL request

curl -X POST http://0.0.0.0:8000/predict -H 'Content-Type: application/json' -d '{ "text": "Steve went to Paris" }'

Output

{
    "result": [
        {
            "confidence": 0.9981840252876282,
            "tag": "B-PER",
            "word": "Steve"
        },
        {
            "confidence": 0.9998939037322998,
            "tag": "O",
            "word": "went"
        },
        {
            "confidence": 0.999891996383667,
            "tag": "O",
            "word": "to"
        },
        {
            "confidence": 0.9991968274116516,
            "tag": "B-LOC",
            "word": "Paris"
        }
    ]
}

Pytorch-Named-Entity-Recognition-with-BERT

Related tags

Overview

BERT NER

Requirements

Run

Result

BERT-BASE

Validation Data

Test Data

Pretrained model download from here

BERT-LARGE

Validation Data

Test Data

Pretrained model download from here

Inference

Inference C++

Pretrained and converted bert-base model download from here

Download libtorch from here

Deploy REST-API

cURL request

cURL

Postman

C++ unicode support

Tensorflow version

Owner

Kamal Raj

My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensorflow

A collection of Classical Chinese natural language processing models, including Classical Chinese related models and resources on the Internet.

Code for "Generating Disentangled Arguments with Prompts: a Simple Event Extraction Framework that Works"

BiQE: Code and dataset for the BiQE paper

Tools and data for measuring the popularity & growth of various programming languages.

Pre-Training with Whole Word Masking for Chinese BERT

Under the hood working of transformers, fine-tuning GPT-3 models, DeBERTa, vision models, and the start of Metaverse, using a variety of NLP platforms: Hugging Face, OpenAI API, Trax, and AllenNLP

Mednlp - Medical natural language parsing and utility library

A Python module made to simplify the usage of Text To Speech and Speech Recognition.

ADCS cert template modification and ACL enumeration

TEACh is a dataset of human-human interactive dialogues to complete tasks in a simulated household environment.

NeuTex: Neural Texture Mapping for Volumetric Neural Rendering

Yes it's true :broken_heart:

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Harvis is designed to automate your C2 Infrastructure.

Scene Text Retrieval via Joint Text Detection and Similarity Learning

NLP made easy

Minimal GUI for accessing the Watson Text to Speech service.

German Text-To-Speech Engine using Tacotron and Griffin-Lim

:P Some basic stuff I'm gonna use for my upcoming Agile Software Development and Devops