A python package to fine-tune transformer-based models for named entity recognition (NER).

Last update: Jul 30, 2022

Related tags

Text Data & NLP nerblackbox

Overview

nerblackbox

A python package to fine-tune transformer-based language models for named entity recognition (NER).

https://coveralls.io/repos/github/af-ai-center/nerblackbox/badge.svg?branch=master

Resources

Source Code: https://github.com/af-ai-center/nerblackbox
Documentation: https://af-ai-center.github.io/nerblackbox
PyPI: https://pypi.org/project/nerblackbox

About

Transformer-based language models like BERT have had a game-changing impact on Natural Language Processing.

In order to utilize Hugging Face's publicly accessible pretrained models for Named Entity Recognition, one needs to retrain (or "fine-tune") them using labeled text.

nerblackbox makes this easy.

You give it

a Dataset (labeled text)
a Pretrained Model (transformers)

and you get

the best Fine-tuned Model
its Performance on the dataset

Installation

pip install nerblackbox

Usage

see documentation: https://af-ai-center.github.io/nerblackbox

Citation

@misc{nerblackbox,
  author = {Stollenwerk, Felix},
  title  = {nerblackbox: a python package to fine-tune transformer-based language models for named entity recognition},
  year   = {2021},
  url    = {https://github.com/af-ai-center/nerblackbox},
}

A python package to fine-tune transformer-based models for named entity recognition (NER).

Related tags

Overview

nerblackbox

Resources

About

Installation

Usage

Citation

Owner

Felix Stollenwerk

Code and datasets for our paper "PTR: Prompt Tuning with Rules for Text Classification"

My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensorflow

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Uncomplete archive of files from the European Nopsled Team

Bnagla hand written document digiiztion

超轻量级bert的pytorch版本，大量中文注释，容易修改结构，持续更新

Spacy-ginza-ner-webapi - Named Entity Recognition API with spaCy and GiNZA

Tracking Progress in Natural Language Processing

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

An Explainable Leaderboard for NLP

中文医疗信息处理基准CBLUE: A Chinese Biomedical LanguageUnderstanding Evaluation Benchmark

An open source library for deep learning end-to-end dialog systems and chatbots.

Making text a first-class citizen in TensorFlow.

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

Natural language computational chemistry command line interface.

CCKS-Title-based-large-scale-commodity-entity-retrieval-top1

RuCLIP-SB (Russian Contrastive Language–Image Pretraining SWIN-BERT) is a multimodal model for obtaining images and text similarities and rearranging captions and pictures. Unlike other versions of the model we use BERT for text encoder and SWIN transformer for image encoder.

Datasets of Automatic Keyphrase Extraction

Google AI 2018 BERT pytorch implementation