RaceBERT -- A transformer based model to predict race and ethnicty from names

Last update: Nov 02, 2022

Related tags

Overview

RaceBERT -- A transformer based model to predict race and ethnicty from names

Installation

pip install racebert

Using a virtual environment is highly recommended! You may need to install pytorch as instructed here: https://pytorch.org/get-started/locally/

Paper

Todo

Usage

raceBERT predicts race (U.S census race) and ethnicity from names.

from racebert import RaceBERT

model = RaceBERT()

# To predict race
model.predict_race("Barack Obama")

>>> {"label": "nh_black", "score": 0.5196923613548279}

The race categories are:

Race	Label
Non-hispanic White	nh_white
Hispanic	hispanic
Non-hispanic Black	nh_black
Asian & Pacific Islander	api
American Indian & Alaskan Native	aian

# Predict ethnicity
model.predict_ethnicty("Arjun Gupta")

>>> {"label": "Asian,IndianSubContinent", "score": 0.9612812399864197}

The ethnicity categories are:

Ethnicity
GreaterEuropean,British
GreaterEuropean,WestEuropean,French
GreaterEuropean,WestEuropean,Italian
GreaterEuropean,WestEuropean,Hispanic
GreaterEuropean,Jewish
GreaterEuropean,EastEuropean
Asian,IndianSubContinent
Asian,GreaterEastAsian,Japanese
GreaterAfrican,Muslim
Asian,GreaterEastAsian,EastAsian
GreaterEuropean,WestEuropean,Nordic
GreaterEuropean,WestEuropean,Germanic
GreaterAfrican,Africans

GPU

If you have a GPU, you can speed up the computation by specifying the CUDA device when you instantiate the model.

from racebert import RaceBERT

model = RaceBERT(device=0)

# predict race in batch
model.predict_race(["Barack Obama", "George Bush"])

>>>
[
        {"label": "nh_black", "score": 0.5196923613548279},
        {"label": "nh_white", "score": 0.8365859389305115}
]

# predict ethnicity in batch
model.predict_ethnicity(["Barack Obama", "George Bush"])

HuggingFace

Alternatively, you can work with the transformers models hosted on the huggingface hub directly.

Race Model: https://huggingface.co/pparasurama/raceBERT
Ethnicity Model: https://huggingface.co/pparasurama/raceBERT-ethnicity

Please refer to the transformers documentation.

RaceBERT -- A transformer based model to predict race and ethnicty from names

Related tags

Overview

RaceBERT -- A transformer based model to predict race and ethnicty from names

Installation

Paper

Usage

GPU

HuggingFace

Owner

Prasanna Parasurama

Self-Supervised Learning

Learning RGB-D Feature Embeddings for Unseen Object Instance Segmentation

Scalable implementation of Lee / Mykland (2012) and Ait-Sahalia / Jacod (2012) Jump tests for noisy high frequency data

This is the accompanying toolbox for the paper "A Survey on GANs for Anomaly Detection"

This repository contains tutorials for the py4DSTEM Python package

The tl;dr on a few notable transformer/language model papers + other papers (alignment, memorization, etc).

Solutions of Reinforcement Learning 2nd Edition

PyTorch implementation for COMPLETER: Incomplete Multi-view Clustering via Contrastive Prediction (CVPR 2021)

Simple-Neural-Network From Scratch in Python

Iran Open Source Hackathon

Improved Fitness Optimization Landscapes for Sequence Design

A deep learning based semantic search platform that computes similarity scores between provided query and documents

The official implementation of CVPR 2021 Paper: Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.

Awesome Remote Sensing Toolkit based on PaddlePaddle.

Fully Connected DenseNet for Image Segmentation

Plotting points that lie on the intersection of the given curves using gradient descent.

KGDet: Keypoint-Guided Fashion Detection (AAAI 2021)

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Privacy-Preserving Portrait Matting [ACM MM-21]

Training a deep learning model on the noisy CIFAR dataset