Python library for parsing resumes using natural language processing and machine learning

Last update: Jul 29, 2021

Overview

CVParser

Python library for parsing resumes using natural language processing and machine learning.

Setup

Installation on Linux and Mac OS

Follow the guide here on how to clone or fork a repo
Follow the guide here on how to create virtualenv

To create a normal virtualenv (example myvenv) and activate it (see Code below).

$ virtualenv --python=python3 myvenv

$ source myvenv/bin/activate

(myvenv) $ pip install -r requirements.txt

Usage

from cvparser.parser import CVParser

CVParser.download_nlk_data()


parser = CVParser(file_path="path/to/file.[pdf|doc|docx|png|jpeg]")
parser.parse()
print(parser.json())

Re-training the Model

cd into the train folder.
Delete the folder model and the file train.json.
Copy your new training data into the train folder. The train data must be in json. This can be generated using the data annotation tool called Dataturk. The file containing the training data must be named train.json.
Then, start re-training the model by execute the python script in the train folder named manual_training.py.
Then test your new model by #usage .

Python library for parsing resumes using natural language processing and machine learning

Related tags

Overview

CVParser

Setup

Installation on Linux and Mac OS

Usage

Re-training the Model

Owner

nafiu

DomainWordsDict, Chinese words dict that contains more than 68 domains, which can be used as text classification、knowledge enhance task

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.

PyWorld3 is a Python implementation of the World3 model

[KBS] Aspect-based sentiment analysis via affective knowledge enhanced graph convolutional networks

NLPretext packages in a unique library all the text preprocessing functions you need to ease your NLP project.

Yet Another Sequence Encoder - Encode sequences to vector of vector in python !

InferSent sentence embeddings

Chatbot for the Chatango messaging platform

Python library for parsing resumes using natural language processing and machine learning

Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downstream tasks like translation and summarisation.

Enterprise Scale NLP with Hugging Face & SageMaker Workshop series

LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)

Problem: Given a nepali news find the category of the news

Predict the spans of toxic posts that were responsible for the toxic label of the posts

Perform sentiment analysis on textual data that people generally post on websites like social networks and movie review sites.

HAN2HAN : Hangul Font Generation

Mapping a variable-length sentence to a fixed-length vector using BERT model

Active learning for text classification in Python

2021 AI CUP Competition on Traditional Chinese Scene Text Recognition - Intermediate Contest

A high-level yet extensible library for fast language model tuning via automatic prompt search