Natural Language Processing

Last update: Oct 31, 2021

Related tags

Text Data & NLP NLP

Overview

NLP

Natural Language Processing apps

Multilingual_NLP.py ################################################## start

#This script is demonstartion of Multilingual Natural Language Processing app using Stanza,Streamlit mainly.

Documentation link for Stanza: https://stanfordnlp.github.io/stanza/

Depencies can be installed using below commands :

pip install streamlit==1.1.0 pip install stanza==1.3.0 pip install mtranslate==1.8 pip install PyAutoGUI==0.9.53 pip install pandas==1.2.4 pip install nltk==3.6.2

The windows path for language downloaded models is : C:\Users \stanza_resources

Refer Supported_Languages sheet in stanza_supported_languages.xlsx and check for the languages you want to download.

#command prompt Sample code to download the language model is as follows :

import stanza

For eg to download language model for Afrikaans run below command

stanza.download('af')

For eg to download language model for German run below command

stanza.download('de')

to download multilingual model run below command

stanza.download("multilingual")

Update langtable sheet in stanza_supported_languages.xlsx if you wish to add OR delete languages. Mostly nlp_langid are transid same however google around for transid.

Multilingual_NLP.py ################################################## end

Natural Language Processing

Related tags

Overview

NLP

Depencies can be installed using below commands :

For eg to download language model for Afrikaans run below command

For eg to download language model for German run below command

to download multilingual model run below command

Owner

Ritesh Sharma

Deduplication is the task to combine different representations of the same real world entity.

Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

The FinQA dataset from paper: FinQA: A Dataset of Numerical Reasoning over Financial Data

Coreference resolution for English, German and Polish, optimised for limited training data and easily extensible for further languages

LUKE -- Language Understanding with Knowledge-based Embeddings

Seq2seq attn - Use the Seq2Seq method to implement machine translation and introduce Attention mechanism to improve the results

VampiresVsWerewolves - Our Implementation of a MiniMax algorithm with alpha beta pruning in the context of an in-class competition

Just a basic Telegram AI chat bot written in Python using Pyrogram.

Python bot created with Selenium that can guess the daily Wordle word correct 96.8% of the time.

Plugin repository for Macast

Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition

Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers

NLP command-line assistant powered by OpenAI

A python package to fine-tune transformer-based models for named entity recognition (NER).

Train GPT-3 model on V100(16GB Mem) Using improved Transformer.

✔👉A Centralized WebApp to Ensure Road Safety by checking on with the activities of the driver and activating label generator using NLP.

Rhythm-Finder is a unsupervised ML driven python powered web-application that can find the songs that suits you.

多语言降噪预训练模型MBart的中文生成任务

CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training

LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language