A number of methods in order to perform Natural Language Processing on live data derived from Twitter

Last update: Nov 24, 2021

Related tags

Overview

Twitter_NLP

Link to Project: https://twitoff-amadou.herokuapp.com/

==Description==

This project integrates a number of methods in order to perform Natural Language Processing (NLP) on live data derived from Twitter. The goal of this project is to demonstrate how NLP can be used at a basic level to classify hypertext by which Twitter user is most likely to 'tweet' (or post) it. For this project, Twitter API access had been granted, and implemented with the Tweepy wrapper for python.

To start, the web app it built using the Flask platform and is deployed on Heroku. For the functionality of the project, data is extracted from Twitter using its API and the Tweepy library and is fed into SQLAlchemy tables. These tables which hold a variety of information we're concerned with, such as the usernames and past tweeting data, are integrated with our PostgreSQL database. The Spacy library is then responsible for vectorizing our tweets into components our models can operate on. Finally, a random forest classifier is tasked with receiving and training on these vectors.

The interface of the app is quite intuitive. There are two text boxes, one labeled "User to add" and the other, "Tweet text to predict". The user is expected to type a name into the 'add' box, such that Tweepy can add the respective twitter user(s) and their tweeting data to our PostgreSQL database. Our random forest will then train live on the inputted values. Once this has been accomplished with at least two Twitter users in the database, one can add text into the 'predict' box, select the two users they wish to compare and let our model produce a result.

A number of methods in order to perform Natural Language Processing on live data derived from Twitter

Related tags

Overview

Twitter_NLP

==Description==

Owner

(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.

Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning

Scikit-learn style model finetuning for NLP

BERN2: an advanced neural biomedical namedentity recognition and normalization tool

Smart discord chatbot integrated with Dialogflow to manage different classrooms and assist in teaching!

Easy to use, state-of-the-art Neural Machine Translation for 100+ languages

🍊 PAUSE (Positive and Annealed Unlabeled Sentence Embedding), accepted by EMNLP'2021 🌴

PortaSpeech - PyTorch Implementation

A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/casual, active/passive, and many more. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.

Honor's thesis project analyzing whether the GPT-2 model can more effectively generate free-verse or structured poetry.

Jupyter Notebook tutorials on solving real-world problems with Machine Learning & Deep Learning using PyTorch

Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)

Beautiful visualizations of how language differs among document types.

CCF BDCI 2020 房产行业聊天问答匹配赛道 A榜47/2985

Code for the paper "Flexible Generation of Natural Language Deductions"

Repository of the Code to Chatbots, developed in Python

Phomber is infomation grathering tool that reverse search phone numbers and get their details, written in python3.

Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP

Codename generator using WordNet parts of speech database

VampiresVsWerewolves - Our Implementation of a MiniMax algorithm with alpha beta pruning in the context of an in-class competition