In this project, we compared Spanish BERT and Multilingual BERT in the Sentiment Analysis task.

Last update: Jan 03, 2022

Overview

Applying BERT Fine Tuning to Sentiment Classification on Amazon Reviews

Abstract

Sentiment analysis has made great progress in recent years, due to the fact that companies want to have a better understanding of how their products are classified by their consumers. However, despite the great advances that emerge in the field of artificial intelligence to solve this task, the most robust models are found in the English language. In the present work, we compare two Artificial Intelligence models that have monolingual and Multilingual approaches, which are Spanish BERT and Multilingual BERT, models based on BERT's transformer Architecture, to which the fine tuned technique was applied for the task of Sentiment analysis on the Amazon reviews dataset in Spanish using the accuracy and F1 score metrics. Finally, it was found that the Spanish BERT model has the best results for the sentiment analysis task on the Amazon reviews dataset in Spanish.

this paper is available here

Pipeline

Prerequisites

Linux / Window
Python3

Clone this Repository

git clone https://github.com/alexliqu09/Sentiment-Analysis-on-Amazon-Reviews.git

Train model

If you want to train the models use the colab Notebooks

Beto
MBert

Run the work in local

If you want to proof the work , you should run the following commands:

First , Install requeriments file:

pip install -r requeriments.txt

Second , download the Weights of Beto & MBERT and put them in this directory
Third , Start Streamlit server:

streamlit run main.py

Note:

Local host : http://localhost:8501 
Network URL:  http://192.168.0.5:8501

Run with Docker 🐋

#Bulding docker image 

docker build -t bert .

#RUN container
docker run -t -p 5000:5000 --name betocontainer bert

open http://172.17.0.2:8501

If you find useful our work , please cite this paper:

@inproceedings{@lvrBERT,
  title={Applying BERT Fine Tuning to Sentiment Classification on Amazon Reviews},
  author={Lique, Alexander and Vásquez, Diego and Rios, Manuel },
  year={2021}
}

In this project, we compared Spanish BERT and Multilingual BERT in the Sentiment Analysis task.

Related tags

Overview

Applying BERT Fine Tuning to Sentiment Classification on Amazon Reviews

Abstract

Pipeline

Prerequisites

Clone this Repository

Train model

Run the work in local

Run with Docker 🐋

Owner

Alexander Leonardo Lique Lamas

Materials (slides, code, assignments) for the NYU class I teach on NLP and ML Systems (Master of Engineering).

Learning Spatio-Temporal Transformer for Visual Tracking

My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensorflow

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP

Generate product descriptions, blogs, ads and more using GPT architecture with a single request to TextCortex API a.k.a Hemingwai

CCF BDCI BERT系统调优赛题baseline（Pytorch版本）

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Text classification is one of the popular tasks in NLP that allows a program to classify free-text documents based on pre-defined classes.

Chinese version of GPT2 training code, using BERT tokenizer.

Contract Understanding Atticus Dataset

TextFlint is a multilingual robustness evaluation platform for natural language processing tasks,

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

Blackstone is a spaCy model and library for processing long-form, unstructured legal text

Simple GUI where you can enter an article and get a crisp summarized version.

NLP codes implemented with Pytorch (w/o library such as huggingface)

Study German declensions (dER nettE Mann, ein nettER Mann, mit dEM nettEN Mann, ohne dEN nettEN Mann ...) Generate as many exercises as you want using the incredible power of SPACY!

Trains an OpenNMT PyTorch model and SentencePiece tokenizer.

Chinese named entity recognization (bert/roberta/macbert/bert_wwm with Keras)

NLTK Source

We have built a Voice based Personal Assistant for people to access files hands free in their device using natural language processing.