Natural Language Processing Specialization

Last update: Oct 06, 2022

Overview

Natural Language Processing Specialization

In this folder, Natural Language Processing Specialization projects and notes can be found.

WHAT I LEARNED

Use logistic regression, naïve Bayes, and word vectors to implement sentiment analysis, complete analogies & translate words.
Use dynamic programming, hidden Markov models, and word embeddings to implement autocorrect, autocomplete & identify part-of-speech tags for words.
Use recurrent neural networks, LSTMs, GRUs & Siamese networks in Trax for sentiment analysis, text generation & named entity recognition.
Use encoder-decoder, causal, & self-attention to machine translate complete sentences, summarize text, build chatbots & question-answering.

There are 4 Courses in this Specialization

Course 1 - Natural Language Processing with Classification and Vector Spaces

In the first course of the Natural Language Processing Specialization
I performed sentiment analysis of tweets using logistic regression and then naïve Bayes,
I used vector space models to discover relationships between words and used PCA to reduce the dimensionality of the vector space and visualize those relationships, and
I wrote a simple English to French translation algorithm using pre-computed word embeddings and locality-sensitive hashing to relate words via approximate k-nearest neighbor search.

Projects

Course 2 - Natural Language Processing with Probabilistic Models

In the second course of the Natural Language Processing Specialization
I wrote a simple auto-correct algorithm using minimum edit distance and dynamic programming,
I applied the Viterbi Algorithm for part-of-speech (POS) tagging, which is vital for computational linguistics,
I wrote a better auto-complete algorithm using an N-gram language model, and
I wrote my own Word2Vec model that uses a neural network to compute word embeddings using a continuous bag-of-words model.

Projects

Course 3 - Natural Language Processing with Sequence Models

In the third course of the Natural Language Processing Specialization
I trained a neural network with GLoVe word embeddings to perform sentiment analysis of tweets,
I generated synthetic Shakespeare text using a Gated Recurrent Unit (GRU) language model,
I trained a recurrent neural network to perform named entity recognition (NER) using LSTMs with linear layers, and
I used so-called ‘Siamese’ LSTM models to compare questions in a corpus and identify those that are worded differently but have the same meaning.

Projects

Course 4 - Natural Language Processing with Attention Models

In the fourth course of the Natural Language Processing Specialization
I translated complete English sentences into German using an encoder-decoder attention model,
I built a Transformer model to summarize text,
I used T5 and BERT models to perform question-answering, and
I built a chatbot using a Reformer model.

Projects

Disclaimer

DeepLearning.AI makes course notes available for educational purposes.
Project solutions are just for educational purposes. I highly recommend trying and solving project/program assignments on your own.

All the best 🤘

Natural Language Processing Specialization

Related tags

Overview

Natural Language Processing Specialization

WHAT I LEARNED

There are 4 Courses in this Specialization

Course 1 - Natural Language Processing with Classification and Vector Spaces

Projects

Course 2 - Natural Language Processing with Probabilistic Models

Projects

Course 3 - Natural Language Processing with Sequence Models

Projects

Course 4 - Natural Language Processing with Attention Models

Projects

Disclaimer

Owner

Kaan BOKE

Contains links to publicly available datasets for modeling health outcomes using speech and language.

Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser)

ChatterBot is a machine learning, conversational dialog engine for creating chat bots

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.

A script that automatically creates a branch name using google translation api and jira api

Python module (C extension and plain python) implementing Aho-Corasick algorithm

wxPython app for converting encodings, modifying and fixing SRT files

JaQuAD: Japanese Question Answering Dataset

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

VMD Audio/Text control with natural language

One Stop Anomaly Shop: Anomaly detection using two-phase approach: (a) pre-labeling using statistics, Natural Language Processing and static rules; (b) anomaly scoring using supervised and unsupervised machine learning.

Nested Named Entity Recognition

A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)

Subtitle Workshop (subshop): tools to download and synchronize subtitles

Write Alphabet, Words and Sentences with your eyes.

YACLC - Yet Another Chinese Learner Corpus

A simple word search made in python

Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing