Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form.

Last update: Nov 16, 2022

Overview

Neural G2P to portuguese language

Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly essential role for natural language processing, text-to-speech synthesis and automatic speech recognition systems. This project was adapted from https://github.com/hajix/G2P.

Dependencies

The following libraries are used:
pytorch
tqdm
matplotlib

Install dependencies using pip:

pip3 install -r requirements.txt

Dataset

The dataset used here was taken from site http://www.portaldalinguaportuguesa.org/, as well as some insertions made by me so that the dataset would give more coverage to common words in the daily life of the Brazilian Portuguese. Some ambiguities were also resolved as the intent of this dataset is to contain a specific speaker bias. The dictionary based on São Paulo speakers was chosen.

As in https://github.com/hajix/G2P, on which this implementation was based, you could easily provide and use your own language specific pronunciatin doctionary for training G2P. More details about data preparation and contribution could be found in resources.
Feel free to provide resources for other languages.

Attention Model

Both encoder-decoder seq2seq model and attention model could handle G2P problem. Here we train attention based model. The encoder model get sequence of graphemes and produces states at each timestep. Encoder states used during attention decoding. The decoder attends to appropriate encoder state (according to its state) and produces phonemes.

Train

To start training the model run:

python train.py

You can also use tensorboard to check the training loss:

tensorboard --logdir log --bind_all

Training parameters could be found at config.py.

Inference

To get pronunciation of a word:

# PT-BR example
python inference.py --sentence 'olá, vamos testar esse projeto.'
o|l|a| |,| |v|a|m|ʊ|s| |t|e|s|t|a| |e|s|i| |p|ɾ|o|ʒ|e|t|ʊ| |.

You could also visualize the attention weights, using --visualize:

# PT-BR example
python inference.py --visualize --sentence 'olá, vamos testar esse projeto.'
o|l|a| |,| |v|a|m|ʊ|s| |t|e|s|t|a| |e|s|i| |p|ɾ|o|ʒ|e|t|ʊ| |.

Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form.

Related tags

Overview

Neural G2P to portuguese language

Dependencies

Dataset

Attention Model

Train

Inference

Owner

fluz

Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch

Basic Utilities for PyTorch Natural Language Processing (NLP)

Study German declensions (dER nettE Mann, ein nettER Mann, mit dEM nettEN Mann, ohne dEN nettEN Mann ...) Generate as many exercises as you want using the incredible power of SPACY!

BookNLP, a natural language processing pipeline for books

The ability of computer software to identify words and phrases in spoken language and convert them to human-readable text

A modular Karton Framework service that unpacks common packers like UPX and others using the Qiling Framework.

Data loaders and abstractions for text and NLP

AI-Broad-casting - AI Broad casting with python

📔️ Generate a text-based journal from a template file.

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。

Edge-Augmented Graph Transformer

Pytorch implementation of Tacotron

NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT

Shared code for training sentence embeddings with Flax / JAX

NeMo: a toolkit for conversational AI

Persian Bert For Long-Range Sequences

Fast, general, and tested differentiable structured prediction in PyTorch

构建一个多源（公众号、RSS）、干净、个性化的阅读环境

L3Cube-MahaCorpus a Marathi monolingual data set scraped from different internet sources.

nlp基础任务

Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form.

Related tags

Overview

Neural G2P to portuguese language

Dependencies

Dataset

Attention Model

Train

Inference

Owner

fluz

Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch

Basic Utilities for PyTorch Natural Language Processing (NLP)

Study German declensions (dER nettE Mann, ein nettER Mann, mit dEM nettEN Mann, ohne dEN nettEN Mann ...) Generate as many exercises as you want using the incredible power of SPACY!

BookNLP, a natural language processing pipeline for books

The ability of computer software to identify words and phrases in spoken language and convert them to human-readable text

A modular Karton Framework service that unpacks common packers like UPX and others using the Qiling Framework.

Data loaders and abstractions for text and NLP

AI-Broad-casting - AI Broad casting with python

📔️ Generate a text-based journal from a template file.

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含 自然语言处理各领域的 面试题积累。

Edge-Augmented Graph Transformer

Pytorch implementation of Tacotron

NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT

Shared code for training sentence embeddings with Flax / JAX

NeMo: a toolkit for conversational AI

Persian Bert For Long-Range Sequences

Fast, general, and tested differentiable structured prediction in PyTorch

构建一个多源（公众号、RSS）、干净、个性化的阅读环境

L3Cube-MahaCorpus a Marathi monolingual data set scraped from different internet sources.

nlp基础任务

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。