This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems

Last update: Dec 04, 2022

Related tags

Text Data & NLP proteno

Overview

Proteno

This is the data release associated with the corresponding NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems (https://arxiv.org/abs/2104.07777)

Security

See CONTRIBUTING for more information.

License

This project is released under CC-BY-NC-4.0 and other licenses:

English: CC-BY-SA
Spanish: CC-BY-SA
Tamil: CC-BY-NC-SA

Citation

If you use our data, please cite the following paper:

@inproceedings{tyagi-etal-2021-proteno,
    title = "Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems",
    author = "Tyagi, Shubhi  and
      Bonafonte, Antonio  and
      Lorenzo-Trueba, Jaime  and
      Latorre, Javier",
    booktitle = "Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers",
    month = jun,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2021.naacl-industry.10",
    pages = "72--79",
}

This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems

Related tags

Overview

Proteno

Security

License

Citation

Owner

Natural Language Processing Specialization

Coreference resolution for English, German and Polish, optimised for limited training data and easily extensible for further languages

Mesh TensorFlow: Model Parallelism Made Easier

An assignment on creating a minimalist neural network toolkit for CS11-747

Modeling cumulative cases of Covid-19 in the US during the Covid 19 Delta wave using Bayesian methods.

Plugin repository for Macast

A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python

Chatbot with Pytorch, Python & Nextjs

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

texlive expressions for documents

Repositório do trabalho de introdução a NLP

The entmax mapping and its loss, a family of sparse softmax alternatives.

Yomichad - a Japanese pop-up dictionary that can display readings and English definitions of Japanese words

NVDA, the free and open source Screen Reader for Microsoft Windows

Maix Speech AI lib, including ASR, chat, TTS etc.

🗣️ NALP is a library that covers Natural Adversarial Language Processing.

Research code for "What to Pre-Train on? Efficient Intermediate Task Selection", EMNLP 2021

The RWKV Language Model

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。

Crowd sourced training data for Rasa NLU models

This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems

Related tags

Overview

Proteno

Security

License

Citation

Owner

Natural Language Processing Specialization

Coreference resolution for English, German and Polish, optimised for limited training data and easily extensible for further languages

Mesh TensorFlow: Model Parallelism Made Easier

An assignment on creating a minimalist neural network toolkit for CS11-747

Modeling cumulative cases of Covid-19 in the US during the Covid 19 Delta wave using Bayesian methods.

Plugin repository for Macast

A Lightweight NLP Data Loader for All Deep Learning Frameworks in Python

Chatbot with Pytorch, Python & Nextjs

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

texlive expressions for documents

Repositório do trabalho de introdução a NLP

The entmax mapping and its loss, a family of sparse softmax alternatives.

Yomichad - a Japanese pop-up dictionary that can display readings and English definitions of Japanese words

NVDA, the free and open source Screen Reader for Microsoft Windows

Maix Speech AI lib, including ASR, chat, TTS etc.

🗣️ NALP is a library that covers Natural Adversarial Language Processing.

Research code for "What to Pre-Train on? Efficient Intermediate Task Selection", EMNLP 2021

The RWKV Language Model

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含 自然语言处理各领域的 面试题积累。

Crowd sourced training data for Rasa NLU models

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。