Machine translation models released by the Gourmet project

Last update: Dec 08, 2021

Related tags

Overview

Gourmet Models

Overview

The Gourmet project has released several machine translation models to translate low-resource languages. This repository contains information about the models, as well as sample code showing how the models can be used. The models themselves are available as dockers, and can be downloaded from a separate site (see below).

Some of the models are described in our public deliverables. See the integration report for details of model training, and the evaluation report for details of evaluation.

Using the Models

The model download links are here. Once downloaded, the model can be deployed using:

docker load <

to launch the translation server, use:

docker run -p 4000:4000 -i --rm

This exposes the model on port 4000. Change the second number above if you want to use a different port.

To test the model, use the sample client like this:

$ ./gourmet-client.py 
2021-11-15 14:54:10 DEBUG: __main__:  Connecting to translation server at localhost4000
2021-11-15 14:54:10 DEBUG: urllib3.connectionpool:  Starting new HTTP connection (1): localhost:4000
2021-11-15 14:54:14 DEBUG: urllib3.connectionpool:  http://localhost:4000 "POST /translation HTTP/1.1" 200 88
Translation: An gudanar da taron sauyin yanayi a Glasgow
Time: 2242
Error: None

This is using the English to Hausa model. You can use the -n and -p arguments to change the host and port. The source text is hard-coded but easily changed.

Some of the models support additional arguments which can be passed to the docker at launch time, using the -e argument to docker (for environemnent) variables. See the models page for more details.

Computational Requirements

All models are configured to use the CPU for translation. A GPU is not required, and the models will not use it.

Licence

CC-BY

Acknowledgements

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825299.

Machine translation models released by the Gourmet project

Related tags

Overview

Gourmet Models

Overview

Using the Models

Computational Requirements

Licence

Acknowledgements

Owner

Edinburgh NLP

Refactored version of FastSpeech2

code for modular summarization work published in ACL2021 by Krishna et al

Stand-alone language identification system

Tensorflow Implementation of A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Code examples for my Write Better Python Code series on YouTube.

Simple GUI where you can enter an article and get a crisp summarized version.

Materials (slides, code, assignments) for the NYU class I teach on NLP and ML Systems (Master of Engineering).

Codename generator using WordNet parts of speech database

Baseline code for Korean open domain question answering(ODQA)

Contract Understanding Atticus Dataset

In this workshop we will be exploring NLP state of the art transformers, with SOTA models like T5 and BERT, then build a model using HugginFace transformers framework.

nlabel is a library for generating, storing and retrieving tagging information and embedding vectors from various nlp libraries through a unified interface.

मराठी भाषा वाचविण्याचा एक प्रयास. इंग्रजी ते मराठीचा शब्दकोश. An attempt to preserve the Marathi language. A lightweight and ad free English to Marathi thesaurus.

Conversational-AI-ChatBot - Intelligent ChatBot built with Microsoft's DialoGPT transformer to make conversations with human users!

Proquabet - Convert your prose into proquints and then you essentially have Vogon poetry

DeLighT: Very Deep and Light-Weight Transformers

Arabic speech recognition, classification and text-to-speech.

TextFlint is a multilingual robustness evaluation platform for natural language processing tasks,

문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.

Russian GPT3 models.