Code Generation using a large neural network called GPT-J

Last update: Dec 31, 2022

Overview

CodeGenX

CodeGenX is a Code Generation system powered by Artificial Intelligence! It is delivered to you in the form of a Visual Studio Code Extension and is Free and Open-source!

Installation

You can find installation instructions and additional information about CodeGenX in the documentation here.

About CodeGenX

1. Languages Supported

CodeGenX currently only supports Python. We are planning to add additional languages in future releases.

2. Modules Trained On

CodeGenX was trained on Python code which covers many of its common uses. Some libraries which CodeGenX is specifically trained on are:

Tensorflow
Pytorch
Scikit-Learn
Pandas
NumPy
OpenCV
Django
Flask
PyGame

3. How CodeGenX Works

At the core of CodeGenX lies a large neural network called GPT-J. GPT-J is a 6 billion parameter transformer model which was trained on hundreds of gigabytes of text from the internet. We fine-tuned this model on a dataset of open-source python code. This fine-tuned model can now be used to generate code when given an input with the right instructions.

Contributors ✨

This project would not have been possible without the help of these wonderful people:

_{Arya Manjaramkar}	_{Matthias Wijnsma}	_{Thomas Houtrique}	_{Dominic Rampas}	_{Bilel Medimegh}	_{Josh Hills}	_Alex
_Tiimo

Acknowledgements

Many thanks to the support of the Google TPU Research Cloud for providing the precious compute needed for this project.

Code Generation using a large neural network called GPT-J

Related tags

Overview

CodeGenX

Installation

About CodeGenX

1. Languages Supported

2. Modules Trained On

3. How CodeGenX Works

Contributors ✨

Acknowledgements

Owner

DeepGenX

This github repo is for Neurips 2021 paper, NORESQA A Framework for Speech Quality Assessment using Non-Matching References.

A single model that parses Universal Dependencies across 75 languages.

This repository consists of a complete guide on natural language processing (NLP) in Python where we'll learn various techniques for implementing NLP including parsing & text processing and understand how to use NLP for text feature engineering.

The training code for the 4th place model at MDX 2021 leaderboard A.

A simple Speech Emotion Recognition (SER) API created using Flask and running in a Docker container.

To be a next-generation DL-based phenotype prediction from genome mutations.

News-Articles-and-Essays - NLP (Topic Modeling and Clustering)

texlive expressions for documents

Every Google, Azure & IBM text to speech voice for free

BPEmb is a collection of pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE) and trained on Wikipedia.

Klexikon: A German Dataset for Joint Summarization and Simplification

An assignment on creating a minimalist neural network toolkit for CS11-747

Unet-TTS: Improving Unseen Speaker and Style Transfer in One-shot Voice Cloning

🏖 Easy training and deployment of seq2seq models.

A script that automatically creates a branch name using google translation api and jira api

Rootski - Full codebase for rootski.io (without the data)

Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge

This repository is home to the Optimus data transformation plugins for various data processing needs.

تولید اسم های رندوم فینگیلیش

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/