NLTK Source

Last update: Jan 04, 2023

Overview

Natural Language Toolkit (NLTK)

NLTK -- the Natural Language Toolkit -- is a suite of open source Python modules, data sets, and tutorials supporting research and development in Natural Language Processing. NLTK requires Python version 3.5, 3.6, 3.7, or 3.8.

For documentation, please visit nltk.org.

Contributing

Do you want to contribute to NLTK development? Great! Please read CONTRIBUTING.md for more details.

Donate

Have you found the toolkit helpful? Please support NLTK development by donating to the project via PayPal, using the link on the NLTK homepage.

Citing

If you publish work that uses NLTK, please cite the NLTK book, as follows:

Bird, Steven, Edward Loper and Ewan Klein (2009).
Natural Language Processing with Python.  O'Reilly Media Inc.

Copyright

For license information, see LICENSE.txt.

AUTHORS.md contains a list of everyone who has contributed to NLTK.

Redistributing

NLTK source code is distributed under the Apache 2.0 License.
NLTK documentation is distributed under the Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States license.
NLTK corpora are provided under the terms given in the README file for each corpus; all are redistributable and available for non-commercial use.
NLTK may be freely redistributed, subject to the provisions of these licenses.

NLTK Source

Related tags

Overview

Natural Language Toolkit (NLTK)

Contributing

Donate

Citing

Copyright

Redistributing

Owner

Natural Language Toolkit

Yomichad - a Japanese pop-up dictionary that can display readings and English definitions of Japanese words

Sentiment Classification using WSD, Maximum Entropy & Naive Bayes Classifiers

Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing

Partially offline multi-language translator built upon Huggingface transformers.

In this Notebook I've build some machine-learning and deep-learning to classify corona virus tweets, in both multi class classification and binary classification.

Prompt-learning is the latest paradigm to adapt pre-trained language models (PLMs) to downstream NLP tasks

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Unsupervised text tokenizer focused on computational efficiency

HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools

DeepSpeech - Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.

Conversational-AI-ChatBot - Intelligent ChatBot built with Microsoft's DialoGPT transformer to make conversations with human users!

leaking paid token generator that was a shit lmao for 100$ haha

Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow

多语言降噪预训练模型MBart的中文生成任务

Generating new names based on trends in data using GPT2 (Transformer network)

I can help you convert your images to pdf file.

Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper

Unsupervised Language Modeling at scale for robust sentiment classification

Stack based programming language that compiles to x86_64 assembly or can alternatively be interpreted in Python

Build Text Rerankers with Deep Language Models