A repo for materials relating to the tutorial of CS-332 NLP

Last update: Feb 15, 2022

Overview

CS-332-NLP

A repo for materials relating to the tutorial of CS-332 NLP

Tutorial 1:
- Introduction
- Corpus
- Regular expression
- Tokenization
Tutorial 2:
- Normalization
- Parsing
- Morpheme
- Stemming
- Lemmatization

Acknowledgements

Speech and Language Processing. Daniel Jurafsky & James H. Martin. (Edition 2 & 3)
Marcinkiewicz, M. A. (1994). Building a large annotated corpus of English: The Penn Treebank. Using Large Corpora, 273.
http://su.diva-portal.org/smash/record.jsf?pid=diva2%3A686162&dswid=9114

Owner

Alok singh

GitHub Repository

Creating a Feed of MISP Events from ThreatFox (by abuse.ch)

ThreatFox2Misp Creating a Feed of MISP Events from ThreatFox (by abuse.ch) What will it do? This will fetch IOCs from ThreatFox by Abuse.ch, convert t

17 Nov 22, 2022

🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

State-of-the-art Natural Language Processing for PyTorch and TensorFlow 2.0 🤗 Transformers provides thousands of pretrained models to perform tasks o

77.3k Jan 03, 2023

Tensorflow implementation of paper: Learning to Diagnose with LSTM Recurrent Neural Networks.

Multilabel time series classification with LSTM Tensorflow implementation of model discussed in the following paper: Learning to Diagnose with LSTM Re

552 Nov 28, 2022

Utilities for preprocessing text for deep learning with Keras

Note: This utility is really old and is no longer maintained. You should use keras.layers.TextVectorization instead of this. Utilities for pre-process

180 Dec 09, 2022

Wikipedia-Utils: Preprocessing Wikipedia Texts for NLP

Wikipedia-Utils: Preprocessing Wikipedia Texts for NLP This repository maintains some utility scripts for retrieving and preprocessing Wikipedia text

44 Oct 19, 2022

Text classification is one of the popular tasks in NLP that allows a program to classify free-text documents based on pre-defined classes.

Deep-Learning-for-Text-Document-Classification Text classification is one of the popular tasks in NLP that allows a program to classify free-text docu

2 Mar 17, 2022

Leon is an open-source personal assistant who can live on your server.

Leon Your open-source personal assistant. Website :: Documentation :: Roadmap :: Contributing :: Story 👋 Introduction Leon is an open-source personal

11.7k Dec 30, 2022

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

T5: Text-To-Text Transfer Transformer The t5 library serves primarily as code for reproducing the experiments in Exploring the Limits of Transfer Lear

4.6k Jan 01, 2023

This is the source code of RPG (Reward-Randomized Policy Gradient)

RPG (Reward-Randomized Policy Gradient) Zhenggang Tang*, Chao Yu*, Boyuan Chen, Huazhe Xu, Xiaolong Wang, Fei Fang, Simon Shaolei Du, Yu Wang, Yi Wu (

40 Nov 25, 2022

Klexikon: A German Dataset for Joint Summarization and Simplification

Klexikon: A German Dataset for Joint Summarization and Simplification Dennis Aumiller and Michael Gertz Heidelberg University Under submission at LREC

8 Jan 03, 2023

DLO8012: Natural Language Processing & CSL804: Computational Lab - II

NATURAL-LANGUAGE-PROCESSING-AND-COMPUTATIONAL-LAB-II DLO8012: NLP & CSL804: CL-II [SEMESTER VIII] Syllabus NLP - Reference Books THE WALL MEGA SATISH

7 Apr 28, 2022

Python package for Turkish Language.

PyTurkce Python package for Turkish Language. Documentation: https://pyturkce.readthedocs.io. Installation pip install pyturkce Usage from pyturkce im

14 Oct 09, 2022

An attempt to map the areas with active conflict in Ukraine using open source twitter data.

Live Action Map (LAM) An attempt to use open source data on Twitter to map areas with active conflict. Right now it is used for the Ukraine-Russia con

171 Nov 21, 2022

The entmax mapping and its loss, a family of sparse softmax alternatives.

entmax This package provides a pytorch implementation of entmax and entmax losses: a sparse family of probability mappings and corresponding loss func

330 Dec 22, 2022

Generate a cool README/About me page for your Github Profile

Github Profile README/ About Me Generator 💯 This webapp lets you build a cool README for your profile. A few inputs + ~15 mins = Your Github Profile

179 Jan 07, 2023

Active learning for text classification in Python

Active Learning allows you to efficiently label training data in a small-data scenario.

375 Dec 28, 2022

txtai: Build AI-powered semantic search applications in Go

txtai: Build AI-powered semantic search applications in Go txtai executes machine-learning workflows to transform data and build AI-powered semantic s

49 Dec 06, 2022

Ecco is a python library for exploring and explaining Natural Language Processing models using interactive visualizations.

Visualize, analyze, and explore NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BER

1.6k Dec 25, 2022

Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS)

TOPSIS implementation in Python Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) CHING-LAI Hwang and Yoon introduced TOPSIS

8 Dec 10, 2022

Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents

Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents [Project Page] [Paper] [Video] Wenlong Huang1, Pieter Abbee

114 Dec 29, 2022

A repo for materials relating to the tutorial of CS-332 NLP

Related tags

Overview

CS-332-NLP

Contents

Acknowledgements

Owner

Alok singh

Creating a Feed of MISP Events from ThreatFox (by abuse.ch)

🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

Tensorflow implementation of paper: Learning to Diagnose with LSTM Recurrent Neural Networks.

Utilities for preprocessing text for deep learning with Keras

Wikipedia-Utils: Preprocessing Wikipedia Texts for NLP

Text classification is one of the popular tasks in NLP that allows a program to classify free-text documents based on pre-defined classes.

Leon is an open-source personal assistant who can live on your server.

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

This is the source code of RPG (Reward-Randomized Policy Gradient)

Klexikon: A German Dataset for Joint Summarization and Simplification

DLO8012: Natural Language Processing & CSL804: Computational Lab - II

Python package for Turkish Language.

An attempt to map the areas with active conflict in Ukraine using open source twitter data.

The entmax mapping and its loss, a family of sparse softmax alternatives.

Generate a cool README/About me page for your Github Profile

Active learning for text classification in Python

txtai: Build AI-powered semantic search applications in Go

Ecco is a python library for exploring and explaining Natural Language Processing models using interactive visualizations.

Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS)

Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents