Awesome-NLP-Research (ANLP)

Overview

Awesome-NLP-Research (ANLP)

(Update on 2020-01-10: we have also added the presentations from the Fall 2020 installment of the course. Check for them under "slides2020".)

As part of the Fall 2018 course CPSC 677 "Advanced Natural Language Processing" at Yale, we developed, with the help of the students, a corpus of useful resources for NLP research. Bibliographies and Powerpoint Presentations for each topic are found below, in addition to several blog posts. We asked the students to also list relevant and prerequisite concepts for each topic, and these keywords are found here.

If you have any questions, would like to contribute further to this project or feel we are missing an important citation, please contact Alex Fabbri at alexander[dot]fabbri[at]yale.[first three letters of education]

Overview of papers presented in class

  • Capsule Networks for NLP by Will Merrill - BIB BLOG SLIDES
  • Commonsense Learning by Michihiro Yasunaga - BIB SLIDES
  • Dialogue Systems by Suyi Li - BIB SLIDES
  • Multilingual-Word-Embeddings by Davey Proctor - BIB SLIDES
  • Neural Embeddings By John Brandt - BIB SLIDES
  • Temporal and Dynamic Embeddings by Yavuz Nuzumlali - BIB SLIDES
  • NLP in Finance by Gaurav Pathak BIB SLIDES
  • Natural Language Generation by Tianwei She - BIB SLIDES
  • Knowledge Graphs by Tomoe Mizutani - BIB SLIDES
  • Cross-Lingual Information Retrieval by Rui Zhang - BIB BLOG SLIDES
  • Neural Information Retrieval by Danny Keller - BIB SLIDES
  • Character-Level Language Modeling by Angus Fong - BIB SLIDES
  • Latent Variable Models in NLP by Brian Kitano - BIB SLIDES
  • Unsupervised Machine Translation By Yongjie Lin - BIB SLIDES
  • Neural Computational Morphology by Garrett Bingham - BIB SLIDES
  • Network Methods by Noah Amsel - BIB SLIDES
  • Neural Semi-Supervised Learning by Alex Fabbri - BIB SLIDES
  • Question Answering by Talley Amir - BIB SLIDES
  • Attribute-Level Sentiment Analaysis by Ishita Chakraborty and Davey Proctor - BIB BLOG SLIDES
  • Semantic Parsing by Bo Pang - BIB SLIDES
  • Sequence2Sequence by Jack Koch - BIB SLIDES
  • Seq2SQL by Tao Yu - BIB SLIDES
  • Spectral Learning by Hannah Lawrence - BIB SLIDES
  • Single Document Summarization by Yi Chern Tan - BIB SLIDES
  • Transfer Learning by Irene Li - BIB SLIDES

Additionally, students from the class made blog posts on the following topics:

  • DARTS - BLOG
  • OpenAI Transformer - BLOG
Owner
Language, Information, and Learning at Yale
Language, Information, and Learning at Yale
Python library for processing Chinese text

SnowNLP: Simplified Chinese Text Processing SnowNLP是一个python写的类库,可以方便的处理中文文本内容,是受到了TextBlob的启发而写的,由于现在大部分的自然语言处理库基本都是针对英文的,于是写了一个方便处理中文的类库,并且和TextBlob

Rui Wang 6k Jan 02, 2023
Voilà turns Jupyter notebooks into standalone web applications

Rendering of live Jupyter notebooks with interactive widgets. Introduction Voilà turns Jupyter notebooks into standalone web applications. Unlike the

Voilà Dashboards 4.5k Jan 03, 2023
Findings of ACL 2021

Assessing Dialogue Systems with Distribution Distances [arXiv][code] We propose to measure the performance of a dialogue system by computing the distr

Yahui Liu 16 Feb 24, 2022
Codes for processing meeting summarization datasets AMI and ICSI.

Meeting Summarization Dataset Meeting plays an essential part in our daily life, which allows us to share information and collaborate with others. Wit

xcfeng 39 Dec 14, 2022
A collection of GNN-based fake news detection models.

This repo includes the Pytorch-Geometric implementation of a series of Graph Neural Network (GNN) based fake news detection models. All GNN models are implemented and evaluated under the User Prefere

SafeGraph 251 Jan 01, 2023
🐍 A hyper-fast Python module for reading/writing JSON data using Rust's serde-json.

A hyper-fast, safe Python module to read and write JSON data. Works as a drop-in replacement for Python's built-in json module. This is alpha software

Matthias 479 Jan 01, 2023
NLP techniques such as named entity recognition, sentiment analysis, topic modeling, text classification with Python to predict sentiment and rating of drug from user reviews.

This file contains the following documents sumbited for Baruch CIS9665 group 9 fall 2021. 1. Dataset: drug_reviews.csv 2. python codes for text classi

Aarif Munwar Jahan 2 Jan 04, 2023
2021语言与智能技术竞赛:机器阅读理解任务

LICS2021 MRC 1. 项目&任务介绍 本项目基于官方给定的baseline(DuReader-Checklist-BASELINE)进行二次改造,对整个代码框架做了简单的重构,对核心网络结构添加了注释,解耦了数据读取的模块,并添加了阈值确认的功能,一些小的细节也做了改进。 本次任务为202

roar 29 Dec 05, 2022
Fast, general, and tested differentiable structured prediction in PyTorch

Torch-Struct: Structured Prediction Library A library of tested, GPU implementations of core structured prediction algorithms for deep learning applic

HNLP 1.1k Dec 16, 2022
Chatbot for the Chatango messaging platform

BroiestBot The baddest bot in the game right now. Uses the ch.py framework for joining Chantango rooms and responding to user messages. Commands If a

Todd Birchard 3 Jan 17, 2022
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

VAENAR-TTS - PyTorch Implementation PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Keon Lee 67 Nov 14, 2022
The code from the whylogs workshop in DataTalks.Club on 29 March 2022

whylogs Workshop The code from the whylogs workshop in DataTalks.Club on 29 March 2022 whylogs - The open source standard for data logging (Don't forg

DataTalksClub 12 Sep 05, 2022
Script to download some free japanese lessons in portuguse from NHK

Nihongo_nhk This is a script to download some free japanese lessons in portuguese from NHK. It can be executed by installing the packages with: pip in

Matheus Alves 2 Jan 06, 2022
Knowledge Management for Humans using Machine Learning & Tags

HyperTag helps humans intuitively express how they think about their files using tags and machine learning. Represent how you think using tags. Find what you look for using semantic search for your t

Ravn Tech, Inc. 166 Jan 07, 2023
Must-read papers on improving efficiency for pre-trained language models.

Must-read papers on improving efficiency for pre-trained language models.

Tobias Lee 89 Jan 03, 2023
customer care chatbot made with Rasa Open Source.

Customer Care Bot Customer care bot for ecomm company which can solve faq and chitchat with users, can contact directly to team. 🛠 Features Basic E-c

Dishant Gandhi 23 Oct 27, 2022
NLP-based analysis of poor Chinese movie reviews on Douban

douban_embedding 豆瓣中文影评差评分析 1. NLP NLP(Natural Language Processing)是指自然语言处理,他的目的是让计算机可以听懂人话。 下面是我将2万条豆瓣影评训练之后,随意输入一段新影评交给神经网络,最终AI推断出的结果。 "很好,演技不错

3 Apr 15, 2022
ADCS cert template modification and ACL enumeration

Purpose This tool is designed to aid an operator in modifying ADCS certificate templates so that a created vulnerable state can be leveraged for privi

Fortalice Solutions, LLC 78 Dec 12, 2022
A Paper List for Speech Translation

Keyword: Speech Translation, Spoken Language Processing, Natural Language Processing

138 Dec 24, 2022
DataCLUE: 国内首个以数据为中心的AI测评(含模型分析报告)

DataCLUE 以数据为中心的AI测评(DataCLUE) DataCLUE: A Chinese Data-centric Language Evaluation Benchmark 内容导引 章节 描述 简介 介绍以数据为中心的AI测评(DataCLUE)的背景 任务描述 任务描述 实验结果

CLUE benchmark 135 Dec 22, 2022