[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games

Last update: Dec 16, 2022

Overview

Contextual Action Language Model (CALM) and the ClubFloyd Dataset

Code and data for paper Keep CALM and Explore: Language Models for Action Generation in Text-based Games at EMNLP 2020.

Overview

Our ClubFloyd dataset (calm/lm_data.zip) is crawled from the ClubFloyd website and contains 426 human gameplay transcripts, which cover 590 text-based games of diverse genres and styles.

The data consists of 223,527 context-action pairs in the format [CLS] observation [SEP] action [SEP] next observation [SEP] next action [SEP]. We use [CLS] observation [SEP] action [SEP] next observation [SEP] as the context to train language models (n-gram, GPT-2) to predict next action [SEP], and show that this action generation ability generalizes to unseen games and supports gameplay when combined with reinforcement learning.

Getting Started

Clone repo and install dependencies:

pip install torch==1.4 transformers==2.5.1 jericho fasttext wandb importlib_metadata
git clone https://github.com/princeton-nlp/calm-textgame && cd calm-textgame
ln -s ../lm calm && ln -s ../lm drrn

(If the pip installation fails for fasttext, try the build steps here: https://github.com/facebookresearch/fastText#building-fasttext-for-python)

Train CALM:

cd calm
unzip lm_data.zip
python train.py

Trained model weights can be downloaded here for both GPT-2 and n-gram models.

Then train DRRN using the trained CALM:

cd ../drrn
python train.py --rom_path ../games/${GAME} --lm_path ${PATH_TO_CALM} --lm_type ${gpt_or_ngram}

To quickly try out the GPT-2 CALM model:

from lm import GPT2LM
model = GPT2LM("model_weights/gpt2")
print(model.generate("[CLS] observation [SEP] action [SEP] next observation [SEP]", k=30))

Citation

@inproceedings{yao2020calm,
    title={Keep CALM and Explore: Language Models for Action Generation in Text-based Games},
    author={Yao, Shunyu and Rao, Rohan and Hausknecht, Matthew and Narasimhan, Karthik},
    booktitle={Empirical Methods in Natural Language Processing (EMNLP)},
    year={2020}
}

Acknowledgements

Thanks Jacqueline for hosting the wonderful ClubFloyd website and granting our use!

The code borrows from TDQN (for the RL part) and Huggingface Transformers (for the CALM part).

For any questions please contact Shunyu Yao <[email protected]>.

[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games

Related tags

Overview

Contextual Action Language Model (CALM) and the ClubFloyd Dataset

Overview

Getting Started

Citation

Acknowledgements

Owner

Princeton Natural Language Processing

code and models for "Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation"

Code for ICCV2021 paper PARE: Part Attention Regressor for 3D Human Body Estimation

YOLOv5🚀 reproduction by Guo Quanhao using PaddlePaddle

Safe Control for Black-box Dynamical Systems via Neural Barrier Certificates

Experimental solutions to selected exercises from the book [Advances in Financial Machine Learning by Marcos Lopez De Prado]

A PyTorch Implementation of the Luna: Linear Unified Nested Attention

Official implementation for the paper "Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection"

Pytorch implementation of our paper accepted by NeurIPS 2021 -- Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme

Deep Reinforcement Learning based autonomous navigation for quadcopters using PPO algorithm.

Contains a bunch of different python programm tasks

The goal of the exercises below is to evaluate the candidate knowledge and problem solving expertise regarding the main development focuses for the iFood ML Platform team: MLOps and Feature Store development.

This is an example of object detection on Micro bacterium tuberculosis using Mask-RCNN

Direct application of DALLE-2 to video synthesis, using factored space-time Unet and Transformers

A Fast Sequence Transducer Implementation with PyTorch Bindings

Unofficial PyTorch Implementation for HifiFace (https://arxiv.org/abs/2106.09965)

Single-Stage Instance Shadow Detection with Bidirectional Relation Learning (CVPR 2021 Oral)

This repository provides an unified frameworks to train and test the state-of-the-art few-shot font generation (FFG) models.

Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation

An algorithm study of the 6th iOS 10 set of Boost Camp Web Mobile

This is the offical website for paper ''Category-consistent deep network learning for accurate vehicle logo recognition''