NaturalProofs: Mathematical Theorem Proving in Natural Language

Last update: Jan 05, 2023

Related tags

Overview

NaturalProofs: Mathematical Theorem Proving in Natural Language

NaturalProofs: Mathematical Theorem Proving in Natural Language
Sean Welleck, Jiacheng Liu, Ronan Le Bras, Hannaneh Hajishirzi, Yejin Choi, Kyunghyun Cho

This repo contains:

The NaturalProofs Dataset and the mathematical reference retrieval task data.
Preprocessing NaturalProofs and the retrieval task data.
Training and evaluation for mathematical reference retrieval.
Pretrained models for mathematical reference retrieval.

Please cite our work if you found the resources in this repository useful:

@article{welleck2021naturalproofs,
  title={NaturalProofs: Mathematical Theorem Proving in Natural Language},
  author={Welleck, Sean and Liu, Jiacheng and Le Bras, Ronan and Hajishirzi, Hannaneh and Choi, Yejin and Cho, Kyunghyun},
  year={2021}
}

Section	Subsection
NaturalProofs Dataset	Dataset
	Preprocessing
Mathematical Reference Retrieval	Dataset
	Setup
	Preprocessing
	Pretrained models
	Training
	Evaluation

NaturalProofs Dataset

We provide the preprocessed NaturalProofs Dataset (JSON):

NaturalProofs Dataset
dataset.json [zenodo]

Preprocessing

To see the steps used to create the NaturalProofs dataset.json from raw ProofWiki data:

Download the ProofWiki XML.
Preprocess the data using notebooks/parse_proofwiki.ipynb.
Form the data splits using notebooks/dataset_splits.ipynb.

Mathematical Reference Retrieval

Dataset

The Mathematical Reference Retrieval dataset contains (x, r, y) examples with theorem statements x, positive and negative references r, and 0/1 labels y, derived from NaturalProofs.

We provide the version used in the paper (bert-based-cased tokenizer, 200 randomly sampled negatives):

Reference Retrieval Dataset
`bert-base-cased` 200 negatives

Pretrained Models

Pretrained models
`bert-base-cased`
`lstm`

These models were trained with the "bert-base-cased 200 negatives" dataset provided above.

Setup

python setup.py develop

You can see the DockerFile for additional version info, etc.

Generating and tokenizing

To create your own version of the retrieval dataset, use python utils.py.

This step is not needed if you are using the reference retrieval dataset provided above.

Example:

python utils.py --filepath /path/to/dataset.json --output-path /path/to/out/ --model-type bert-base-cased
# => Writing dataset to /path/to/out/dataset_tokenized__bert-base-cased_200.pkl

Evaluation

Using the retrieval dataset and a model provided above, we compute the test evaluation metrics in the paper:

Predict the rankings:

python naturalproofs/predict.py \
--method bert-base-cased \      # | lstm
--model-type bert-base-cased \  # | lstm
--datapath /path/to/dataset_tokenized__bert-base-cased_200.pkl \
--datapath-base /path/to/dataset.json \
--checkpoint-path /path/to/best.ckpt \
--output-dir /path/to/out/ \
--split test  # use valid during model development

Compute metrics over the rankings:

python naturalproofs/analyze.py \
--method bert-base-cased \      # | lstm
--eval-path /path/to/out/eval.pkl \
--analysis-path /path/to/out/analysis.pkl

Training

python naturalproofs/model.py \
--datapath /path/to/dataset_tokenized__bert-base-cased_200.pkl \
--default-root-dir /path/to/out/

Classical Retrieval Baselines

TF-IDF example:

python naturalproofs/baselines.py \
--method tfidf \
--datapath /path/to/dataset_tokenized__bert-base-cased_200.pkl \
--datapath-base /path/to/dataset.json \
--savedir /path/to/out/

Then use analyze.py as shown above to compute metrics.

NaturalProofs: Mathematical Theorem Proving in Natural Language

Related tags

Overview

NaturalProofs: Mathematical Theorem Proving in Natural Language

NaturalProofs Dataset

Preprocessing

Mathematical Reference Retrieval

Dataset

Pretrained Models

Setup

Generating and tokenizing

Evaluation

Training

Classical Retrieval Baselines

Owner

Sean Welleck

Protect against subdomain takeover

Repositório para arquivos sobre o Módulo 1 do curso Top Coders da Let's Code + Safra

Code for "Learning to Segment Rigid Motions from Two Frames".

Colab notebook and additional materials for Python-driven analysis of redlining data in Philadelphia

PyTorch implementation of the ideas presented in the paper Interaction Grounded Learning (IGL)

Unofficial implementation of Fast-SCNN: Fast Semantic Segmentation Network

Code for EMNLP 2021 paper Contrastive Out-of-Distribution Detection for Pretrained Transformers.

Self-Supervised Image Denoising via Iterative Data Refinement

Machine learning library for fast and efficient Gaussian mixture models

Keyword spotting on Arm Cortex-M Microcontrollers

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

Discord bot-CTFD-Thread-Parser - Discord bot CTFD-Thread-Parser

Official code for the publication "HyFactor: Hydrogen-count labelled graph-based defactorization Autoencoder".

CondenseNet: Light weighted CNN for mobile devices

Fast and robust certifiable relative pose estimation

LSTM Neural Networks for Spectroscopic Studies of Type Ia Supernovae

Deep Networks with Recurrent Layer Aggregation

This repository contains the re-implementation of our paper deSpeckNet: Generalizing Deep Learning Based SAR Image Despeckling

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

[2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval