This repository is the official implementation of Open Rule Induction. This paper has been accepted to NeurIPS 2021.

Last update: Nov 14, 2022

Related tags

Overview

Open Rule Induction

This repository is the official implementation of Open Rule Induction. This paper has been accepted to NeurIPS 2021.

Abstract

Rules have a number of desirable properties. It is easy to understand, infer new knowledge, and communicate with other inference systems. One weakness of the previous rule induction systems is that they only find rules within a knowledge base (KB) and therefore cannot generalize to more open and complex real-world rules. Recently, the language model (LM)-based rule generation are proposed to enhance the expressive power of the rules. In this paper, we revisit the differences between KB-based rule induction and LM-based rule generation. We argue that, while KB-based methods inducted rules by discovering data commonalitiess, the current LM-based methods are “learning rules from rules”. This limits these methods to only produce “canned” rules whose patterns are constrained by the annotated rules, while discarding the rich expressive power of LMs for free text.

Therefore, in this paper, we propose the open rule induction problem, which aims to induce open rules utilizing the knowledge in LMs. Besides, we propose the Orion (open rule induction) system to automatically mine open rules from LMs without supervision of annotated rules. We conducted extensive experiments to verify the quality and quantity of the inducted open rules. Surprisingly, when applying the open rules in downstream tasks (i.e. relation extraction), these automatically inducted rules even outperformed the manually annotated rules.

Dependencies

To install requirements:

conda env create -f environment.yml
conda activate orion

Download the Orion

We have released the continue trained models for $P(ins|r_p)$ and $P(r_h|ins)$, you could just download them following the steps:

mkdir models
cd models

Then you should download two parts of Orion to here.

Download model for $P(ins|r_p)$ from here
Download model for $P(r_h|ins)$ from here

Evaluate for OpenRule155

To evaluate Orion's performance on OpenRule155 or other relation extraction datasets, run this command:

python evaluation.py --task openrule155 --inductor rule --mlm_training True --bart_training True --group_beam True

Evaluate for Relation Extraction

To evaluate Orion's performance on other relation extraction datasets, run this command:

python evaluation.py --task <task> --inductor rule --mlm_training True --bart_training True --group_beam True

Evaluate for costomize rule

If you want to experience it with your costomize rules, follow this:

from inductor import BartInductor

inductor = BartInductor()

rule = '<mask> is the capital of <mask>.'
generated_texts = inductor.generate(rule)

print('output generated rules:')
for text in generated_texts:
    print(text)

# output generated rules:
# <mask> is the capital and largest city of <mask>.
# <mask> is the largest city in <mask>.
# <mask> is the most populous state in <mask>.
# <mask> is the capital of <mask>.
# <mask> is a state in <mask>.
# <mask> is a capital of <mask>.
# <mask> has one of the highest rates of poverty in <mask>.
# <mask> is a major commercial and financial centre of <mask>.
# <mask> was then a part of <mask>.
# <mask>, the capital of the country, is the largest city in <mask>.

This repository is the official implementation of Open Rule Induction. This paper has been accepted to NeurIPS 2021.

Related tags

Overview

Open Rule Induction

Abstract

Dependencies

Download the Orion

Evaluate for OpenRule155

Evaluate for Relation Extraction

Evaluate for costomize rule

Owner

Xingran Chen

Visualizing Yolov5's layers using GradCam

Trajectory Variational Autoencder baseline for Multi-Agent Behavior challenge 2022

ChainerRL is a deep reinforcement learning library built on top of Chainer.

Learning from graph data using Keras

This is the repo for the paper "Improving the Accuracy-Memory Trade-Off of Random Forests Via Leaf-Refinement".

A PyTorch-centric hybrid classical-quantum machine learning framework

Anagram Generator in Python

A Python library for working with arbitrary-dimension hypercomplex numbers following the Cayley-Dickson construction of algebras.

Fastquant - Backtest and optimize your trading strategies with only 3 lines of code!

Pytorch implementation of PCT: Point Cloud Transformer

WTTE-RNN a framework for churn and time to event prediction

Repository for "Space-Time Correspondence as a Contrastive Random Walk" (NeurIPS 2020)

A user-friendly research and development tool built to standardize RL competency assessment for custom agents and environments.

Code for "Diffusion is All You Need for Learning on Surfaces"

Proposed n-stage Latent Dirichlet Allocation method - A Novel Approach for LDA

Survival analysis (SA) is a well-known statistical technique for the study of temporal events.

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

QRec: A Python Framework for quick implementation of recommender systems (TensorFlow Based)

NumQMBasic - A mini-course offered to Undergrad physics students

An Official Repo of CVPR '20 "MSeg: A Composite Dataset for Multi-Domain Segmentation"