Pytorch implementation of "A simple neural network module for relational reasoning" (Relational Networks)

Last update: Dec 05, 2022

Overview

Pytorch implementation of Relational Networks - A simple neural network module for relational reasoning

Implemented & tested on Sort-of-CLEVR task.

Sort-of-CLEVR

Sort-of-CLEVR is simplified version of CLEVR.This is composed of 10000 images and 20 questions (10 relational questions and 10 non-relational questions) per each image. 6 colors (red, green, blue, orange, gray, yellow) are assigned to randomly chosen shape (square or circle), and placed in a image.

Non-relational questions are composed of 3 subtypes:

Shape of certain colored object
Horizontal location of certain colored object : whether it is on the left side of the image or right side of the image
Vertical location of certain colored object : whether it is on the upside of the image or downside of the image

Theses questions are "non-relational" because the agent only need to focus on certain object.

Relational questions are composed of 3 subtypes:

Shape of the object which is closest to the certain colored object
Shape of the object which is furthest to the certain colored object
Number of objects which have the same shape with the certain colored object

These questions are "relational" because the agent has to consider the relations between objects.

Questions are encoded into a vector of size of 11 : 6 for one-hot vector for certain color among 6 colors, 2 for one-hot vector of relational/non-relational questions. 3 for one-hot vector of 3 subtypes.

I.e., with the sample image shown, we can generate non-relational questions like:

What is the shape of the red object? => Circle (even though it does not really look like "circle"...)
Is green object placed on the left side of the image? => yes
Is orange object placed on the upside of the image? => no

And relational questions:

What is the shape of the object closest to the red object? => square
What is the shape of the object furthest to the orange object? => circle
How many objects have same shape with the blue object? => 3

Setup

Create conda environment from environment.yml file

$ conda env create -f environment.yml

Activate environment

$ conda activate RN3

If you don't use conda install python 3 normally and use pip install to install remaining dependencies. The list of dependencies can be found in the environment.yml file.

Usage

$ ./run.sh

$ python sort_of_clevr_generator.py

to generate sort-of-clevr dataset and

 $ python main.py

to train the binary RN model. Alternatively, use

 $ python main.py --relation-type=ternary

to train the ternary RN model.

Modifications

In the original paper, Sort-of-CLEVR task used different model from CLEVR task. However, because model used CLEVR requires much less time to compute (network is much smaller), this model is used for Sort-of-CLEVR task.

Result

	Relational Networks (20th epoch)	CNN + MLP (without RN, 100th epoch)
Non-relational question	99%	66%
Relational question	89%	66%

CNN + MLP occured overfitting to the training data.

Relational networks shows far better results in relational questions and non-relation questions.

Contributions

@gngdb speeds up the model by 10 times.

Pytorch implementation of "A simple neural network module for relational reasoning" (Relational Networks)

Related tags

Overview

Sort-of-CLEVR

Setup

Usage

Modifications

Result

Contributions

Owner

Kim Heecheol

Paddle implementation for "Cross-Lingual Word Embedding Refinement by ℓ1 Norm Optimisation" (NAACL 2021)

FEDn is an open-source, modular and ML-framework agnostic framework for Federated Machine Learning

An image processing project uses Viola-jones technique to detect faces and then use SIFT algorithm for recognition.

An implementation of the efficient attention module.

Block Sparse movement pruning

Project NII pytorch scripts

DziriBERT: a Pre-trained Language Model for the Algerian Dialect

Cleaned up code for DSTC 10: SIMMC 2.0 track: subtask 2: multimodal coreference resolution

My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control

Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

Fusion-in-Decoder Distilling Knowledge from Reader to Retriever for Question Answering

ShuttleNet: Position-aware Fusion of Rally Progress and Player Styles for Stroke Forecasting in Badminton (AAAI'22)

Network Pruning That Matters: A Case Study on Retraining Variants (ICLR 2021)

Implementation of BI-RADS-BERT & The Advantages of Section Tokenization.

Code to accompany our paper "Continual Learning Through Synaptic Intelligence" ICML 2017

A tensorflow=1.13 implementation of Deconvolutional Networks on Graph Data (NeurIPS 2021)

Mesh TensorFlow: Model Parallelism Made Easier

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:

Codes for Causal Semantic Generative model (CSG), the model proposed in "Learning Causal Semantic Representation for Out-of-Distribution Prediction" (NeurIPS-21)

Official PyTorch Implementation of Unsupervised Learning of Scene Flow Estimation Fusing with Local Rigidity