Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

Last update: Dec 28, 2022

Overview

MKGFormer

Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

Model Architecture

Illustration of MKGformer for (a) Unified Multimodal KGC Framework and (b) Detailed M-Encoder.

Requirements

To run the codes, you need to install the requirements:

pip install -r requirements.txt

Data Collection

The datasets that we used in our experiments are as follows:

Twitter2017

You can download the twitter2017 dataset via this link (https://drive.google.com/file/d/1ogfbn-XEYtk9GpUECq1-IwzINnhKGJqy/view?usp=sharing)

For more information regarding the dataset, please refer to the UMT repository.
MRE

The MRE dataset comes from MEGA, many thanks.

You can download the MRE dataset with detected visual objects using folloing command:
```
cd MRE
wget 120.27.214.45/Data/re/multimodal/data.tar.gz
tar -xzvf data.tar.gz
```
MKG
- FB15K-237-IMG
  
  For more information regarding the dataset, please refer to the mmkb and kg-bert repositories.
- WN18-IMG
  
  For more information regarding the dataset, please refer to the RSME repository.

The expected structure of files is:

MKGFormer
 |-- MKG	# Multimodal Knowledge Graph
 |    |-- dataset       # task data
 |    |-- data          # data process file
 |    |-- lit_models    # lightning model
 |    |-- models        # mkg model
 |    |-- scripts       # running script
 |    |-- main.py   
 |-- MNER	# Multimodal Named Entity Recognition
 |    |-- data          # task data
 |    |-- models        # mner model
 |    |-- modules       # running script
 |    |-- processor     # data process file
 |    |-- utils
 |    |-- run_mner.sh
 |    |-- run.py
 |-- MRE    # Multimodal Relation Extraction
 |    |-- data          # task data
 |    |-- models        # mre model
 |    |-- modules       # running script
 |    |-- processor     # data process file
 |    |-- run_mre.sh
 |    |-- run.py

How to run

MKG Task
- First run Image-text Incorporated Entity Modeling to train entity embedding.
```
    cd MKG
    bash scripts/pretrain_fb15k-237-image.sh
```
- Then do Missing Entity Prediction.
```
    bash scripts/fb15k-237-image.sh
```
MNER Task

To run mner task, run this script.
```
cd MNER
bash run_mner.py
```
MRE Task

To run mre task, run this script.
```
cd MRE
bash run_mre.py
```

Acknowledgement

The acquisition of image data for the multimodal link prediction task refer to the code from https://github.com/wangmengsd/RSME, many thanks.

Papers for the Project & How to Cite

If you use or extend our work, please cite the paper as follows:

Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

Related tags

Overview

MKGFormer

Model Architecture

Requirements

Data Collection

How to run

MKG Task

MNER Task

MRE Task

Acknowledgement

Papers for the Project & How to Cite

Owner

ZJUNLP

Deepfake Scanner by Deepware.

Classic Papers for Beginners and Impact Scope for Authors.

A PyTorch Implementation of Single Shot MultiBox Detector

[NeurIPS'21] "AugMax: Adversarial Composition of Random Augmentations for Robust Training" by Haotao Wang, Chaowei Xiao, Jean Kossaifi, Zhiding Yu, Animashree Anandkumar, and Zhangyang Wang.

Dense Contrastive Learning (DenseCL) for self-supervised representation learning, CVPR 2021.

On the Limits of Pseudo Ground Truth in Visual Camera Re-Localization

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

This repo is about implementing different approaches of pose estimation and also is a sub-task of the smart hospital bed project :smile:

Graph Analysis From Scratch

Deep learning for spiking neural networks

Pytorch implementation of the paper "Topic Modeling Revisited: A Document Graph-based Neural Network Perspective"

Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

CLIP2Video: Mastering Video-Text Retrieval via Image CLIP

Pytorch Implementation of Auto-Compressing Subset Pruning for Semantic Image Segmentation

This repo in the implementation of EMNLP'21 paper "SPARQLing Database Queries from Intermediate Question Decompositions" by Irina Saparina, Anton Osokin

Reproduction of Vision Transformer in Tensorflow2. Train from scratch and Finetune.

(3DV 2021 Oral) Filtering by Cluster Consistency for Large-Scale Multi-Image Matching

Lingvo is a framework for building neural networks in Tensorflow, particularly sequence models.

[ICCV' 21] "Unsupervised Point Cloud Pre-training via Occlusion Completion"

Character-Input - Create a program that asks the user to enter their name and their age