HeCo

This repo is for source code of KDD 2021 paper "Self-supervised Heterogeneous Graph Neural Network with Co-contrastive Learning".
Paper Link: https://arxiv.org/abs/2105.09111

Environment Settings

python==3.8.5
scipy==1.5.4
torch==1.7.0
numpy==1.19.2
scikit_learn==0.24.2

GPU: GeForce RTX 2080 Ti
CPU: Intel(R) Xeon(R) Silver 4210 CPU @ 2.20GHz

Usage

Fisrt, go into ./code, and then you can use the following commend to run our model:

python main.py acm --gpu=0

Here, "acm" can be replaced by "dblp", "aminer" or "freebase".

Some tips in parameters

We suggest you to carefully select the “pos_num” (existed in ./data/pos.py) to ensure the threshold of postives for every node. This is very important to final results. Of course, more effective way to select positives is welcome.
In ./code/utils/params.py, except "lr" and "patience", meticulously tuning dropout and tau is applaudable.
In our experiments, we only assign target type of nodes with original features, but assign other type of nodes with one-hot. This is because most of datasets used only provide features of target nodes in their original version. So, we believe in that if high-quality features of other type of nodes are provided, the overall results will improve a lot. The AMiner dataset is an example. In this dataset, there are not original features, so every type of nodes are all asigned with one-hot. In other words, every node has the same quality of features, and in this case, our HeCo is far ahead of other baselines. So, we strongly suggest that if you have high-quality features for other type of nodes, try it!

Cite

Contact

If you have any questions, please feel free to contact me with [email protected]

The source code of HeCo

Related tags

Overview

HeCo

Environment Settings

Usage

Some tips in parameters

Cite

Contact

Owner

Nian Liu

A python package to fine-tune transformer-based models for named entity recognition (NER).

Amazon Multilingual Counterfactual Dataset (AMCD)

An ActivityWatch watcher to pose questions to the user and record her answers.

基于GRU网络的句子判断程序/A program based on GRU network for judging sentences

The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models

✔👉A Centralized WebApp to Ensure Road Safety by checking on with the activities of the driver and activating label generator using NLP.

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

Auto translate textbox from Japanese to English or Indonesia

ETM - R package for Topic Modelling in Embedding Spaces

It analyze the sentiment of the user, whether it is postive or negative.

Wrapper to display a script output or a text file content on the desktop in sway or other wlroots-based compositors

The SVO-Probes Dataset for Verb Understanding

Modified GPT using average pooling to reduce the softmax attention memory constraints.

Augmenty is an augmentation library based on spaCy for augmenting texts.

code for modular summarization work published in ACL2021 by Krishna et al

A python script to prefab your scripts/text files, and re create them with ease and not have to open your browser to copy code or write code yourself

MASS: Masked Sequence to Sequence Pre-training for Language Generation

Package for controllable summarization

Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"

This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.