Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining

Last update: Dec 20, 2022

Overview

LMSOC: An Approach for Socially Sensitive Pretraining

Code for reproducing the paper LMSOC: An Approach for Socially Sensitive Pretraining to appear at 2021 Conference on Empirical Methods in Natural Language Processing: Findings.

Abstract

While large-scale pretrained language models have been shown to learn effective linguistic representations for many NLP tasks, there remain many real-world contextual aspects of language that current approaches do not capture. For instance, consider a cloze-test "I enjoyed the ____ game this weekend": the correct answer depends heavily on where the speaker is from, when the utterance occurred, and the speaker's broader social milieu and preferences. Although language depends heavily on the geographical, temporal, and other social contexts of the speaker, these elements have not been incorporated into modern transformer-based language models. We propose a simple but effective approach to incorporate speaker social context into the learned representations of large-scale language models. Our method first learns dense representations of social contexts using graph representation learning algorithms and then primes language model pretraining with these social context representations. We evaluate our approach on geographically-sensitive language-modeling tasks and show a substantial improvement (more than 100% relative lift on MRR) compared to baselines.

Citation

Please cite as:

Kulkarni, V., Mishra, S., & Haghighi, A. (2021). LMSOC: An Approach for Socially Sensitive Pretraining. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: Findings. arXiv

@inproceedings{kulkarni2021lmsoc,
  title={LMSOC: An Approach for Socially Sensitive Pretraining},
  author={Kulkarni, Vivek and Mishra, Shubhanshu and Haghighi, Aria},
  booktitle={Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: Findings},
  year={2021}
  address={Online},
  publisher={Association for Computational Linguistics},
  pages={1--9},
  eprint={2110.10319},
  archivePrefix={arXiv},
  primaryClass={cs.CL}
}

Reproducibility

NOTE: Dependencies are specified in the notebooks. But we have also encluded an requirements.txt and environment.yml files to install dependencies using pip or conda.

Create Social Context Embeddings via the example notebook embed_time_toy_task.ipynb which contains the implementation of how to embed time for Task 1 in the paper.
Upload the files in data/ to the location where you will run the next notebook.
The notebook lmsoc_train_and_eval_toy_task.ipynb contains the LMSOC training code.
- NOTE: This notebook assumes you have already trained social context embeddings for the data you have (for example, here the social context is time).
- It is a runnable colab notebook which demonstrates the entire process of training and evaluating LMSOC as described in the paper.
- If run, it will reproduce the experimental setup for Task 1 and ultimately yield Figure 2.
- In order to run this notebook in colab, open this notebook in Google Colab and upload the files in "data" directory to your colab workspace.

Security Issues?

Please report sensitive security issues via Twitter's bug-bounty program (https://hackerone.com/twitter) rather than GitHub.

Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining

Related tags

Overview

LMSOC: An Approach for Socially Sensitive Pretraining

Abstract

Citation

Reproducibility

Security Issues?

Owner

Twitter Research

Public Implementation of ChIRo from "Learning 3D Representations of Molecular Chirality with Invariance to Bond Rotations"

Syllabic Quantity Patterns as Rhythmic Features for Latin Authorship Attribution

Re-implememtation of MAE (Masked Autoencoders Are Scalable Vision Learners) using PyTorch.

dataset for ECCV 2020 "Motion Capture from Internet Videos"

Code for ViTAS_Vision Transformer Architecture Search

Optical machine for senses sensing using speckle and deep learning

DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weights, in order to achieve a given trade-off between model size and accuracy.

Implementing yolov4 target detection and tracking based on nao robot

PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"

This repository contains the source code of Auto-Lambda and baselines from the paper, Auto-Lambda: Disentangling Dynamic Task Relationships.

Neural network for recognizing the gender of people in photos

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

Tools for investing in Python

PyTorch implementation of MoCo: Momentum Contrast for Unsupervised Visual Representation Learning

LRBoost is a scikit-learn compatible approach to performing linear residual based stacking/boosting.

CondLaneNet: a Top-to-down Lane Detection Framework Based on Conditional Convolution

HINet: Half Instance Normalization Network for Image Restoration

Pixel Consensus Voting for Panoptic Segmentation (CVPR 2020)

Unrolled Generative Adversarial Networks

Deploy optimized transformer based models on Nvidia Triton server