RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation (CIKM'17)

Last update: Feb 10, 2022

Overview

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation

This is the implementation of RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation.

Code

To run our code, please use the following commands:

g++ RATE.cpp -o RATE -std=c++11
./RATE [Training File] [Test File] [L, optional, default = 30] [T, optional, default = 1]

For example,

g++ RATE.cpp -o RATE -std=c++11
./RATE Dataset/train.txt Dataset/test.txt 40 1

The prediction results will be in ./result.txt (the first row is the classification result). Then you can run

python eval.py

to obtain evaluation metrics.

Dataset

We release the Europe dataset (Dataset/data.json), where each line is a json file with tweet text and metadata. Due to privacy issues, we have anonymized the whole dataset by representing each word/feature as an integer. An example is shown below.

{ 
   "label":0,
   "language":"3",
   "timezone":"5",
   "offset":"7",
   "userlang":"5",
   "latitude":"36.8901",
   "longitude":"30.6809",
   "text":"3332 2608 29"
}

Given the json file, one can run

cd Dataset/
python preprocess.py

to get training and testing data (Dataset/train.txt and Dataset/test.txt).

Result

Method	Micro-F1 (Acc)	Macro-F1	Mean Distance Error (km)	[email protected]
RATE	0.8905	0.5230	365.16	0.4315

Citation

@inproceedings{zhang2017rate,
  title={RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation},
  author={Zhang, Yu and Wei, Wei and Huang, Binxuan and Carley, Kathleen M and Zhang, Yan},
  booktitle={Proceedings of the 2017 ACM on Conference on Information and Knowledge Management},
  pages={2423--2426},
  year={2017},
  organization={ACM}
}

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation (CIKM'17)

Related tags

Overview

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation

Code

Dataset

Result

Citation

Owner

Yu Zhang

The AWS Certified SysOps Administrator

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

A Loss Function for Generative Neural Networks Based on Watson’s Perceptual Model

BLEND: A Fast, Memory-Efficient, and Accurate Mechanism to Find Fuzzy Seed Matches

SPT_LSA_ViT - Implementation for Visual Transformer for Small-size Datasets

KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch

In the AI for TSP competition we try to solve optimization problems using machine learning.

PyTorch implementaton of our CVPR 2021 paper "Bridging the Visual Gap: Wide-Range Image Blending"

Facial Expression Detection In The Realtime

GDSC-ML Team Interview Task

A task-agnostic vision-language architecture as a step towards General Purpose Vision

Framework for evaluating ANNS algorithms on billion scale datasets.

PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks

Autoencoders pretraining using clustering

School of Artificial Intelligence at the Nanjing University (NJU)School of Artificial Intelligence at the Nanjing University (NJU)

Minimal fastai code needed for working with pytorch

Official PyTorch implementation of "Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets" (ICLR 2021)

Very deep VAEs in JAX/Flax

Deep metric learning methods implemented in Chainer

This is a GUI interface which can process forest fire detection, smoke detection and fire segmentation