Code for the ACL2021 paper "Combining Static Word Embedding and Contextual Representations for Bilingual Lexicon Induction"

Last update: Oct 08, 2022

Related tags

Computer Vision CSCBLI

Overview

CSCBLI

Code for our ACL Findings 2021 paper,
"Combining Static Word Embedding and Contextual Representations for Bilingual Lexicon Induction".

Requirements

python >= 3.6
numpy >= 1.9.0
pytorch >= 1.0

Supervised

How to train

CUDA_VISIBLE_DEVICES=0 python train.py --src_lang $lg --tgt_lang en\
        --static_src_emb_path $ssemb --static_tgt_emb_path $stemb\
        --context_src_emb_path $csemb --context_tgt_emb_path $ctemb\
        --train_data_path $data_path --save_path $save_path

--static_src_emb_path   aligned source static embedding path 
--static_tgt_emb_path   aligned target static embedding path
--context_src_emb_path  source context embedding path
--context_tgt_emb_path  target context embedding path

How to Test

CUDA_VISIBLE_DEVICES=0 python test_on_all_word.py --src_lang $lg\
        --tgt_lang en --model_path $model_path\
        --dict_path $dict_path\
        --vecmap_context_src_emb_path $vcpath\
        --vecmap_context_tgt_emb_path $vspath\
        --vecmap

--vecmap_context_src_emb_path aligned source context embedding path
--vecmap_context_tgt_emb_path aligned target context embedding path
--vecmap use interpolation method, else unified method

Unsupervised

How to train

lg=ar
CUDA_VISIBLE_DEVICES=0 python train.py --src_lang en --tgt_lang $lg\
  --static_src_emb_path $ssemb --static_tgt_emb_path $stemb\
  --context_src_emb_path $csemb --context_tgt_emb_path $ctemb\
   --save_path $save_path

--static_src_emb_path   aligned source static embedding path 
--static_tgt_emb_path   aligned target static embedding path
--context_src_emb_path  source context embedding path
--context_tgt_emb_path  target context embedding path

How to Test

src=ar
tgt=en
model_path=../checkpoints/$src-$tgt-add_orign_nw.pkl_last
CUDA_VISIBLE_DEVICES=0 python test.py  --model_path $model_path \
        --dict_path ../$src-$tgt.5000-6500.txt  --mode v2 \
        --src_lang $src --tgt_lang $tgt  \
        --reload_src_ctx   $path1 \
        --reload_tgt_ctx   $path2 --lambda_w1 0.11

--mode type    use v1 for unified method and v2 for interpolated 
--lambda_w1    the weight for interpolation
--reload_src_ctx   aligned source context embedding
--reload_tgt_ctx   aligned targte context embedding

Code for the ACL2021 paper "Combining Static Word Embedding and Contextual Representations for Bilingual Lexicon Induction"

Related tags

Overview

CSCBLI

Requirements

Supervised

How to train

How to Test

Unsupervised

How to train

How to Test

Owner

Jinpeng Zhang

pulse2percept: A Python-based simulation framework for bionic vision

Framework for the Complete Gaze Tracking Pipeline

Image augmentation library in Python for machine learning.

Repository collecting all the submodules for the new PyTorch-based OCR System.

Um RPG de texto orientado a objetos.

Drowsiness Detection and Alert System

A toolbox of scene text detection and recognition

Contextual speed detection for python

Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

Markup for note taking

BNF Globalization Code (CVPR 2016)

Motion Detection Squid Game with OpenCV Python

An organized collection of tutorials and projects created for aspriring computer vision students.

Rotational region detection based on Faster-RCNN.

Web interface for browsing arXiv papers

Augmenting Anchors by the Detector Itself

Play the Namibian game of Owela against a terrible AI. Built using Django and htmx.

Make OpenCV camera loops less of a chore by skipping the boilerplate and getting right to the interesting stuff

An advanced 2D image manipulation with features such as edge detection and image segmentation built using OpenCV

This is used to convert a string to an Image with Handwritten Characters.