A PyTorch implementation of QANet.

Last update: Nov 03, 2022

Related tags

Deep Learning QANet-pytorch

Overview

QANet-pytorch

NOTICE

I'm very busy these months. I'll return to this repo in about 10 days.

Introduction

An implementation of QANet with PyTorch.

Any contributions are welcome!

Current performance

F1	EM	Got by
66	?	InitialBug
64	50	BangLiu

Usage

Install pytorch 0.4 for Python 3.6+
Run pip install -r requirements.txt to install python dependencies.
Run download.sh to download the dataset.
Run python preproc.py to build tensors from the raw dataset.
Run python main.py --mode train to train the model. After training, log/model.pt will be generated.
Run python main.py --mode test to test an pretrained model. Default model file is log/model.pt

Structure

preproc.py: downloads dataset and builds input tensors.

main.py: program entry; functions about training and testing.

models.py: QANet structure.

config.py: configurations.

Differences from the paper

The paper doesn't mention which activation function they used. I use relu.
I don't set the embedding of <UNK> trainable.
The connector between embedding layers and embedding encoders may be different from the implementation of Google, since the description in the paper is inconsistent (residual block can't be used because the dimensions of input and output are different) and they don't say how they implemented it.

TODO

Reduce memory usage
Improve converging speed (to reach 60 F1 scores in 1000 iterations)
Reach state-of-art scroes of the original paper
Performance analysis
Test on SQuAD 2.0

Contributors

InitialBug: found two bugs: (1) positional encodings require gradients; (2) wrong weight sharing among encoders.
linthieda: fixed one issue about dependencies and offered computing resources.
BangLiu: tested the model.
wlhgtc: (1) improved the calculation of Context-Question Attention; (2) fixed a bug that is compacting embeddings before highway nets.

A PyTorch implementation of QANet.

Related tags

Overview

QANet-pytorch

NOTICE

Introduction

Current performance

Usage

Structure

Differences from the paper

TODO

Contributors

Owner

H. Z.

Using VideoBERT to tackle video prediction

Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions

Multimodal Co-Attention Transformer (MCAT) for Survival Prediction in Gigapixel Whole Slide Images

Implemented fully documented Particle Swarm Optimization algorithm (basic model with few advanced features) using Python programming language

StackNet is a computational, scalable and analytical Meta modelling framework

Explaining neural decisions contrastively to alternative decisions.

Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild

Self-Learned Video Rain Streak Removal: When Cyclic Consistency Meets Temporal Correspondence

10x faster matrix and vector operations

This repository provides an unified frameworks to train and test the state-of-the-art few-shot font generation (FFG) models.

This repository is the offical Pytorch implementation of ContextPose: Context Modeling in 3D Human Pose Estimation: A Unified Perspective (CVPR 2021).

Create time-series datacubes for supervised machine learning with ICEYE SAR images.

[ECCV 2020] XingGAN for Person Image Generation

Low-dose Digital Mammography with Deep Learning

Official Pytorch Implementation of Relational Self-Attention: What's Missing in Attention for Video Understanding

Detect roadway lanes using Python OpenCV for project during the 5th semester at DHBW Stuttgart for lecture in digital image processing.

PyTorch implementation of the Deep SLDA method from our CVPRW-2020 paper "Lifelong Machine Learning with Deep Streaming Linear Discriminant Analysis"

Image reconstruction done with untrained neural networks.

A PyTorch implementation of a Factorization Machine module in cython.

CLOOB training (JAX) and inference (JAX and PyTorch)