Neural Caption Generator with Attention

Last update: Nov 30, 2022

Overview

Neural Caption Generator with Attention

Tensorflow implementation of "Show, attend and Tell" http://arxiv.org/abs/1502.03044
Borrowed most of the idea from the author's source code https://github.com/kelvinxu/arctic-captions

Code

make_flickr_dataset.py: Extracts conv5_3 layer activations of VGG Network for flickr30k images, and save them in 'data/feats.npy'
model_tensorflow.py: Main codes

Usage

Download flickr30k Dataset.
Extract VGG conv5_3 features using make_flickr_dataset.py
Train: run train() in model_tensorflow.py
Test: run test() in model_tensorflow.py

Owner

Taeksoo Kim

GitHub Repository

2021 National Underwater Robotics Vision Optics

2021-National-Underwater-Robotics-Vision-Optics 2021年全国水下机器人算法大赛-光学赛道-B榜精度第18名 (Kilian_Di的团队：A榜[email pro

9 Nov 04, 2022

Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems

126 Dec 21, 2022

A testcase generation tool for Persistent Memory Programs.

PMFuzz PMFuzz is a testcase generation tool to generate high-value tests cases for PM testing tools (XFDetector, PMDebugger, PMTest and Pmemcheck) If

14 Jul 24, 2022

Official implement of "CAT: Cross Attention in Vision Transformer".

CAT: Cross Attention in Vision Transformer This is official implement of "CAT: Cross Attention in Vision Transformer". Abstract Since Transformer has

100 Dec 15, 2022

Multi Task Vision and Language

12-in-1: Multi-Task Vision and Language Representation Learning Please cite the following if you use this code. Code and pre-trained models for 12-in-

712 Dec 19, 2022

TensorFlow implementation of "Variational Inference with Normalizing Flows"

[TensorFlow 2] Variational Inference with Normalizing Flows TensorFlow implementation of "Variational Inference with Normalizing Flows" [1] Concept Co

7 Jun 08, 2022

Package for working with hypernetworks in PyTorch.

71 Jan 05, 2023

Trustworthy AI related projects

Trustworthy AI This repository aims to include trustworthy AI related projects from Huawei Noah's Ark Lab. Current projects include: Causal Structure

589 Dec 30, 2022

Rank 1st in the public leaderboard of ScanRefer (2021-03-18)

InstanceRefer InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual Referring

63 Dec 07, 2022

Official implementation of "OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association" in PyTorch.

openpifpaf Continuously tested on Linux, MacOS and Windows: New 2021 paper: OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Te

50 Dec 29, 2022

🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"

SGLKT-VisDial Pytorch Implementation for the paper: Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer Gi-Cheon Kang, Junseok P

9 Jul 05, 2022

Neural Caption Generator with Attention

Related tags

Overview

Neural Caption Generator with Attention

Code

Usage

Owner

Taeksoo Kim

2021 National Underwater Robotics Vision Optics

Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems

A testcase generation tool for Persistent Memory Programs.

Official implement of "CAT: Cross Attention in Vision Transformer".

Multi Task Vision and Language

TensorFlow implementation of "Variational Inference with Normalizing Flows"

Package for working with hypernetworks in PyTorch.

Trustworthy AI related projects

Rank 1st in the public leaderboard of ScanRefer (2021-03-18)

Official implementation of "OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association" in PyTorch.

The official implementation of "Rethink Dilated Convolution for Real-time Semantic Segmentation"

Code for our paper "Sematic Representation for Dialogue Modeling" in ACL2021

Supporting code for "Autoregressive neural-network wavefunctions for ab initio quantum chemistry".

NumQMBasic - A mini-course offered to Undergrad physics students

Adversarial Graph Augmentation to Improve Graph Contrastive Learning

Dataset and codebase for NeurIPS 2021 paper: Exploring Forensic Dental Identification with Deep Learning

CCNet: Criss-Cross Attention for Semantic Segmentation (TPAMI 2020 & ICCV 2019).

Space-event-trace - Tracing service for spaceteam events

code for our ECCV 2020 paper "A Balanced and Uncertainty-aware Approach for Partial Domain Adaptation"

🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"