Image Captioning using CNN ,LSTM and Attention

Last update: Dec 16, 2021

Related tags

Deep Learning imagecaptioningproject

Overview

Image Captioning using CNN ,LSTM and Attention

This is a deeplearning model which tries to summarize an image into a text .

Installation

Install this project with pip3. Use python version 3.7

  pip3 install -R requirements.txt
  python3 app.py

these commands are applicable if you want to try the website in localhost.

you can also install docker and build an image from the docker file and run it.

  docker build -f Dockerfile -t imagecaptioning:api .
  docker run -p 8080:8080 -ti imagecaptioning

Deployment

To deploy this project in google cloud app engine . First create an project in app engine. Install google SDK to push ptojects into your local machine then run the following commands.

  gcloud init
  gcloud app deploy

choose the right project and then push the application to the cloud. This is an monolithic application so a single docker image is complied on the app engine.

Demo

link to demo-https://lucky-dahlia-333406.el.r.appspot.com/index

FAQ

why is this project implimented in tensorflow ?

Tensorflow is actively maintained by google and is very convenient to deploy on a server .It automatically switches to gpu while training if it finds one.

what is BELU score ?

BLEU, or the Bilingual Evaluation Understudy, is a score for comparing a candidate translation of text to one or more reference translations.Although developed for translation, it can be used to evaluate text generated for a suite of natural language processing tasks.

In this project, you will discover the BLEU score for evaluating and scoring candidate text using the NLTK library in Python.

Authors

License

MIT

Image Captioning using CNN ,LSTM and Attention

Related tags

Overview

Image Captioning using CNN ,LSTM and Attention

Installation

Deployment

Demo

FAQ

why is this project implimented in tensorflow ?

what is BELU score ?

Authors

License

Owner

ASUTOSH GHANTO

Automatic library of congress classification, using word embeddings from book titles and synopses.

Meta-learning for NLP

ManimML is a project focused on providing animations and visualizations of common machine learning concepts with the Manim Community Library.

bespoke tooling for offensive security's Windows Usermode Exploit Dev course (OSED)

This repository contains the scripts for downloading and validating scripts for the documents

Music Classification: Beyond Supervised Learning, Towards Real-world Applications

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

Conformer: Local Features Coupling Global Representations for Visual Recognition

This library contains a Tensorflow implementation of the paper Stability Analysis of Unfolded WMMSE for Power Allocation

A Collection of Papers and Codes for ICCV2021 Low Level Vision and Image Generation

Depth image based mouse cursor visual haptic

An official implementation of the Anchor DETR.

MGFN: Multi-Graph Fusion Networks for Urban Region Embedding was accepted by IJCAI-2022.

Circuit Training: An open-source framework for generating chip floor plans with distributed deep reinforcement learning

Materials for upcoming beginner-friendly PyTorch course (work in progress).

Code repo for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

Face Identity Disentanglement via Latent Space Mapping [SIGGRAPH ASIA 2020]

Bio-Computing Platform Featuring Large-Scale Representation Learning and Multi-Task Deep Learning “螺旋桨”生物计算工具集

Stream images from a connected camera over MQTT, view using Streamlit, record to file and sqlite

Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification