Caffe implementation for Hu et al. Segmentation for Natural Language Expressions

Last update: Jul 27, 2021

Related tags

Overview

Segmentation from Natural Language Expressions

This repository contains the Caffe reimplementation of the following paper:

R. Hu, M. Rohrbach, T. Darrell, Segmentation from Natural Language Expressions. in arXiv:1603.06180, 2016. (PDF)

@article{hu2016segmentation,
  title={Segmentation from Natural Language Expressions},
  author={Hu, Ronghang and Rohrbach, Marcus and Darrell, Trevor},
  journal={arXiv preprint arXiv:1603.06180},
  year={2016}
}

Project Page: http://ronghanghu.com/text_objseg

Installation

Install Caffe following the instructions here.
Download this repository or clone with Git, and then cd into the root directory of the repository.

Training and evaluation on ReferIt Dataset

Download dataset and VGG network

Download ReferIt dataset:

./referit/referit-dataset/download_referit_dataset.sh

Download the caffemodel for VGG-16 network parameters trained on ImageNET 1000 classes.

Training

You may need to add the repository root directory to Python's module path:

export PYTHONPATH=/path/to/text_objseg_caffe/:$PYTHONPATH

Build training batches for bounding boxes:

python referit/build_training_batches_det.py

Build training batches for segmentation:

python referit/build_training_batches_seg.py

Configure the config.py file in the directory det_model and train the language-based bounding box localization model:

python det_model/train_det_model.py

Configure the config.py file in the directory seg_low_res_model and train the low resolution language-based segmentation model (from the previous bounding box localization model):

python seg_low_res_model/train_low_res_model.py

Configure the config.py file in the directory seg_model and train the high resolution language-based segmentation model (from the previous low resolution segmentation model):

python seg_model/train_seg_model.py

Evaluation

You may need to add the repository root directory to Python's module path:

export PYTHONPATH=path/to/text_objseg_caffe:$PYTHONPATH

Configure the test_config.py file in the directory seg_model and run evaluation for the high resolution language-based segmentation model:

python seg_model/test_seg_model.py

This should reproduce the results in the paper. You may also evaluate the language-based bounding box localization model:

python det_model/test_det_model.py

The results can be compared to this paper.

Demo

There is a demo that you can try! Run the demo in ./demo/text_objseg_demo.ipynb with Jupyter Notebook (IPython Notebook).

Caffe implementation for Hu et al. Segmentation for Natural Language Expressions

Related tags

Overview

Segmentation from Natural Language Expressions

Installation

Training and evaluation on ReferIt Dataset

Download dataset and VGG network

Training

Evaluation

Demo

Owner

Caffe-like explicit model constructor. C(onfig)Model

Invariant Causal Prediction for Block MDPs

Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

TLXZoo - Pre-trained models based on TensorLayerX

Official and maintained implementation of the paper "OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D Medical Data" [BMVC 2021].

SpeechNAS Better Trade off between Latency and Accuracy for Large Scale Speaker Verification

Open-source python package for the extraction of Radiomics features from 2D and 3D images and binary masks.

Bald-to-Hairy Translation Using CycleGAN

TimeSHAP explains Recurrent Neural Network predictions.

How Effective is Incongruity? Implications for Code-mix Sarcasm Detection.

The code release of paper Low-Light Image Enhancement with Normalizing Flow

So-ViT: Mind Visual Tokens for Vision Transformer

Official implementation of deep Gaussian process (DGP)-based multi-speaker speech synthesis with PyTorch.

Official implementation of Self-supervised Image-to-text and Text-to-image Synthesis

A curated list of resources for Image and Video Deblurring

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Composable transformations of Python+NumPy programsComposable transformations of Python+NumPy programs