Deep Learning for Human Part Discovery in Images - Chainer implementation

Last update: Sep 25, 2022

Overview

Deep Learning for Human Part Discovery in Images - Chainer implementation

NOTE: This is not official implementation. Original paper is Deep Learning for Human Part Discovery in Images.

We are now reproducing the experiments in the original paper. Any contribution will be welcomed!

Requirements

Python 2.7.11+
- Chainer 1.10+
- numpy 1.9+
- scipy 0.16+
- six
- matplotlib
- tqdm
- cv2 (opencv)

Preparation

Data

bash prepare.sh

This script downloads VOC 2010 dataset (http://host.robots.ox.ac.uk/pascal/VOC/voc2010/VOCtrainval_03-May-2010.tar) and the authors' original dataset (http://www2.informatik.uni-freiburg.de/~oliveira/datasets/Sitting.tar.gz).

Model

You can download pre-trained FCN model from here.

We will use weights of this model and train new model on VOC dataset.

Start training

python train.py -g 0 -b 3 -e 3000 -l on -s on

Possible options

python train.py --help

GPU memory requirement

Citation from the original paper:

Each minibatch consists of just one image. The learning rate and momentum are fixed to 1e 10 and 0.99, respectively. We train the refinement layer by layer, which takes two days per refinement layer. Thus, the overall training starting from the pre-trained VGG network took 10 days on a single GPU.

Current maximum batchsize is 3 for 12 GB memory GPU.

Also it was confirmed that MBP (Late 2016, memory 16 GiB) can run with batchsize 1.

Result

Now in prep.

Visualize Prediction

python visualize.py -f PATH_TO_IMAGE_FILE

LICENSE

MIT LICENSE.

Author

shiba24, August 2016.

Contributors

bobye

Deep Learning for Human Part Discovery in Images - Chainer implementation

Related tags

Overview

Deep Learning for Human Part Discovery in Images - Chainer implementation

Requirements

Preparation

Data

Model

Start training

Possible options

GPU memory requirement

Result

Visualize Prediction

LICENSE

Author

Contributors

Owner

Shintaro Shiba

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

CVPR2021 Workshop - HDRUNet: Single Image HDR Reconstruction with Denoising and Dequantization.

An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates neural fields, predictive coding, top-down-bottom-up, and attention (consensus between columns)

Codes for 'Dual Parameterization of Sparse Variational Gaussian Processes'

This is the code of NeurIPS'21 paper "Towards Enabling Meta-Learning from Target Models".

Pytorch implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

Repository for GNSS-based position estimation using a Deep Neural Network

torchbearer: A model fitting library for PyTorch

An open source Python package for plasma science that is under development

Stock-history-display - something like a easy yearly review for your stock performance

An easy way to build PyTorch datasets. Modularly build datasets and automatically cache processed results

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

Have you ever wondered how cool it would be to have your own A.I

clustimage is a python package for unsupervised clustering of images.

deep learning model with only python and numpy with test accuracy 99 % on mnist dataset and different optimization choices

Repo for our ICML21 paper Unsupervised Learning of Visual 3D Keypoints for Control

This repo is the official implementation for Multi-Scale Adaptive Graph Neural Network for Multivariate Time Series Forecasting

Code for 'Single Image 3D Shape Retrieval via Cross-Modal Instance and Category Contrastive Learning', ICCV 2021

YOLOv2 in PyTorch

HybridNets: End-to-End Perception Network